Pricing

from $0.65 / run started

Hacker News Scraper

Reliable Apify actor for scraping public Hacker News sections with HTML-only crawling. Extract rank, title, URL, points, author, age, and comment count in a clean dataset for trend tracking, research, content discovery, and automation.

Pricing

from $0.65 / run started

Rating

0.0

(0)

Developer

Techionik

Actor stats

Bookmarked

Total users

Monthly active users

5 days ago

Last modified

Hacker News Scraper

A fast, lightweight, and marketplace-ready Apify actor for scraping public Hacker News listings using plain HTML parsing.

This actor is built specifically for Hacker News and uses CheerioCrawler instead of a full browser, making it efficient, low-cost, and reliable for structured data extraction. It collects one clean dataset item per post and supports the main Hacker News sections used for trending stories, discovery, monitoring, and research workflows.

Features

Scrapes public Hacker News listing pages
Extracts one clean record per post
Supports multiple Hacker News sections
Automatically paginates until the requested result limit is reached
Uses HTML-only crawling for faster and cheaper runs
Keeps input simple and user-friendly

Supported Page Types

front
newest
ask
show
jobs
best
active
classic

Extracted Fields

Each result may include the following fields:

pageType
rank
title
url
points
author
age
commentsCount

Input

This actor uses a very simple input format:

pageType: The Hacker News section to scrape
maxResults: The maximum number of posts to extract

Example input:

{ "pageType": "best", "maxResults": 10 }

Output

The actor returns one dataset item per Hacker News post.

Example output:

{ "pageType": "best", "rank": 1, "title": "Ghostty is leaving GitHub", "url": "https://mitchellh.com/writing/ghostty-leaving-github", "points": 3400, "author": "WadeGrimridge", "age": "1 day ago", "commentsCount": 1015 }

Notes About the Data

Some Hacker News sections do not always expose the same metadata.

For example, on the jobs page, fields like points, author, or commentsCount may be missing on the page itself. In such cases, the actor returns default values such as 0 or null where appropriate.

This is expected behavior and reflects the actual structure of Hacker News.

Why Use This Actor

Hacker News is mostly server-rendered HTML, which makes it a strong fit for a Cheerio-based scraper.

Benefits of this actor include:

Faster execution than browser-based scrapers
Lower runtime and compute cost
Clean and structured output
Reliable extraction from major Hacker News sections
Good fit for automation, trend tracking, research, and content workflows

Best Use Cases

This actor is useful for:

Tracking trending Hacker News posts
Monitoring top stories by section
Startup and tech news aggregation
Research and content discovery workflows
Lightweight automation and data collection pipelines

Technical Approach

This actor is built with:

Apify
Crawlee CheerioCrawler
Plain HTML parsing
Automatic pagination handling

Because it does not use Playwright or Puppeteer, it is more efficient for Hacker News than a full-browser solution.

Scope

This actor is designed specifically for Hacker News.

It is not a universal news scraper and is not intended for arbitrary websites with different HTML structures. If you need broad website text extraction, a generic content scraper is a better fit. This actor is purpose-built for clean Hacker News post extraction.

Summary

If you need a simple, reliable, and cost-effective Hacker News scraper for Apify, this actor provides a clean structured output with minimal input and efficient HTML-only crawling.

Hacker News Scraper

klondikeking/hacker-news-scraper

Pierrick McD0nald

Hacker News Search Scraper

sthiven_r/hacker-news-search-scraper

Search Hacker News by keyword and get stories (title, URL, points, comments, author, date). For tech monitoring & research.

Wilker Sthiven Rangel Manrique

Hacker News Scraper

vernacular_reservoir/hacker-news-scraper

Scrape Hacker News top, new, best, ask, show and jobs stories. Extract title, URL, score, author, comment count and age. Optionally include top comments. No API key required. Perfect for tech news monitoring and trend analysis.

Aleksandrs

Hacker News Post Scraper

glowing_glove/hacker-news-signal-monitor

Scrape public Hacker News sections and return ranked story rows with titles, URLs, authors, points, ages, and comment counts.

Ushba Khan

Hacker News Story & Comment Scraper

wsgcjj/hacker-news-scraper

Scrape Hacker News top/new/best stories with points, comments, author info, and timestamps. Monitor tech trends, startup news, and developer discussions. Uses official Firebase API for reliable data.

陈俊杰

Hacker News Story Search (Pythia)

apricot_blackberry/pythia-hackernews

Search Hacker News via the Algolia API. Returns up to 50 stories with title, URL, author, points, and comment count. Stories with 100+ points are flagged Notable.

Creator Fusion

Hacker News Scraper

devilscrapes/hacker-news-scraper

Scrape Hacker News stories (top, new, best, ask, show, jobs) plus per-story metadata in one call — title, URL, score, author, comment count, posted-at — export to JSON or CSV. A Hacker News API wrapper that handles pagination, fan-out, retries, and rate-limit pacing.

DevilScrapes

Hacker News Scraper — Stories, Comments & Jobs

cryptosignals/hackernews-scraper

Scrape Hacker News stories, comments, and user profiles — extract title, URL, score, author, comment threads, and submission time. CSV/JSON output.

Web Data Labs

Hacker News Search Scraper

hermes-yuri/hackernews-search-scraper

Search and scrape Hacker News stories, comments with filtering and sentiment signals. Pay per result!

Yuri

Hacker News Scraper

pink_fence/Hacker-News-scraper

Extract Hacker News posts instantly — title, URL, points, author, comments. Supports Top, New, Show HN, Ask HN and Jobs feeds. Pagination built in. Clean JSON output. No API key needed.