Hacker News Scraper — Stories, Comments & Jobs avatar

Hacker News Scraper — Stories, Comments & Jobs

Pricing

Pay per usage

Go to Apify Store
Hacker News Scraper — Stories, Comments & Jobs

Hacker News Scraper — Stories, Comments & Jobs

Extract Hacker News stories, comments, and job posts. Full-text search via Algolia with date filtering. Scrape top, new, best, Ask HN, Show HN, jobs. Filter by score and author. Uses official Firebase + Algolia APIs. No API key needed. Export JSON, CSV, Excel.

Pricing

Pay per usage

Rating

0.0

(0)

Developer

CryptoSignals Agent

CryptoSignals Agent

Maintained by Community

Actor stats

0

Bookmarked

6

Total users

2

Monthly active users

5 minutes ago

Last modified

Share

Extract Hacker News stories, comments, and job posts with full-text search. Filter by score, comment count, and keywords. No API key needed.

Uses the official HN Firebase API and Algolia Search API for fast, reliable results.


Free Trial Ending April 3

This actor is free while we collect feedback. Starting April 3, 2026, it moves to $4.99/month. Add a payment method at apify.com/billing to keep access.


Features

FeatureDetails
Front Page StoriesScrape top, new, and best stories with scores and metadata
Full-Text SearchSearch all HN content via Algolia — posts, comments, titles
Ask HN / Show HNFilter community questions and project showcases
Job PostsYC startup job listings from "Who is Hiring?" threads
CommentsThreaded comments with author, text, and timestamps
FiltersMinimum score, minimum comment count, keyword matching
User ProfilesKarma, about text, account age, submission history

Input Parameters

ParameterTypeDefaultDescription
categorystring"top"top, new, best, ask, show, jobs, or search
searchQuerystringKeyword for search mode (Algolia full-text search)
scrapeTypestring"stories"stories, comments, or users
maxItemsinteger100Maximum items to return (1–500)
includeCommentsbooleanfalseFetch comment threads for each story
maxCommentsPerStoryinteger50Max comments per story
minScoreinteger0Minimum story score filter
minCommentsinteger0Minimum comment count filter
keywordstringAdditional keyword filter within results

Example Inputs

Get today's top stories

{
"category": "top",
"maxItems": 50
}

Search by keyword

{
"category": "search",
"searchQuery": "large language models",
"maxItems": 100,
"minScore": 50
}

Get Show HN posts with comments

{
"category": "show",
"maxItems": 30,
"includeComments": true,
"maxCommentsPerStory": 20
}

Filter by keyword and comment count

{
"category": "top",
"keyword": "rust",
"minComments": 50,
"maxItems": 25
}

Output Format

Story

{
"title": "Show HN: Open-source LLM that runs on a Raspberry Pi",
"url": "https://github.com/example/tiny-llm",
"score": 487,
"by": "techbuilder",
"descendants": 203,
"time": 1711540800,
"type": "story",
"hnUrl": "https://news.ycombinator.com/item?id=12345678"
}

Comment

{
"text": "This is impressive. I tested it on my Pi 5 and inference takes about 2s per token...",
"by": "commenter",
"time": 1711544400,
"parent": 12345678,
"storyId": 12345678,
"type": "comment"
}

Use Cases

  • Tech trend monitoring — track what developers discuss and upvote
  • Startup research — find YC companies, Show HN launches, and job openings
  • Content curation — build newsletters from high-scoring posts by topic
  • Competitive intelligence — monitor mentions of your product or competitors
  • Recruiting — extract job posts from "Who is Hiring?" threads
  • Market research — analyze sentiment around technologies and frameworks
  • Academic research — study developer opinions and technology adoption

Why This Scraper?

HN has two official APIs (Firebase and Algolia), but neither is optimized for bulk extraction. This Actor combines both:

  • Firebase API for real-time front page, new, and best stories
  • Algolia Search API for full-text search across the full HN archive

FAQ

Do I need an API key? No. Both APIs are public. No authentication needed.

How far back can I search? The full HN archive back to 2006 via Algolia search.

How fresh is the data? Real-time. Firebase reflects the live front page; Algolia indexes new posts within minutes.

What export formats are supported? JSON, CSV, Excel, XML, HTML. Connect via API, webhooks, Zapier, Make, or Google Sheets.

Integrations

  • REST API — trigger runs and fetch results programmatically
  • Webhooks — get notified when scraping completes
  • Zapier / Make — connect to 5,000+ apps
  • Google Sheets — export directly to spreadsheets
  • Slack / Email — set up alerts for trending topics

See all scrapers by CryptoSignals