Hacker News Scraper - Stories, Comments & Search
Pricing
from $0.30 / 1,000 results
Hacker News Scraper - Stories, Comments & Search
Scrape Hacker News stories, comments, polls, jobs, and Ask/Show HN posts. Search by keyword and date range, pull a user's full activity, or fetch any HN list (front page, new, best, ask, show, job). No auth required. Built for AI/ML datasets, tech trend research, and news monitoring.
Pricing
from $0.30 / 1,000 results
Rating
0.0
(0)
Developer
NIJ KANANI
Actor stats
0
Bookmarked
2
Total users
1
Monthly active users
a day ago
Last modified
Share
π Hacker News Scraper
Scrape Hacker News stories, comments, polls, jobs, and Ask/Show HN posts. Search by keyword + date range, pull full HN lists (front page, new, best, ask, show, jobs), or fetch any user's complete activity.
β‘ Uses HN's official Algolia + Firebase APIs. No login, no rate-limit nightmares, no scraping HTML.
β¨ What you can do
- π Keyword search with date range, tag filter, min-points threshold, and sort by relevance/date/points
- π Pull any HN list β top, new, best, Ask HN, Show HN, jobs
- π€ Get any user's full activity β stories AND comments, by username
- π¬ Optionally fetch full comment trees β flattened with parent IDs (great for AI training)
π Quick start
- Click Try for free
- Pick mode:
search,list, oruser - Enter targets
- Click Start
π₯ Input examples
Search for "claude" stories from the last month with 50+ points
{"mode": "search","searchQueries": ["claude"],"tags": "story","sortBy": "points","since": "2026-04-01","minPoints": 50,"maxItems": 500,"includeComments": true}
Pull current front page
{"mode": "list","listType": "topstories","maxItems": 30,"includeComments": true,"maxCommentsPerStory": 50}
Get pg's recent activity
{"mode": "user","users": ["pg", "dang"],"maxItems": 100}
π€ Output (per item)
{"type": "story","id": 12345678,"title": "Show HN: A new way to scrape data","text": null,"author": "username","points": 234,"numComments": 87,"createdAt": "2026-04-15T12:34:56.000Z","url": "https://example.com/article","hnUrl": "https://news.ycombinator.com/item?id=12345678","tags": ["story", "front_page"],"comments": [{"type": "comment","id": 12345679,"text": "Great post!","author": "commenter","points": 12,"createdAt": "2026-04-15T13:00:00.000Z","parentId": 12345678}]}
π― Use cases
| Who | Why |
|---|---|
| π€ AI / LLM teams | High-signal tech-discussion training data, filterable by topic and quality (points threshold) |
| π° Tech journalists | Track what's trending in dev/startup community |
| π§βπ» Engineers | Dataset of "Show HN" launches, open-source releases, new tools |
| π VCs / scouts | Monitor early-stage signals across founders posting on HN |
| π SEO researchers | Track tech keywords surfacing in HN search trends |
βοΈ Tech notes
- Search uses HN's Algolia API β fast, fielded, supports complex queries
- Lists & item details use HN's Firebase API β official, real-time
- No auth required, no API key needed, no rate-limit (within reason)
- Comments are fetched recursively from Firebase for accuracy and full trees
β FAQ
Will I get rate-limited? HN is generous β the Algolia API is unmetered for reasonable use. Comment-tree scraping at large scale is slower because each item is a separate Firebase call.
Are deleted/dead items included? No β they're filtered automatically.
Can I scrape historical data?
Yes. Use since and until in search mode to pull any historical date range. HN's archive goes back to 2007.
Schedule it? Yes β set up an Apify Schedule for daily/hourly trend tracking.