Hacker News Scraper - Stories, Comments & Search
Pricing
from $0.30 / 1,000 results
Hacker News Scraper - Stories, Comments & Search
Scrape Hacker News stories, comments, polls, jobs, and Ask/Show HN posts. Search by keyword and date range, pull a user's full activity, or fetch any HN list (front page, new, best, ask, show, job). No auth required. Built for AI/ML datasets, tech trend research, and news monitoring.
Pricing
from $0.30 / 1,000 results
Rating
0.0
(0)
Developer
NIJ KANANI
Maintained by CommunityActor stats
0
Bookmarked
2
Total users
1
Monthly active users
22 days ago
Last modified
Share
๐ Hacker News Scraper
Scrape Hacker News stories, comments, polls, jobs, and Ask/Show HN posts. Search by keyword + date range, pull full HN lists (front page, new, best, ask, show, jobs), or fetch any user's complete activity.
โก Uses HN's official Algolia + Firebase APIs. No login, no rate-limit nightmares, no scraping HTML.
โจ What you can do
- ๐ Keyword search with date range, tag filter, min-points threshold, and sort by relevance/date/points
- ๐ Pull any HN list โ top, new, best, Ask HN, Show HN, jobs
- ๐ค Get any user's full activity โ stories AND comments, by username
- ๐ฌ Optionally fetch full comment trees โ flattened with parent IDs (great for AI training)
๐ Quick start
- Click Try for free
- Pick mode:
search,list, oruser - Enter targets
- Click Start
๐ฅ Input examples
Search for "claude" stories from the last month with 50+ points
{"mode": "search","searchQueries": ["claude"],"tags": "story","sortBy": "points","since": "2026-04-01","minPoints": 50,"maxItems": 500,"includeComments": true}
Pull current front page
{"mode": "list","listType": "topstories","maxItems": 30,"includeComments": true,"maxCommentsPerStory": 50}
Get pg's recent activity
{"mode": "user","users": ["pg", "dang"],"maxItems": 100}
๐ค Output (per item)
{"type": "story","id": 12345678,"title": "Show HN: A new way to scrape data","text": null,"author": "username","points": 234,"numComments": 87,"createdAt": "2026-04-15T12:34:56.000Z","url": "https://example.com/article","hnUrl": "https://news.ycombinator.com/item?id=12345678","tags": ["story", "front_page"],"comments": [{"type": "comment","id": 12345679,"text": "Great post!","author": "commenter","points": 12,"createdAt": "2026-04-15T13:00:00.000Z","parentId": 12345678}]}
๐ฏ Use cases
| Who | Why |
|---|---|
| ๐ค AI / LLM teams | High-signal tech-discussion training data, filterable by topic and quality (points threshold) |
| ๐ฐ Tech journalists | Track what's trending in dev/startup community |
| ๐งโ๐ป Engineers | Dataset of "Show HN" launches, open-source releases, new tools |
| ๐ VCs / scouts | Monitor early-stage signals across founders posting on HN |
| ๐ SEO researchers | Track tech keywords surfacing in HN search trends |
โ๏ธ Tech notes
- Search uses HN's Algolia API โ fast, fielded, supports complex queries
- Lists & item details use HN's Firebase API โ official, real-time
- No auth required, no API key needed, no rate-limit (within reason)
- Comments are fetched recursively from Firebase for accuracy and full trees
โ FAQ
Will I get rate-limited? HN is generous โ the Algolia API is unmetered for reasonable use. Comment-tree scraping at large scale is slower because each item is a separate Firebase call.
Are deleted/dead items included? No โ they're filtered automatically.
Can I scrape historical data?
Yes. Use since and until in search mode to pull any historical date range. HN's archive goes back to 2007.
Schedule it? Yes โ set up an Apify Schedule for daily/hourly trend tracking.