Pricing

from $2.00 / 1,000 hacker news records

Hacker News Watchlist and Story Monitor

Monitor Hacker News stories and keywords with scores, comments, URLs, authors, and discussion metadata for startup research, tech trend monitoring, and AI agents.

Pricing

from $2.00 / 1,000 hacker news records

Rating

0.0

(0)

Developer

Skootle

Actor stats

Bookmarked

Total users

Monthly active users

17 hours ago

Last modified

Fast answer: what this Actor is for

Track Hacker News stories and keywords for startup, AI, and developer trend signals with clean records and discussion links.

Run it from the Apify UI for one-off exports.
Schedule it or call it by API for recurring monitoring.
Use the dataset output directly in spreadsheets, automations, and AI agents.

Hacker News Watchlist hero

TL;DR

Monitor Hacker News stories and comments across top, new, best, ask, show, and jobs streams. Returns clean structured JSON with story-type enum, ISO timestamps, author karma + account age, and a 300-500 character markdown summary per story. Watchlist mode emits only NEW records since the previous run. Built on HN's official Firebase API. Zero authentication, zero anti-bot, no rate-limit issues in practice.

Try it on a small dataset, then let us know what you think in a review.

What does Hacker News Watchlist do?

Hacker News Watchlist extracts stories and comments from any Hacker News stream (top, new, best, ask, show, jobs). For each story you get: title, URL, external domain, score, comment count, author, author's karma + account age, rank in the stream, story-type enum (story, ask_hn, show_hn, job, poll), and ISO 8601 timestamps.

With fetchComments: true, the actor walks the comment tree per story (configurable cap, max 500 per story) and emits one record per comment with depth, parent ID, body, score, and author info.

Watchlist mode (watchlistMode: true) makes this scraper schedulable. State persists across runs in the actor's key-value store, so a daily cron only emits stories and comments NEW since the last run.

Why scrape Hacker News?

HN moves fast, important threads age out in 12 hours. Watch top + new + best + Show HN + Ask HN + jobs across the day without 30 tabs open. Useful for founders watching for product mentions, VC scouts watching Show HN for early signal, journalists watching for breaking tech news, and recruiters scanning the monthly Who Is Hiring thread.

Daily AI-driven HN summary digests, brand-mention alerts on competitor domains, and labeled tech-discourse corpora for LLM training all run off one watchlist feed.

Who needs this?

DevRel teams monitoring HN for new posts about competitor or category technologies
Recruiters scanning the monthly Who Is Hiring? thread plus the jobs stream
Brand and product teams watching for unexpected HN posts about their tools
VC and tech-scouting analysts filtering Show HN for new product launches
AI / LLM teams building training corpora from high-quality tech discourse
AI agents consuming a daily filtered HN digest as a topic-of-interest feed

How to use Hacker News Watchlist

Open the Input tab on the actor page
Pick streams in the streams field (top, new, best, ask, show, jobs). One run handles many.
Set storiesPerStream (default 20)
Optionally enable fetchAuthorProfile (default true) for author karma + account age
Optionally enable fetchComments and set commentsPerStory (max 500 per story tree)
Optionally set domainAllowlist to filter to specific external domains
Optionally set minScore to ignore low-engagement posts
Optionally enable watchlistMode for daily diffs
Click Start

How much will scraping Hacker News cost?

This actor is priced per event:

Actor Start: $0.002 once per run
Hacker News record (story or comment): tiered, charged per record written

Apify plan	$/1000 records
FREE	$2.00
BRONZE	$2.50
SILVER	$2.80
GOLD	$3.00
PLATINUM	$3.00
DIAMOND	$3.00

A daily watchlist on top with storiesPerStream: 30 and fetchAuthorProfile: true is charged for the records it writes. With watchlistMode enabled, repeat runs can emit fewer records because already-seen stories are not written again.

Is it legal to scrape Hacker News?

Yes. Hacker News's Firebase API is explicitly published as a public read-only API for developers (hacker-news.firebaseio.com/v0/). HN encourages programmatic access. There is no authentication, no terms-of-service block on commercial use, and the data (titles, URLs, author handles, scores, public comments) is freely visible to anyone in a browser.

Use the data for research, AI training, brand monitoring, recruiting, internal analytics. Standard practice is to attribute HN as a source if you republish content, but the API itself is unrestricted.

Maintenance status

Verified 2026-07-29: the published pricing explanation was reconciled to the current per-event price configuration, and the maintained source is checked with its declared TypeScript build and bounded Cloud canary before deployment. A canary is considered successful only when it returns one or more structured story records with no reported run errors.

Examples

Example 1: Daily top-30 digest

{
  "streams": ["top"],
  "storiesPerStream": 30,
  "fetchAuthorProfile": true,
  "watchlistMode": true,
  "maxItems": 30
}

Example 2: New posts above 100 score, last 24h

{
  "streams": ["new"],
  "storiesPerStream": 100,
  "minScore": 100,
  "fetchAuthorProfile": true,
  "maxItems": 30
}

Example 3: Show HN product-launch tracker

{
  "streams": ["show"],
  "storiesPerStream": 50,
  "watchlistMode": true,
  "fetchAuthorProfile": true,
  "maxItems": 50
}

Example 4: Ask HN community question feed

{
  "streams": ["ask"],
  "storiesPerStream": 30,
  "fetchComments": true,
  "commentsPerStory": 30,
  "watchlistMode": true,
  "maxItems": 1000
}

Example 5: Job-listing watchlist

{
  "streams": ["jobs"],
  "storiesPerStream": 100,
  "watchlistMode": true,
  "maxItems": 100
}

Example 6: Brand monitoring (filter by domain)

{
  "streams": ["top", "new"],
  "storiesPerStream": 100,
  "domainAllowlist": ["yourcompany.com", "competitor1.com", "competitor2.com"],
  "watchlistMode": true,
  "maxItems": 50
}

Example 7: AI / LLM corpus build

{
  "streams": ["top"],
  "storiesPerStream": 500,
  "fetchComments": true,
  "commentsPerStory": 100,
  "fetchAuthorProfile": true,
  "maxItems": 50000
}

Run weekly to accumulate a labeled tech-discourse dataset for fine-tuning.

Example 8: Author-trust filter

{
  "streams": ["new"],
  "storiesPerStream": 200,
  "fetchAuthorProfile": true,
  "minScore": 5,
  "maxItems": 100
}

Filter the output downstream for authorAccountAge != 'today' and authorKarma > 100 to skip brand-new spam accounts.

Input parameters

Field	Type	Default	Description
`streams`	enum[]	`["top"]`	`top`, `new`, `best`, `ask`, `show`, `jobs`. One run handles many.
`storiesPerStream`	int	`20`	1-500
`fetchAuthorProfile`	bool	`true`	Adds author karma + account age. One extra API call per unique author, cached.
`fetchComments`	bool	`false`	Walks the comment tree per story
`commentsPerStory`	int	`0`	Max 500
`domainAllowlist`	string[]	`[]`	Only emit stories whose external URL matches
`minScore`	int	`0`	Score threshold
`watchlistMode`	bool	`false`	Idempotent diff against KV-stored seen IDs
`maxItems`	int	`50`	Hard cap on records (stories + comments)

Story-type enum

Value	Meaning
`story`	Standard linked story
`ask_hn`	Ask HN: question to the community
`show_hn`	Show HN: project/product launch
`job`	YC company job posting
`poll`	HN poll

Hacker News output format

The dataset has two record types. Filter by recordType.

`hn_story`

Field	Type	Description
`outputSchemaVersion`, `recordType`, `recordId`	string	Discriminated identity
`itemId`, `url`, `hnUrl`	int/string	HN ID + external URL + HN comment-page URL
`storyType`	enum	See enum table
`title`, `text`, `textPlain`	string	Title + body (HTML + stripped)
`externalUrl`, `domain`	string	External link + parsed domain
`author`, `authorKarma`, `authorAccountAge`	string/int/string	Author profile (when `fetchAuthorProfile: true`); accountAge as `'12y'`, `'5mo'`, etc.
`score`, `descendants`, `rank`	int	Score, comment count, position in stream
`stream`	enum	Source stream
`createdAt`, `scrapedAt`	ISO 8601
`fieldCompletenessScore`, `agentMarkdown`	int / string	Quality + LLM-ready summary

`hn_comment`

Field	Type	Description
`outputSchemaVersion`, `recordType`, `recordId`	string	Discriminated identity
`itemId`, `url`	int/string	Comment ID + URL
`storyId`, `parentId`, `depth`	int	Tree linkage
`text`, `textPlain`	string	Body (HTML + stripped)
`author`, `createdAt`, `scrapedAt`	string / ISO 8601
`fieldCompletenessScore`, `agentMarkdown`	int / string	Quality + LLM-ready summary

Hacker News scraper output example (story)

{
  "outputSchemaVersion": "2026-05-08",
  "recordType": "hn_story",
  "recordId": "hn:story:48067119",
  "itemId": 48067119,
  "stream": "top",
  "rank": 1,
  "storyType": "story",
  "title": "Google broke reCAPTCHA for de-googled Android users",
  "url": "https://reclaimthenet.org/google-broke-recaptcha-for-de-googled-android-users",
  "domain": "reclaimthenet.org",
  "score": 656,
  "descendants": 234,
  "author": "anonymousiam",
  "authorKarma": 5099,
  "authorAccountAge": "9y",
  "createdAt": "2026-05-08T14:22:53.000Z",
  "fieldCompletenessScore": 100,
  "agentMarkdown": "**📰 HN · Google broke reCAPTCHA for de-googled Android users**\n- ⬆ 656 · 💬 234 · #1\n- 👤 u/anonymousiam · 5099 karma · 9y\n- 🌐 reclaimthenet.org\n- 🔗 https://reclaimthenet.org/..."
}

During the Actor run

The actor pulls stories, comments, and author profiles from HN's official Firebase API with respectful pacing. No authentication required, no rate-limit issues in practice; author lookups are cached per run so a busy thread doesn't multiply API calls. Alongside the dataset, three artifacts land in the actor's key-value store: OUTPUT (run summary), AGENT_BRIEFING (markdown digest with top stories by score), and WATCHLIST_STATE (seen story + comment IDs, when watchlistMode: true).

FAQ

Is there a rate limit?

HN's Firebase API doesn't publish a hard rate limit and is generous in practice. The actor paces requests respectfully so a daily run never trips a soft cap.

Can I monitor for new stories only?

Yes. Set watchlistMode: true. The first run captures everything; subsequent runs only emit records new since the previous run.

Can I get author karma and account age?

Yes, that's the default (fetchAuthorProfile: true). Adds one HN API call per unique author, cached within the run.

Can I get comments along with stories?

Yes. Set fetchComments: true and commentsPerStory to your cap.

Can I filter by domain?

Yes. Set domainAllowlist to a list of allowed domains (e.g., ["yourcompany.com"]). Only stories whose external URL matches will be emitted.

Can I filter by score?

Yes. Set minScore to your threshold. Stories below it are skipped.

Can I use this with the Apify API?

Yes. POST to https://api.apify.com/v2/acts/skootle~hackernews-watchlist/runs.

Can I integrate with Make / Zapier / n8n / Slack?

Yes. Click Integrations on the actor page.

Why use this when HN's API is free?

The free API requires per-item fetches and gives you no schema. This actor handles all the orchestration (stream → IDs → items → authors → comments → watchlist diff), normalizes into a versioned typed schema, joins author profile data per story, computes ranks, and ships agent-ready markdown summaries. If you're feeding this into a daily Slack digest or an AI agent, it pays back the per-record cost in saved engineering time.

Your feedback

Hit a bug or want a feature? Open an issue on the Issues tab rather than the reviews page, and we'll fix it fast (typically within 48 hours).

Why choose Hacker News Watchlist

Monitor mode emits only what's new since last run, so a daily Slack digest or AI agent feed never replays yesterday's stories
All 6 streams in one run, top, new, best, ask, show, jobs, instead of refreshing HN tab by tab through the day
Author profile join, authorKarma and authorAccountAge per story, so brand monitors and VC scouts can filter spam vs trusted contributors immediately
Story-type filter without grep, typed enum (story, ask_hn, show_hn, job, poll) means Show HN trackers and job watchlists work in one downstream query
Sub-minute typical runtime, built on HN's official public Firebase API, no anti-bot, no auth, no rate-limit issues in practice
Agent-ready markdown per record drops straight into an LLM context window
Stories and comments in one dataset, filterable by recordType
Re-runs are safe to dedupe by ID, stable hn:story:<id> and hn:comment:<id> keys
Schema doesn't break your pipeline, versioned and bumped on breaking change

Other Skootle actors you might want to check

Reddit Subreddit Scraper, same pattern for subreddit monitoring
GitHub Trending Repos, daily trending dev repos with API enrichment
SEC EDGAR Filings Monitor, public-company filings stream
Apple App Store Reviews Monitor, App Store reviews + metadata
Shopify App Store Scraper, Shopify app listings + pricing tiers

Support and contact

File issues on this actor's page, replies within 48 hours. Feature requests welcome, tag with enhancement.

Hacker News Scraper — Tech News Feed API

nexgendata/hacker-news-scraper

Monitor Hacker News stories and engagement trends. Clean JSON for PR, media-monitoring teams and AI agents.

NexGenData

Hacker News Deep Scraper

fluxcurulin/hn-scraper

Extract Hacker News stories, points, authors, comment counts, and links with full metadata. Track tech trends, monitor startup discussions, and export structured data for market intelligence and competitive analysis.

Josh Pinkerton

Hacker News Story & Comment Scraper

wsgcjj/hacker-news-scraper

Scrape Hacker News top/new/best stories with points, comments, author info, and timestamps. Monitor tech trends, startup news, and developer discussions. Uses official Firebase API for reliable data.

陈俊杰

Hacker News Search Scraper

sthiven_r/hacker-news-search-scraper

Search Hacker News by keyword and get stories (title, URL, points, comments, author, date). For tech monitoring & research.

Wilker Sthiven Rangel Manrique

Hacker News Keyword Mention Monitor

enviable_shell/hn-keyword-mention-monitor

Monitor Hacker News stories and comments for brand, competitor, product, and technology mentions.

佳斌王

Hacker News Scraper - Stories & Comments

spiky_pepperoni/hacker-news-scraper

Search Hacker News stories and comments by keyword. No login.

Arad S

Hacker News Scraper - Stories, Comments & Trends

viralanalyzer/hackernews-intelligence

Scrape Hacker News stories, comments, and discussions. Track tech trends, startup news, and developer community sentiment.

viralanalyzer

5.0

Hacker News Scraper

sweet_rebel/hacker-news-scraper

Rajat Sharda

Hacker News Scraper

klondikeking/hacker-news-scraper

Pierrick McD0nald

Hacker News Scraper

prince.sh/hacker-news-scraper

Scrape Hacker News stories, comments, and rankings. Get titles, points, authors, and URLs via the official Algolia API. Search by keyword or list top/new/best stories. Ideal for developer audience research, tech trend monitoring, and AI training datasets.

Prince Jain

Hacker News Watchlist and Story Monitor

Fast answer: what this Actor is for

TL;DR

What does Hacker News Watchlist do?

Why scrape Hacker News?

Who needs this?

How to use Hacker News Watchlist

How much will scraping Hacker News cost?

Is it legal to scrape Hacker News?

Maintenance status

Examples

Example 1: Daily top-30 digest

Example 2: New posts above 100 score, last 24h

Example 3: Show HN product-launch tracker

Example 4: Ask HN community question feed

Example 5: Job-listing watchlist

Example 6: Brand monitoring (filter by domain)

Example 7: AI / LLM corpus build

Example 8: Author-trust filter

Input parameters

Story-type enum

Hacker News output format

hn_story

hn_comment

Hacker News scraper output example (story)

During the Actor run

FAQ

Is there a rate limit?

Can I monitor for new stories only?

Can I get author karma and account age?

Can I get comments along with stories?

Can I filter by domain?

Can I filter by score?

Can I use this with the Apify API?

Can I integrate with Make / Zapier / n8n / Slack?

Why use this when HN's API is free?

Your feedback

Why choose Hacker News Watchlist

Other Skootle actors you might want to check

Support and contact

You might also like

Hacker News Scraper — Tech News Feed API

Hacker News Deep Scraper

Hacker News Story & Comment Scraper

Hacker News Search Scraper

Hacker News Keyword Mention Monitor

Hacker News Scraper - Stories & Comments

Hacker News Scraper - Stories, Comments & Trends

Hacker News Scraper

Hacker News Scraper

Hacker News Scraper

`hn_story`

`hn_comment`