Hackernews Intelligence avatar

Hackernews Intelligence

Pricing

from $15.75 / 1,000 story scrapeds

Go to Apify Store
Hackernews Intelligence

Hackernews Intelligence

Scrape Hacker News stories, comments, and discussions. Track tech trends, startup news, and developer community sentiment.

Pricing

from $15.75 / 1,000 story scrapeds

Rating

5.0

(3)

Developer

viralanalyzer

viralanalyzer

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

20 days ago

Last modified

Share

📰 Hacker News Intelligence — Stories, Comments & Trends

🔗 View on Apify Store | 🇺🇸 English | 🇧🇷 Português

Scrape Hacker News stories and comments using the public Algolia HN API and Firebase API. Search by keyword, browse the front page, get top stories, or explore Show HN and Ask HN submissions. No API key needed.

✨ Features

  • 🔍 5 scraping modes — Search, Front Page, Top Stories, Show HN, Ask HN
  • 🔑 Keyword search — Find stories by topic with relevance or date sorting
  • 🕐 Time range filters — Past 24h, week, month, year, or all time
  • 💬 Comment extraction — Fetch top comments per story (optional)
  • 📊 Full metrics — Points, comment count, author, timestamps
  • 🔗 Direct links — Original URL + HN discussion URL
  • 🛡️ Anti-placeholder guardrails — Every result validated as real data
  • API-based — Fast and reliable via Algolia HN Search + Firebase

📥 Input

ParameterTypeRequiredDefaultDescription
modestring"search"Scraping mode: search, front_page, top_stories, show_hn, ask_hn
searchQueriesstring[]Keywords to search (required in "search" mode, up to 10)
sortBystring"relevance"Sort: relevance, date
timeRangestring"all"Time filter: all, 24h, week, month, year
maxItemsinteger50Max stories to scrape (1-500)
includeCommentsbooleanfalseFetch top comments for each story
maxCommentsPerStoryinteger5Max comments per story (1-20)

Input Example

{
"mode": "search",
"searchQueries": [
"AI startup",
"machine learning",
"open source LLM"
],
"sortBy": "date",
"timeRange": "week",
"maxItems": 100,
"includeComments": true,
"maxCommentsPerStory": 5
}

📤 Output

Every story includes these fields:

FieldTypeDescription
storyIdstringHacker News story ID
titlestringStory title
urlstringOriginal link URL
authorstringSubmitter username
pointsnumberUpvote count
numCommentsnumberTotal comment count
createdAtstringSubmission timestamp (ISO 8601)
storyTextstringStory body text (for Ask HN / Show HN)
tagsstring[]HN tags (story, show_hn, ask_hn, etc.)
hnUrlstringHacker News discussion URL
platformstringAlways "hackernews"
scrapedAtstringISO 8601 timestamp
topCommentsobject[]Top comments (when includeComments is enabled)

Comment Fields

FieldTypeDescription
commentIdstringComment ID
authorstringCommenter username
textstringComment text (HTML stripped)
createdAtstringComment timestamp
pointsnumberComment points

Output Example

{
"storyId": "39847261",
"title": "Show HN: We built an open-source alternative to Notion with local-first sync",
"url": "https://github.com/AppFlowy-IO/AppFlowy",
"author": "annieflowy",
"points": 847,
"numComments": 213,
"createdAt": "2026-03-04T16:42:31.000Z",
"storyText": "",
"tags": ["story", "show_hn", "front_page"],
"hnUrl": "https://news.ycombinator.com/item?id=39847261",
"platform": "hackernews",
"scrapedAt": "2026-03-06T14:15:22.108Z",
"topComments": [
{
"commentId": "39848102",
"author": "tptacek",
"text": "This is impressive work. The offline-first approach with CRDT sync is the right way to build collaborative tools.",
"createdAt": "2026-03-04T17:08:14.000Z",
"points": 142
}
]
}

📋 Use Cases

  • Tech Trend Detection — Track which technologies and startups are gaining traction
  • Content Marketing — Find trending topics in your niche for blog inspiration
  • Competitive Intelligence — Monitor mentions of competitors or products
  • Developer Relations — Track Show HN launches and community feedback
  • Research — Analyze discussion patterns and sentiment in tech communities
  • News Aggregation — Build curated feeds of top stories by topic

❓ FAQ

Q: Does this actor need an API key? A: No. It uses the public Algolia HN Search API and Firebase API, both freely accessible without authentication.

Q: What is the difference between "front_page" and "top_stories" modes? A: "front_page" returns the stories currently displayed on the Hacker News homepage (around 30 items). "top_stories" uses Firebase's canonical top stories list, which can return up to 500 of the highest-ranked active stories.

Q: Will enabling comments slow down the scraper? A: Yes. Each story with comments requires an additional API call. For large runs (100+ stories), expect significantly longer execution times. Use maxCommentsPerStory to control the volume.

Q: Can I search for multiple keywords in one run? A: Yes. The searchQueries parameter accepts up to 10 keywords. Results from all queries are deduplicated by story ID.

Q: Why did the actor return fewer items than maxItems? A: This happens when fewer stories match your search criteria. The actor deduplicates results and removes stories with titles shorter than 3 characters. If 0 results are found, the actor throws an error instead of exiting silently.

💰 Pricing

This actor uses Pay Per Event (PPE) pricing:

MetricCost
Per story scraped$0.03

📝 Changelog

v1.0 (Current)

  • ✅ 5 scraping modes (search, front_page, top_stories, show_hn, ask_hn)
  • ✅ Keyword search with relevance and date sorting
  • ✅ Time range filtering (24h, week, month, year, all)
  • ✅ Optional top comment extraction per story
  • ✅ Dual API support (Algolia HN Search + Firebase)
  • ✅ Deduplication by story ID
  • ✅ Anti-placeholder guardrails
  • ✅ Retry logic with rate limit handling

📰 Hacker News Intelligence — Stories, Comentários & Tendências

🇺🇸 English | 🇧🇷 Português

Extraia stories e comentários do Hacker News usando a API pública Algolia HN e Firebase. Busque por palavra-chave, navegue pela front page, obtenha top stories, ou explore Show HN e Ask HN. Sem necessidade de API key.

✨ Funcionalidades

  • 🔍 5 modos de scraping — Search, Front Page, Top Stories, Show HN, Ask HN
  • 🔑 Busca por palavra-chave — Encontre stories por tópico com ordenação por relevância ou data
  • 🕐 Filtros de período — Últimas 24h, semana, mês, ano ou todos os tempos
  • 💬 Extração de comentários — Busque top comentários por story (opcional)
  • 📊 Métricas completas — Pontos, contagem de comentários, autor, timestamps
  • 🔗 Links diretos — URL original + URL da discussão no HN
  • 🛡️ Guardrails anti-placeholder — Todo resultado validado como dado real
  • Baseado em API — Rápido e confiável via Algolia HN Search + Firebase

📥 Entrada

ParâmetroTipoObrigatórioPadrãoDescrição
modestring"search"Modo: search, front_page, top_stories, show_hn, ask_hn
searchQueriesstring[]Palavras-chave (obrigatório no modo "search", até 10)
sortBystring"relevance"Ordenar: relevance, date
timeRangestring"all"Período: all, 24h, week, month, year
maxItemsinteiro50Máx stories para extrair (1-500)
includeCommentsbooleanfalseBuscar top comentários por story
maxCommentsPerStoryinteiro5Máx comentários por story (1-20)

Exemplo de Entrada

{
"mode": "search",
"searchQueries": [
"AI startup",
"machine learning",
"open source LLM"
],
"sortBy": "date",
"timeRange": "week",
"maxItems": 100,
"includeComments": true,
"maxCommentsPerStory": 5
}

📤 Saída

Cada story inclui estes campos:

CampoTipoDescrição
storyIdstringID da story no Hacker News
titlestringTítulo da story
urlstringURL do link original
authorstringUsername do autor
pointsnúmeroContagem de upvotes
numCommentsnúmeroTotal de comentários
createdAtstringTimestamp da submissão (ISO 8601)
storyTextstringTexto da story (para Ask HN / Show HN)
tagsstring[]Tags do HN (story, show_hn, ask_hn, etc.)
hnUrlstringURL da discussão no Hacker News
platformstringSempre "hackernews"
scrapedAtstringTimestamp ISO 8601
topCommentsobjeto[]Top comentários (quando includeComments habilitado)

Campos do Comentário

CampoTipoDescrição
commentIdstringID do comentário
authorstringUsername do comentarista
textstringTexto do comentário (HTML removido)
createdAtstringTimestamp do comentário
pointsnúmeroPontos do comentário

Exemplo de Saída

{
"storyId": "39847261",
"title": "Show HN: We built an open-source alternative to Notion with local-first sync",
"url": "https://github.com/AppFlowy-IO/AppFlowy",
"author": "annieflowy",
"points": 847,
"numComments": 213,
"createdAt": "2026-03-04T16:42:31.000Z",
"storyText": "",
"tags": ["story", "show_hn", "front_page"],
"hnUrl": "https://news.ycombinator.com/item?id=39847261",
"platform": "hackernews",
"scrapedAt": "2026-03-06T14:15:22.108Z",
"topComments": [
{
"commentId": "39848102",
"author": "tptacek",
"text": "This is impressive work. The offline-first approach with CRDT sync is the right way to build collaborative tools.",
"createdAt": "2026-03-04T17:08:14.000Z",
"points": 142
}
]
}

📋 Casos de Uso

  • Detecção de Tendências Tech — Acompanhe quais tecnologias e startups estão ganhando tração
  • Marketing de Conteúdo — Encontre tópicos em alta no seu nicho para inspiração de blog
  • Inteligência Competitiva — Monitore menções de concorrentes ou produtos
  • Developer Relations — Acompanhe lançamentos Show HN e feedback da comunidade
  • Pesquisa — Analise padrões de discussão e sentimento em comunidades tech
  • Agregação de Notícias — Monte feeds curados de top stories por tópico

❓ Perguntas Frequentes

P: Este actor precisa de uma API key? R: Não. Ele usa a API pública Algolia HN Search e a API Firebase, ambas acessíveis livremente sem autenticação.

P: Qual é a diferença entre os modos "front_page" e "top_stories"? R: "front_page" retorna as stories atualmente exibidas na página inicial do Hacker News (cerca de 30 itens). "top_stories" usa a lista canônica de top stories do Firebase, que pode retornar até 500 das stories ativas com melhor classificação.

P: Habilitar comentários vai deixar o scraper mais lento? R: Sim. Cada story com comentários requer uma chamada API adicional. Para execuções grandes (100+ stories), espere tempos de execução significativamente maiores. Use maxCommentsPerStory para controlar o volume.

P: Posso buscar várias palavras-chave em uma execução? R: Sim. O parâmetro searchQueries aceita até 10 palavras-chave. Resultados de todas as queries são deduplicados por story ID.

P: Por que o actor retornou menos itens que maxItems? R: Isso acontece quando menos stories correspondem aos seus critérios de busca. O actor deduplica resultados e remove stories com títulos menores que 3 caracteres. Se 0 resultados forem encontrados, o actor lança um erro em vez de sair silenciosamente.

💰 Preços

Este actor usa precificação Pay Per Event (PPE):

MétricaCusto
Por story extraída$0.03

🔗 Actors Relacionados

📝 Changelog

v1.0 (Atual)

  • ✅ 5 modos de scraping (search, front_page, top_stories, show_hn, ask_hn)
  • ✅ Busca por palavra-chave com ordenação por relevância e data
  • ✅ Filtragem por período (24h, semana, mês, ano, todos)
  • ✅ Extração opcional de top comentários por story
  • ✅ Suporte dual API (Algolia HN Search + Firebase)
  • ✅ Deduplicação por story ID
  • ✅ Guardrails anti-placeholder
  • ✅ Lógica de retry com tratamento de rate limit