Hacker News Scraper avatar

Hacker News Scraper

Pricing

$4.99/month + usage

Go to Apify Store
Hacker News Scraper

Hacker News Scraper

Extract Hacker News stories, Ask HN, Show HN & jobs. Multi-category scraping. Structured JSON output. Fast & reliable. Export to CSV/Excel/JSON.

Pricing

$4.99/month + usage

Rating

0.0

(0)

Developer

Zoro

Zoro

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

7 days ago

Last modified

Share

Hacker News Scraper - Fast & Reliable Data Extraction

Extract stories, discussions, and job postings from Hacker News with this powerful scraper. Get real-time data from multiple categories including front page news, newest submissions, Ask HN, Show HN, and past stories.

πŸš€ Key Features

  • Multi-Category Scraping: Scrape from default news, newest, past news, Ask HN, and Show HN simultaneously
  • Asynchronous Processing: Fast concurrent scraping of multiple categories at once
  • Structured Data: Get clean, structured JSON data with titles, URLs, and scores
  • Real-time Updates: Access the latest Hacker News content as it's published
  • No Rate Limiting Issues: Built-in request throttling to respect Hacker News servers
  • Easy to Use: Simple configuration with just a few parameters
  • Scalable: Handle large-scale data extraction efficiently

πŸ“Š What Data Can You Extract?

This Hacker News scraper extracts the following data points from each post:

  • Title: Full story or discussion title
  • URL: Link to the external article or HN discussion
  • Score: Number of upvotes/points
  • Category: Source category (default, new, ask, show, past news)

Perfect for:

  • πŸ“ˆ Tech trend analysis and monitoring
  • πŸ” Content discovery and curation
  • πŸ“° News aggregation
  • 🎯 Competitive intelligence
  • πŸ“Š Data analysis and research
  • πŸ€– Training ML models on tech discussions
  • πŸ’Ό Job market analysis

🎯 Use Cases

Content Creators & Marketers

  • Monitor trending tech topics for content ideas
  • Track what stories resonate with the tech community
  • Identify popular discussion themes
  • Discover viral content patterns

Researchers & Analysts

  • Analyze Hacker News trends over time
  • Study community interests and engagement patterns
  • Extract data for academic research
  • Track technology adoption and sentiment

Developers & Startups

  • Track Show HN launches and feedback
  • Monitor competitor mentions
  • Identify emerging technologies and tools
  • Analyze community reception of products

Job Seekers & Recruiters

  • Access Hacker News job postings
  • Monitor hiring trends in tech
  • Find companies actively recruiting
  • Track in-demand skills and technologies

πŸ’‘ How to Use

Input Configuration

Configure the scraper with these simple parameters:

{
"categories": ["default", "show", "ask"],
"page_count": 2
}

Input Parameters

ParameterTypeRequiredDefaultDescription
categoriesArrayNo["default"]Categories to scrape from Hacker News
page_countIntegerNo1Number of pages per category (1-10)

Available Categories

Choose from these Hacker News sections:

  • default: Front page news and popular stories
  • new: Newest submissions across all categories
  • past news: Previously popular front page stories
  • ask: Ask HN discussions and questions
  • show: Show HN project launches and demos

Example Configurations

Scrape front page stories (1 page)

{
"categories": ["default"],
"page_count": 1
}

Scrape Show HN and Ask HN (3 pages each)

{
"categories": ["show", "ask"],
"page_count": 3
}

Comprehensive scrape (all categories, 2 pages each)

{
"categories": ["default", "new", "past news", "ask", "show"],
"page_count": 2
}

Maximum data extraction

{
"categories": ["default", "new", "past news", "ask", "show"],
"page_count": 10
}

πŸ“€ Output Format

Each item in the dataset contains:

{
"title": "Show HN: I built a tool for analyzing Hacker News trends",
"url": "https://example.com/hn-analytics",
"score": "324 points",
"category": "show"
}

Output Fields

FieldTypeDescription
titleStringStory or discussion title
urlStringLink to content or discussion
scoreStringUpvotes/points (e.g., "324 points")
categoryStringSource category

🎬 Quick Start Guide

Step 1: Open the Actor

Click "Try for free" or "Start" button

Step 2: Configure Input

Select your desired categories and page count:

  • Choose one or more categories from the dropdown
  • Set pages per category (1-10)

Step 3: Run the Actor

Click "Start" and wait for the scraper to complete

Step 4: Download Your Data

Export your data in multiple formats:

  • JSON
  • CSV
  • Excel
  • HTML Table
  • RSS Feed

πŸ“ˆ Performance & Limits

  • Items per Page: ~30 items
  • Pages per Category: 1-10 pages
  • Maximum Items per Category: Up to 300 items
  • Concurrent Processing: All categories scraped simultaneously
  • Average Speed: ~1-2 seconds per page
  • Built-in Delays: Automatic rate limiting for reliability
Use CaseCategoriesPagesEst. Items
Quick check["default"]1~30
Daily monitoring["default", "new"]2~120
Comprehensive analysisAll categories5~750
Maximum extractionAll categories10~1,500

πŸ”„ Integration & Automation

Scheduled Runs

Set up automatic scraping:

  • Hourly updates for breaking tech news
  • Daily digests of top stories
  • Weekly trend analysis

Connect to Your Tools

Export data directly to:

  • Google Sheets: Automatic spreadsheet updates
  • Slack: Get notifications of new content
  • Webhooks: Send data to your applications
  • Zapier/Make: Integrate with 1000+ apps
  • Email: Receive data reports

API Access

Access your scraped data via:

  • Apify API
  • Direct dataset download
  • Webhook delivery
  • RSS feed

πŸ’Ž Pro Tips

Optimize Your Scraping

For Real-time Monitoring

{
"categories": ["new"],
"page_count": 1
}

Run every 15-30 minutes to catch new submissions.

For Trend Analysis

{
"categories": ["default"],
"page_count": 5
}

Run daily to track what's trending.

For Research Projects

{
"categories": ["default", "ask", "show"],
"page_count": 10
}

Run weekly for comprehensive datasets.

For Job Hunting Monitor "Who is hiring?" threads in Ask HN category.

Data Analysis Ideas

  • Track keyword frequency in titles
  • Analyze score distribution patterns
  • Monitor domain popularity
  • Identify trending topics over time
  • Compare engagement across categories

πŸ”’ Privacy & Compliance

  • βœ… Scrapes only publicly available data
  • βœ… No authentication required
  • βœ… Respects Hacker News rate limits
  • βœ… GDPR compliant (no personal data)
  • βœ… No cookies or tracking
  • βœ… Ethical scraping practices

πŸ› Troubleshooting

No Results Returned

Possible causes:

  • Category name misspelled (check spelling and case)
  • Page count set to 0
  • Network connectivity issues

Solution: Verify input parameters and try again

Partial Results

Possible causes:

  • Some categories may have fewer items
  • Pages beyond available content return empty

Solution: This is normal behavior - you'll get all available items

Slow Performance

Possible causes:

  • Scraping many categories with high page counts
  • High platform load

Solution: Reduce page_count or split into multiple runs

πŸ“Š Sample Use Cases with Code

Analyze Top Stories

Get the top 100 stories and analyze trends:

{
"categories": ["default"],
"page_count": 4
}

Monitor Show HN Launches

Track new product launches:

{
"categories": ["show"],
"page_count": 3
}

Research Community Discussions

Study Ask HN questions and discussions:

{
"categories": ["ask"],
"page_count": 5
}

🎯 What Makes This Scraper Different?

✨ Multi-Category Support: Scrape multiple sections simultaneously
⚑ Fast & Efficient: Asynchronous processing for speed
🎨 Clean Data: Structured, ready-to-use JSON output
πŸ”„ Reliable: Built-in error handling and retries
πŸ“Š Scalable: Handle small to large-scale extraction
πŸš€ Easy to Use: No coding required, just configure and run

❓ FAQ

Q: How many items can I scrape?
A: Up to 300 items per category (10 pages Γ— ~30 items)

Q: How often can I run this scraper?
A: As often as needed, but we recommend respecting rate limits

Q: Can I scrape historical data?
A: The "past news" category shows previously popular stories

Q: Is this free to use?
A: Free tier available. Check Apify pricing for usage limits

Q: Can I export to CSV/Excel?
A: Yes! Export in JSON, CSV, Excel, HTML, or RSS

Q: How fresh is the data?
A: Data is scraped in real-time when you run the actor

πŸ’‘ Need Help?

  • πŸ“– Read the documentation above
  • πŸ’¬ Contact support through Apify platform
  • πŸ› Report issues via actor feedback
  • ⭐ Rate and review if you find this useful!

🏷️ Keywords

hacker news scraper, hn scraper, tech news extractor, news aggregator, hacker news data, show hn, ask hn, y combinator news, tech trends, startup news, discussion forum scraper, web scraping, data extraction, news api, tech community, developer news, startup stories


Ready to extract Hacker News data? Click "Start" to begin scraping!