Pricing

$8.00/month + usage

Hacker News Scraper & API - Export Stories, Comments, Data

Extract top stories, trending posts, points, comments & authors from Hacker News front page. Real-time data export to JSON/CSV. Monitor tech trends, analyze viral content, track HN activity. Fast Playwright scraper.

Pricing

$8.00/month + usage

Rating

0.0

(0)

Developer

Brennan Crawford

Actor stats

Bookmarked

Total users

Monthly active users

2 months ago

Last modified

Hacker News Scraper for Apify

A production-ready Apify actor that scrapes stories from Hacker News front page using Playwright.

🚀 Features

Scrapes Hacker News front page stories
Extracts comprehensive story data:
- Title and URL
- Points (upvotes)
- Author username
- Number of comments
- Time posted
- Story rank
- Hacker News discussion URL
Configurable number of stories to scrape
Option to include/exclude job posts
Built with Playwright for reliable scraping
Production-ready for Apify platform

📁 Project Structure

hackernews-scraper/
├── .actor/
│   ├── actor.json              # Actor metadata and configuration
│   └── dataset_schema.json     # Output data schema
├── apify_actor.py              # Main actor entry point
├── hackernews_scraper.py       # Core scraper implementation
├── Dockerfile                  # Docker configuration for Apify
├── requirements.txt            # Python dependencies
├── INPUT_SCHEMA.json           # Input configuration schema
└── README.md                   # This file

🔧 Local Testing

Prerequisites

Python 3.11+
pip

Installation

Install dependencies:

$pip install -r requirements.txt

Install Playwright browsers:

$playwright install chromium

Test the scraper locally:

$python hackernews_scraper.py

🌐 Deploy to Apify

Prerequisites

Create an Apify account
Install Apify CLI: npm install -g apify-cli
Login: apify login

Deployment Steps

Navigate to project directory:

$cd hackernews-scraper

Deploy to Apify:

$apify push

Access your actor at Apify Console

Running on Apify

Navigate to your actor in the Apify Console
Click "Run"
Configure input options (optional)
Click "Start" to run the actor
View results in the "Dataset" tab

⚙️ Input Configuration

Field	Type	Default	Description
`maxStories`	integer	30	Maximum number of stories to scrape (1-100)
`includeJobPosts`	boolean	false	Include "Who is hiring?" job posts

Example Input

{
  "maxStories": 30,
  "includeJobPosts": false
}

📊 Output Format

Each story is returned as a JSON object with the following structure:

{
  "rank": 1,
  "title": "Show HN: I built a tool for...",
  "url": "https://example.com/article",
  "points": 342,
  "author": "username",
  "comments": 127,
  "timeAgo": "2024-01-15T10:30:00.000Z",
  "hackerNewsUrl": "https://news.ycombinator.com/item?id=12345678"
}

Output Fields

Field	Type	Description
`rank`	number	Story position on front page
`title`	string	Story title
`url`	string	Link to the story/article
`points`	number	Number of upvotes
`author`	string	Username who posted the story
`comments`	number	Number of comments
`timeAgo`	string	Timestamp when story was posted
`hackerNewsUrl`	string	URL to Hacker News discussion

🛠️ Built With

Python 3.11 - Programming language
Playwright - Browser automation
Apify SDK - Actor framework
Following Apify best practices and patterns

📝 Use Cases

Monitor trending tech stories
Track specific topics on HN
Build custom HN readers/aggregators
Research what content performs well
Create HN analytics dashboards

🔒 Rate Limiting

The scraper is designed to be respectful of Hacker News:

Single page load per run
No aggressive pagination
Configurable limits on stories scraped

📄 License

This actor is provided as-is for use on the Apify platform.

🤝 Support

For issues or questions:

Check the Apify documentation
Open an issue in the repository
Contact via Apify platform

Ready to deploy in under 10 minutes! 🎉

Hacker News Api Scraper

fresh_cliff/hacker-news-api-scraper

Extract Hacker News top stories, comments, points & authors. No API keys. Real-time JSON/CSV export. Monitor tech trends, analyze viral content, track HN activity. Fast requests-based scraper with alternative frontend fallback.

Brennan Crawford

Hacker News Scraper

muscular_quadruplet/hackernews-scraper

Scrape Hacker News stories, comments, and user profiles. Extract top stories, new posts, Show HN, Ask HN. Monitor tech trends, track discussions, build news aggregators. Real-time tech news scraping.

Do It

Hacker News Scraper - Extract Stories, Comments & Tech News

intelligent_yaffle/hacker-news-scraper

Scrape Hacker News stories, comments, and user profiles. Extract titles, URLs, points, comment counts, and full discussion threads. Great for tech trend analysis, startup monitoring, developer community insights, and content curation. Supports front page, new, best, and ask HN sections.

Fatih Dağüstü

Hacker News Scraper

coder_zoro/hacker-news-scraper

Extract Hacker News stories, Ask HN, Show HN & jobs. Multi-category scraping. Structured JSON output. Fast & reliable. Export to CSV/Excel/JSON.

Zoro

Hacker News Scraper

cloud9_ai/hackernews-scraper

Scrape Hacker News stories, comments, and user profiles via official Firebase API. Get top, new, best, ask, show stories with scores, comments, and author data.

cloud9

Hacker News Search

ryanclinton/hackernews-search

Search Hacker News stories, comments, and polls via Algolia API. Filter by date, points, comments, author. Track tech trends, monitor brand mentions, find discussions. No API key needed.

ryan clinton

Hacker News Scraper

nexgendata/hacker-news-scraper

Scrape stories from Hacker News including top, new, best, ask, show, and job stories. Returns titles, URLs, scores, comment counts, and optionally top-level comments. Uses the official HN Firebase API.

Stephan Corbeil

Hacker News Live Feed

desmond-dev/hacker-news-tech-trends

Real-time top stories from Hacker News (Y Combinator). Fetches title, URL, score, and comments. Perfect for tracking tech trends, AI news, and startup buzz.

Desmond Chigariro

Hacker News Data Scraper

epctex/hackernews-scraper

Extract Y Combinator's Hacker News based on any search criteria. Crawl the front page, Show HN, Ask HN, news, job listings, and historical data. Get links, titles, comments, ratings, and more!

epctex

152

5.0

Hacker News Scraper

parseforge/hacker-news-scraper

Extract stories, comments, and user data from Hacker News. Browse 6 feed types (Top, New, Best, Ask HN, Show HN, Jobs) or search with filters for points, comments, and date ranges. Get nested comment threads with depth control and author karma scores. Perfect for tech trends monitoring and analysis.