
HackerNoon Scraper
Pricing
$20.00/month + usage

HackerNoon Scraper
Extract articles from HackerNoon across 22 different categories (AI, Web3, Business, Finance, and many more). Discover high-quality tech stories from +45000 contributing writers.
0.0 (0)
Pricing
$20.00/month + usage
0
1
1
Last modified
14 days ago
Extract articles from HackerNoon[https://hackernoon.com/] across 22 different categories (AI, Web3, Business, Finance and many more). Discover high quality tech stories from +45000 contributing writers.
🚀 Features
- Multi-Category Support: Scrape from 22 different HackerNoon categories
- Rich Data Extraction: Extracts comprehensive article metadata including content, author information, images, and engagement metrics
- Configurable Limits: Set maximum number of articles to scrape
📊 Supported Categories
The scraper supports all major HackerNoon categories:
- Technology: AI, Programming, Tech Companies, Tech Stories, Cybersecurity, Cloud, Data Science
- Business: Business, Finance, Startups, Management, Product Management
- Lifestyle: Life Hacking, Remote Work, Gaming, Writing
- Science & Innovation: Science, Futurism, Web3
- Community: HackerNoon, Society, Media
- Special: Top Stories (trending articles)
📋 Data Fields Extracted
Each scraped article includes the following information:
Article Metadata
id
- Unique article identifiertitle
- Article titleslug
- URL sluglink
- Full article URLexcerpt
- Article excerpt/summarytldr
- Too Long; Didn't Read summaryarticleBody
- Full article contentcreatedAt
- Publication date and timeparentCategory
- Primary categorytags
- Array of article tagscommentsCount
- Number of commentspageViews
- Number of page views/readsarweave
- Arweave blockchain reference
Media Information
mainImage
- Main article image URLmainImageHeight
- Main image height (pixels)mainImageWidth
- Main image width (pixels)socialPreviewImage
- Social media preview image
Author Information
author_name
- Author display nameauthor_handle
- Author username/handleauthor_avatar
- Author profile picture URLauthor_bio
- Author biographyauthor_isBrand
- Whether the author is a brand accountauthor_isTrusted
- Whether the author is verified/trusted
⚙️ Configuration
Input Parameters
The scraper accepts the following input parameters:
{"category": "AI","max_posts": 100}
-
category (required): Choose from available categories
- Default: "Top Stories"
- Options: See full list of supported categories above
-
max_posts (optional): Maximum number of articles to scrape
- Default: No limit (up to 500 articles max)
- Range: 50-500
- ⚠️ Note: For "Top Stories" category, maximum output is limited to 150 articles due to the time it takes to render compared to the other categories.
📊 Output Format
The scraper outputs data in JSON format, with each article as a separate record:
{"id": "rxhvxiLNxsRwnMGivMFc","title": "The Future of AI in Healthcare","slug": "the-future-of-ai-in-healthcare","link": "https://hackernoon.com/the-future-of-ai-in-healthcare","excerpt": "Exploring how AI is revolutionizing medical diagnosis...","tldr": "AI is transforming healthcare through improved diagnostics and personalized treatment.","articleBody": "Full article content here...","createdAt": "2023-09-30T21:36:56.367Z","mainImage": "https://hackernoon.imgix.net/images/...","mainImageHeight": 1024,"mainImageWidth": 1536,"socialPreviewImage": "https://hackernoon.imgix.net/images/...","parentCategory": "ai","tags": ["artificial-intelligence", "healthcare", "machine-learning"],"commentsCount": 15,"pageViews": 348080,"arweave": "QuKn6Hew8wrwpJ9Zt0OFoeVt5yQwBQyZf30TtejOOno","author_name": "Dr. Sarah Johnson","author_handle": "sarahj_ai","author_avatar": "https://cdn.hackernoon.com/images/...","author_bio": "AI researcher and healthcare innovation expert","author_isBrand": false,"author_isTrusted": true}