Product Hunt Scraper
Pricing
Pay per usage
Product Hunt Scraper
Scrape Product Hunt launches and products. Extract names, taglines, votes, comments, makers, topics, ratings. Export to JSON, CSV, Excel.
Pricing
Pay per usage
Rating
0.0
(0)
Developer
Glass Ventures
Actor stats
0
Bookmarked
2
Total users
1
Monthly active users
4 days ago
Last modified
Categories
Share
Scrape products and launches from Product Hunt. Extract names, taglines, votes, comments, makers, topics, ratings, and more.
What does Product Hunt Scraper do?
Product Hunt Scraper is an Apify actor that extracts product launch data from Product Hunt, the leading platform where startups and makers showcase their products. It collects comprehensive data about products including names, taglines, descriptions, vote counts, comment counts, maker information, topics, and ratings.
The actor supports scraping from the homepage (daily launches), search results, topic pages, and individual product pages. It uses CheerioCrawler for fast, efficient HTTP-based scraping by extracting embedded NEXT_DATA and Apollo cache from Product Hunt pages.
Whether you need to track trending products, analyze startup launches, or monitor competitors, this actor provides structured data ready for analysis.
Use Cases
- Market researchers — Track trending products and startup launches across categories
- Venture capitalists — Monitor new product launches and identify promising startups early
- Product managers — Analyze competitor products, features, and community reception
- Data analysts — Build datasets of product launches for trend analysis and market insights
- Developers — Discover new developer tools and APIs as they launch
Features
- Scrape Product Hunt homepage, search results, topic pages, and individual product pages
- Extract comprehensive product data: name, tagline, votes, comments, makers, topics, rating
- Automatic URL classification — detects page type from URL pattern
- Multiple data extraction strategies: NEXT_DATA, Apollo cache, HTML fallback
- Proxy support with automatic session rotation
- Deduplication to avoid duplicate products in output
- Handles pagination and large datasets automatically
- Exports to JSON, CSV, Excel, or connect via API
How much will it cost?
| Results | Estimated Cost |
|---|---|
| 100 | ~$0.10 |
| 1,000 | ~$0.50 |
| 10,000 | ~$3.00 |
| Cost Component | Per 1,000 Results |
|---|---|
| Platform compute | ~$0.25 |
| Proxy (datacenter) | ~$0.25 |
| Total | ~$0.50 |
How to use
- Go to the Product Hunt Scraper page on Apify Store
- Click "Start" or "Try for free"
- Enter Product Hunt URLs, search terms, or topics
- Set the maximum number of items
- Click "Start" and wait for the results
Input parameters
| Parameter | Type | Description | Default |
|---|---|---|---|
| startUrls | array | Product Hunt URLs to scrape | - |
| searchTerms | array | Search queries to find products | - |
| topics | array | Topic slugs (e.g., "artificial-intelligence") | - |
| maxItems | number | Max results to return | 100 |
| maxConcurrency | number | Parallel page limit | 10 |
| debugMode | boolean | Enable verbose logging | false |
| proxyConfig | object | Proxy settings | Apify Proxy |
Output
The actor produces a dataset with the following fields:
{"name": "ChatGPT","tagline": "An AI-powered chatbot by OpenAI","description": "ChatGPT is a conversational AI model...","url": "https://www.producthunt.com/posts/chatgpt","websiteUrl": "https://chat.openai.com","votesCount": 3200,"commentsCount": 450,"reviewsCount": 120,"rating": 4.8,"topics": ["Artificial Intelligence", "Productivity"],"makers": [{"name": "Sam Altman","username": "sama","profileUrl": "https://www.producthunt.com/@sama"}],"thumbnailUrl": "https://ph-files.imgix.net/...","launchDate": "2022-12-01T00:00:00.000Z","isFeatured": true,"scrapedAt": "2024-01-15T10:30:00.000Z"}
| Field | Type | Description |
|---|---|---|
| name | string | Product name |
| tagline | string | Product tagline |
| description | string | Product description |
| url | string | Product Hunt page URL |
| websiteUrl | string | Product website URL |
| votesCount | integer | Number of upvotes |
| commentsCount | integer | Number of comments |
| reviewsCount | integer | Number of reviews |
| rating | number | Average rating |
| topics | array | Product topics/categories |
| makers | array | Product makers with name, username, profile URL |
| thumbnailUrl | string | Product thumbnail image URL |
| launchDate | string | Date the product was launched |
| isFeatured | boolean | Whether the product was featured |
| scrapedAt | string | ISO 8601 scrape timestamp |
Integrations
Connect Product Hunt Scraper with other tools:
- Apify API — REST API for programmatic access
- Webhooks — get notified when a run finishes
- Zapier / Make — connect to 5,000+ apps
- Google Sheets — export directly to spreadsheets
API Example (Node.js)
import { ApifyClient } from 'apify-client';const client = new ApifyClient({ token: 'YOUR_TOKEN' });const run = await client.actor('YOUR_USERNAME/product-hunt-scraper').call({startUrls: [{ url: 'https://www.producthunt.com/' }],maxItems: 100,});const { items } = await client.dataset(run.defaultDatasetId).listItems();
API Example (Python)
from apify_client import ApifyClientclient = ApifyClient('YOUR_TOKEN')run = client.actor('YOUR_USERNAME/product-hunt-scraper').call(run_input={'startUrls': [{'url': 'https://www.producthunt.com/'}],'maxItems': 100,})items = client.dataset(run['defaultDatasetId']).list_items().items
API Example (cURL)
curl "https://api.apify.com/v2/acts/YOUR_USERNAME~product-hunt-scraper/runs" \-X POST \-H "Content-Type: application/json" \-H "Authorization: Bearer YOUR_TOKEN" \-d '{"startUrls": [{"url": "https://www.producthunt.com/"}], "maxItems": 100}'
Tips and tricks
- Start with a small
maxItems(10-20) to test before running large scrapes - Use topics to scrape products in specific categories like "artificial-intelligence" or "developer-tools"
- Combine search terms and topics for comprehensive data collection
- The actor automatically deduplicates products across different input sources
FAQ
Q: Does this actor require login credentials? A: No. Product Hunt product data is publicly accessible without authentication.
Q: How fast is the scraping? A: Approximately 50-100 products per minute using CheerioCrawler with default concurrency.
Q: What should I do if I get blocked? A: Switch to residential proxies in the Proxy Configuration settings and lower the concurrency.
Q: Can I scrape specific topics? A: Yes. Use the Topics input with slugs like "artificial-intelligence", "saas", "developer-tools".
Is it legal to scrape Product Hunt?
Web scraping of publicly available data is generally legal based on precedents like the LinkedIn v. HiQ Labs case. This actor only accesses publicly available data. Always review and respect the target site's Terms of Service and robots.txt. For more information, see Apify's blog on web scraping legality.
Limitations
- Product Hunt may change their page structure, which could require actor updates
- Rate limiting may apply — use proxies for large-scale scraping
- Some detailed product information may only be available on individual product pages
- Historical data beyond what's currently visible on the site is not accessible
Changelog
- v0.1 (2026-04-23) — Initial release