
Producthunt Scraper
Pricing
$29.00/month + usage

Producthunt Scraper
A web scraper that extracts comprehensive product information from Product Hunt using Apify.
5.0 (1)
Pricing
$29.00/month + usage
3
54
12
Last modified
15 days ago
Product Hunt Scraper
A web scraper that extracts comprehensive product information from Product Hunt using Apify and Crawlee.
Quick Start
- Input: Configure your scraping parameters in the input field
- Run: Click "Start" to begin scraping
- Output: Download results from the Dataset tab
Input Configuration
Note: If you provide a Start Date (and/or End Date), the Start URLs field will be ignored. Only one method (date range OR Start URLs) will be used per run.
Basic Configuration
Daily Leaderboard Scraping
Input Parameters
| Parameter | Type | Default | Description | 
|---|---|---|---|
| startUrls | Array | ["https://www.producthunt.com/"] | Starting URLs for scraping | 
| maxRequestRetries | Number | 3 | Maximum retry attempts for failed requests | 
| maxConcurrency | Number | 5 | Number of concurrent requests | 
| maxRequestsPerCrawl | Number | 100 | Maximum pages to crawl | 
| scrapeComingSoon | Boolean | true | Whether to scrape "Coming Soon" products | 
| scrapeDailyLeaderboard | Boolean | false | Whether to scrape daily leaderboard | 
| sortByDate | Boolean | false | Whether to sort output by date | 
| sortOrder | String | "desc" | Sort order: "asc"(oldest first) or"desc"(newest first) | 
| startDate | String | null | Start date for daily leaderboard range (format: YYYY-MM-DDorYYYY/MM/DD) | 
| endDate | String | null | End date for daily leaderboard range (defaults to yesterday if not specified) | 
Input Method Exclusivity
- If you provide a Start Date (and/or End Date), the Start URLs field will be ignored.
- If you do not provide a Start Date, the scraper will use the Start URLs.
- If neither is provided, the scraper will default to scraping the daily leaderboard for yesterday (today - 1 day).
Examples
Default (no input, scrapes yesterday's leaderboard):
// Will scrape the daily leaderboard for yesterday
Date range only (Start URLs ignored):
Start URLs only (date fields empty):
Both provided (date range takes precedence):
Output Format
Regular Products
Daily Leaderboard Products
Team Extraction Output
- makers: Array of all team members with the "Maker" role. Each object contains:
- username: Product Hunt username (string)
- name: Display name (string)
- roles: Array of roles (e.g.- ["Maker"]or- ["Hunter", "Maker"])
 
- hunter: Object with the first team member who has the "Hunter" role, with:
- username: Product Hunt username (string)
- name: Display name (string)
- If no hunter is found, this field is null.
 
Usage Examples
Scrape Today's Products
Scrape Daily Leaderboard
Scrape Coming Soon Products
Scrape Specific Categories
Scrape with Date Sorting
Scrape Daily Leaderboard with Date Sorting (Oldest First)
Scrape Daily Leaderboard Date Range
Scrape Daily Leaderboard from Date to Yesterday
Combine Custom URLs with Date Range
Data Fields
| Field | Description | Available For | 
|---|---|---|
| name | Product name | All products | 
| tagline | Short product description | All products | 
| description | Detailed product description | Regular products | 
| upvotes | Number of upvotes | All products | 
| categories | Array of product categories | All products | 
| maker | Product creator name | All products | 
| launchDate | Product launch date | All products | 
| imageUrl | Product image URL | All products | 
| productUrl | Direct product website link | All products | 
| pricing | Pricing information | Regular products | 
| metaKeywords | Meta keywords | Regular products | 
| socialLinks | Social media links | Regular products | 
| scrapedFrom | Data source identifier | All products | 
| scrapedAt | Timestamp of scraping | All products | 
| sourceUrl | Original Product Hunt URL | All products | 
Performance Tips
- Concurrency: Increase maxConcurrencyfor faster scraping (be mindful of rate limits)
- Retries: Higher maxRequestRetriesvalues improve reliability but slow down scraping
- Request Limits: Adjust maxRequestsPerCrawlbased on your needs
- Proxy: The actor uses Camoufox proxy for anti-detection
- Sorting: Enable sortByDateto get chronologically ordered results (adds processing time for large datasets)
- Date Ranges: Large date ranges will generate many URLs; increase maxRequestsPerCrawlaccordingly
Troubleshooting
Common Issues
- No products found: Product Hunt may have changed their HTML structure
- Rate limiting: Reduce maxConcurrencyor add delays
- Missing data: Some products may not have all fields available
- Daily leaderboard issues: Check if the URL format is correct
Debug Mode
Enable debug logging by checking the actor logs in the Apify console.
Legal Notice
- Respect Product Hunt's robots.txt and terms of service
- Use reasonable request rates
- Use scraped data responsibly and in accordance with applicable laws
- This scraper is for educational and research purposes
Support
For issues and questions:
- Check the troubleshooting section
- Review the actor logs
- Contact Apify support
Note: Always respect website terms of service and use data responsibly.
Related Actors
- CNN Top Headlines Scraper Actor: Scrape the latest top news headlines and full article details from CNN.
On this page
- 
- 
- Scrape Today's Products
- Scrape Daily Leaderboard
- Scrape Coming Soon Products
- Scrape Specific Categories
- Scrape with Date Sorting
- Scrape Daily Leaderboard with Date Sorting (Oldest First)
- Scrape Daily Leaderboard Date Range
- Scrape Daily Leaderboard from Date to Yesterday
- Combine Custom URLs with Date Range
 
 
Share Actor:


















