Advanced Product Hunt Scraper avatar

Advanced Product Hunt Scraper

Try for free

3 days trial then $15.00/month - No credit card required now

Go to Store
Advanced Product Hunt Scraper

Advanced Product Hunt Scraper

danpoletaev/product-hunt-scraper
Try for free

3 days trial then $15.00/month - No credit card required now

Scrape product hunt "Top Products Launching Today" section. Actor crawls products and extracts information about the product: title, description, categories, images, maker info with contact links and website info with raw text and email. Export scraped datasets in JSON, csv, etc. Run via API.

SR

Suggestions

Closed
saravana.kumar opened this issue
2 months ago

You can improve the functionality with a few little tweaks.

Currently, you are scraping everything. Instead, you can add a couple more parameters.

  1. Products with more than 100 votes.
  2. Top 10 products of the day.
  3. Return only contact details not the entire website content - this makes the dataset bigger.
danpoletaev avatar

Hi 👋

Thank you for your valuable feedback! I’ve reviewed your suggestions and implemented the changes to improve the scraping functionality. Below are the updates:

  1. You can now filter products by the minimum amount of upvotes. Use "filter.minVotes" to set minimum amount of upvotes.
  2. You can now crawl only top N amount of products. Use "filter.topNProducts" to set the amount of products to crawl.
  3. If you set "saveWebsiteContent" to false - we'll not save website content to the dataset. We'll only save email.

Here’s the updated configuration for your reference:

{ "scrapeMakers": true, "scrapeWebsite": true, "archiveDate": "2024-05-15", // Date of the day you want to scrape in the format YYYY-MM-DD. If not provided, the scraper will scrape the current day. "filter": { // Filter is optional - if you don't pass it, we'll crawl all products for the following day "minVotes": 100, // Filter products by the minimum amount of upvotes "topNProducts": 10, // Take top N amount of products }, "saveWebsiteContent": true, // Set to true, if you want to set the content of the product's website to websiteRawText. This leads to a much bigger dataset. }

Also, you can check the documentation here: https://apify.com/danpoletaev/product-hunt-scraper#input

Important note: to avoid making breaking changes. By default we scrape all products and save website content to the dataset.

I am closing the issue. Let me know if you'll face any problems. 🙏

Developer
Maintained by Community

Actor Metrics

  • 49 monthly users

  • 13 bookmarks

  • >99% runs succeeded

  • 1.1 days response time

  • Created in Feb 2024

  • Modified 11 days ago