Advanced Product Hunt Scraper avatar
Advanced Product Hunt Scraper

Pricing

$15.00/month + usage

Go to Store
Advanced Product Hunt Scraper

Advanced Product Hunt Scraper

Developed by

Danil Poletaev

Danil Poletaev

Maintained by Community

Scrape product hunt "Top Products Launching Today" section. Actor crawls products and extracts information about the product: title, description, categories, images, maker info with contact links and website info with raw text and email. Export scraped datasets in JSON, csv, etc. Run via API.

5.0 (2)

Pricing

$15.00/month + usage

24

Total users

400

Monthly users

62

Runs succeeded

>99%

Issues response

16 hours

Last modified

4 days ago

BH

Yearly run stops early

Closed

bilingual_hulusi opened this issue
2 months ago

I tried running the yearly run for the year 2024, with a very high "topNProducts" limit, and it stopped after only 168 products. Any idea why it terminated so early?

danpoletaev avatar

Hi 👋

I’ve checked it, and it seems that sometimes it takes a while for the "loading" to finish (when you scroll down on the yearly page: https://www.producthunt.com/leaderboard/yearly/2024/all, you’ll see the loading issue).

I increased the timeout to 7.5 seconds, so hopefully it works better now. I also fixed another related issue.

I tried scraping 1000 products and it took about 10 minutes just to fetch them — so scraping 100,000 would take quite a while.

If you're planning to scrape that many, I’d really recommend going day by day. That way, you can resume from a specific date if something fails.

Unfortunately, Product Hunt doesn’t have pagination, so if scraping fails midway, there’s no simple way to continue from the "last" item.

P.S. I've also added information to log, where you can see how much items were already scrolled to. It will look like this "Found posts, keep scrolling: 342".

Build is still running and will finish in like 10-15 minutes. New version 2.0.3

Let me know if there’s anything else I can help with!

Thanks, Dan

BH

bilingual_hulusi

a month ago

Thanks for looking into it. Is there a way to crawl the Weekly or Monthly pages? Those would be a good compromise btw daily and yearly, which I agree could be impractically slow to crawl.

danpoletaev avatar

That's a good point! I'll add this functionality to crawl launches by week and let you know once it's ready 😉

BH

bilingual_hulusi

a month ago

Hey Danil - did you get a chance to update the crawler for Weekly pages? Would love to try it out when it's ready, thanks!

danpoletaev avatar

Hi 👋

I've just added a new feature that lets you crawl Weekly pages! To use it, please provide both the year and the week number. Also, make sure to set filter.topNProducts — if you don't, we'll default it to 100.

Let me know if everything works smoothly for you!

Thanks, Dan

BH

bilingual_hulusi

a month ago

Thanks! It worked for some runs but seems to be scraping the wrong items for other runs. For ex run YvjGHeJItnJH4FhGC

danpoletaev avatar

I would expect this to happen only very rarely. It looks like there were no products on the opened page — possibly due to infinite loading or a similar issue. I've added a fix that will automatically try again in such cases.

I'll go ahead and close the issue, but please let me know if you need anything else!