Clutch.co Scraper avatar

Clutch.co Scraper

Try for free

3 days trial then $25.00/month - No credit card required now

View all Actors
Clutch.co Scraper

Clutch.co Scraper

curious_coder/clutch-scraper
Try for free

3 days trial then $25.00/month - No credit card required now

Scrape clutch.co and get companies information including website, name, hourly rate, reviews, logo and many more details

TJ

Fails with timeouts and didn't resurrect correctly

Closed

technological_jujube opened this issue
5 months ago

I spent a day and about $100 trying your parser, but it fails a lot, on some timeouts, when I press resurrect it actually starting from first page not from the place it stopped, so a lot of duplicates in the result file. It's unable to parse more 1k records from any page.

curious_coder avatar

Looks like most of the duplicates are "featured providers" (or ads) which show up in all pages . I didn't notice that. Will update the actor to avoid duplicates records and use caching or skip featured providers as an option. Will update here once done. My sincere apologies for the overage. and thanks for all your efforts to use my actor.

curious_coder avatar

Just sent an update which makes sure no featured listings are scraped more than once

Developer
Maintained by Community

Actor Metrics

  • 47 monthly users

  • 5 stars

  • >99% runs succeeded

  • 15 days response time

  • Created in Jun 2023

  • Modified 9 days ago

Categories