Similarweb Scraper avatar

Similarweb Scraper

Try for free

No credit card required

View all Actors
Similarweb Scraper

Similarweb Scraper

tri_angle/similarweb-scraper
Try for free

No credit card required

A simple but powerful scraper for similarweb.com. Retrieve website popularity information and get it in a JSON/XML/CSV/Excel/HTML table format. Get data such as total visits, traffic sources, competitors, top countries, company info, etc..

Do you want to learn more about this Actor?

Get a demo
NR

After 25 result it starts to show no results at all even though there is data available in Similar Web

Closed

noah_rue opened this issue
10 months ago

After 25 result it starts to show no results at all even though there is data available in Similar Web. Please see the attached screenshots. The first 25 results were perfectly fine. After the 25th it started to show almost no results

NR

noah_rue

10 months ago

i tested it again, this time with 8 gb ram. the same story but a little bit different: until the 25th result everything was fine. After that some of the results with supposedly 0 traffic were wrong, there was traffic (as seen in the screenshots). Though there were more correct answers than in the first run. Beginning from the 77th result there were again 0 results. This started later this time then the test before

NX

natural_xerocopier

10 months ago

i have the same problem. i get empty results right at the beginning - e.g. 31% of 50 requests were completed successfully; from web page #217 onwards there are only NULL values. According to the log, there are probably several problems. Firstly, the website is marked as successfully crawled, but returns NULL values. Secondly, cookie problems are probably responsible for the NULL result.

NR

noah_rue

10 months ago

just look at the other issue, i made a comment which other similar web scraper works good. its the one from curius_coder, this works way better. Only fails when the domains are redirected.

EV

eventvesta

9 months ago

We are also experiencing this issue, but it isn't happening predictably. It looks like we'll have a bunch of sites that will be scraped with correct results, then randomly the sites will pass or fail until our run is finished (we are currently attempting to do runs of > 500 urls).

It would be great if this was fixed, so that we don't have to refactor our code to split everything into 10-url chunks...

Developer
Maintained by Apify
Actor metrics
  • 122 monthly users
  • 24 stars
  • 99.9% runs succeeded
  • 6 hours response time
  • Created in May 2022
  • Modified 2 days ago