Similarweb Scraper avatar
Similarweb Scraper
Try for free

7 days trial then $25.00/month - No credit card required now

View all Actors
Similarweb Scraper

Similarweb Scraper

tri_angle/similarweb-scraper
Try for free

7 days trial then $25.00/month - No credit card required now

A simple but powerful scraper for similarweb.com. Retrieve website popularity information and get it in a JSON/XML/CSV/Excel/HTML table format. Get data such as total visits, traffic sources, competitors, top countries, company info, etc..

AS

I am still getting now results or output

Closed

advantageous_spruce opened this issue
2 years ago

I am working on an urgent project I need help quickly for this to work. I am still not getting any output from the call it used to work a few weeks ago. Here are the logs I am also using the latest build etc.

2022-08-15T14:24:45.050Z ACTOR: Pulling Docker image from repository. 2022-08-15T14:24:46.138Z ACTOR: Creating Docker container. 2022-08-15T14:24:46.255Z ACTOR: Starting Docker container. 2022-08-15T14:24:51.340Z INFO System info {"apifyVersion":"2.3.2","apifyClientVersion":"2.3.1","osType":"Linux","nodeVersion":"v16.15.0"} 2022-08-15T14:24:53.302Z INFO Starting the crawl. 2022-08-15T14:24:53.396Z INFO CheerioCrawler:AutoscaledPool: state {"currentConcurrency":0,"desiredConcurrency":2,"systemStatus":{"isSystemIdle":true,"memInfo":{"isOverloaded":false,"limitRatio":0.2,"actualRatio":null},"eventLoopInfo":{"isOverloaded":false,"limitRatio":0.7,"actualRatio":null},"cpuInfo":{"isOverloaded":false,"limitRatio":0.4,"actualRatio":null},"clientInfo":{"isOverloaded":false,"limitRatio":0.3,"actualRatio":null}}} 2022-08-15T14:24:56.943Z ERROR CheerioCrawler: handleRequestFunction failed, reclaiming failed request back to the list or queue {"url":"https://similarweb.com/website/Turing.com","retryCount":1,"id":"28FDXH31se7sSjj"} 2022-08-15T14:24:56.945Z App data JSON parse failed! Error: SyntaxError: Unexpected end of JSON input -- https://similarweb.com/website/Turing.com 2022-08-15T14:25:21.205Z ERROR CheerioCrawler: handleRequestFunction failed, reclaiming failed request back to the list or queue {"url":"https://similarweb.com/website/addepar.com","retryCount":1,"id":"FVhxj4SAdOaJc3d"} 2022-08-15T14:25:21.208Z App data JSON parse failed! Error: SyntaxError: Unexpected end of JSON input -- https://similarweb.com/website/addepar.com 2022-08-15T14:25:30.280Z ERROR CheerioCrawler: handleRequestFunction failed, reclaiming failed request back to the list or queue {"url":"https://similarweb.com/website/Turing.com","retryCount":2,"id":"28FDXH31se7sSjj"} 2022-08-15T14:25:30.283Z Error: request timed out after 30 seconds. 2022-08-15T14:25:30.285Z at Timeout._onTimeout (/usr/src/app/node_modules/@apify/timeout/index.js:84:74) 2022-08-15T14:25:30.288Z at listOnTimeout (node:internal/timers:559:17) 2022-08-15T14:25:30.290Z at processTimers (node:internal/timers:502:7) 2022-08-15T14:25:41.284Z ERROR CheerioCrawler: handleRequestFunction failed, reclaiming failed request back to the list or queue {"url":"https://similarweb.com/website/Turing.com","retryCount":3,"id":"28FDXH31se7sSjj"} 2022-08-15T14:25:41.287Z App data JSON parse failed! Error: SyntaxError: Unexpected end of JSON input -- https://similarweb.com/website/Turing.com 2022-08-15T14:25:50.784Z ERROR CheerioCrawler: Request failed and reached maximum retries {"id":"28FDXH31se7sSjj","url":"https://similarweb.com/website/Turing.com","method":"GET","uniqueKey":"https://similarweb.com/website/Turing.com"} 2022-08-15T14:25:50.786Z App data JSON parse failed! Error: SyntaxError: Unexpected end of JSON input -- https://similarweb.com/website/Turing.com 2022-08-15T14:25:53.443Z INFO Statistics: CheerioCrawler request statistics: {"requestAvgFailedDurationMillis":6374,"requestAvgFinishedDurationMillis":null,"requestsFinishedPerMinute":0,"requestsFailedPerMinute":0,"requestTotalDurationMillis":6374,"requestsTotal":1,"crawlerRuntimeMillis":60141,"retryHistogram":[null,null,null,1]} 2022-08-15T14:25:53.474Z INFO CheerioCrawler:AutoscaledPool: state {"currentConcurrency":1,"desiredConcurrency":4,"systemStatus":{"isSystemIdle":true,"memInfo":{"isOverloaded":false,"limitRatio":0.2,"actualRatio":0},"eventLoopInfo":{"isOverloaded":false,"limitRatio":0.7,"actualRatio":0.075},"cpuInfo":{"isOverloaded":false,"limitRatio":0.4,"actualRatio":0.026},"clientInfo":{"isOverloaded":false,"limitRatio":0.3,"actualRatio":0}}} 2022-08-15T14:25:54.622Z ERROR CheerioCrawler: handleRequestFunction failed, reclaiming failed request back to the list or queue {"url":"https://similarweb.com/website/addepar.com","retryCount":2,"id":"FVhxj4SAdOaJc3d"} 2022-08-15T14:25:54.624Z Error: request timed out after 30 seconds. 2022-08-15T14:25:54.626Z at Timeout._onTimeout (/usr/src/app/node_modules/@apify/timeout/index.js:84:74) 2022-08-15T14:25:54.628Z at listOnTimeout (node:internal/timers:559:17) 2022-08-15T14:25:54.630Z at processTimers (node:internal/timers:502:7) 2022-08-15T14:26:28.044Z ERROR CheerioCrawler: handleRequestFunction failed, reclaiming failed request back to the list or queue {"url":"https://similarweb.com/website/addepar.com","retryCount":3,"id":"FVhxj4SAdOaJc3d"} 2022-08-15T14:26:28.046Z Error: request timed out after 30 seconds. 2022-08-15T14:26:28.047Z at Timeout._onTimeout (/usr/src/app/node_modules/@apify/timeout/index.js:84:74) 2022-08-15T14:26:28.049Z at listOnTimeout (node:internal/timers:559:17) 2022-08-15T14:26:28.052Z at processTimers (node:internal/timers:502:7) 2022-08-15T14:26:34.562Z ERROR CheerioCrawler: Request failed and reached maximum retries {"id":"FVhxj4SAdOaJc3d","url":"https://similarweb.com/website/addepar.com","method":"GET","uniqueKey":"https://similarweb.com/website/addepar.com"} 2022-08-15T14:26:34.564Z App data JSON parse failed! Error: SyntaxError: Unexpected end of JSON input -- https://similarweb.com/website/addepar.com 2022-08-15T14:26:35.216Z INFO CheerioCrawler: All the requests from request list and/or request queue have been processed, the crawler will shut down. 2022-08-15T14:26:35.412Z INFO CheerioCrawler: Final request statistics: {"requestsFinished":0,"requestsFailed":2,"retryHistogram":[null,null,null,2],"requestAvgFailedDurationMillis":4844,"requestAvgFinishedDurationMillis":null,"requestsFinishedPerMinute":0,"requestsFailedPerMinute":1,"requestTotalDurationMillis":9687,"requestsTotal":2,"crawlerRuntimeMillis":102111} 2022-08-15T14:26:35.414Z INFO Crawl finished.

Developer
Maintained by Apify
Actor metrics
  • 56 monthly users
  • 8 stars
  • 99.9% runs succeeded
  • Created in May 2022
  • Modified about 1 month ago