Zalando Scraper avatar
Zalando Scraper
Under maintenance
Try for free

No credit card required

View all Actors
This Actor is under maintenance.

This Actor may be unreliable while under maintenance. Would you like to try a similar Actor instead?

See alternative Actors
Zalando Scraper

Zalando Scraper

lhotanova/zalando-scraper
Try for free

No credit card required

Scrape product data from Zalando, such as images, prices, brands or product attributes. You can extract data from any of the available Zalando domains - zalando.co.uk, zalando.de, zalando.fr, zalando.it and others. Search products by categories or provide URLs of concrete products.

User avatar

Crawler Not Working

Open

umermansoor opened this issue
11 days ago

First I tried crawler on "https://www.zalando.co.uk/mens-clothing-t-shirts/" and it scrapped almost 8268 products but total products available on website were 11461 so it missed almost around 3000+ products.

Then I tried to scrap "https://www.zalando.co.uk/womens-clothing-tops/" but the scrapper/crawler is not working at all.

User avatar

umermansoor

11 days ago

2024-04-02T07:10:49.403Z ACTOR: Pulling Docker image of build OhXlBnLMsSCfpjqUG from repository. 2024-04-02T07:10:51.169Z ACTOR: Creating Docker container. 2024-04-02T07:10:51.510Z ACTOR: Starting Docker container. 2024-04-02T07:10:53.964Z INFO System info {"apifyVersion":"3.1.14","apifyClientVersion":"2.8.4","crawleeVersion":"3.7.1","osType":"Linux","nodeVersion":"v16.20.2"} 2024-04-02T07:10:54.844Z INFO CheerioCrawler: Starting the crawler. 2024-04-02T07:10:57.363Z INFO CheerioCrawler: Opened category page: Women's Tops | Logo T-Shirts | Zalando {"url":"https://www.zalando.co.uk/womens-clothing-tops/"} 2024-04-02T07:10:57.381Z WARN CheerioCrawler: Reclaiming failed request back to the list or queue. Response could not be parsed 2024-04-02T07:10:57.382Z at tryParseReponse (file:///usr/src/app/dist/utils.js:23:15) {"id":"BSSn2gMCCrYT1LQ","url":"https://www.zalando.co.uk/womens-clothing-tops/","retryCount":1} 2024-04-02T07:11:02.269Z INFO CheerioCrawler: Opened category page: Women's Tops | Logo T-Shirts | Zalando {"url":"https://www.zalando.co.uk/womens-clothing-tops/"} 2024-04-02T07:11:02.271Z WARN CheerioCrawler: Reclaiming failed request back to the list or queue. Response could not be parsed 2024-04-02T07:11:02.272Z at tryParseReponse (file:///usr/src/app/dist/utils.js:23:15) {"id":"BSSn2gMCCrYT1LQ","url":"https://www.zalando.co.uk/womens-clothing-tops/","retryCount":2} 2024-04-02T07:11:07.849Z INFO CheerioCrawler: Opened category page: Women's Tops | Logo T-Shirts | Zalando {"url":"https://www.zalando.co.uk/womens-clothing-tops/"} 2024-04-02T07:11:07.858Z WARN CheerioCrawler: Reclaiming failed request back to the list or queue. Response could not be parsed 2024-04-02T07:11:07.859Z at tryParseReponse (file:///usr/src/app/dist/utils.js:23:15) {"id":"BSSn2gMCCrYT1LQ","url":"https://www.zalando.co.uk/womens-clothing-tops/","retryCount":3} 2024-04-02T07:11:12.893Z INFO CheerioCrawler: Opened category page: Women's Tops | Logo T-Shirts | Zalando {"url":"https://www.zalando.co.uk/womens-clothing-tops/"} 2024-04-02T07:11:12.981Z ERROR CheerioCrawler: Request failed and reached maximum retries. Error: Response could not be parsed 2024-04-02T07:11:12.982Z at tryParseReponse (file:///usr/src/app/dist/utils.js:23:15) 2024-04-02T07:11:12.983Z at parseGraphqlProductUrls (file:///usr/src/app/dist/utils.js:44:30) 2024-04-02T07:11:12.984Z at categoryRoute (file:///usr/src/app/dist/routes/categoryRoute.js:7:32) 2024-04-02T07:11:12.985Z at CheerioCrawler.func [as requestHandler] (/usr/src/app/node_modules/@crawlee/core/router.js:172:44) 2024-04-02T07:11:12.986Z at /usr/src/app/node_modules/@crawlee/http/internals/http-crawler.js:347:87 2024-04-02T07:11:12.986Z at wrap (/usr/src/app/node_modules/@apify/timeout/index.js:52:27) 2024-04-02T07:11:12.988Z at /usr/src/app/node_modules/@apify/timeout/index.js:66:7 2024-04-02T07:11:12.989Z at AsyncLocalStorage.run (node:async_hooks:319:14) 2024-04-02T07:11:12.990Z at /usr/src/app/node_modules/@apify/timeout/index.js:65:13 2024-04-02T07:11:12.991Z at new Promise (

Developer
Maintained by Community
Actor metrics
  • 9 monthly users
  • 78.8% runs succeeded
  • 9.8 days response time
  • Created in May 2023
  • Modified about 1 month ago
Categories