Allegro Product Detail Scraper avatar
Allegro Product Detail Scraper

Pricing

$1.50 / 1,000 Products

Go to Store
Allegro Product Detail Scraper

Allegro Product Detail Scraper

tri_angle/allegro-product-detail-scraper

Developed by

Tri⟁angle

Maintained by Apify

Allegro Scraper allows you to scrape data from allegro.pl, allegro.cz and allegro.sk, one of the most popular online shopping platforms in Europe and the largest e-commerce platform of European origin.

5.0 (5)

Pricing

$1.50 / 1,000 Products

5

Monthly users

16

Runs succeeded

>99%

Response time

24 days

Last modified

24 days ago

M

Cant scrape anything, bot protection

Closed
MAA opened this issue
9 months ago

Hi, I was trying to scrape data about specific products and in all cases its blocking scraper from its job.

2024-06-27T16:27:22.220Z INFO CheerioCrawler: [PRODUCT] Processing request: https://allegro.pl/oferta/namiot-turystyczny-kempingowy-3-osobowy-wodoodporny-przedsionek-peme-15213870689 2024-06-27T16:27:22.820Z INFO CheerioCrawler: [PRODUCT] Processing request: https://allegro.pl/oferta/namiot-turystyczny-monsun-4-pro-acamper-3500mm-gratis-15479700634 2024-06-27T16:27:23.865Z INFO CheerioCrawler: [PRODUCT] Processing request: https://allegro.pl/oferta/namiot-kempingowy-quechua-2-seconds-3-osobowy-15472227114 2024-06-27T16:27:24.352Z INFO CheerioCrawler: [PRODUCT] Processing request: https://allegro.pl/oferta/przedsionek-kempingowy-quechua-arpenaz-6-osobowy-13888999931 2024-06-27T16:27:24.365Z INFO CheerioCrawler: [PRODUCT] Processing request: https://allegro.pl/oferta/namiot-abarqs-vigo-4-os-3000mm-przedsionek-tropik-wodoodporny-abarqs-15745234567 2024-06-27T16:27:25.887Z INFO [PRODUCT] Request failed, retrying... {"url":"https://allegro.pl/oferta/namiot-turystyczny-kempingowy-3-osobowy-wodoodporny-przedsionek-peme-15213870689","statusCode":200,"msg":"[PRODUCT] Detected bot protection, retrying..."} 2024-06-27T16:27:27.070Z INFO [PRODUCT] Request failed {"url":"https://allegro.pl/oferta/namiot-kempingowy-quechua-2-seconds-3-osobowy-15472227114","statusCode":200,"msg":"[PRODUCT] Detected bot protection, retrying..."} 2024-06-27T16:27:27.376Z INFO [PRO... [trimmed]

tri_angle avatar

Hi, yes this unfortunately happens sometimes to users on the free plan due to a small/trial proxy pool, the team is already brainstorming possible improvements. I adjusted the settings of your account a bit and started a new run on your account with your input, please check it out and let me know if you have any other issues. Have a nice weekend! :)

M

MAA

6 months ago

Hi. Two questions:

  1. I'm about to use Your actor at scale (hundred of thousands URLs) - i see that many issues are with a proxy. Will the starter pack be enough for this operation? If not - what can we do about this?
  2. Is it possible to scrape URLs like - https://archiwum.allegro.pl/oferta/061-piornik-duzy-twardy-czarny-c-kemer-i14357383989.html ?

For example this run made 34/39 URLs, rest is failed (bot protection). I'm willing to pay the price but i need to be sure that i can use it for like 1 milion urls :) https://console.apify.com/actors/Ctugh61azcuDl4nDh/runs/zyS94U2dTqeS6egOz

tri_angle avatar

Hi,

  1. You can scrape 1 milion urls for sure, but as you say, the issue might be blocking. Bigger proxy pool is a good start, but despite that we can't guarantee the success rate, there are the retries, but even that is sometimes not enough. Are you looking for a one-time scrape of the 1M urls? Or you want to re-scrape them on regular basis?

Here you can check the results which I got with your input (with a much bigger proxy pool) - https://api.apify.com/v2/datasets/VRT5NN2D8dubTWFaU/items?clean=true&format=json

  1. the archiwum links are not supported
M

MAA

6 months ago

Hi, thanks for fast response!

  1. I will have about 1M URL to one-time scrape, i was planning to scrape it in batches - 100k~ each batch. I'm prepared that part of batch will be blocked but i'm trying to figure out what big will be this part :).
  2. Damn :/ If You will be looking for new actor to make, archiwum is good idea ;)
tri_angle avatar

Ok, thanks for the info, I'm sending you an email.

Pricing

Pricing model

Pay per result 

This Actor is paid per result. You are not charged for the Apify platform usage, but only a fixed price for each dataset of 1,000 items in the Actor outputs.

Price per 1,000 items

$1.50