AliExpress Scraper avatar
AliExpress Scraper
Try for free

3 days trial then $30.00/month - No credit card required now

View all Actors
AliExpress Scraper

AliExpress Scraper

epctex/aliexpress-scraper
Try for free

3 days trial then $30.00/month - No credit card required now

Effortlessly extract descriptions, images, feedback, questions, prices, and shipping information from AliExpress. Customize country, language, and region preferences for enhanced data gathering.

User avatar

Scraper keeps getting blocked

Closed

impressive_rainforest opened this issue
a year ago

All recent runs are being blocked and not successfully scraping any product information

User avatar

tugkan

a year ago

Hey Matt Payne,

Can you please send us one/two run IDs so that we can investigate?

Best

User avatar

impressive_rainforest

a year ago

IQWJXg8EsBYVMdWJk 0WpeK3TF0NC8imd6K mhZn6mwrgUZvFxGT0 WTfrpp8KU3ubofjfm

User avatar

impressive_rainforest

a year ago

A more recent run: qovTyrNWbcOHCtcMb

we are still encountering this error: Reclaiming failed request back to the list or queue. We got blocked. Retrying.

User avatar

impressive_rainforest

a year ago

I went back in my run history, on 2023-03-01 22:47 run number "6llGGtPE3UEcKA9Fi" was the last successful run I had. This was using version 1.0.162. I tried using this older version again (new run ID: "hPoCgXzXU7ssYCgqQ"), but this also failed.

On top of these requests failing, they are all fasely reporting as successful with the exit codes you are using

User avatar

tugkan

a year ago

Hey Matt Payne!

Thank you very much for using this public actor. If possible, can you please add https:// at the beginning of the startUrls and try it again?

Awaiting your response. Best

User avatar

impressive_rainforest

a year ago

yep that works, why is there no error handling / better error messages or documentation on this? feel like that is an easy one to catch

User avatar

impressive_rainforest

a year ago

I am still getting intermittent issues after adding the https:// at the beginning of the startUrls. Here are two run numbers Wdmk6seNwWlHxIGj5. XAHVWIyZdUi3DykS4. This error is causing great delay in the time it takes to scrape these pages

User avatar

tugkan

a year ago

Hey Matt Payne,

The error handling is completely our fault. I'll pass this problem as a ticket to the team. About the intermittent issues, that is expected. All the public actors have the potential to get blocked. That's why it contains the retry mechanisms and proxies in place. About the delay on the scrape, I checked the logs of the run and it seems like; scraping takes 40 seconds but the build time of the actor takes 2 minutes. Unfortunately, this is completely up to the Apify platform and there is not much to do on our side.

Best

User avatar

impressive_rainforest

a year ago

Is there a private option?

User avatar

tugkan

a year ago

Do you mean getting the same service from somewhere else? If so, unfortunately not.

Developer
Maintained by Community
Actor metrics
  • 23 monthly users
  • 99.9% runs succeeded
  • 0.8 days response time
  • Created in Oct 2019
  • Modified about 2 hours ago