3 days trial then $30.00/month - No credit card required now

AliExpress Scraper

epctex/aliexpress-scraper

3 days trial then $30.00/month - No credit card required now

Effortlessly extract descriptions, images, feedback, questions, prices, and shipping information from AliExpress. Customize country, language, and region preferences for enhanced data gathering.

All issues Create new issue

Scraper keeps getting blocked

Closed

impressive_rainforest opened this issue

All recent runs are being blocked and not successfully scraping any product information

tugkan

Hey Matt Payne,

Can you please send us one/two run IDs so that we can investigate?

Best

impressive_rainforest

IQWJXg8EsBYVMdWJk 0WpeK3TF0NC8imd6K mhZn6mwrgUZvFxGT0 WTfrpp8KU3ubofjfm

impressive_rainforest

A more recent run: qovTyrNWbcOHCtcMb

we are still encountering this error: Reclaiming failed request back to the list or queue. We got blocked. Retrying.

impressive_rainforest

I went back in my run history, on 2023-03-01 22:47 run number "6llGGtPE3UEcKA9Fi" was the last successful run I had. This was using version 1.0.162. I tried using this older version again (new run ID: "hPoCgXzXU7ssYCgqQ"), but this also failed.

On top of these requests failing, they are all fasely reporting as successful with the exit codes you are using

tugkan

Hey Matt Payne!

Thank you very much for using this public actor. If possible, can you please add https:// at the beginning of the startUrls and try it again?

Awaiting your response. Best

impressive_rainforest

yep that works, why is there no error handling / better error messages or documentation on this? feel like that is an easy one to catch

impressive_rainforest

I am still getting intermittent issues after adding the https:// at the beginning of the startUrls. Here are two run numbers Wdmk6seNwWlHxIGj5. XAHVWIyZdUi3DykS4. This error is causing great delay in the time it takes to scrape these pages

tugkan

Hey Matt Payne,

The error handling is completely our fault. I'll pass this problem as a ticket to the team. About the intermittent issues, that is expected. All the public actors have the potential to get blocked. That's why it contains the retry mechanisms and proxies in place. About the delay on the scrape, I checked the logs of the run and it seems like; scraping takes 40 seconds but the build time of the actor takes 2 minutes. Unfortunately, this is completely up to the Apify platform and there is not much to do on our side.

Best

impressive_rainforest

Is there a private option?

tugkan

Do you mean getting the same service from somewhere else? If so, unfortunately not.

Add comment

Developer

epctex

Actor metrics

23 monthly users
99.9% runs succeeded
0.8 days response time
Created in Oct 2019
Modified about 2 hours ago

Categories

E-commerce

Business

Google Maps Scraper

compass/crawler-google-places

Extract data from hundreds of Google Maps locations and businesses. Get Google Maps data including reviews, images, contact info, opening hours, location, popular times, prices & more. Export scraped data, run the scraper via API, schedule and monitor runs, or integrate with other tools.

Compass

62.4k

Website Content Crawler

apify/website-content-crawler

Automatically crawl and extract text content from websites with documentation, knowledge bases, help centers, or blogs. This Actor is designed to provide data to feed, fine-tune, or train large language models such as ChatGPT or LLaMA.

Apify

12.9k

GPT Scraper

drobnikj/gpt-scraper

Extract data from any website and feed it into GPT via the OpenAI API. Use ChatGPT to proofread content, analyze sentiment, summarize reviews, extract contact details, and much more.

Jakub Drobník

4.2k

Amazon Product Scraper

junglee/Amazon-crawler

Use this Amazon scraper to collect data based on URL and country from the Amazon website. Extract product information without using the Amazon API, including reviews, prices, descriptions, and Amazon Standard Identification Numbers (ASINs). Download data in various structured formats.

Junglee

5.7k

AI Product Matcher

equidem/ai-product-matcher

Match products across multiple e-commerce websites. Use this AI product matching Actor whenever you need to find matching pairs of products from different online shops for dynamic pricing, competitor analysis or market research.

Matěj Sochor

308

Facebook Ads Scraper

apify/facebook-ads-scraper

Extract advertising data from one or multiple Facebook Pages. Get page details, reach estimates, publisher platforms, report count, number of impressions, ad IDs, timestamps, and more. Download Facebook ads data in JSON, CSV, and Excel and use it in apps, spreadsheets, and reports.

Apify

3.9k

📩📍 Google Maps Email Extractor

lukaskrivka/google-maps-with-contact-details

Extract Google Maps contact details. Scrape websites of Google Maps places for contact details and get email addresses, website, location, address, zipcode, phone number, social media links. Export scraped data, run the scraper via API, schedule and monitor runs or integrate with other tools.

Lukáš Křivka

3.2k

Amazon Scraper

junglee/free-amazon-product-scraper

Gets you product data from Amazon. Unofficial API. Scrapes and downloads product information without using the Amazon API, including reviews, prices, descriptions, and ASIN.

Junglee

2.6k

Fast Booking Scraper

voyager/fast-booking-scraper

Scrape Booking with this hotel scraper and get data about accommodation on Booking.com. Extract data by keywords or URLs for hotel prices, ratings, location, number of reviews, stars. Scrape and download data from Booking.com in JSON, Excel, HTML ,and CSV.

Voyager

616

Fast Google Scraper

hooli/easy-google-scraper

Make collecting data from Google easy with Fast Google Scraper. Extract organic and paid Google search engine results pages, then download your data as HTML table, JSON, CSV, Excel, or XML.