Amazon Scraper avatar

Amazon Scraper

Try for free

Pay $10.00 for 1,000 results

Go to Store
Amazon Scraper

Amazon Scraper

junglee/free-amazon-product-scraper
Try for free

Pay $10.00 for 1,000 results

Gets you product data from Amazon. Unofficial API. Scrapes and downloads product information without using the Amazon API, including reviews, prices, descriptions, and ASIN.

Developer
Maintained by Apify

Actor Metrics

  • 513 Monthly users

  • 4.5 / 5 (13)

  • 99 bookmarks

  • 97% runs succeeded

  • 1.9 days response time

  • Created in May 2022

  • Modified 2 days ago

Categories
DR

One ASIN being scraped MULTIPLE times while others are skipped.

Closed
adrianbe opened this issue
2 months ago

Hello and good day to you,

in this data set: 0kD7XirigWz13KbEj

you can see that B08XY78L17 shows up 3 times in the output which is very strange...

could you guys tell me why this happened? It's messing up my data on the backend quite alot :(

// Adrian

ruocco-l avatar

Hello, thank you for opening this issue.

The problem is on Amazon when you are trying to scrape a product that has multiple "variants" like https://www.amazon.com/dp/B08XY78L17 because it defaults on the first option.

What you can do is postpone ?th=1 like this https://www.amazon.com/dp/B0B1HL48P2?th=1 and this will guarantee to land on the option based on the ASIN present in your URL.

This run shows the result on the product you indicated: https://console.apify.com/view/runs/jwLwdIj9oBv8JuO1o

Hopes this helps! Let me know if you have any more questions. Happy scraping!

lukas.prusa avatar

Hi, we've had a second look at this and figured out the actual problem.

You are using some weird ASINs on input that are getting redirected to the same product variant. For the case of the B08XY78L17 ASIN, there are two redirected ASINs that lead to it: B08XY82YM6 and B08XY8247F.

Do not mistake them for product variants like e.g. B0B1HL48P2. Those actually lead to a different product variant and ASIN. The redirected ASINs are all leading to the same ASIN/product.

Please use the originalAsin or input fields in your backend to differentiate your data from one another, instead of the loaded ASIN, which can be redirected.

I hope this helps, thanks and happy scraping!