Amazon Product Scraper avatar
Amazon Product Scraper
Try for free

14 days trial then $40.00/month - No credit card required now

View all Actors
Amazon Product Scraper

Amazon Product Scraper

junglee/amazon-crawler
Try for free

14 days trial then $40.00/month - No credit card required now

Use this Amazon scraper to collect data based on URL and country from the Amazon website. Extract product information without using the Amazon API, including reviews, prices, descriptions, and Amazon Standard Identification Numbers (ASINs). Download data in various structured formats.

User avatar

Some Product Descriptions are empty

Closed

frank_acront opened this issue
5 months ago

Why is it that the description is often null? Sometimes it has a value but more often than not I don't get the actual description.

Example:

INPUT: { "categoryOrProductUrls": [ { "url": "https://www.amazon.com/dp/B08KDJL5CK" }, { "url": "https://www.amazon.com/dp/B0B28G5Y4R" }, { "url": "https://www.amazon.com/dp/B006RQ8TY0" } ], "maxItemsPerStartUrl": 100, "proxyConfiguration": { "useApifyProxy": true, "apifyProxyGroups": [ "RESIDENTIAL" ], "apifyProxyCountry": "US" }, "useCaptchaSolver": true }

OUTPUT: { "title": "FANGOR WiFi Projector Bluetooth 8400mAh Battery, Rechargeable Portable Home Projector,1080P Supported Movie Projector with Sync Smartphone Screen via WiFi/USB Cable, Compatible with iPhone, Laptop", "asin": "B08KDJL5CK", "brand": "FANGOR", "stars": 4.1, "reviewsCount": 4755, "thumbnailImage": "https://m.media-amazon.com/images/I/81IZWRi2zhL.__AC_SX300_SY300_QL70_FMwebp_.jpg", "breadCrumbs": "Electronics › Video Projectors", "description": null, "price": { "value": 159.98, "currency": "$" }, "url": "https://www.amazon.com/dp/B08KDJL5CK" } ...

This is one of the 3 results. It has "description": null - but if you go to the page you can see it does indeed have a product description.

If it is null, then maybe you need to look in alternate places for this data?

Or maybe there is an issue when descriptions contain images, in which case only having the text would be OK for me.

Whatever the cause - I'd really like to have this data, thanks.

User avatar

Hi Frank, thanks a lot for reporting this! Yes, we currently only support just simple text descriptions. Extracting the full product description types is a very hard process as there are a LOT of them and can still vary a bit, depending on the seller. (btw if you are interested here is a video I found showing just how many of the description types there are) I've added this to our backlog and will keep you update here, thanks!

User avatar

frank_acront

5 months ago

Thanks for looking into that. Yeah I've noticed there must be many templates for product descriptions.

One more thing - if you do have time to do a more robust product description scrape - would you also consider grabbing all image links in the product descriptions? Thanks again.

User avatar

Hi, thanks for a suggestion, we'll look into it as well and let you know!

User avatar

Kumon

5 months ago

I am also interested into the FULL product description of each product....

User avatar

Hi Kumon, thanks for joining this issue! Seeing more users join in we will prioritize this a bit more ;)

Also for the images, yes we will definitely scrape those as well. Thanks!

User avatar

Hi all,

I'm happy to inform you, that we've just updated the Actor with this feature ;)

Quick summary from the changelog:

  • Added extraction of Amazon's A+ content under aPlusContent field
    • content like text, images and videos is scraped under rawText, rawImages, and rawVideos fields
    • specific A+ content modules are scraped fully under the modules field

Try it out and let me know how it works, thanks!

Developer
Maintained by Apify
Actor metrics
  • 170 monthly users
  • 91.3% runs succeeded
  • 4.5 days response time
  • Created in Oct 2018
  • Modified 7 days ago
Categories