Amazon Review Scraper avatar

Amazon Review Scraper

Pricing

from $5.00 / 1,000 results

Go to Apify Store
Amazon Review Scraper

Amazon Review Scraper

Amazon Reviews Scraper extracts structured customer reviews from Amazon product pages, including ratings, text, titles, dates, variants, helpful votes, images, and review URLs. It supports pagination, proxy use, live dataset updates, and marketplace URL normalization.

Pricing

from $5.00 / 1,000 results

Rating

5.0

(1)

Developer

Lorenzo Cerqua

Lorenzo Cerqua

Maintained by Community

Actor stats

1

Bookmarked

2

Total users

1

Monthly active users

16 hours ago

Last modified

Share

What does Amazon Reviews Scraper do?

Amazon Reviews Scraper extracts customer reviews from Amazon product review pages such as https://www.amazon.it/product-reviews/B0DT17JXF2/. It collects review text, ratings, titles, reviewer names, dates, verified purchase badges, variants, helpful vote counts, images, review URLs, ASINs, marketplace domains, and source page metadata.

The Actor accepts either Amazon product URLs or Amazon review URLs from any supported Amazon marketplace. It extracts the ASIN, opens the matching /product-reviews/{ASIN}/ page on amazon.it, uses Italian Amazon cookies when bundled, and requests the page with English language settings. Running it on the Apify platform gives you API access, scheduling, integrations, proxy rotation, monitoring, and downloadable datasets.

Why use Amazon Reviews Scraper?

Amazon reviews are useful for product research, competitor monitoring, customer sentiment analysis, quality tracking, marketplace audits, and Voice of Customer workflows. Instead of manually browsing page after page, this Actor builds a structured dataset that can be exported to JSON, CSV, Excel, HTML, or consumed directly through the Apify API.

Common use cases include:

  • Tracking what customers like or dislike about a product.
  • Comparing verified purchase feedback across competing ASINs.
  • Monitoring review volume and review quality over time.
  • Collecting examples for support, product, marketing, or listing optimization teams.
  • Building downstream NLP or sentiment analysis pipelines.

How to use Amazon Reviews Scraper

  1. Open the Actor on Apify.
  2. Add one or more Amazon product or review URLs in the Input tab.
  3. Choose optional limits such as maximum reviews or maximum pages per product.
  4. Keep the residential proxy enabled for better reliability on Amazon.
  5. Run the Actor.
  6. Download the dataset from the Output tab or access it through the Apify API.

You can paste product links from any supported Amazon marketplace; the Actor extracts the ASIN and always opens the Italian Amazon review page in English.

Input

The main input field is start_urls, which accepts Amazon product URLs or review URLs.

Example input:

{
"start_urls": [
{
"url": "https://www.amazon.it/product-reviews/B0DT17JXF2/?reviewerType=all_reviews"
}
],
"max_reviews_per_product": 100,
"max_pages_per_product": 10,
"reviewer_type": "all_reviews",
"filter_by_star": "",
"fallback_marketplaces": "amazon.it",
"use_proxy": true,
"proxy_country": "IT",
"max_retries": 5,
"max_concurrency": 3,
"delay_min_seconds": 1,
"delay_max_seconds": 3
}

Input fields:

FieldDescription
start_urlsAmazon product or review URLs to scrape. The ASIN is extracted and opened on /product-reviews/{ASIN}/ on amazon.it.
max_reviews_per_productMaximum reviews per product. Use 0 to continue until no next page is available.
max_pages_per_productMaximum review pages per product. Use 0 for unlimited pages.
reviewer_typeAmazon reviewer type, such as all_reviews or avp_only_reviews.
filter_by_starOptional star filter, such as five_star, one_star, positive, or critical.
fallback_marketplacesDeprecated. The Actor always opens ASINs on amazon.it.
use_proxyEnables Apify Residential Proxy. Recommended for Amazon.
proxy_countryDeprecated. The Actor always uses IT because scraping is fixed to amazon.it.
max_retriesNumber of retry attempts per URL.
max_concurrencyNumber of products scraped in parallel.
delay_min_seconds / delay_max_secondsRandom delay range between review pages for the same product.

Output

Each dataset item represents one Amazon review. On Apify, reviews are pushed to the dataset page by page while the Actor is still running, so you can inspect partial results before the full product finishes. The Actor also stores a per-ASIN summary in the default key-value store.

Example output item:

{
"scraped_at": "2026-06-18T21:30:00.000000+00:00",
"asin": "B0DT17JXF2",
"marketplace": "amazon.in",
"review_id": "R123EXAMPLE",
"reviewer_name": "Alex",
"rating": 5,
"rating_text": "5.0 out of 5 stars",
"title": "Great product",
"review_country": "Italy",
"review_date": "18 June 2026",
"verified_purchase": true,
"verified_purchase_text": "Verified Purchase",
"variant": "Colour Name: Black",
"body": "The customer review is saved as clean text.",
"helpful_votes": 3,
"images": ["https://m.media-amazon.com/images/I/example.jpg"],
"review_url": "https://www.amazon.in/gp/customer-reviews/R123EXAMPLE",
"page_number": 1,
"source_url": "https://www.amazon.in/product-reviews/B0DT17JXF2/?reviewerType=all_reviews&pageNumber=1"
}

You can download the dataset in various formats such as JSON, HTML, CSV, or Excel.

Data table

FieldDescription
asinAmazon product ASIN.
product_titleProduct title detected on the reviews page.
marketplaceMarketplace domain from the user's input URL, such as amazon.in.
review_idUnique Amazon review ID when available.
reviewer_namePublic reviewer display name.
ratingNumeric star rating.
titleReview title.
review_countryCountry/marketplace text extracted from the review date line.
review_dateReview date text as shown by Amazon.
verified_purchaseWhether Amazon shows a verified purchase badge.
variantProduct variant or format strip shown on the review.
bodyCleaned review body text.
helpful_votesNumber of helpful votes when shown.
imagesReview image URLs.
review_urlDirect URL to the individual review when available.
page_numberReview page number where the item was found.
source_urlOriginal user-provided URL or the equivalent input-marketplace URL.

Pricing / Cost estimation

How much does it cost to scrape Amazon reviews? The main cost drivers are the number of review pages, proxy usage, retries, and run duration. A product with a small number of reviews may finish in seconds, while large ASINs with thousands of reviews will require more requests.

To control cost, set max_reviews_per_product or max_pages_per_product. New Apify users can usually test small runs within the free tier, but larger Amazon review exports may require paid platform usage and proxy traffic.

Tips or Advanced options

  • Keep use_proxy enabled for production Amazon scraping.
  • Links from Amazon marketplaces, such as amazon.co.uk or amazon.in, are converted to amazon.it before scraping. The proxy country is fixed to IT.
  • If the ASIN is not available on Amazon Italy, the Actor will not try other marketplaces automatically.
  • If a filtered /product-reviews/ page is blocked, the Actor can fall back to the product detail page and collect the reviews visible there. This fallback is partial because Amazon product pages usually expose fewer reviews than the dedicated review pages.
  • Cookie handling is automatic and not exposed in the input form. If amazon_cookies.json is bundled with the Actor, the scraper loads those sessions internally, warms up the Amazon marketplace homepage, refreshes cookie values in memory for the current run from valid Amazon Set-Cookie responses, and skips sessions that Amazon redirects to login.
  • Requests use marketplace-aware language headers. Requests to amazon.it prefer an English locale so badges, dates, labels, and product metadata are returned in English where Amazon supports it.
  • Watch the live log for page-level progress: each page reports found reviews, newly pushed reviews, cumulative total, and whether a next page token was found.
  • When no star filter is selected, the Actor automatically runs extra passes for five_star, four_star, three_star, two_star, and one_star to work around Amazon's 100-review cap on a single review view, then deduplicates review IDs.
  • The Actor uses the Italian marketplace, Italian Amazon cookies, and English language settings for consistent output.
  • Start with max_pages_per_product: 2 or max_reviews_per_product: 20 while testing selectors and output.
  • Use filter_by_star when you only need positive, critical, or specific star reviews.
  • Lower max_concurrency if Amazon starts returning CAPTCHA or blocked pages.
  • Increase delays between pages for large runs to reduce block risk.

FAQ, disclaimers, and support

Scraping publicly available web data can be legal depending on your jurisdiction, purpose, and how you use the data. You are responsible for making sure your use complies with applicable laws, Amazon terms, privacy rules, and platform policies. Do not collect or store sensitive personal data unless you have a lawful basis and explicit permission.

Why did the Actor stop before collecting every review?

The Actor stops when Amazon no longer exposes a next page token, when it reaches your configured limits, or when Amazon returns a blocked/CAPTCHA page. Check the key-value store summary for stop_reason and page diagnostics.

Amazon can cap each individual review view at 100 items, even when the product has more text reviews. When possible, this Actor expands the crawl through star filters and deduplicates the results. Some very large star buckets can still remain partially hidden if Amazon caps that filtered view too.

Does the Actor use nextPageToken?

Yes. For each page, it extracts Amazon's nextPageToken from hidden inputs, links, AJAX fragments, or embedded markup. The Actor requests tokenized review pages with nextPageToken, pageNumber, asin, and reftag, then falls back to the next review page URL when needed.

Where can I report issues?

Use the Actor's Issues tab on Apify for bugs, selector changes, or feature requests. Custom scraping solutions and extensions can be built for additional Amazon fields, review media, scheduling, or downstream analysis.