Amazon Review Scraper
Pricing
from $5.00 / 1,000 results
Amazon Review Scraper
Amazon Reviews Scraper extracts structured customer reviews from Amazon product pages, including ratings, text, titles, dates, variants, helpful votes, images, and review URLs. It supports pagination, proxy use, live dataset updates, and marketplace URL normalization.
Pricing
from $5.00 / 1,000 results
Rating
5.0
(1)
Developer
Lorenzo Cerqua
Maintained by CommunityActor stats
1
Bookmarked
2
Total users
1
Monthly active users
16 hours ago
Last modified
Categories
Share
What does Amazon Reviews Scraper do?
Amazon Reviews Scraper extracts customer reviews from Amazon product review pages such as https://www.amazon.it/product-reviews/B0DT17JXF2/. It collects review text, ratings, titles, reviewer names, dates, verified purchase badges, variants, helpful vote counts, images, review URLs, ASINs, marketplace domains, and source page metadata.
The Actor accepts either Amazon product URLs or Amazon review URLs from any supported Amazon marketplace. It extracts the ASIN, opens the matching /product-reviews/{ASIN}/ page on amazon.it, uses Italian Amazon cookies when bundled, and requests the page with English language settings. Running it on the Apify platform gives you API access, scheduling, integrations, proxy rotation, monitoring, and downloadable datasets.
Why use Amazon Reviews Scraper?
Amazon reviews are useful for product research, competitor monitoring, customer sentiment analysis, quality tracking, marketplace audits, and Voice of Customer workflows. Instead of manually browsing page after page, this Actor builds a structured dataset that can be exported to JSON, CSV, Excel, HTML, or consumed directly through the Apify API.
Common use cases include:
- Tracking what customers like or dislike about a product.
- Comparing verified purchase feedback across competing ASINs.
- Monitoring review volume and review quality over time.
- Collecting examples for support, product, marketing, or listing optimization teams.
- Building downstream NLP or sentiment analysis pipelines.
How to use Amazon Reviews Scraper
- Open the Actor on Apify.
- Add one or more Amazon product or review URLs in the Input tab.
- Choose optional limits such as maximum reviews or maximum pages per product.
- Keep the residential proxy enabled for better reliability on Amazon.
- Run the Actor.
- Download the dataset from the Output tab or access it through the Apify API.
You can paste product links from any supported Amazon marketplace; the Actor extracts the ASIN and always opens the Italian Amazon review page in English.
Input
The main input field is start_urls, which accepts Amazon product URLs or review URLs.
Example input:
{"start_urls": [{"url": "https://www.amazon.it/product-reviews/B0DT17JXF2/?reviewerType=all_reviews"}],"max_reviews_per_product": 100,"max_pages_per_product": 10,"reviewer_type": "all_reviews","filter_by_star": "","fallback_marketplaces": "amazon.it","use_proxy": true,"proxy_country": "IT","max_retries": 5,"max_concurrency": 3,"delay_min_seconds": 1,"delay_max_seconds": 3}
Input fields:
| Field | Description |
|---|---|
start_urls | Amazon product or review URLs to scrape. The ASIN is extracted and opened on /product-reviews/{ASIN}/ on amazon.it. |
max_reviews_per_product | Maximum reviews per product. Use 0 to continue until no next page is available. |
max_pages_per_product | Maximum review pages per product. Use 0 for unlimited pages. |
reviewer_type | Amazon reviewer type, such as all_reviews or avp_only_reviews. |
filter_by_star | Optional star filter, such as five_star, one_star, positive, or critical. |
fallback_marketplaces | Deprecated. The Actor always opens ASINs on amazon.it. |
use_proxy | Enables Apify Residential Proxy. Recommended for Amazon. |
proxy_country | Deprecated. The Actor always uses IT because scraping is fixed to amazon.it. |
max_retries | Number of retry attempts per URL. |
max_concurrency | Number of products scraped in parallel. |
delay_min_seconds / delay_max_seconds | Random delay range between review pages for the same product. |
Output
Each dataset item represents one Amazon review. On Apify, reviews are pushed to the dataset page by page while the Actor is still running, so you can inspect partial results before the full product finishes. The Actor also stores a per-ASIN summary in the default key-value store.
Example output item:
{"scraped_at": "2026-06-18T21:30:00.000000+00:00","asin": "B0DT17JXF2","marketplace": "amazon.in","review_id": "R123EXAMPLE","reviewer_name": "Alex","rating": 5,"rating_text": "5.0 out of 5 stars","title": "Great product","review_country": "Italy","review_date": "18 June 2026","verified_purchase": true,"verified_purchase_text": "Verified Purchase","variant": "Colour Name: Black","body": "The customer review is saved as clean text.","helpful_votes": 3,"images": ["https://m.media-amazon.com/images/I/example.jpg"],"review_url": "https://www.amazon.in/gp/customer-reviews/R123EXAMPLE","page_number": 1,"source_url": "https://www.amazon.in/product-reviews/B0DT17JXF2/?reviewerType=all_reviews&pageNumber=1"}
You can download the dataset in various formats such as JSON, HTML, CSV, or Excel.
Data table
| Field | Description |
|---|---|
asin | Amazon product ASIN. |
product_title | Product title detected on the reviews page. |
marketplace | Marketplace domain from the user's input URL, such as amazon.in. |
review_id | Unique Amazon review ID when available. |
reviewer_name | Public reviewer display name. |
rating | Numeric star rating. |
title | Review title. |
review_country | Country/marketplace text extracted from the review date line. |
review_date | Review date text as shown by Amazon. |
verified_purchase | Whether Amazon shows a verified purchase badge. |
variant | Product variant or format strip shown on the review. |
body | Cleaned review body text. |
helpful_votes | Number of helpful votes when shown. |
images | Review image URLs. |
review_url | Direct URL to the individual review when available. |
page_number | Review page number where the item was found. |
source_url | Original user-provided URL or the equivalent input-marketplace URL. |
Pricing / Cost estimation
How much does it cost to scrape Amazon reviews? The main cost drivers are the number of review pages, proxy usage, retries, and run duration. A product with a small number of reviews may finish in seconds, while large ASINs with thousands of reviews will require more requests.
To control cost, set max_reviews_per_product or max_pages_per_product. New Apify users can usually test small runs within the free tier, but larger Amazon review exports may require paid platform usage and proxy traffic.
Tips or Advanced options
- Keep
use_proxyenabled for production Amazon scraping. - Links from Amazon marketplaces, such as
amazon.co.ukoramazon.in, are converted toamazon.itbefore scraping. The proxy country is fixed toIT. - If the ASIN is not available on Amazon Italy, the Actor will not try other marketplaces automatically.
- If a filtered
/product-reviews/page is blocked, the Actor can fall back to the product detail page and collect the reviews visible there. This fallback is partial because Amazon product pages usually expose fewer reviews than the dedicated review pages. - Cookie handling is automatic and not exposed in the input form. If
amazon_cookies.jsonis bundled with the Actor, the scraper loads those sessions internally, warms up the Amazon marketplace homepage, refreshes cookie values in memory for the current run from valid AmazonSet-Cookieresponses, and skips sessions that Amazon redirects to login. - Requests use marketplace-aware language headers. Requests to
amazon.itprefer an English locale so badges, dates, labels, and product metadata are returned in English where Amazon supports it. - Watch the live log for page-level progress: each page reports found reviews, newly pushed reviews, cumulative total, and whether a next page token was found.
- When no star filter is selected, the Actor automatically runs extra passes for
five_star,four_star,three_star,two_star, andone_starto work around Amazon's 100-review cap on a single review view, then deduplicates review IDs. - The Actor uses the Italian marketplace, Italian Amazon cookies, and English language settings for consistent output.
- Start with
max_pages_per_product: 2ormax_reviews_per_product: 20while testing selectors and output. - Use
filter_by_starwhen you only need positive, critical, or specific star reviews. - Lower
max_concurrencyif Amazon starts returning CAPTCHA or blocked pages. - Increase delays between pages for large runs to reduce block risk.
FAQ, disclaimers, and support
Is it legal to scrape Amazon reviews?
Scraping publicly available web data can be legal depending on your jurisdiction, purpose, and how you use the data. You are responsible for making sure your use complies with applicable laws, Amazon terms, privacy rules, and platform policies. Do not collect or store sensitive personal data unless you have a lawful basis and explicit permission.
Why did the Actor stop before collecting every review?
The Actor stops when Amazon no longer exposes a next page token, when it reaches your configured limits, or when Amazon returns a blocked/CAPTCHA page. Check the key-value store summary for stop_reason and page diagnostics.
Amazon can cap each individual review view at 100 items, even when the product has more text reviews. When possible, this Actor expands the crawl through star filters and deduplicates the results. Some very large star buckets can still remain partially hidden if Amazon caps that filtered view too.
Does the Actor use nextPageToken?
Yes. For each page, it extracts Amazon's nextPageToken from hidden inputs, links, AJAX fragments, or embedded markup. The Actor requests tokenized review pages with nextPageToken, pageNumber, asin, and reftag, then falls back to the next review page URL when needed.
Where can I report issues?
Use the Actor's Issues tab on Apify for bugs, selector changes, or feature requests. Custom scraping solutions and extensions can be built for additional Amazon fields, review media, scheduling, or downstream analysis.