Amazon Reviews Scraper
Pricing
from $40.00 / 1,000 results
Amazon Reviews Scraper
Extract customer reviews from any Amazon product with filtering by star rating, verified purchases, and sorting options. Returns structured data including review text, ratings, helpful counts, dates, sentiment hints, images, and more across 19+ Amazon domains.
Pricing
from $40.00 / 1,000 results
Rating
5.0
(3)
Developer

Crawler Bros
Actor stats
0
Bookmarked
2
Total users
1
Monthly active users
9 hours ago
Last modified
Categories
Share
A production-grade Amazon Reviews Scraper built for the Apify platform. This scraper extracts high-quality, structured review data from Amazon product pages with advanced filtering, pagination, and GDPR compliance features.
๐ Features
- Multi-product support: Scrape reviews from multiple Amazon products in a single run
- Smart filtering: Filter by star rating, verified purchases only
- Flexible sorting: Sort by most helpful or most recent reviews
- GDPR compliant: Option to exclude sensitive reviewer data
- Multi-region: Support for 19+ Amazon domains worldwide
- Anti-detection: Browser fingerprinting, proxy rotation, and stealth mode
- Deduplication: Automatic removal of duplicate reviews
- Pagination: Handles all review pages automatically
- Image extraction: Optionally extract reviewer-uploaded images
- Robust error handling: Captcha detection, retry logic, rate limiting
๐ฆ Output Schema
Each review is returned with the following structure:
{"productAsin": "B0BDHWDR12","ratingScore": 5,"reviewTitle": "Amazing product!","reviewUrl": "https://www.amazon.com/gp/customer-reviews/R1234567890","reviewReaction": "123 people found this helpful","reviewedIn": "Reviewed in the United States on January 15, 2024","reviewDescription": "This is the full review text...","isVerified": true,"variant": "Size: Large, Color: Blue","reviewImages": ["https://..."],"position": 1,"reviewId": "R1234567890","helpfulCount": 123,"reviewDate": "2024-01-15T00:00:00","reviewLocation": "United States","sentimentHint": "positive","wordCount": 150,"hasImages": true,"imageCount": 2,"scrapedAt": "2024-01-20T10:30:00Z","sourceUrl": "https://www.amazon.com/dp/B0BDHWDR12"}
๐ง Input Configuration
| Field | Type | Description | Default |
|---|---|---|---|
productUrls | array | List of Amazon product URLs (required) | - |
maxReviews | integer | Maximum reviews per product (1-500) | All available |
sortBy | string | Sort order: helpful or recent | helpful |
filterByStar | integer | Filter by star rating (1-5) | All stars |
verifiedOnly | boolean | Only verified purchases | false |
includeImages | boolean | Extract reviewer images | true |
includeGdprSensitive | boolean | Include reviewer names/avatars | false |
proxyMode | string | automatic or residential | automatic |
country | string | Amazon domain | amazon.com |
maxConcurrency | integer | Concurrent pages (1-10) | 3 |
requestTimeout | integer | Request timeout in seconds | 60 |
retryCount | integer | Retries for failed requests | 3 |
Example Input
{"productUrls": ["https://www.amazon.com/dp/B0BDHWDR12","https://www.amazon.com/dp/B08N5WRWNW"],"maxReviews": 100,"sortBy": "helpful","filterByStar": null,"verifiedOnly": false,"includeImages": true,"includeGdprSensitive": false,"proxyMode": "automatic","country": "amazon.com"}
๐ Supported Amazon Domains
| Domain | Country |
|---|---|
| amazon.com | United States |
| amazon.co.uk | United Kingdom |
| amazon.de | Germany |
| amazon.fr | France |
| amazon.it | Italy |
| amazon.es | Spain |
| amazon.ca | Canada |
| amazon.com.au | Australia |
| amazon.co.jp | Japan |
| amazon.in | India |
| amazon.com.br | Brazil |
| amazon.com.mx | Mexico |
| amazon.nl | Netherlands |
| amazon.sg | Singapore |
| amazon.ae | UAE |
| amazon.sa | Saudi Arabia |
| amazon.pl | Poland |
| amazon.se | Sweden |
| amazon.com.tr | Turkey |
๐๏ธ Architecture
reviews-scraper/โโโ src/โ โโโ main.py # Apify actor entry pointโ โโโ scraper.py # Core scraping logicโ โโโ url_parser.py # URL parsing & ASIN extractionโ โโโ review_extractor.py # HTML parsing & extractionโ โโโ data_normalizer.py # Data validation & normalizationโ โโโ deduplication.py # Duplicate removalโ โโโ proxy_manager.py # Proxy & session managementโ โโโ cookie_manager.py # MongoDB cookie rotationโ โโโ cookie_admin.py # Cookie management CLIโ โโโ email_notifier.py # Failure email notificationsโ โโโ constants.py # Configuration constantsโ โโโ utils.py # Helper functionsโโโ .actor/โ โโโ actor.json # Apify actor configurationโโโ Dockerfile # Docker build configurationโโโ INPUT_SCHEMA.json # Input schema for Apifyโโโ requirements.txt # Python dependenciesโโโ README.md
๐ GDPR Compliance
When includeGdprSensitive is set to false (default), the scraper excludes:
- Reviewer names
- Reviewer profile links
- Reviewer avatars
This ensures compliance with privacy regulations while still providing valuable review data.
โ๏ธ Technical Details
Anti-Detection Measures
- Browser Fingerprinting: Randomized user agents, viewports, and locales
- Stealth Mode: Webdriver detection bypass, plugin spoofing
- Proxy Rotation: Support for datacenter and residential proxies
- Rate Limiting: Dynamic request throttling
- Session Management: Automatic session rotation on blocks
- Human Simulation: Mouse movements, scrolling, and realistic delays
โ ๏ธ Important: Proxies Required
Amazon has aggressive anti-bot detection. For reliable scraping, you MUST use proxies.
- Without proxies: Expect sign-in redirects and captchas
- With datacenter proxies: Works for moderate volume
- With residential proxies: Best success rate
On Apify, enable "Apify Proxy" with residential group for best results:
{"proxyMode": "residential"}
Error Handling
- Captcha Detection: Automatically detects and reports captcha challenges
- Retry Logic: Exponential backoff for failed requests
- Timeout Management: Configurable request timeouts
- Graceful Degradation: Continues with remaining products on failures
Limitations
- Maximum ~100 reviews per star rating (Amazon limitation)
- Maximum ~500 total reviews per product
- Only reviews with text content are extracted
Test Input File
Create storage/key_value_stores/default/INPUT.json:
{"productUrls": ["https://www.amazon.com/dp/B0BDHWDR12"],"maxReviews": 20,"sortBy": "helpful"}
๐ Example Output
[{"productAsin": "B0BDHWDR12","ratingScore": 5,"reviewTitle": "Best purchase ever!","reviewUrl": "https://www.amazon.com/gp/customer-reviews/R3EXAMPLE123","reviewReaction": "234 people found this helpful","reviewedIn": "Reviewed in the United States on December 15, 2023","reviewDescription": "I've been using this product for 3 months now and it has exceeded all my expectations. The quality is outstanding and the value for money is excellent. Highly recommend to anyone looking for a reliable solution.","isVerified": true,"variant": "Size: Medium","reviewImages": ["https://images-na.ssl-images-amazon.com/images/I/71example1.jpg"],"position": 1,"reviewId": "R3EXAMPLE123","helpfulCount": 234,"reviewDate": "2023-12-15T00:00:00","reviewLocation": "United States","sentimentHint": "positive","wordCount": 42,"hasImages": true,"imageCount": 1,"scrapedAt": "2024-01-20T10:30:00Z","sourceUrl": "https://www.amazon.com/dp/B0BDHWDR12"}]
๐ Integration
Export Formats
- JSON: Native output format
- CSV: Export via Apify dataset
- Excel: Export via Apify dataset
API Access
# Get results via APIcurl "https://api.apify.com/v2/datasets/{datasetId}/items?token={apiToken}"