
Amazon Reviews Stats Scraper
Pricing
$10.00 / 1,000 results

Amazon Reviews Stats Scraper
Extracts review stats from Amazon product pages using Playwright. Get overall rating, total number of ratings, and star distribution (1★–5★). Supports multiple URLs, all Amazon domains, and includes robust fallback selectors.
5.0 (1)
Pricing
$10.00 / 1,000 results
0
2
2
Last modified
6 days ago
Amazon Review Scraper
A lightweight and efficient Apify Actor that extracts basic review statistics from Amazon product pages using Playwright and Python.
Features
✅ Simple & Fast - Focuses only on essential review metrics
✅ Multi-URL Support - Process multiple products in a single run
✅ Cross-Region Support - Works with Amazon domains worldwide
✅ Reliable Extraction - Uses multiple fallback selectors for robust data extraction
What It Scrapes
This scraper extracts the following information from Amazon product pages:
- Overall Rating (e.g., 4.5 out of 5)
- Total Number of Ratings (e.g., 3,030 global ratings)
- Star Distribution:
- 5 star percentage
- 4 star percentage
- 3 star percentage
- 2 star percentage
- 1 star percentage
Input Configuration
The scraper accepts a simple JSON input with an array of Amazon product URLs:
{"urls": ["https://www.amazon.co.uk/dp/B081VRSGWF","https://www.amazon.com/dp/B08N5WRWNW"]}
Supported URL Formats
- Product pages:
https://www.amazon.com/dp/[ASIN]
- Alternative format:
https://www.amazon.com/gp/product/[ASIN]
- All Amazon domains supported (.com, .co.uk, .de, .fr, .es, .it, .ca, etc.)
Output Format
The scraper returns structured data for each product:
{"url": "https://www.amazon.de/dp/B0CB37RQ9Z","asin": "B0CB37RQ9Z","success": true,"error": null,"data": {"overall_rating": 4.5,"total_ratings": 148,"star_distribution": {"5_star": 73,"4_star": 17,"3_star": 5,"2_star": 0,"1_star": 5}}}
Output Fields
- url: The original Amazon product URL
- asin: The extracted Amazon Standard Identification Number
- success: Boolean indicating if scraping was successful
- error: Error message if scraping failed (null on success)
- data: The scraped review statistics
- overall_rating: Average rating (0-5 scale)
- total_ratings: Total number of customer ratings
- star_distribution: Percentage breakdown by star rating (values are integers representing percentages)
Local Development
Prerequisites
- Python 3.11+
- Apify CLI (
npm install -g apify-cli
)
Project Structure
amazon-review-scraper/├── .actor/│ └── actor.json # Actor configuration├── src/│ ├── __init__.py # Package initialization│ ├── __main__.py # Entry point│ └── main.py # Main scraper logic├── input_schema.json # Input validation schema├── requirements.txt # Python dependencies├── Dockerfile # Container configuration└── README.md # Documentation
Installation
- Clone or create the project structure
- Install dependencies:
pip install -r requirements.txtplaywright install chromium
Running Locally
-
Create an
INPUT.json
file:{"urls": ["https://www.amazon.co.uk/dp/B081VRSGWF"]} -
Run the actor:
$apify run -p
Deployment to Apify Platform
Method 1: Via Apify CLI
-
Login to Apify:
$apify login -
Initialize the actor:
$apify init -
Push to platform:
$apify push
Method 2: Via Web Console
- Go to Apify Console
- Create a new Actor
- Choose "Python + Playwright" template
- Upload the source files
- Build and run
Use Cases
- Price Monitoring - Track product ratings over time
- Market Research - Analyze customer satisfaction across products
- Competitor Analysis - Compare review metrics between similar products
- Quality Assurance - Monitor your own product reviews
- Data Analytics - Gather review data for analysis
Limitations
- Scrapes only publicly available review statistics
- Does not extract individual review text or reviewer information
- Requires valid Amazon product URLs with ASIN
- Rate limited to prevent overwhelming Amazon servers (2-second delay between requests)
Error Handling
The scraper includes robust error handling:
- Invalid URLs are reported with error messages
- Network timeouts are handled gracefully
- Missing elements use fallback selectors
- Each URL failure doesn't affect others in the batch
Performance
- Speed: ~5-10 seconds per product page
- Memory: ~256-1024 MB depending on batch size
- CPU: Minimal usage (0.25-1 CPU)
- Concurrency: Sequential processing with delays
Support
For issues, questions, or feature requests:
- Check the Apify Documentation
- Visit Apify Support
- Review the Actor logs for debugging
License
This actor is provided as-is for educational and commercial use. Please ensure compliance with Amazon's Terms of Service and robots.txt when using this scraper.
Changelog
Version 0.1.0
- Initial release
- Basic review statistics extraction
- Multi-URL support
- Playwright-based scraping
On this page
Share Actor: