Amazon Reviews Stats Scraper avatar
Amazon Reviews Stats Scraper

Pricing

$10.00 / 1,000 results

Go to Apify Store
Amazon Reviews Stats Scraper

Amazon Reviews Stats Scraper

Developed by

Lemuel Torrefiel

Lemuel Torrefiel

Maintained by Community

Extracts review stats from Amazon product pages using Playwright. Get overall rating, total number of ratings, and star distribution (1★–5★). Supports multiple URLs, all Amazon domains, and includes robust fallback selectors.

5.0 (1)

Pricing

$10.00 / 1,000 results

0

2

2

Last modified

6 days ago

Amazon Review Scraper

A lightweight and efficient Apify Actor that extracts basic review statistics from Amazon product pages using Playwright and Python.

Features

Simple & Fast - Focuses only on essential review metrics
Multi-URL Support - Process multiple products in a single run
Cross-Region Support - Works with Amazon domains worldwide
Reliable Extraction - Uses multiple fallback selectors for robust data extraction

What It Scrapes

This scraper extracts the following information from Amazon product pages:

  • Overall Rating (e.g., 4.5 out of 5)
  • Total Number of Ratings (e.g., 3,030 global ratings)
  • Star Distribution:
    • 5 star percentage
    • 4 star percentage
    • 3 star percentage
    • 2 star percentage
    • 1 star percentage

Input Configuration

The scraper accepts a simple JSON input with an array of Amazon product URLs:

{
"urls": [
"https://www.amazon.co.uk/dp/B081VRSGWF",
"https://www.amazon.com/dp/B08N5WRWNW"
]
}

Supported URL Formats

  • Product pages: https://www.amazon.com/dp/[ASIN]
  • Alternative format: https://www.amazon.com/gp/product/[ASIN]
  • All Amazon domains supported (.com, .co.uk, .de, .fr, .es, .it, .ca, etc.)

Output Format

The scraper returns structured data for each product:

{
"url": "https://www.amazon.de/dp/B0CB37RQ9Z",
"asin": "B0CB37RQ9Z",
"success": true,
"error": null,
"data": {
"overall_rating": 4.5,
"total_ratings": 148,
"star_distribution": {
"5_star": 73,
"4_star": 17,
"3_star": 5,
"2_star": 0,
"1_star": 5
}
}
}

Output Fields

  • url: The original Amazon product URL
  • asin: The extracted Amazon Standard Identification Number
  • success: Boolean indicating if scraping was successful
  • error: Error message if scraping failed (null on success)
  • data: The scraped review statistics
    • overall_rating: Average rating (0-5 scale)
    • total_ratings: Total number of customer ratings
    • star_distribution: Percentage breakdown by star rating (values are integers representing percentages)

Local Development

Prerequisites

  • Python 3.11+
  • Apify CLI (npm install -g apify-cli)

Project Structure

amazon-review-scraper/
├── .actor/
│ └── actor.json # Actor configuration
├── src/
│ ├── __init__.py # Package initialization
│ ├── __main__.py # Entry point
│ └── main.py # Main scraper logic
├── input_schema.json # Input validation schema
├── requirements.txt # Python dependencies
├── Dockerfile # Container configuration
└── README.md # Documentation

Installation

  1. Clone or create the project structure
  2. Install dependencies:
    pip install -r requirements.txt
    playwright install chromium

Running Locally

  1. Create an INPUT.json file:

    {
    "urls": ["https://www.amazon.co.uk/dp/B081VRSGWF"]
    }
  2. Run the actor:

    $apify run -p

Deployment to Apify Platform

Method 1: Via Apify CLI

  1. Login to Apify:

    $apify login
  2. Initialize the actor:

    $apify init
  3. Push to platform:

    $apify push

Method 2: Via Web Console

  1. Go to Apify Console
  2. Create a new Actor
  3. Choose "Python + Playwright" template
  4. Upload the source files
  5. Build and run

Use Cases

  • Price Monitoring - Track product ratings over time
  • Market Research - Analyze customer satisfaction across products
  • Competitor Analysis - Compare review metrics between similar products
  • Quality Assurance - Monitor your own product reviews
  • Data Analytics - Gather review data for analysis

Limitations

  • Scrapes only publicly available review statistics
  • Does not extract individual review text or reviewer information
  • Requires valid Amazon product URLs with ASIN
  • Rate limited to prevent overwhelming Amazon servers (2-second delay between requests)

Error Handling

The scraper includes robust error handling:

  • Invalid URLs are reported with error messages
  • Network timeouts are handled gracefully
  • Missing elements use fallback selectors
  • Each URL failure doesn't affect others in the batch

Performance

  • Speed: ~5-10 seconds per product page
  • Memory: ~256-1024 MB depending on batch size
  • CPU: Minimal usage (0.25-1 CPU)
  • Concurrency: Sequential processing with delays

Support

For issues, questions, or feature requests:

License

This actor is provided as-is for educational and commercial use. Please ensure compliance with Amazon's Terms of Service and robots.txt when using this scraper.

Changelog

Version 0.1.0

  • Initial release
  • Basic review statistics extraction
  • Multi-URL support
  • Playwright-based scraping