Awesome Amazon Product Scraper
Pricing
$5.00/month + usage
Awesome Amazon Product Scraper
Amazon Product Scraper Actor retrieves detailed product data from Amazon search and product pages. It supports configurable limits, proxy usage, and flexible image extraction modes for efficient product research and price tracking.
Pricing
$5.00/month + usage
Rating
0.0
(0)
Developer

Alam
Actor stats
0
Bookmarked
3
Total users
3
Monthly active users
7 months ago
Last modified
Categories
Share
Amazon Product Scraper
A powerful and efficient web scraper built on the Apify platform that extracts detailed product data from Amazon search results and product pages. Perfect for price monitoring, product research, and market analysis.
Features
-
Versatile URL Handling
- Process Amazon search result pages
- Extract data from individual product pages
- Support for both search queries and direct product URLs
-
Smart Image Extraction
- Basic Mode: High-quality main product image
- Advanced Mode: All available product images with best resolution
- Automatic image quality selection
-
Comprehensive Data Extraction
- Product title and detailed description
- Current price with currency information
- Real-time availability status
- ASIN identifier
- Customer ratings and review counts
- Product features and bullet points
- High-resolution product images
- Timestamps for data freshness
-
Robust Architecture
- Built with Playwright for maximum reliability
- Smart retry mechanism with multiple selectors
- Automatic handling of layout changes
- Built-in duplicate detection
- Configurable request limits
Input Configuration
Configure the scraper using these parameters:
{"startUrls": [{ "url": "https://www.amazon.com/s?k=gaming+laptop" }],"maxItems": 5,"maxPagesPerQuery": 1,"useProxy": true,"scrapeMultipleImages": false}
Input Parameters
| Parameter | Type | Description | Default |
|---|---|---|---|
startUrls | Array | Amazon search or product URLs | Required |
maxItems | Integer | Maximum products to scrape | 5 |
maxPagesPerQuery | Integer | Maximum search pages to process | 1 |
useProxy | Boolean | Enable/disable proxy usage | false |
scrapeMultipleImages | Boolean | Extract all available product images | false |
Output Format
The actor outputs detailed product information in JSON format. The structure varies based on the scrapeMultipleImages setting:
Basic Output (scrapeMultipleImages: false)
{"title": "Example Product","url": "https://www.amazon.com/dp/B0XXXXXXXX","asin": "B0XXXXXXXX","price": 999.99,"currency": "$","images": ["https://images-na.ssl-images-amazon.com/images/main.jpg"],"description": "Detailed product description...","features": ["Feature 1", "Feature 2"],"rating": 4.5,"reviewCount": 1250,"availability": "In Stock","timestamp": "2024-01-01T12:00:00.000Z"}
Advanced Output (scrapeMultipleImages: true)
{"title": "Example Product","url": "https://www.amazon.com/dp/B0XXXXXXXX","asin": "B0XXXXXXXX","price": 999.99,"currency": "$","images": ["https://images-na.ssl-images-amazon.com/images/main.jpg","https://images-na.ssl-images-amazon.com/images/angle1.jpg","https://images-na.ssl-images-amazon.com/images/angle2.jpg"],"description": "Detailed product description...","features": ["Feature 1", "Feature 2", "Feature 3"],"rating": 4.5,"reviewCount": 1250,"availability": "In Stock","timestamp": "2024-01-01T12:00:00.000Z"}
Performance Optimization
The actor includes several features to ensure reliable scraping:
-
Anti-Blocking Measures
- Random delays between requests
- Browser fingerprint randomization
- Automatic retry mechanism
- Smart handling of CAPTCHAs
-
Resource Management
- Concurrent scraping (2 parallel requests)
- Configurable request timeouts
- Memory-efficient processing
- Automatic cleanup
Usage Tips
- Start Small: Begin with a small
maxItemsvalue to test the setup - Use Proxies: Enable
useProxyfor production use to avoid IP blocks - Image Settings: Use
scrapeMultipleImages: trueonly when you need all product images - Pagination: Adjust
maxPagesPerQuerybased on your depth requirements - Monitor Logs: Check the actor's logs for detailed progress information
Limitations
- Some product data might be unavailable depending on the page layout
- Price and availability may vary based on location/currency
- Access to some products might be restricted
- Rate limiting may apply based on Amazon's policies
Error Handling
The actor implements comprehensive error handling:
- Automatic retries for failed requests
- Detailed error logging
- Screenshot capture for debugging
- Graceful failure recovery
Support
For issues, feature requests, or custom development needs:
- Check the actor's documentation on Apify
- Create an issue in the GitHub repository
- Contact us through Apify's support channels
License
MIT License - feel free to use this actor in your projects!