Amazon Search Products Scraper
Pricing
$19.99/month + usage
Amazon Search Products Scraper
Amazon Search Products Scraper collects product results from any Amazon search query. Extract titles, prices, ratings, images, sellers, availability, and key details. Ideal for market research, price tracking, product analysis, and workflows needing structured Amazon search data.
Pricing
$19.99/month + usage
Rating
0.0
(0)
Developer

API Empire
Actor stats
0
Bookmarked
3
Total users
2
Monthly active users
a day ago
Last modified
Categories
Share
A powerful and reliable Apify Actor designed to scrape Amazon search results by keywords or search URLs. This actor extracts comprehensive product data from Amazon search pages including prices, ratings, reviews, images, and detailed product information. Perfect for market research, price monitoring, competitor analysis, and product discovery.
Why Choose Us?
- ๐ Keyword-Based Scraping: Search Amazon using keywords or full search URLs - no need for individual product ASINs
- ๐ Comprehensive Data Extraction: Captures 30+ data points per product including prices, ratings, reviews, images, and metadata
- ๐ Bulk Processing: Process multiple keywords simultaneously with concurrent page scraping
- ๐ Intelligent Proxy Fallback: Automatic proxy fallback system (No Proxy โ Datacenter โ Residential) with 3 retries to avoid blocks
- โก Fast & Efficient: Concurrent page scraping for maximum speed
- ๐ Multi-Domain Support: Works with all Amazon domains (amazon.com, amazon.co.uk, amazon.de, etc.)
- ๐ Real-Time Logging: Detailed progress logs to keep you informed throughout the scraping process
- ๐ฏ Flexible Sorting: Sort results by relevance, price, reviews, or newest arrivals
Key Features
๐ Comprehensive Product Data
- Product title, description, and ASIN
- Current price and retail price
- Product rating and total review count
- Product images and URLs
- Prime eligibility and sponsored status
- Sales volume and delivery information
- Manufacturer, series, and variations
- Product details and specifications
๐ Search & Pagination
- Search by keywords or full Amazon search URLs
- Extract keywords automatically from URLs
- Scrape multiple pages per search (configurable)
- Extract result counts and categories
- Find similar keywords and related searches
- Track search position and page number
๐ Smart Proxy Management
- Default: Starts with no proxy (direct connection)
- Automatic Fallback: Falls back to datacenter proxy if blocked
- Residential Fallback: Falls back to residential proxy if datacenter fails
- Retry Logic: 3 retries with residential proxy before giving up
- Clear Logging: All proxy events are logged for transparency
๐ฏ Flexible Sorting Options
- Relevance (default): Most relevant results first
- Price Low to High: Sort by ascending price
- Price High to Low: Sort by descending price
- Customer Reviews: Sort by review rating
- Newest Arrivals: Sort by date (newest first)
๐ Rich Metadata
- Search keyword and domain code
- Page number and result position
- Total result count
- Categories and browse nodes
- Similar keywords and related searches
- Sort strategy used
๐ How to Use the Actor (via Apify Console)
- Log in at https://console.apify.com and go to Actors
- Find the actor (amazon-search-products-scraper) and click it
- Configure inputs:
- Enter keywords or search URLs in the "Enter Amazon Keywords or Search URLs" field
- (Optional) Set Amazon domain (default: amazon.com)
- (Optional) Choose sort order (default: relevance)
- (Optional) Set maximum pages per search (default: 1)
- (Optional) Configure proxy settings
- Run the actor by clicking the "Start" button
- Monitor logs in real time to see scraping progress
- Access results in the OUTPUT tab when the run completes
- Export results to JSON or CSV format
Input
The actor accepts the following input parameters:
Required Fields
startUrls(Array): List of keywords or Amazon search URLs- Can be simple keywords like
"Playstation"or"TV" - Can be full Amazon search URLs like
"https://www.amazon.com/s?k=Playstation" - Supports bulk input - add multiple keywords/URLs
- Can be simple keywords like
Optional Fields
-
amazonDomain(String): Amazon domain to use- Examples:
"amazon.com","amazon.co.uk","amazon.de","amazon.fr" - Default:
"amazon.com"
- Examples:
-
language(String): Language for Amazon pages- Options:
""(auto),"en-US","en-GB","de-DE","fr-FR","es-ES","it-IT","ja-JP","zh-CN" - Default:
""(automatic detection)
- Options:
-
proxyCountry(String): Country for proxy selection- Options:
"AUTO","US","GB","DE","FR","ES","IT","CA","AU","JP" - Default:
"AUTO"(automatic selection)
- Options:
-
useCaptchaSolver(Boolean): Enable automatic captcha solving- Default:
false - Warning: May incur additional costs and processing time
- Default:
-
sortOrder(String): How to sort search results- Options:
"relevanceblender"- Most relevant (default)"price-asc-rank"- Price: Low to High"price-desc-rank"- Price: High to Low"review-rank"- Customer Reviews"date-desc-rank"- Newest Arrivals
- Default:
"relevanceblender"
- Options:
-
maxPages(Integer): Maximum number of pages to scrape per search- Minimum: 1
- Maximum: 100
- Default: 1
-
proxyConfiguration(Object): Proxy settings- Default: No proxy (direct connection)
- Automatic fallback to datacenter and residential proxies if blocked
- See proxy configuration section below
Input Example
{"startUrls": [{ "url": "Playstation" },{ "url": "https://www.amazon.com/s?k=TV" },{ "url": "Shoes" }],"amazonDomain": "amazon.com","sortOrder": "relevanceblender","maxPages": 3,"proxyConfiguration": {"useApifyProxy": false}}
Output
The actor saves scraped product data to the Apify dataset. Each product record contains:
Core Product Information
| Field | Type | Description |
|---|---|---|
asin | String | Amazon Standard Identification Number (10 characters) |
productDescription | String | Product title/description |
imgUrl | String | Product image URL |
price | Number or null | Current product price |
retailPrice | Number or null | Original/retail price (if on sale) |
productRating | String or null | Overall product rating (e.g., "4.7 out of 5 stars") |
countReview | Number | Total number of reviews |
dpUrl | String | Product detail page URL (relative) |
prime | Boolean | Whether product is Prime eligible |
sponsored | Boolean | Whether product is a sponsored result |
salesVolume | String or null | Sales volume information (e.g., "7K+ bought in past month") |
deliveryMessage | String or null | Delivery time and shipping information |
manufacturer | String or null | Product manufacturer/brand |
series | String or null | Product series or model (if applicable) |
variations | Array | Product variations (size, color, etc.) |
productDetails | Array | Product details and features |
secondaryOffer | Number | Used/refurbished price (if available) |
Search Metadata
| Field | Type | Description |
|---|---|---|
statusCode | Number | HTTP status code (200 = success) |
statusMessage | String | Status message ("FOUND", "ERROR", "BLOCKED", etc.) |
keyword | String | Search keyword used |
domainCode | String | Amazon domain code (e.g., "com", "co.uk") |
page | Number | Page number of search results |
selectedCategory | String | Selected category (default: "aps") |
browseNode | Number or null | Amazon browse node ID |
nodeHierarchy | String or null | Browse node hierarchy |
resultCount | Number | Total number of search results |
categories | Array | Product categories found |
similarKeywords | Array | Similar/related keywords with URLs |
currentPage | Number | Current page number |
sortStrategy | String | Sort strategy used |
searchResultPosition | Number | Position of product in search results (0-indexed) |
Output Example
[{"asin": "B0CL5KNB9M","productDescription": "PlayStationยฎ5 Digital Edition (slim)","imgUrl": "https://m.media-amazon.com/images/I/51fM0CKG+HL._AC_UY218_.jpg","price": null,"retailPrice": null,"productRating": "4.7 out of 5 stars","countReview": 10500,"dpUrl": "/PlayStation%C2%AE5-Digital-slim-PlayStation-5/dp/B0CL5KNB9M/ref=sr_1_1?...","prime": false,"sponsored": false,"salesVolume": "7K+ bought in past month","deliveryMessage": null,"manufacturer": "Amazon's Choice: Overall Pick","series": null,"variations": [],"productDetails": ["Nov 24, 2023","No featured offers available"],"secondaryOffer": 0.0,"statusCode": 200,"statusMessage": "FOUND","keyword": "Playstation","domainCode": "com","page": 1,"selectedCategory": "aps","browseNode": null,"nodeHierarchy": null,"resultCount": 50000,"categories": [],"similarKeywords": [{"keyword": "playstation 5","url": "/s?k=playstation+5&ref=rsl_sug_0_0&..."},{"keyword": "playstation gift card","url": "/s?k=playstation+gift+card&ref=rsl_sug_0_3&..."}],"currentPage": 1,"sortStrategy": "relevanceblender","searchResultPosition": 0}]
Proxy Configuration
The actor includes intelligent proxy fallback logic to avoid blocks:
Default Behavior
- Starts with No Proxy: Direct connection to Amazon (fastest, lowest cost)
- Automatic Detection: Detects if Amazon blocks the request
- Fallback to Datacenter: If blocked, automatically switches to datacenter proxy
- Fallback to Residential: If datacenter fails, switches to residential proxy
- Retry Logic: 3 retries with residential proxy before giving up
- Sticky Proxy: Once fallback occurs, uses the same proxy type for remaining requests
Proxy Events Logging
All proxy events are clearly logged:
๐ Proxy: Starting with NO PROXY (direct connection)๐ซ Request blocked with no proxy. Falling back to DATACENTER proxy...โ Switched to DATACENTER proxy: ...๐ซ Request blocked with datacenter proxy. Falling back to RESIDENTIAL proxy...โ Switched to RESIDENTIAL proxy: ...๐ Residential proxy failed. Retry 1/3...
Manual Proxy Configuration
You can manually configure proxy settings in the input:
- No Proxy:
"useApifyProxy": false(default) - Datacenter Proxy:
"useApifyProxy": true(default Apify proxy) - Residential Proxy:
"useApifyProxy": true, "apifyProxyGroups": ["RESIDENTIAL"]
Best Use Cases
- ๐ Market Research: Discover products, analyze competition, and track market trends
- ๐ฐ Price Monitoring: Monitor product prices across search results and track price changes
- ๐ Product Discovery: Find products by keywords and explore related items
- ๐ Competitor Analysis: Analyze competitor products, pricing, and customer feedback
- ๐ Content Creation: Gather product information for blog posts, reviews, or comparisons
- ๐ E-commerce Integration: Import Amazon product data into your own platform
- ๐ Product Comparison: Compare multiple products side-by-side from search results
- ๐ฑ Inventory Research: Research product availability and variations
- ๐ฏ Keyword Research: Discover related keywords and search trends
- ๐ SEO Analysis: Analyze product rankings and search positions
Frequently Asked Questions
How does the keyword search work?
Simply enter keywords (e.g., "Playstation", "TV") or full Amazon search URLs. The actor automatically builds the correct Amazon search URL and scrapes the results. Keywords are extracted from URLs if you provide full search URLs.
Can I scrape multiple keywords at once?
Yes! Add multiple keywords or URLs to the startUrls array. The actor will process them sequentially, scraping all pages for each keyword.
What happens if Amazon blocks my requests?
The actor automatically detects blocks and falls back through the proxy chain: No Proxy โ Datacenter โ Residential. All proxy events are logged so you can see what's happening. If all proxies fail after retries, the actor will log an error and continue with other keywords.
How many pages can I scrape per search?
You can scrape up to 100 pages per search keyword. Set the maxPages parameter to control this. Note that scraping many pages may take longer and use more resources.
Can I scrape Amazon from different countries?
Yes! Set the amazonDomain parameter to any Amazon domain (amazon.com, amazon.co.uk, amazon.de, amazon.fr, etc.). The actor automatically adapts to the domain.
What's the difference between keywords and search URLs?
- Keywords: Simple text like
"Playstation"- the actor builds the search URL automatically - Search URLs: Full URLs like
"https://www.amazon.com/s?k=Playstation"- the actor extracts the keyword and uses it
Both work the same way - use whichever is more convenient for you.
How long does it take to scrape results?
Scraping speed depends on:
- Number of keywords
- Number of pages per keyword
- Amazon's response time
- Whether proxies are needed
Typically, 1 page with 1 keyword takes 5-10 seconds. Multiple pages are scraped concurrently for better speed.
What if a search returns no results?
The actor will return a record with statusCode: 404 and statusMessage: "NOT_FOUND" for that search. It will continue processing other keywords.
Can I sort results differently?
Yes! Use the sortOrder parameter:
"relevanceblender"- Most relevant (default)"price-asc-rank"- Price: Low to High"price-desc-rank"- Price: High to Low"review-rank"- Customer Reviews"date-desc-rank"- Newest Arrivals
How accurate is the data extraction?
The actor uses robust HTML parsing with multiple selectors and validation to ensure accurate data extraction. It handles various Amazon page layouts and extracts data reliably.
What's included in the output?
Each product includes:
- Basic info: ASIN, title, price, rating, reviews
- Images and URLs
- Prime/sponsored status
- Sales volume and delivery info
- Manufacturer and variations
- Search metadata: keyword, page, position, categories, similar keywords
Can I use this for commercial purposes?
Yes, as long as you comply with:
- Amazon's Terms of Service
- Local laws regarding web scraping
- Data protection and privacy regulations
- Rate limiting and respectful usage
Support and Feedback
If you encounter any issues or have suggestions for improvement:
- Check the actor logs for detailed error messages
- Verify your input parameters are correct
- Ensure keywords/URLs are valid
- Check that you're not exceeding rate limits
For technical support or feature requests, please contact the actor maintainer through the Apify platform.
Cautions
โ ๏ธ Important Legal and Ethical Considerations:
- This actor scrapes only publicly available data from Amazon search pages
- Respect Amazon's Terms of Service when using this actor
- Do not scrape private or password-protected content
- Comply with local laws regarding web scraping and data collection
- Use responsibly and avoid aggressive scraping that could impact Amazon's servers
- Respect rate limits - the actor includes delays between requests
- Data usage: You are responsible for how you use the scraped data and must ensure compliance with privacy laws, spam regulations, and data protection requirements
- Proxy usage: Using proxies may incur additional costs depending on your Apify plan
Note: This actor is designed for legitimate business and research purposes. Always use it ethically and in compliance with applicable laws and Amazon's terms of service.