Amazon Search Products Scraper avatar
Amazon Search Products Scraper

Pricing

$19.99/month + usage

Go to Apify Store
Amazon Search Products Scraper

Amazon Search Products Scraper

Amazon Search Products Scraper collects product results from any Amazon search query. Extract titles, prices, ratings, images, sellers, availability, and key details. Ideal for market research, price tracking, product analysis, and workflows needing structured Amazon search data.

Pricing

$19.99/month + usage

Rating

0.0

(0)

Developer

API Empire

API Empire

Maintained by Community

Actor stats

0

Bookmarked

3

Total users

2

Monthly active users

a day ago

Last modified

Share

A powerful and reliable Apify Actor designed to scrape Amazon search results by keywords or search URLs. This actor extracts comprehensive product data from Amazon search pages including prices, ratings, reviews, images, and detailed product information. Perfect for market research, price monitoring, competitor analysis, and product discovery.

Why Choose Us?

  • ๐Ÿ” Keyword-Based Scraping: Search Amazon using keywords or full search URLs - no need for individual product ASINs
  • ๐Ÿ“Š Comprehensive Data Extraction: Captures 30+ data points per product including prices, ratings, reviews, images, and metadata
  • ๐Ÿš€ Bulk Processing: Process multiple keywords simultaneously with concurrent page scraping
  • ๐Ÿ”„ Intelligent Proxy Fallback: Automatic proxy fallback system (No Proxy โ†’ Datacenter โ†’ Residential) with 3 retries to avoid blocks
  • โšก Fast & Efficient: Concurrent page scraping for maximum speed
  • ๐ŸŒ Multi-Domain Support: Works with all Amazon domains (amazon.com, amazon.co.uk, amazon.de, etc.)
  • ๐Ÿ“ˆ Real-Time Logging: Detailed progress logs to keep you informed throughout the scraping process
  • ๐ŸŽฏ Flexible Sorting: Sort results by relevance, price, reviews, or newest arrivals

Key Features

๐Ÿ” Comprehensive Product Data

  • Product title, description, and ASIN
  • Current price and retail price
  • Product rating and total review count
  • Product images and URLs
  • Prime eligibility and sponsored status
  • Sales volume and delivery information
  • Manufacturer, series, and variations
  • Product details and specifications

๐Ÿ“‘ Search & Pagination

  • Search by keywords or full Amazon search URLs
  • Extract keywords automatically from URLs
  • Scrape multiple pages per search (configurable)
  • Extract result counts and categories
  • Find similar keywords and related searches
  • Track search position and page number

๐Ÿ”„ Smart Proxy Management

  • Default: Starts with no proxy (direct connection)
  • Automatic Fallback: Falls back to datacenter proxy if blocked
  • Residential Fallback: Falls back to residential proxy if datacenter fails
  • Retry Logic: 3 retries with residential proxy before giving up
  • Clear Logging: All proxy events are logged for transparency

๐ŸŽฏ Flexible Sorting Options

  • Relevance (default): Most relevant results first
  • Price Low to High: Sort by ascending price
  • Price High to Low: Sort by descending price
  • Customer Reviews: Sort by review rating
  • Newest Arrivals: Sort by date (newest first)

๐Ÿ“Š Rich Metadata

  • Search keyword and domain code
  • Page number and result position
  • Total result count
  • Categories and browse nodes
  • Similar keywords and related searches
  • Sort strategy used

๐Ÿš€ How to Use the Actor (via Apify Console)

  1. Log in at https://console.apify.com and go to Actors
  2. Find the actor (amazon-search-products-scraper) and click it
  3. Configure inputs:
    • Enter keywords or search URLs in the "Enter Amazon Keywords or Search URLs" field
    • (Optional) Set Amazon domain (default: amazon.com)
    • (Optional) Choose sort order (default: relevance)
    • (Optional) Set maximum pages per search (default: 1)
    • (Optional) Configure proxy settings
  4. Run the actor by clicking the "Start" button
  5. Monitor logs in real time to see scraping progress
  6. Access results in the OUTPUT tab when the run completes
  7. Export results to JSON or CSV format

Input

The actor accepts the following input parameters:

Required Fields

  • startUrls (Array): List of keywords or Amazon search URLs
    • Can be simple keywords like "Playstation" or "TV"
    • Can be full Amazon search URLs like "https://www.amazon.com/s?k=Playstation"
    • Supports bulk input - add multiple keywords/URLs

Optional Fields

  • amazonDomain (String): Amazon domain to use

    • Examples: "amazon.com", "amazon.co.uk", "amazon.de", "amazon.fr"
    • Default: "amazon.com"
  • language (String): Language for Amazon pages

    • Options: "" (auto), "en-US", "en-GB", "de-DE", "fr-FR", "es-ES", "it-IT", "ja-JP", "zh-CN"
    • Default: "" (automatic detection)
  • proxyCountry (String): Country for proxy selection

    • Options: "AUTO", "US", "GB", "DE", "FR", "ES", "IT", "CA", "AU", "JP"
    • Default: "AUTO" (automatic selection)
  • useCaptchaSolver (Boolean): Enable automatic captcha solving

    • Default: false
    • Warning: May incur additional costs and processing time
  • sortOrder (String): How to sort search results

    • Options:
      • "relevanceblender" - Most relevant (default)
      • "price-asc-rank" - Price: Low to High
      • "price-desc-rank" - Price: High to Low
      • "review-rank" - Customer Reviews
      • "date-desc-rank" - Newest Arrivals
    • Default: "relevanceblender"
  • maxPages (Integer): Maximum number of pages to scrape per search

    • Minimum: 1
    • Maximum: 100
    • Default: 1
  • proxyConfiguration (Object): Proxy settings

    • Default: No proxy (direct connection)
    • Automatic fallback to datacenter and residential proxies if blocked
    • See proxy configuration section below

Input Example

{
"startUrls": [
{ "url": "Playstation" },
{ "url": "https://www.amazon.com/s?k=TV" },
{ "url": "Shoes" }
],
"amazonDomain": "amazon.com",
"sortOrder": "relevanceblender",
"maxPages": 3,
"proxyConfiguration": {
"useApifyProxy": false
}
}

Output

The actor saves scraped product data to the Apify dataset. Each product record contains:

Core Product Information

FieldTypeDescription
asinStringAmazon Standard Identification Number (10 characters)
productDescriptionStringProduct title/description
imgUrlStringProduct image URL
priceNumber or nullCurrent product price
retailPriceNumber or nullOriginal/retail price (if on sale)
productRatingString or nullOverall product rating (e.g., "4.7 out of 5 stars")
countReviewNumberTotal number of reviews
dpUrlStringProduct detail page URL (relative)
primeBooleanWhether product is Prime eligible
sponsoredBooleanWhether product is a sponsored result
salesVolumeString or nullSales volume information (e.g., "7K+ bought in past month")
deliveryMessageString or nullDelivery time and shipping information
manufacturerString or nullProduct manufacturer/brand
seriesString or nullProduct series or model (if applicable)
variationsArrayProduct variations (size, color, etc.)
productDetailsArrayProduct details and features
secondaryOfferNumberUsed/refurbished price (if available)

Search Metadata

FieldTypeDescription
statusCodeNumberHTTP status code (200 = success)
statusMessageStringStatus message ("FOUND", "ERROR", "BLOCKED", etc.)
keywordStringSearch keyword used
domainCodeStringAmazon domain code (e.g., "com", "co.uk")
pageNumberPage number of search results
selectedCategoryStringSelected category (default: "aps")
browseNodeNumber or nullAmazon browse node ID
nodeHierarchyString or nullBrowse node hierarchy
resultCountNumberTotal number of search results
categoriesArrayProduct categories found
similarKeywordsArraySimilar/related keywords with URLs
currentPageNumberCurrent page number
sortStrategyStringSort strategy used
searchResultPositionNumberPosition of product in search results (0-indexed)

Output Example

[
{
"asin": "B0CL5KNB9M",
"productDescription": "PlayStationยฎ5 Digital Edition (slim)",
"imgUrl": "https://m.media-amazon.com/images/I/51fM0CKG+HL._AC_UY218_.jpg",
"price": null,
"retailPrice": null,
"productRating": "4.7 out of 5 stars",
"countReview": 10500,
"dpUrl": "/PlayStation%C2%AE5-Digital-slim-PlayStation-5/dp/B0CL5KNB9M/ref=sr_1_1?...",
"prime": false,
"sponsored": false,
"salesVolume": "7K+ bought in past month",
"deliveryMessage": null,
"manufacturer": "Amazon's Choice: Overall Pick",
"series": null,
"variations": [],
"productDetails": [
"Nov 24, 2023",
"No featured offers available"
],
"secondaryOffer": 0.0,
"statusCode": 200,
"statusMessage": "FOUND",
"keyword": "Playstation",
"domainCode": "com",
"page": 1,
"selectedCategory": "aps",
"browseNode": null,
"nodeHierarchy": null,
"resultCount": 50000,
"categories": [],
"similarKeywords": [
{
"keyword": "playstation 5",
"url": "/s?k=playstation+5&ref=rsl_sug_0_0&..."
},
{
"keyword": "playstation gift card",
"url": "/s?k=playstation+gift+card&ref=rsl_sug_0_3&..."
}
],
"currentPage": 1,
"sortStrategy": "relevanceblender",
"searchResultPosition": 0
}
]

Proxy Configuration

The actor includes intelligent proxy fallback logic to avoid blocks:

Default Behavior

  1. Starts with No Proxy: Direct connection to Amazon (fastest, lowest cost)
  2. Automatic Detection: Detects if Amazon blocks the request
  3. Fallback to Datacenter: If blocked, automatically switches to datacenter proxy
  4. Fallback to Residential: If datacenter fails, switches to residential proxy
  5. Retry Logic: 3 retries with residential proxy before giving up
  6. Sticky Proxy: Once fallback occurs, uses the same proxy type for remaining requests

Proxy Events Logging

All proxy events are clearly logged:

  • ๐Ÿ”“ Proxy: Starting with NO PROXY (direct connection)
  • ๐Ÿšซ Request blocked with no proxy. Falling back to DATACENTER proxy...
  • โœ… Switched to DATACENTER proxy: ...
  • ๐Ÿšซ Request blocked with datacenter proxy. Falling back to RESIDENTIAL proxy...
  • โœ… Switched to RESIDENTIAL proxy: ...
  • ๐Ÿ”„ Residential proxy failed. Retry 1/3...

Manual Proxy Configuration

You can manually configure proxy settings in the input:

  • No Proxy: "useApifyProxy": false (default)
  • Datacenter Proxy: "useApifyProxy": true (default Apify proxy)
  • Residential Proxy: "useApifyProxy": true, "apifyProxyGroups": ["RESIDENTIAL"]

Best Use Cases

  • ๐Ÿ“Š Market Research: Discover products, analyze competition, and track market trends
  • ๐Ÿ’ฐ Price Monitoring: Monitor product prices across search results and track price changes
  • ๐Ÿ” Product Discovery: Find products by keywords and explore related items
  • ๐Ÿ“ˆ Competitor Analysis: Analyze competitor products, pricing, and customer feedback
  • ๐Ÿ“ Content Creation: Gather product information for blog posts, reviews, or comparisons
  • ๐Ÿ›’ E-commerce Integration: Import Amazon product data into your own platform
  • ๐Ÿ“‹ Product Comparison: Compare multiple products side-by-side from search results
  • ๐Ÿ“ฑ Inventory Research: Research product availability and variations
  • ๐ŸŽฏ Keyword Research: Discover related keywords and search trends
  • ๐Ÿ“Š SEO Analysis: Analyze product rankings and search positions

Frequently Asked Questions

How does the keyword search work?

Simply enter keywords (e.g., "Playstation", "TV") or full Amazon search URLs. The actor automatically builds the correct Amazon search URL and scrapes the results. Keywords are extracted from URLs if you provide full search URLs.

Can I scrape multiple keywords at once?

Yes! Add multiple keywords or URLs to the startUrls array. The actor will process them sequentially, scraping all pages for each keyword.

What happens if Amazon blocks my requests?

The actor automatically detects blocks and falls back through the proxy chain: No Proxy โ†’ Datacenter โ†’ Residential. All proxy events are logged so you can see what's happening. If all proxies fail after retries, the actor will log an error and continue with other keywords.

You can scrape up to 100 pages per search keyword. Set the maxPages parameter to control this. Note that scraping many pages may take longer and use more resources.

Can I scrape Amazon from different countries?

Yes! Set the amazonDomain parameter to any Amazon domain (amazon.com, amazon.co.uk, amazon.de, amazon.fr, etc.). The actor automatically adapts to the domain.

What's the difference between keywords and search URLs?

  • Keywords: Simple text like "Playstation" - the actor builds the search URL automatically
  • Search URLs: Full URLs like "https://www.amazon.com/s?k=Playstation" - the actor extracts the keyword and uses it

Both work the same way - use whichever is more convenient for you.

How long does it take to scrape results?

Scraping speed depends on:

  • Number of keywords
  • Number of pages per keyword
  • Amazon's response time
  • Whether proxies are needed

Typically, 1 page with 1 keyword takes 5-10 seconds. Multiple pages are scraped concurrently for better speed.

What if a search returns no results?

The actor will return a record with statusCode: 404 and statusMessage: "NOT_FOUND" for that search. It will continue processing other keywords.

Can I sort results differently?

Yes! Use the sortOrder parameter:

  • "relevanceblender" - Most relevant (default)
  • "price-asc-rank" - Price: Low to High
  • "price-desc-rank" - Price: High to Low
  • "review-rank" - Customer Reviews
  • "date-desc-rank" - Newest Arrivals

How accurate is the data extraction?

The actor uses robust HTML parsing with multiple selectors and validation to ensure accurate data extraction. It handles various Amazon page layouts and extracts data reliably.

What's included in the output?

Each product includes:

  • Basic info: ASIN, title, price, rating, reviews
  • Images and URLs
  • Prime/sponsored status
  • Sales volume and delivery info
  • Manufacturer and variations
  • Search metadata: keyword, page, position, categories, similar keywords

Can I use this for commercial purposes?

Yes, as long as you comply with:

  • Amazon's Terms of Service
  • Local laws regarding web scraping
  • Data protection and privacy regulations
  • Rate limiting and respectful usage

Support and Feedback

If you encounter any issues or have suggestions for improvement:

  1. Check the actor logs for detailed error messages
  2. Verify your input parameters are correct
  3. Ensure keywords/URLs are valid
  4. Check that you're not exceeding rate limits

For technical support or feature requests, please contact the actor maintainer through the Apify platform.

Cautions

โš ๏ธ Important Legal and Ethical Considerations:

  • This actor scrapes only publicly available data from Amazon search pages
  • Respect Amazon's Terms of Service when using this actor
  • Do not scrape private or password-protected content
  • Comply with local laws regarding web scraping and data collection
  • Use responsibly and avoid aggressive scraping that could impact Amazon's servers
  • Respect rate limits - the actor includes delays between requests
  • Data usage: You are responsible for how you use the scraped data and must ensure compliance with privacy laws, spam regulations, and data protection requirements
  • Proxy usage: Using proxies may incur additional costs depending on your Apify plan

Note: This actor is designed for legitimate business and research purposes. Always use it ethically and in compliance with applicable laws and Amazon's terms of service.