Amazon Search Scrapper avatar

Amazon Search Scrapper

Under maintenance

Pricing

$7.29/month + usage

Go to Apify Store
Amazon Search Scrapper

Amazon Search Scrapper

Under maintenance

Collect detailed Amazon product data across all major Amazon domains worldwide. Extract titles, prices, ratings, features, images, and seller details using simple search keywords—ideal for market research, price tracking, and global catalog building.

Pricing

$7.29/month + usage

Rating

0.0

(0)

Developer

Aadyantha Solutions

Aadyantha Solutions

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

a month ago

Last modified

Share

Amazon Search & Product Scraper

This actor extracts Amazon product data based on one or more search keywords.
It uses a two‑stage crawling approach to balance speed, reliability, and data completeness.

  • Search result pages are crawled to discover ASINs
  • Individual product pages are crawled to extract detailed product information
  • Built with Playwright, session pooling, and residential proxies for Amazon stability

Key Capabilities

  • Multiple search keywords in a single run
  • Automatic pagination of search results
  • Configurable maximum results per keyword
  • Full product‑level data extraction
  • Supports products with or without ratings/reviews
  • Safe seller extraction using verify logic (no waiting for optional elements)
  • Retry strategy:
    • Search pages: up to 5 retries
    • Product pages: 1 retry
  • Session pooling + residential proxies for reduced blocking

Input Parameters

ParameterTypeRequiredDefaultDescription
search_stringsArrayYesKeywords to search on Amazon
max_resultsNumberNo5Maximum number of products per keyword
domain_codeStringNoinAmazon domain (in, com, co.uk, etc.)
sort_byStringNorelevanceblenderAmazon sort option
proxyConfigurationObjectNoApify ResidentialProxy configuration

Input Example

{
"search_strings": ["luggage bag", "school backpack"],
"max_results": 10,
"domain_code": "in",
"proxyConfiguration": {
"useApifyProxy": true,
"apifyProxyGroups": ["RESIDENTIAL"],
"countryCode": "IN"
}
}

Notes

  • Each keyword is processed independently
  • Pagination continues automatically until enough ASINs are discovered
  • Product crawling stops once max_results is reached
  • Product pages are fetched individually for accuracy

Output Schema

Each dataset item represents one Amazon product.

Output Fields

FieldTypeDescription
search_keywordStringKeyword used for discovery
asinStringAmazon Standard Identification Number
titleStringProduct title
brandString | nullBrand name
priceString | nullSelling price
mrpString | nullMaximum retail price
ratingNumber | nullAverage customer rating
reviews_countNumber | nullNumber of reviews
featuresArrayBullet‑point product features
imagesArrayHigh‑resolution product image URLs
seller_nameString | nullSeller name
seller_typeStringAmazon / Third Party
seller_profile_urlString | nullSeller profile link
product_urlStringAmazon product page URL

Output Example

{
"search_keyword": "luggage bag",
"asin": "B0F1FPWF8N",
"title": "Skybags Brat Polycarbonate 65 cms Check‑in Luggage",
"brand": "Skybags",
"price": "₹5,499",
"mrp": "₹9,999",
"rating": 4.2,
"reviews_count": 1384,
"features": [
"Polycarbonate hard shell",
"4 smooth spinner wheels",
"TSA lock"
],
"images": [
"https://m.media-amazon.com/images/I/81abc123_SX1000_.jpg"
],
"seller_name": "Amazon Retail",
"seller_type": "Amazon",
"seller_profile_url": "https://www.amazon.in/sp?seller=A1XYZ",
"product_url": "https://www.amazon.in/dp/B0F1FPWF8N"
}

For Amazon scraping, Residential proxies are strongly recommended.

{
"useApifyProxy": true,
"apifyProxyGroups": ["RESIDENTIAL"],
"countryCode": "IN"
}
  • Match proxy country with Amazon domain
  • Keep Playwright concurrency ≤ 3 for stability

Limitations

  • Amazon DOM structure may change over time
  • Prices, sellers, and availability can vary by user/location
  • Some products may not have ratings or reviews
  • Excessive concurrency can trigger bot protection

Troubleshooting Tips

  1. Use residential proxies (avoid datacenter IPs)
  2. Reduce max_results if timeouts occur
  3. Avoid extremely generic keywords
  4. Check Actor logs for blocked pages

Support

For issues or feature requests:

📧 aadyanthasolutions@gmail.com