Amazon Product List Scraper avatar

Amazon Product List Scraper

Pricing

$4.99/month + usage

Go to Apify Store
Amazon Product List Scraper

Amazon Product List Scraper

Scrape Amazon product listings by keyword or URL. Supports concurrent searches, ASIN deduplication, auto-retry, and combined keyword + URL mode in a single run.

Pricing

$4.99/month + usage

Rating

0.0

(0)

Developer

ZeroBreak

ZeroBreak

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

2 days ago

Last modified

Share

🛒 Amazon Product List Scraper

Scrape Amazon product listings at scale using keywords and/or direct search URLs. Supports concurrent searches, automatic deduplication by ASIN, and configurable retries — no browser required.


✨ Features

  • 🔍 Keyword search — search Amazon by any keyword or phrase
  • 🔗 URL search — scrape any Amazon search result URL directly
  • 🔀 Combined mode — run keywords + URLs together in one run
  • Concurrent requests — run multiple searches in parallel (up to 10)
  • 🔁 Auto retry — exponential back-off retry on failed requests (up to 3x)
  • 🧹 Deduplication — removes duplicate products by ASIN across all searches
  • ⏱️ Configurable timeout — set per-request timeout (10–120 seconds)

📥 Input

FieldTypeDefaultDescription
keywordsstring[][]Keywords to search (e.g. laptop, headphones)
urlsstring[][]Amazon search URLs to scrape
max_results_per_searchinteger50Max products per keyword/URL (0 = unlimited)
max_pages_per_searchinteger1Max result pages per search (~20 products/page)
concurrencyinteger3Number of parallel searches (1–10)
request_timeout_secsinteger30Per-request timeout in seconds (10–120)
retry_on_errorbooleantrueRetry failed requests up to 3 times
deduplicate_resultsbooleantrueRemove duplicate ASINs across all results

Example Input

{
"keywords": ["wireless headphones", "gaming mouse"],
"urls": ["https://www.amazon.com/s?k=laptop&rh=n%3A172282"],
"max_results_per_search": 100,
"max_pages_per_search": 3,
"concurrency": 5,
"retry_on_error": true,
"deduplicate_results": true
}

Mode is auto-detected:

  • Only keywords filled → keyword mode
  • Only urls filled → URL mode
  • Both filled → combined mode (all run together)

📤 Output

Each result is pushed to the Apify dataset. The raw API response is returned at the top level alongside metadata:

{
"label": "keyword:wireless headphones",
"status": "success",
"status_code": 200,
"products": [
{
"asin": "B09XYZ1234",
"title": "Sony WH-1000XM5 Wireless Headphones",
"price": 279.99,
"rating": 4.7,
"reviews": 12483,
"url": "https://www.amazon.com/dp/B09XYZ1234",
"image": "https://m.media-amazon.com/images/..."
}
],
"total_products": 48
}

A summary is also saved to the key-value store under the OUTPUT key:

{
"total_searches": 3,
"successful": 3,
"errors": 0,
"total_products": 214
}

🚀 Usage Tips

  • Use max_pages_per_search: 3–5 with max_results_per_search: 100 for broad category scraping
  • Set concurrency: 5–10 for large keyword lists to reduce total run time
  • Enable deduplicate_results when combining keywords and URLs that may overlap
  • Use Amazon's filter parameters in URLs (e.g. &rh=n%3A172282 for category filters) for more targeted results

This actor is intended for personal research, price monitoring, and product analysis. Always review Amazon's Terms of Service before use. Do not use for mass data harvesting or commercial redistribution of Amazon data.