Amazon Search Scraper
Pricing
$9.00/month + usage
Amazon Search Scraper
β‘ Instantly discover Amazonβs best-selling products with one click! π This smart actor fetches real-time prices, ratings, and deals β giving you insights in seconds. Trusted by pros for accuracy, speed, and reliability. Run it now and find hidden gems before your competitors do! π
0.0 (0)
Pricing
$9.00/month + usage
0
1
1
Last modified
a day ago
π Amazon Search Scraper
One-line tagline: Instantly extract product listings and product-page metadata from Amazon search queries β fast, secure, and ready for business use.
π Summary
Amazon Search Scraper retrieves product data from Amazon search results and product pages and returns clean, business-ready JSON records for analysis. Designed for speed and reliability, it helps teams discover product details, prices, reviews, images, and availability for competitive research, analytics, and monitoring.
π‘ Use cases / When to use
- Competitive price monitoring and alerting
- Market/product research and sourcing
- Creating product catalogs and feeds
- Gathering product images and descriptions for analytics
- Quickly prototyping e-commerce dashboards
β‘ Quick Start β Console (one-click)
- Open this Actor in Apify Console.
- Fill the Queries input (single keyword or array of keywords).
- (Optional) Enable Proxy Configuration if scraping at scale.
- Click Run. Results appear in the default dataset/OUTPUT in seconds.
Friendly microcopy: βPlug in a search term, click Run, and get structured product data instantly.β
βοΈ Quick Start (CLI + API)
CLI
# Run an actor with JSON input via apify-cliapify run --actor <your-actor-id> --input input.example.json
Python (apify-client)
from apify_client import ApifyClientclient = ApifyClient('<APIFY_TOKEN>')run = client.actor('your-user/amazon-search-scraper').call(run_input={"queries": ["wireless earbuds"]})print('Started run:', run['id'])
π Inputs (fields & schema)
Console JSON input example (see input.example.json file):
{"queries": ["wireless earbuds", "gaming mouse"],"headless": true,"requestDelay": [1.0, 2.0]}
Fields
queriesβ string or array β required β Search keywords or Amazon product URLs. The actor accepts either a search keyword (will run site search) or a direct Amazon product URL for product-page scraping.headlessβ boolean β optional β Run browser in headless mode (default: true).requestDelayβ array [min, max] β optional β Delay range (seconds) between product-page requests to reduce rate.
βοΈ Configuration
| π Name | π Type | β Required | βοΈ Default | π Example | π§ Notes |
|---|---|---|---|---|---|
| queries | string/array | β Yes | β | ["wireless earbuds"] | Search terms or product URLs |
| headless | boolean | βοΈ Optional | true | false | Turn off to debug visually |
| requestDelay | array (min,max) | βοΈ Optional | [1.0, 2.0] | [0.5, 1.0] | Avoids aggressive scraping |
| proxyConfiguration | object | βοΈ Optional | {} | {"useApifyProxy": true} | Use residential proxies for scale |
Example Console setup: paste "wireless earbuds" into queries and press Run Actor.
π Outputs (Dataset / KV examples)
Each dataset item is a JSON object with attributes similar to:
{"asin": "B09EXAMPLE","title": "Wireless Earbuds XYZ","brand_name": "BrandCo","url": "https://www.amazon.com/...]","price": "$59.99","currency": "$","thumbnail": "https://...jpg","images": ["https://...jpg", "https://...jpg"],"stars": 4.3,"review_count": "1.2k","availability": "In Stock","description": "Key bullet points...","categories": "Electronics > Audio","search_keyword": "wireless earbuds"}
Note: The actor pushes results to the default dataset and also returns the full result array in the key-value store under the standard
OUTPUTkey.
π Environment Variables
<APIFY_TOKEN>β required to call the Actor programmatically via API.<PROXY_USER:PASS@HOST:PORT>β placeholder for custom proxy credentials.
Security note: Store secrets in Apify Console Secrets β do not paste them into input fields.
βΆοΈ How to Run
Console
- Go to the Actor page in Apify Console.
- Paste your queries (single string or array) into the Input field.
- (Optional) Configure proxies under the Proxy Configuration editor.
- Click Run.
CLI
$apify run --actor your-user/amazon-search-scraper --input input.example.json
API (Python)
See Quick Start (above) β use client.actor(...).call(run_input=...) and read the returned run ID.
β° Scheduling & Webhooks
- Use Apify Console scheduling to run this Actor at any interval (hourly, daily, weekly).
- Configure webhooks on run completion to forward JSON output to your endpoint for real-time processing.
πΎοΈ Logs & Troubleshooting
-
Check the Console logs for step-by-step run info and any per-item warnings.
-
Common issues:
- No results: verify that
queriesare valid and spelled correctly. - Request timeouts: increase
requestDelayor enable Proxy Configuration. - Selector changes on Amazon: refresh the run and adjust queries β the actor is resilient but web UIs change frequently.
- No results: verify that
π Permissions & Storage Notes
- This Actor collects publicly visible product information only. It does not perform account actions.
- Results are stored in Apify Datasets/Key-Value stores in your account and follow Apifyβs standard retention and access controls.
π Changelog / Versioning
- v0.1.0 β Initial public release: search + product-page scraping, structured dataset output.
π Notes / TODOs
- TODO: Consider adding a CLI flag / input for limiting the number of product pages per query (reason: some queries return many results).
- TODO: Add optional CSV export in output settings (reason: convenient for BI ingestion).
π Proxy Configuration
If you will run many searches or large-scale scraping, configure Apify Proxy or custom proxies.
Enable Apify Proxy (Console):
- In the Actor run form, open Proxy configuration and enable Use Apify Proxy (choose RESIDENTIAL for best results).
Custom proxy example (as secret):
- Use
<PROXY_USER:PASS@HOST:PORT>format and store as a Console Secret. Reference it in the Proxy Configuration editor.
Environment variables (examples)
HTTP_PROXY=<PROXY_USER:PASS@HOST:PORT>HTTPS_PROXY=<PROXY_USER:PASS@HOST:PORT>
Reminder: Store proxy credentials in Secrets and do not paste them into public inputs.
TODO: Consider proxy rotation for large-scale scraping.
π References
- Apify Actor README guidelines β https://docs.apify.com/console/actors/README
- Apify Input/Output schemas β https://docs.apify.com/platform/input-output
- Apify CLI & API usage β https://docs.apify.com/console/actors/run
π€ What I inferred from main.py
- The Actor accepts
queries(keywords or Amazon URLs) and uses an automated browser to fetch search results and product pages. - It extracts product metadata, images, prices, ratings, and availability and returns structured JSON items.
- It respects throttling delays and can be configured to use proxies for scale.
- Results are pushed to the default dataset and the key-value store under
OUTPUT.
input.example.json
{"queries": ["wireless earbuds","gaming mouse"],"headless": true,"requestDelay": [1.0, 2.0]}
CONFIG.md (optional)
Quick config notes
- Secrets: Add
<APIFY_TOKEN>and any proxy credentials to Console Secrets. - Scaling: For repeated large runs, enable Apify Proxy (RESIDENTIAL) and consider running with scheduling + webhooks to automate downstream processing.
Suggested settings in Console
- Proxy configuration: use Apify Proxy β RESIDENTIAL
- Dataset retention: enable automatic export to your storage of choice
Generated by: Neuro Scraper
On this page
-
- π Summary
- π‘ Use cases / When to use
- β‘ Quick Start β Console (one-click)
- βοΈ Quick Start (CLI + API)
- π Inputs (fields & schema)
- βοΈ Configuration
- π Outputs (Dataset / KV examples)
- π Environment Variables
- βΆοΈ How to Run
- β° Scheduling & Webhooks
- πΎοΈ Logs & Troubleshooting
- π Permissions & Storage Notes
- π Changelog / Versioning
- π Notes / TODOs
- π Proxy Configuration
- π References
- π€ What I inferred from
main.py
Share Actor:
