E-commerce Product Crawler avatar
E-commerce Product Crawler
Deprecated

Pricing

$10.00 / 1,000 results

Go to Apify Store
E-commerce Product Crawler

E-commerce Product Crawler

Deprecated

Crawl product information from Amazon, eBay, and Shopee. Get detailed product data including prices, descriptions, images, ratings, and sales data.

Pricing

$10.00 / 1,000 results

Rating

0.0

(0)

Developer

Hung Dinh

Hung Dinh

Maintained by Community

Actor stats

3

Bookmarked

24

Total users

7

Monthly active users

9 months ago

Last modified

Categories

Share

🎡️ E-commerce Product Crawler

An Apify actor for scraping product information from major e-commerce platforms (Amazon, eBay, Shopee). This actor allows you to collect detailed product data based on your search terms.

✨ Features

  • πŸ” Search products across multiple platforms
  • πŸ“Š Configurable number of results
  • πŸ’° Real-time pricing data
  • πŸ“ Detailed product descriptions
  • πŸ–ΌοΈ Product images
  • ⭐ Ratings and reviews
  • πŸ“ˆ Sales statistics
  • πŸ‘₯ Seller information
  • πŸ€– Powered by Gemini AI
  • πŸ”‘ Pre-configured API key

βš™οΈ Input Configuration

The actor accepts the following input parameters:

FieldTypeRequiredDescription
searchTermStringYesProduct name or keywords to search for
maxProductsIntegerNoMaximum number of products to collect (default: 10, max: 100)
platformStringYesE-commerce platform to search on (amazon, ebay, shopee)

Example input:

{
"searchTerm": "wireless earbuds",
"maxProducts": 20,
"platform": "amazon"
}

πŸ“€ Output

The actor stores results in the default dataset. Each item contains:

  • πŸ“ name: Product name
  • πŸ’° price: Current price
  • πŸ“„ description: Product description
  • πŸ–ΌοΈ images: List of product image URLs
  • ⭐ rating: Average rating (out of 5)
  • πŸ’¬ numReviews: Number of reviews
  • πŸ“Š numSales: Number of sales (if available)
  • πŸͺ seller: Seller name/store
  • πŸ”— url: Product URL
  • 🏒 platform: E-commerce platform name
  • ℹ️ additionalDetails: Additional product details
  • ⏰ collectedAt: Data collection timestamp

Example output:

{
"name": "Sony WF-1000XM4 Wireless Earbuds",
"price": 248.00,
"description": "Industry Leading Noise Canceling Bluetooth Earbuds...",
"images": [
"https://m.media-amazon.com/images/I/abc123.jpg",
"https://m.media-amazon.com/images/I/def456.jpg"
],
"rating": 4.5,
"numReviews": 1250,
"numSales": 5000,
"seller": "Sony Store",
"url": "https://www.amazon.com/product/abc123",
"platform": "amazon",
"additionalDetails": {
"color": "Black",
"brand": "Sony",
"warranty": "1 year"
},
"collectedAt": "2023-12-24T12:00:00"
}

πŸš€ Usage

  1. πŸ“Œ Create a new task for the E-commerce Product Crawler actor
  2. ⌨️ Input your search term, platform choice, and maximum number of products
  3. ▢️ Run the actor and wait for results
  4. πŸ“Š Access the collected data in the actor's dataset

⚠️ Limitations

  • πŸ•’ Rate limiting: The actor includes delays between requests
  • πŸ“ˆ Maximum results: Limited to 100 products per run
  • πŸ” Search scope: Limited to available products
  • 🌐 Platform support: Currently supports Amazon, eBay, and Shopee
  • πŸ€– AI accuracy: Results depend on Gemini API's accuracy

πŸ“¦ Dependencies

  • 🐍 Python 3.9+
  • πŸ“š Required packages:
    • apify>=1.1.0
    • requests>=2.31.0
    • python-dotenv>=1.0.0
    • aiohttp>=3.8.0
    • google-generativeai>=0.3.0
    • beautifulsoup4>=4.12.0
    • lxml>=4.9.0

πŸ“„ License

This project is licensed under the Apache License 2.0 - see the LICENSE file for details.


Made with Hung Dinh ❀️