Woocommerce Scraper avatar

Woocommerce Scraper

Pricing

$19.99/month + usage

Go to Apify Store
Woocommerce Scraper

Woocommerce Scraper

Pricing

$19.99/month + usage

Rating

0.0

(0)

Developer

ScrapePilot

ScrapePilot

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

3 days ago

Last modified

Share

WooCommerce Store Scraper

A robust Apify Actor for scraping product data from WooCommerce stores using the WooCommerce REST API.

Why Choose Us?

  • Comprehensive Data Extraction: Scrapes complete product information including prices, images, categories, reviews, and more
  • Smart Proxy Management: Automatic proxy fallback system ensures successful scraping even when stores block requests
  • Bulk Processing: Process multiple stores in a single run
  • Incremental Saving: Data is saved as it's scraped, so you don't lose progress if the actor stops
  • Production Ready: Built with error handling, retry logic, and detailed logging

Key Features

  • ✅ Scrape products from multiple WooCommerce stores
  • ✅ Automatic proxy fallback (no proxy → datacenter → residential)
  • ✅ Filter by featured or on-sale products
  • ✅ Detailed product information including prices, images, categories, reviews
  • ✅ Incremental data saving to prevent data loss
  • ✅ Comprehensive error handling and retry logic
  • ✅ Real-time progress logging

Input

JSON Example

{
"startUrls": [
{ "url": "https://woocommerce.com" }
],
"proxyConfiguration": {
"useApifyProxy": false
},
"limit": 100,
"featured": false,
"sale": false,
"devNoStrip": false
}

Input Fields

  • startUrls (required): Array of store URLs to scrape. Each item should be an object with a url property.
  • proxyConfiguration (optional): Proxy settings. By default, no proxy is used. The actor will automatically fallback to datacenter or residential proxies if needed.
  • limit (optional): Maximum number of products to scrape per store (default: 100, max: 1000).
  • featured (optional): If true, only scrape featured products (default: false).
  • sale (optional): If true, only scrape products that are on sale (default: false).
  • devNoStrip (optional): If true, product names and slugs will not be stripped of whitespace (default: false).

Output

JSON Example

{
"url": "https://woocommerce.com/products/example/",
"id": 12345,
"name": "Example Product",
"slug": "example-product",
"type": "simple",
"sku": "EX-001",
"on_sale": false,
"prices": {
"price": "29.99",
"regular_price": "29.99",
"sale_price": "0",
"currency_code": "USD",
"currency_symbol": "$",
"currency_prefix": "$"
},
"average_rating": "4.5",
"review_count": 10,
"images": [...],
"categories": [...],
"tags": [...],
"is_in_stock": true,
"store": "https://woocommerce.com"
}

Output Fields

  • url: Product page URL
  • id: Product ID
  • name: Product name
  • slug: Product slug
  • type: Product type (simple, variable, etc.)
  • sku: Product SKU
  • on_sale: Whether product is on sale
  • prices: Price information including currency details
  • average_rating: Average customer rating
  • review_count: Number of reviews
  • images: Array of product images
  • categories: Product categories
  • tags: Product tags
  • is_in_stock: Stock availability
  • store: Source store URL

🚀 How to Use the Actor (via Apify Console)

  1. Log in at https://console.apify.com and go to Actors.
  2. Find your actor (woocommerce-scraper) and click it.
  3. Configure inputs:
    • Add store URLs in the startUrls field
    • Optionally configure proxy settings
    • Set product limit and filters if needed
  4. Click Start to run the actor.
  5. Monitor logs in real time to see progress.
  6. Access results in the OUTPUT tab once scraping completes.
  7. Export results to JSON or CSV.

Best Use Cases

  • E-commerce Research: Analyze product catalogs from WooCommerce stores
  • Price Monitoring: Track product prices across multiple stores
  • Product Comparison: Compare products from different WooCommerce stores
  • Market Analysis: Gather product data for market research
  • Inventory Management: Export product data for inventory systems

Frequently Asked Questions

Q: Does this work with all WooCommerce stores?
A: Yes, as long as the store has the WooCommerce REST API enabled (which is standard for WooCommerce stores).

Q: What happens if a store blocks my requests?
A: The actor automatically falls back through proxy options: first datacenter proxy, then residential proxy with retries.

Q: Can I scrape private or password-protected stores?
A: No, this actor only works with publicly accessible WooCommerce REST API endpoints.

Q: How many products can I scrape?
A: The limit is configurable up to 1000 products per store. For larger catalogs, you may need to run multiple times with different filters.

Q: Will the actor save data if it crashes?
A: Yes, data is saved incrementally as it's scraped, so you won't lose progress if the actor stops.

Support and Feedback

For issues, questions, or feedback, please contact support through the Apify platform.

Cautions

  • Data is collected only from publicly available sources.
  • No data is taken from private accounts or password-protected content.
  • The end user is responsible for ensuring legal compliance (spam laws, privacy, data protection, etc.).
  • Respect the target website's robots.txt and terms of service.