Ecommerce Scraper — Shopify, WooCommerce, Generic (MCP)
Pricing
from $3.00 / 1,000 product scrapeds
Ecommerce Scraper — Shopify, WooCommerce, Generic (MCP)
MCP-ready ecommerce scraper. Extract products, prices, discounts, variants, stock & images from any Shopify, WooCommerce, or generic store. No login, no API key. Works with Claude and ChatGPT via Apify MCP.
Pricing
from $3.00 / 1,000 product scrapeds
Rating
0.0
(0)
Developer
Khadin Akbar
Actor stats
0
Bookmarked
3
Total users
2
Monthly active users
6 days ago
Last modified
Categories
Share
Ecommerce Store Scraper — Shopify, WooCommerce, Any Store (MCP-Ready)
What does this actor do?
This is a universal ecommerce product scraper. Point it at any online store — Shopify, WooCommerce, or a generic ecommerce site — and it returns a clean JSON dataset of products with prices, discounts, stock, brand, ratings, images, and variants. No login, no API keys, no cookies.
It is designed as an MCP server tool so that Claude, ChatGPT, and other AI agents can call it through the Apify MCP integration for price monitoring, catalog auditing, and research.
Why use this actor?
- Zero setup — paste a store homepage, get products. No API keys, no login.
- Auto platform detection — Shopify, WooCommerce, and generic stores handled in one actor.
- Rich structured output — price, original price, discount %, stock, SKU, brand, category, tags, images, rating, review count, and variants (sizes / colors / styles).
- MCP-native — flat JSON fields, token-efficient payload, stable keys across runs. Works out of the box with Claude via
mcp.apify.com. - Fast — Shopify stores use the public
/products.jsonAPI (up to 250 products per call). WooCommerce stores use the public Store REST API (/wp-json/wc/store/v1/products).
What data does it extract?
| Field | Type | Description |
|---|---|---|
title | string | Full product name |
price | number | Current selling price |
original_price | number | null | Pre-sale price (if on discount) |
currency | string | null | ISO 4217 currency code (USD, EUR, GBP, ...) |
discount_percentage | number | null | % discount from original price |
in_stock | boolean | Availability flag |
sku | string | null | Stock keeping unit |
brand | string | null | Brand or vendor |
category | string | null | Product type or category |
tags | string[] | Tags assigned by the store |
images | string[] | Up to 10 product image URLs |
rating | number | null | Average star rating (0–5) |
review_count | integer | null | Total reviews |
variants | array | Size / color / style entries with their own price, SKU, and availability |
platform | string | Detected platform: shopify, woocommerce, or generic |
url | string | Canonical product URL |
source_url | string | URL passed in by the user |
scraped_at | string | ISO 8601 extraction timestamp |
Pricing (pay-per-event)
| Event | Price | When charged |
|---|---|---|
apify-actor-start | $0.005 | Once per run (covers discovery + platform detection) |
product-scraped | $0.003 | Per product returned |
Typical run cost:
| Products returned | Total cost |
|---|---|
| 50 | ~$0.16 |
| 500 | ~$1.51 |
| 5,000 | ~$15.01 |
Apify platform compute + proxy costs are billed separately and are typically cents per run.
Supported platforms
- Shopify — uses
/products.json(public endpoint). Returns up to 250 products per API call. - WooCommerce — uses the public Store REST API (
/wp-json/wc/store/v1/products). Works for any WooCommerce store where products have public visibility. - Generic / other — falls back to schema.org
ProductJSON-LD plus CSS selectors. Covers BigCommerce, Magento, custom builds, and most other platforms.
How to run
Step 1 — Provide a store URL
Paste one or more store homepage or product URLs:
https://allbirds.comhttps://gymshark.comhttps://www.examplewoostore.comhttps://brand.com/products/some-product
Step 2 — Set a product limit
maxProducts controls how many records the run returns. Defaults to 50. Set higher for full catalog extraction.
Step 3 — Run and export
Hit Start. Watch products stream into the Output tab. Export as JSON, CSV, Excel, or pipe directly into Google Sheets via the Apify integration.
Input parameters
| Parameter | Type | Default | Description |
|---|---|---|---|
startUrls | array | [{url: "https://allbirds.com"}] | Store homepage or product URLs |
maxProducts | integer | 50 | Max products returned per run |
includeVariants | boolean | true | Include size / color / style variants |
includeDescription | boolean | true | Include full product description text |
proxyConfiguration | object | Apify Residential | Proxy settings (advanced) |
Output example
{"url": "https://allbirds.com/products/mens-wool-runners","title": "Men's Wool Runners","description": "The shoe that started it all. Made with ZQ-certified merino wool.","price": 110.0,"original_price": null,"currency": null,"discount_percentage": null,"in_stock": true,"sku": "WR-M-NVY-9","brand": "Allbirds","category": "Shoes","tags": ["wool", "sustainable", "running"],"images": ["https://cdn.shopify.com/.../wool-runner.jpg"],"rating": 4.7,"review_count": 3241,"variants": [{ "title": "Size 9 / Navy", "price": 110.0, "original_price": null, "sku": "WR-M-NVY-9", "available": true }],"platform": "shopify","source_url": "https://allbirds.com","scraped_at": "2026-04-24T02:52:13.000Z"}
Use cases
- Price monitoring — track competitor prices and discounts over time.
- Catalog audit — inventory your own or a competitor's product catalog.
- AI training data — feed structured product data into LLMs, recommendation systems, or embeddings.
- Market research — understand assortment, positioning, and pricing in a niche.
- Inventory intelligence — monitor which products go in and out of stock.
Using this actor with Claude / ChatGPT (MCP)
This actor is MCP-ready. Add it to Claude via mcp.apify.com?actors=khadinakbar/ecommerce-store-scraper or enable the Apify MCP server in your client and Claude will call it as apify--ecommerce-store-scraper. Typical prompts:
- "Scrape allbirds.com and give me the five cheapest products."
- "Compare prices for running shoes across allbirds.com and vessi.com."
- "Pull the last 200 products from this Shopify store and tell me which are on sale."
FAQ
Does it need a proxy? Residential proxy is on by default. Most Shopify and WooCommerce stores work without it, but it's required for some anti-bot-protected sites.
Does it need a login or API key? No. The actor only uses publicly available data and APIs.
How fast is it?
Shopify: a few seconds per 250 products via the /products.json API. WooCommerce: up to 100 products per API call. Generic stores: one HTTP request per product page.
What if the store blocks scraping? Try enabling / switching proxy groups. If the target site aggressively rate-limits, increase actor memory and reduce concurrency.
Why is my price null?
Some stores lazy-load prices via JavaScript. For those, use the generic path with a product-level URL instead of the store homepage, and the JSON-LD extractor will pick up the price.
Related actors
- shopify-all-in-one-scraper — Shopify-only deep scraper with reviews.
- amazon-product-intelligence — Amazon product data.
- etsy-all-in-one-scraper — Etsy products and reviews.
- google-shopping-scraper — cross-store product search via Google Shopping.
Legal disclaimer
This actor is intended for lawful data collection from publicly available sources. Users are responsible for compliance with applicable laws, each target website's terms of service, and data protection regulations (GDPR, CCPA, etc.). Do not use this actor to collect personal data or violate intellectual property rights.
Works great with
- Apify Google Sheets integration — export directly to Google Sheets.
- Make / Zapier — trigger workflows when new products are found.
- Claude AI via MCP — ask Claude to analyze products and find deals.