Ecommerce Scraper — Shopify, WooCommerce, Generic (MCP) avatar

Ecommerce Scraper — Shopify, WooCommerce, Generic (MCP)

Pricing

from $3.00 / 1,000 product scrapeds

Go to Apify Store
Ecommerce Scraper — Shopify, WooCommerce, Generic (MCP)

Ecommerce Scraper — Shopify, WooCommerce, Generic (MCP)

MCP-ready ecommerce scraper. Extract products, prices, discounts, variants, stock & images from any Shopify, WooCommerce, or generic store. No login, no API key. Works with Claude and ChatGPT via Apify MCP.

Pricing

from $3.00 / 1,000 product scrapeds

Rating

0.0

(0)

Developer

Khadin Akbar

Khadin Akbar

Maintained by Community

Actor stats

0

Bookmarked

3

Total users

2

Monthly active users

6 days ago

Last modified

Share

Ecommerce Store Scraper — Shopify, WooCommerce, Any Store (MCP-Ready)

What does this actor do?

This is a universal ecommerce product scraper. Point it at any online store — Shopify, WooCommerce, or a generic ecommerce site — and it returns a clean JSON dataset of products with prices, discounts, stock, brand, ratings, images, and variants. No login, no API keys, no cookies.

It is designed as an MCP server tool so that Claude, ChatGPT, and other AI agents can call it through the Apify MCP integration for price monitoring, catalog auditing, and research.

Why use this actor?

  • Zero setup — paste a store homepage, get products. No API keys, no login.
  • Auto platform detection — Shopify, WooCommerce, and generic stores handled in one actor.
  • Rich structured output — price, original price, discount %, stock, SKU, brand, category, tags, images, rating, review count, and variants (sizes / colors / styles).
  • MCP-native — flat JSON fields, token-efficient payload, stable keys across runs. Works out of the box with Claude via mcp.apify.com.
  • Fast — Shopify stores use the public /products.json API (up to 250 products per call). WooCommerce stores use the public Store REST API (/wp-json/wc/store/v1/products).

What data does it extract?

FieldTypeDescription
titlestringFull product name
pricenumberCurrent selling price
original_pricenumber | nullPre-sale price (if on discount)
currencystring | nullISO 4217 currency code (USD, EUR, GBP, ...)
discount_percentagenumber | null% discount from original price
in_stockbooleanAvailability flag
skustring | nullStock keeping unit
brandstring | nullBrand or vendor
categorystring | nullProduct type or category
tagsstring[]Tags assigned by the store
imagesstring[]Up to 10 product image URLs
ratingnumber | nullAverage star rating (0–5)
review_countinteger | nullTotal reviews
variantsarraySize / color / style entries with their own price, SKU, and availability
platformstringDetected platform: shopify, woocommerce, or generic
urlstringCanonical product URL
source_urlstringURL passed in by the user
scraped_atstringISO 8601 extraction timestamp

Pricing (pay-per-event)

EventPriceWhen charged
apify-actor-start$0.005Once per run (covers discovery + platform detection)
product-scraped$0.003Per product returned

Typical run cost:

Products returnedTotal cost
50~$0.16
500~$1.51
5,000~$15.01

Apify platform compute + proxy costs are billed separately and are typically cents per run.

Supported platforms

  • Shopify — uses /products.json (public endpoint). Returns up to 250 products per API call.
  • WooCommerce — uses the public Store REST API (/wp-json/wc/store/v1/products). Works for any WooCommerce store where products have public visibility.
  • Generic / other — falls back to schema.org Product JSON-LD plus CSS selectors. Covers BigCommerce, Magento, custom builds, and most other platforms.

How to run

Step 1 — Provide a store URL

Paste one or more store homepage or product URLs:

https://allbirds.com
https://gymshark.com
https://www.examplewoostore.com
https://brand.com/products/some-product

Step 2 — Set a product limit

maxProducts controls how many records the run returns. Defaults to 50. Set higher for full catalog extraction.

Step 3 — Run and export

Hit Start. Watch products stream into the Output tab. Export as JSON, CSV, Excel, or pipe directly into Google Sheets via the Apify integration.

Input parameters

ParameterTypeDefaultDescription
startUrlsarray[{url: "https://allbirds.com"}]Store homepage or product URLs
maxProductsinteger50Max products returned per run
includeVariantsbooleantrueInclude size / color / style variants
includeDescriptionbooleantrueInclude full product description text
proxyConfigurationobjectApify ResidentialProxy settings (advanced)

Output example

{
"url": "https://allbirds.com/products/mens-wool-runners",
"title": "Men's Wool Runners",
"description": "The shoe that started it all. Made with ZQ-certified merino wool.",
"price": 110.0,
"original_price": null,
"currency": null,
"discount_percentage": null,
"in_stock": true,
"sku": "WR-M-NVY-9",
"brand": "Allbirds",
"category": "Shoes",
"tags": ["wool", "sustainable", "running"],
"images": ["https://cdn.shopify.com/.../wool-runner.jpg"],
"rating": 4.7,
"review_count": 3241,
"variants": [
{ "title": "Size 9 / Navy", "price": 110.0, "original_price": null, "sku": "WR-M-NVY-9", "available": true }
],
"platform": "shopify",
"source_url": "https://allbirds.com",
"scraped_at": "2026-04-24T02:52:13.000Z"
}

Use cases

  • Price monitoring — track competitor prices and discounts over time.
  • Catalog audit — inventory your own or a competitor's product catalog.
  • AI training data — feed structured product data into LLMs, recommendation systems, or embeddings.
  • Market research — understand assortment, positioning, and pricing in a niche.
  • Inventory intelligence — monitor which products go in and out of stock.

Using this actor with Claude / ChatGPT (MCP)

This actor is MCP-ready. Add it to Claude via mcp.apify.com?actors=khadinakbar/ecommerce-store-scraper or enable the Apify MCP server in your client and Claude will call it as apify--ecommerce-store-scraper. Typical prompts:

  • "Scrape allbirds.com and give me the five cheapest products."
  • "Compare prices for running shoes across allbirds.com and vessi.com."
  • "Pull the last 200 products from this Shopify store and tell me which are on sale."

FAQ

Does it need a proxy? Residential proxy is on by default. Most Shopify and WooCommerce stores work without it, but it's required for some anti-bot-protected sites.

Does it need a login or API key? No. The actor only uses publicly available data and APIs.

How fast is it? Shopify: a few seconds per 250 products via the /products.json API. WooCommerce: up to 100 products per API call. Generic stores: one HTTP request per product page.

What if the store blocks scraping? Try enabling / switching proxy groups. If the target site aggressively rate-limits, increase actor memory and reduce concurrency.

Why is my price null? Some stores lazy-load prices via JavaScript. For those, use the generic path with a product-level URL instead of the store homepage, and the JSON-LD extractor will pick up the price.

This actor is intended for lawful data collection from publicly available sources. Users are responsible for compliance with applicable laws, each target website's terms of service, and data protection regulations (GDPR, CCPA, etc.). Do not use this actor to collect personal data or violate intellectual property rights.

Works great with

  • Apify Google Sheets integration — export directly to Google Sheets.
  • Make / Zapier — trigger workflows when new products are found.
  • Claude AI via MCP — ask Claude to analyze products and find deals.