Shopify Store Scraper
Pricing
$19.99/month + usage
Shopify Store Scraper
๐๏ธ Shopify Store Scraper extracts product data from any Shopify store โ titles, prices, variants, SKUs, images, descriptions, inventory & collections. ๐ Ideal for competitor research, price tracking, SEO, and catalog builds. ๐ Fast, scalable, CSV/JSON exports.
Pricing
$19.99/month + usage
Rating
0.0
(0)
Developer
ScrapeMesh
Actor stats
0
Bookmarked
2
Total users
1
Monthly active users
5 days ago
Last modified
Categories
Share
Shopify Store Scraper
The Shopify Store Scraper is an Apify actor that extracts structured data from public Shopify storefronts โ products, collections, themes, installed apps, and optional Yotpo reviews โ in one run. It solves the pain of manual research by acting as a scalable Shopify product scraper and Shopify store data extractor for analysis, pricing, and catalog ops. Built for marketers, developers, data analysts, and researchers, this Shopify scraper tool lets you scrape Shopify products at scale and turn them into clean, exportable datasets.
What data / output can you get?
Below are real output fields the actor saves to the Apify dataset. Each item represents one store.
| Field | Description | Example value |
|---|---|---|
| store_domain | Store domain name without protocol | "www.allbirds.com" |
| store_url | Full store URL | "https://www.allbirds.com" |
| shop_name | Shopify shop name (from meta or page) | "shop11044168.myshopify.com" |
| theme_name | Active theme name parsed from HTML | "rc-cm-2025-12-26_20-35 Update" |
| apps_installed | Detected apps in page HTML | ["Elevar Conversion Tracking","JsEncrypt"] |
| apps_count | Total count of detected apps | 2 |
| total_products | Number of products fetched via /products.json | 681 |
| total_variants | Total count of product variants | 7276 |
| collections_count | Number of collections fetched via /collections.json | 250 |
| price_min | Minimum variant price across catalog | 2.0 |
| price_avg | Average variant price across catalog | 79.34 |
| total_variants_available | Count of variants with available = true | 5647 |
| inventory_availability_pct | Percent of variants in stock | 77.6 |
| top_products | Array of newest products with title, url, price, reviews, rating | [{ "title": "...", "url": "...", "price": 8.0, "reviews": 0, "rating": 0.0 }] |
| detailed_reviews | Present only if extractReviews is true (Yotpo) | [{ "product_url": "...", "reviews_count": 0, "avg_rating": 0.0, "reviews": [] }] |
| scraped_at | ISO timestamp of when scraping finished | "2025-12-29T07:34:23.715157Z" |
| scrape_duration_sec | Total scrape time in seconds | 15.71 |
| api_blocked | True if no products were retrieved | false |
Bonus fields include apps_by_category, collections (titles), categories, vendors, theme_id, theme_version, products_with_images, avg_images_per_product, products_with_description, and more. You can export results from the Apify dataset as JSON or CSV for easy analysis and pipeline integration.
Key features
-
๐ฆ Optional smart proxy fallback (when enabled) When Apify Proxy is enabled via proxyConfiguration, requests automatically escalate from no proxy โ datacenter โ residential with retries for better success on rate limits and blocks.
-
๐ฆ Full product & collection coverage Uses Shopifyโs public JSON endpoints (/products.json, /collections.json, /meta.json) to fetch catalogs, then enriches with theme and app signals from HTML.
-
๐งฎ Pricing & inventory analytics Computes price_min, price_avg, price_max and inventory_availability_pct at the store level for quick benchmarking and analytics.
-
โญ Yotpo review extraction for top products If extractReviews is true, attempts Yotpo review collection for the top 10 newest products using the public widget API (author, rating, title, content, created_at).
-
๐พ Live dataset saving Pushes each storeโs output to the Apify dataset as itโs scraped, so you donโt lose data mid-run.
-
๐ก๏ธ Robust error reporting If a store fails, an error object with store_url, error, and scraped_at is saved to the dataset for transparency and troubleshooting.
-
๐ฉโ๐ป Developer-friendly Built on the Apify SDK with async HTTP and clean JSON output, ideal for integrations, pipelines, and the Shopify store data scraper API pattern.
-
๐ Scales for batch inputs Add multiple store URLs in startUrls to run sequentially across many targets without manual effort โ perfect for a Shopify store crawler workflow.
How to use Shopify Store Scraper - step by step
- Sign in to Apify Console and go to Actors.
- Open the โshopify-store-scraperโ actor by scrapemesh.
- Add input:
- Provide one or more Shopify URLs in startUrls. The actor accepts:
- Plain strings: "https://www.allbirds.com"
- Or objects with a url key via API: { "url": "https://example-store.myshopify.com" }
- Provide one or more Shopify URLs in startUrls. The actor accepts:
- Configure settings:
- extractReviews: true/false to enable Yotpo review extraction for the top 10 newest products.
- proxyConfiguration: set useApifyProxy to true if you want proxy fallback (datacenter โ residential).
- Start the run. The actor will fetch meta, collections, and all products via public JSON endpoints and parse theme/app signals from the homepage.
- Monitor logs. Youโll see pagination progress, proxy fallback events (if enabled), and review extraction status.
- Get your data in the OUTPUT tab. Export to JSON or CSV or fetch via the Apify API for automation.
Pro Tip: For bigger batches and hands-off workflows, call the actor via the Apify API and stream results into your analytics stack as a Shopify product feed scraper.
Use cases
| Use case name | Description |
|---|---|
| Competitor price tracking | Monitor price_min, price_avg, and price_max across rivals using this Shopify price scraper to quantify changes over time. |
| Catalog & feed enrichment | Build a Shopify product import scraper to fill product titles, variants, images, and vendors for merchandising and feeds. |
| Inventory availability monitoring | Track inventory_availability_pct to spot stockouts and availability trends with a Shopify inventory scraper. |
| Market research & category analysis | Analyze categories, vendors, and collections at scale with a Shopify store crawler to map assortment and positioning. |
| Review benchmarking | Use extractReviews to collect Yotpo ratings for the newest products and benchmark review volume and sentiment. |
| API pipeline integration | Plug the Shopify store data scraper API into data warehouses or BI tools using Apifyโs dataset and API endpoints. |
| SEO & content ops | Audit products_with_description and avg_images_per_product to prioritize SEO and PDP content improvements. |
Why choose Shopify Store Scraper?
This production-ready Shopify store scraping software focuses on structured outputs, stability, and developer-friendly integration.
- ๐ฏ Accurate, structured outputs taken directly from Shopifyโs public JSON and storefront HTML
- ๐ Proxy fallback when enabled, escalating to residential if blocked for resilient scraping
- ๐งฐ Developer access via Apify datasets and API for easy integration into pipelines
- ๐ธ Cost-effective with a flat monthly price and trial minutes for testing before scaling
- ๐งฉ Works better than browser extensions โ no manual browsing, no fragile DOM workflows
- ๐ก๏ธ Ethical by design โ targets public endpoints only, leaving private data untouched
- ๐ Built for scale โ add many stores and export results in JSON or CSV with ease
In short: a reliable Shopify scraper tool vs. unstable alternatives โ optimized for data quality and automation.
Is it legal / ethical to use Shopify Store Scraper?
Yes, when used responsibly. The actor collects data from publicly available Shopify endpoints (/products.json, /collections.json, /meta.json) and storefront HTML. It does not access private accounts or authenticated content.
Guidelines for compliant use:
- Scrape only public data and respect each storeโs terms of service.
- Ensure adherence to data protection laws (e.g., GDPR, CCPA) and local regulations.
- Avoid collecting personal or sensitive information.
- Consult your legal team for edge cases and intended use.
Input parameters & output format
JSON input example
{"startUrls": ["https://www.allbirds.com","https://example-store.myshopify.com"],"extractReviews": true,"proxyConfiguration": {"useApifyProxy": true}}
Parameters
- startUrls (array, required): List one or more Shopify store URLs (e.g., https://www.allbirds.com).
- Default: none (required)
- extractReviews (boolean, optional): Enable to extract detailed product reviews from Yotpo for the top 10 newest products.
- Default: true
- proxyConfiguration (object, optional): Choose which proxies to use. If Shopify rejects the proxy, a residential proxy will be used as a fallback (when Apify Proxy is enabled).
- Default: none (prefill in UI has "useApifyProxy": false)
JSON output example (successful store)
{"store_domain": "www.allbirds.com","store_url": "https://www.allbirds.com","shop_name": "shop11044168.myshopify.com","country": "US","currency": "USD","locale": "en-US","theme_name": "rc-cm-2025-12-26_20-35 Update","theme_id": 128596836432,"theme_version": "1.131.2","apps_installed": ["Elevar Conversion Tracking", "JsEncrypt"],"apps_by_category": {"Analytics": ["Elevar Conversion Tracking"],"Other": ["JsEncrypt"]},"apps_count": 2,"total_products": 681,"total_variants": 7276,"collections": ["Accessories", "Shoes", "Apparel"],"collections_count": 250,"categories": ["Accessories", "Apparel", "Shoes", "Socks"],"vendors": ["Allbirds"],"price_min": 2.0,"price_avg": 79.34,"price_max": 500.0,"products_on_sale": 681,"discount_pct_avg": null,"products_with_reviews": 0,"review_coverage_pct": 0,"avg_reviews_per_product": null,"avg_rating": null,"total_estimated_sales": null,"avg_sales_per_product": null,"total_variants_available": 5647,"inventory_availability_pct": 77.6,"estimated_launch_date": "2018-11-12","oldest_product_date": "2018-11-12","newest_product_date": "2025-11-06","products_with_images": 680,"avg_images_per_product": 4.5,"products_with_description": 681,"top_products": [{"title": "Allbirds Laces - Warm Red - Round","url": "https://www.allbirds.com/products/allbirds-laces-warm-red-round","price": 8.0,"reviews": 0,"rating": 0.0}],"detailed_reviews": [{"product_url": "https://www.allbirds.com/products/allbirds-laces-warm-red-round","product_id": "7242343809104","reviews_count": 0,"avg_rating": 0.0,"reviews": []}],"scraped_at": "2025-12-29T07:34:23.715157Z","scrape_duration_sec": 15.71,"api_blocked": false}
Note:
- detailed_reviews appears only when extractReviews is true and Yotpo data is found.
- Several analytics keys (e.g., discount_pct_avg, avg_rating, total_estimated_sales) may be null by design if not computed from public data.
- On failures, the actor pushes error objects to the dataset for transparency.
Example error outputs:
- Missing/invalid input
{"error": "No valid URLs found in startUrls. Please provide at least one Shopify store URL in the input.","input_received": {},"scraped_at": "2025-12-29T07:34:23.715157Z"}
- Per-store failure
{"store_url": "https://example-store.myshopify.com","error": "Failed to fetch homepage from https://example-store.myshopify.com. All request attempts failed.","scraped_at": "2025-12-29T07:34:23.715157Z"}
FAQ
Is there a free trial or pricing?
Yes. The Apify listing provides 120 trial minutes and a flat monthly price of $19.99 for this actor. You can test within the trial minutes and upgrade when youโre ready to scale.
Do I need to enable a proxy?
No, a proxy is optional. The actor first tries direct requests. If you enable proxyConfiguration with useApifyProxy: true, it will automatically fall back from datacenter to residential proxies when blocked for more reliable scraping.
Can I scrape multiple Shopify stores in one run?
Yes. Provide multiple URLs in startUrls and the actor will process them sequentially, pushing each storeโs data to the dataset as it completes.
Does it extract product reviews?
Yes, optionally. If extractReviews is true, the actor attempts to fetch Yotpo reviews for the top 10 newest products, returning counts, average ratings, and individual reviews (author, rating, title, content, created_at) when available.
Is this a Chrome extension?
No. This is an Apify actor (server-side), not a Shopify store scraper Chrome extension. It runs in the cloud, uses public Shopify endpoints, and exports clean JSON/CSV without manual browsing.
What data formats can I export?
Results are saved to an Apify dataset. You can download data as JSON or CSV directly from the console, or fetch it programmatically via the Apify API for pipelines and reporting.
How long does a run take?
It depends on the size of the store and whether reviews are enabled. Each output includes scrape_duration_sec so you can monitor performance across runs.
Which Shopify endpoints does it use?
The actor relies on public Shopify endpoints: /products.json (with pagination), /collections.json, and /meta.json, plus storefront HTML for theme/app signals.
Closing CTA / Final thoughts
Shopify Store Scraper is built to turn public Shopify storefronts into structured, analysis-ready data. With optional smart proxy fallback, clean JSON/CSV exports, and Yotpo review support, itโs ideal for marketers, developers, analysts, and researchers who need a dependable Shopify competitor product scraper and Shopify store data extractor. Run it in the Apify Console, or integrate via the Apify API to automate a Shopify product feed scraper pipeline. Start extracting smarter product, pricing, and inventory insights at scale today.