Pricing

$19.99/month + usage

Shopify Products Scraper

🛍️ Shopify Products Scraper extracts titles, prices, variants, images, SKUs, tags & descriptions from any Shopify store. 📦 Supports collections & inventory. 📊 Export CSV/JSON for catalog builds, competitor analysis & SEO. ⚡ Fast, reliable, no coding required.

Pricing

$19.99/month + usage

Rating

0.0

(0)

Developer

Scraply

Actor stats

Bookmarked

Total users

Monthly active users

8 days ago

Last modified

Shopify Products Scraper

Shopify Products Scraper is an Apify actor that discovers product pages on Shopify stores and fetches each product’s .json endpoint to extract structured data like titles, vendors, product types, prices, and variants. It solves the manual effort of cataloging products by acting as a shopify product scraper and shopify product data extractor that automatically finds product URLs and preserves the complete Shopify product JSON for analysis and exports. Built for marketers, developers, data analysts, and researchers, this shopify store scraper scales across multiple stores with robust proxy fallback to keep your pipelines flowing.

What data / output can you get?

Below are the exact fields pushed to the Apify dataset during a run. Each record represents a single product and includes both summary fields and the full Shopify product JSON.

Data field	Description	Example value
store_url	The base Shopify store URL processed for this product	https://lootcrate.com
product_url	The public product page URL	https://lootcrate.com/products/loot-crate
json_url	The .json endpoint used for extraction	https://lootcrate.com/products/loot-crate.json
product_id	Shopify product ID from the JSON	5083963261059
title	Product title from the JSON	Loot Crate
vendor	Vendor/brand from the JSON	Loot Crate Core
product_type	Product type from the JSON	Subscription Box
price	Price from the first variant (if available)	29.99
compare_at_price	Compare-at price from the first variant (if available)	24.99
tags	Comma-separated product tags from the JSON	Subscription, Collectibles, Pop Culture
total_found	Count of product URLs discovered for the store in this run	5
successful	Number of products successfully extracted for the store at this point	5
full_data	Full Shopify product JSON object returned by the .json endpoint	{ "product": { ... } }

Notes:

The scraper preserves the entire Shopify product response in full_data, which typically includes variants, images, timestamps, and more — ideal for shopify variant scraper, shopify product images scraper, and shopify price scraper use cases.
You can export results from the dataset as CSV, JSON, or Excel to power shopify product csv export workflows.

Key features

🔍 Automatic product discovery
- Scans store HTML for links containing “/products/” and normalizes absolute/relative URLs. If the site is detected as Shopify, it paginates “/products.json” to collect handles reliably.
🧩 Complete JSON preservation
- Fetches each product’s .json endpoint and stores the entire response in full_data for maximum flexibility (variants, images, tags, vendor, product_type, and more).
🔁 Intelligent proxy fallback
- Resilient pipeline with direct → datacenter → residential proxy escalation, including retries and sticky residential usage once activated. Designed to keep your shopify product listing scraper running through blocks (403/429).
⚡ Concurrent requests at scale
- Asynchronous fetching and controlled concurrency to scrape shopify products across multiple stores quickly and reliably.
💾 Live dataset saving
- Pushes each product to the dataset in real time, so partial results are saved even if a run is interrupted.
🛡️ Robust error handling
- Network retries, structured proxy switching, and clear logging for monitoring and debugging.
🧪 Store-aware strategy
- Detects Shopify signatures to use “/products.json” pagination where possible; otherwise falls back to HTML link discovery for product URLs.
📈 Practical defaults
- Internally paginates “/products.json?limit=250&page=N” and uses an internal cap to limit total products processed per store in a run.

How to use Shopify Products Scraper - step by step

Sign in to Apify Console and go to Actors.
Find “shopify-products-scraper” by username “scraply” and open the actor.
Add input data:
- In startUrls, provide one or more Shopify store homepages (e.g., https://lootcrate.com).
- Optionally set proxyConfiguration if you want to start with Apify Proxy; otherwise the actor starts with no proxy and falls back automatically if blocked.
Start the run. The actor will detect Shopify stores and either paginate products.json or extract product links from HTML.
Monitor logs. You’ll see:
- Discovery status (found product URLs)
- Proxy switches (direct → datacenter → residential)
- Per-product success/failure messages
Review results:
- Dataset: Each product is saved with store_url, product_url, json_url, and more. Export as CSV, JSON, or Excel.
- Key-Value Store: A grouped “OUTPUT” object summarizing each store with total_found, successful, method, and products (url + json) is stored for programmatic consumption.
Export and integrate:
- Use Apify’s dataset exports for BI dashboards or pipelines.
- Automate downloads via the Apify API for scheduled reports or feeds.

Pro Tip: Add multiple store URLs to startUrls to run bulk extraction. The proxy fallback will become sticky at the residential level across product requests if needed, improving throughput and stability for large catalogs.

Use cases

Use case name	Description
Competitor pricing monitoring	Track current and compare-at pricing across stores to power a shopify price scraper for discount analysis and pricing intelligence.
Catalog building & enrichment	Build product feeds from full_data and export to CSV/JSON for ingestion in PIM/ERP — an end-to-end shopify product feed scraper.
Variant & inventory analysis	Analyze SKUs, variant titles, and inventory-related fields contained in the Shopify product JSON — ideal for a shopify inventory scraper workflow.
SEO & content audits	Use titles, tags, and metadata to study keyword strategy and product taxonomy for SEO research.
Image pipelines	Collect product images from the preserved JSON for creative workflows — a practical shopify product images scraper use case.
Market research	Compare vendors, product types, and tags across multiple Shopify stores for trend and assortment analysis.
API data pipelines	Feed the dataset output into internal APIs or data lakes via the Apify API for automation and reporting.

Why choose Shopify Products Scraper?

Built for precision, scale, and reliability, this shopify product scraper uses Shopify-native endpoints when available and falls back to HTML discovery seamlessly.

🎯 Accurate by design: Extracts directly from product .json endpoints and preserves the full response for flexible downstream use.
⚡ Scale-ready: Concurrent fetching and per-store batching make it fast to scrape shopify products across many domains.
🔐 Resilient networking: Automatic proxy fallback (direct → datacenter → residential) with retries and sticky residential behavior under blocks.
💾 Real-time saving: Writes each product to the dataset as it’s processed to prevent data loss and enable early exports.
🧰 Developer-friendly: Outputs clean JSON with stable field names and stores a grouped “OUTPUT” object in the key-value store.
🧭 Better than extensions: No brittle browser automation — this production-grade shopify product data extractor runs server-side with clear logs.
💸 Export-friendly: Download datasets as CSV, JSON, or Excel for BI, catalog syncs, and feeds without extra tooling.

Is it legal / ethical to use Shopify Products Scraper?

Yes — when used responsibly. This actor collects data from publicly available Shopify pages and does not access private accounts or authenticated areas.

Guidelines:

Scrape only public product pages and metadata.
Review and respect target websites’ terms of service and robots.txt.
Ensure compliance with applicable laws (e.g., GDPR, CCPA).
Avoid misuse; do not employ the tool for unlawful or unethical activities.
Consult your legal team for edge cases or jurisdiction-specific requirements.

Input parameters & output format

Example JSON input

{
  "startUrls": [
    "https://lootcrate.com",
    "https://www.decathlon.com"
  ],
  "proxyConfiguration": {
    "useApifyProxy": false
  }
}

Input fields

startUrls (array, required)
- Description: List one or more Shopify store URLs (e.g., https://lootcrate.com, https://www.decathlon.com). Supports bulk input.
- Default: ["https://lootcrate.com"] (prefill)
proxyConfiguration (object, optional)
- Description: Choose which proxies to use. By default, no proxy is used. If the platform rejects or blocks the request, it will automatically fallback to datacenter proxy, then residential proxy with 3 retries.
- Default: { "useApifyProxy": false } (prefill)

Example JSON output (single dataset item)

{
  "store_url": "https://lootcrate.com",
  "product_url": "https://lootcrate.com/products/loot-crate",
  "json_url": "https://lootcrate.com/products/loot-crate.json",
  "product_id": 5083963261059,
  "title": "Loot Crate",
  "vendor": "Loot Crate Core",
  "product_type": "Subscription Box",
  "price": "29.99",
  "compare_at_price": "24.99",
  "tags": "Subscription, Collectibles, Pop Culture",
  "total_found": 5,
  "successful": 5,
  "full_data": {
    "product": {
      "id": 5083963261059,
      "title": "Loot Crate",
      "vendor": "Loot Crate Core",
      "product_type": "Subscription Box",
      "handle": "loot-crate",
      "tags": "Subscription, Collectibles, Pop Culture",
      "variants": [
        {
          "id": 34197535719555,
          "title": "S / XS",
          "price": "29.99",
          "compare_at_price": "24.99",
          "sku": "1010126US",
          "inventory_management": "shopify",
          "requires_shipping": true
        }
      ],
      "images": [
        {
          "id": 123456789,
          "src": "https://cdn.shopify.com/..."
        }
      ]
    }
  }
}

Notes:

compare_at_price may be null if not set on the first variant.
tags may be an empty string if the store doesn’t use tags.
full_data contains the raw Shopify product JSON, which can include additional fields like body_html, created_at, updated_at, published_at, and more depending on the store.

FAQ

No. The actor fetches public pages and product .json endpoints without login or cookies. It operates as a server-side shopify product scraper tool with robust networking and retries.

Can I scrape multiple Shopify stores in one run?

Yes. Add several stores to the startUrls array. The actor processes each store, discovers or enumerates products, and saves results per product to the dataset, suitable to scrape shopify products at scale.

What product data is included in the output?

Each dataset item contains store_url, product_url, json_url, product_id, title, vendor, product_type, price, compare_at_price, tags, total_found, successful, and full_data. The full_data field preserves the complete Shopify product JSON (including variants and images), making it a reliable shopify variant scraper and shopify product images scraper.

How does proxy fallback work if a store blocks requests?

The actor starts with no proxy. On 403/429 blocks, it falls back to a datacenter proxy, and if blocking persists, it escalates to a residential proxy with retries. Once residential is enabled, it remains sticky for remaining requests to maintain reliability.

Can I export results to CSV for a shopify product csv export workflow?

Yes. Open the run’s Dataset and export to CSV, JSON, or Excel. This makes it easy to build a shopify product feed scraper pipeline for catalogs, analytics, or SEO.

Is there an API to access results programmatically?

Yes. You can access the run’s Dataset and Key-Value Store (including the grouped “OUTPUT” summary) via the Apify API to integrate with internal systems or automation workflows.

Does it capture variant pricing and images?

Yes. Variant and image data are included within full_data from the Shopify product JSON, enabling shopify price scraper and asset-processing workflows.

Are there any limits during a run?

The actor paginates “/products.json” where available and applies internal limits to the total number of products processed per store in a run. You can monitor progress and totals in the logs and in the total_found and successful fields.

Closing thoughts

Shopify Products Scraper is built for automated, reliable product data extraction from Shopify stores. It discovers product URLs, fetches each product’s .json, and preserves the complete response for downstream use.

Whether you’re a marketer, developer, data analyst, or researcher, you can export clean CSV/JSON feeds, track pricing and variants, or power analytics with the full_data payload. Developers can integrate via the Apify API and automate pipelines end-to-end. Start extracting smarter product insights at scale with a resilient shopify product information scraper that’s production-ready.

Shopify Products Scraper

scrapeflow/shopify-products-scraper

🛍️ Shopify Products Scraper extracts complete product data from any Shopify store — titles, prices, variants, SKUs, inventory, images, collections, tags & descriptions — at scale. ⚡ Export JSON/CSV. 🔍 Ideal for market research, competitor analysis, feeds & catalog builds.

ScrapeFlow

Shopify Products Scraper

scrapeengine/shopify-products-scraper

🛒 Shopify Products Scraper extracts product data from Shopify stores — titles, prices, variants, images, collections, descriptions, SKUs & inventory. 📦 Export CSV/JSON. 🚀 Perfect for competitor analysis, catalog building, price monitoring & SEO research.

ScrapeEngine

Shopify Products Scraper

scrapepilotapi/shopify-products-scraper

🛍️ Shopify Products Scraper extracts titles, prices, SKUs, variants, images, inventory, descriptions & URLs from any Shopify store. ⚡ Bulk scraping with smart pagination & anti-blocking. 📦 Export CSV/JSON for competitor research, price tracking & catalog builds. 🚀 Fast, reliable.

ScrapePilot

Shopify Store Scraper

scrapemesh/shopify-store-scraper

🛍️ Shopify Store Scraper extracts product data from any Shopify store — titles, prices, variants, SKUs, images, descriptions, inventory & collections. 📊 Ideal for competitor research, price tracking, SEO, and catalog builds. 🚀 Fast, scalable, CSV/JSON exports.

ScrapeMesh

Shopify Scraper

api-empire/shopify-scraper

🛍️ Shopify Scraper extracts product & store data from Shopify stores — titles, prices, SKUs, variants, collections, descriptions, images, tags and inventory. ⚡ Ideal for competitor analysis, price tracking, SEO audits & catalog building. 📤 Export CSV/JSON for quick analysis.

API Empire

Shopify Store Scraper

scraperx/shopify-store-scraper

🛍️ Shopify Store Scraper extracts products, prices, variants, inventory, images, collections & SEO data from public Shopify stores. ⚡ Fast, scalable, API-ready. 📊 CSV/JSON export. 🚀 Ideal for competitor analysis, price tracking, and catalog enrichment.

ScraperX

Shopify Products Scraper

scraperx/shopify-products-scraper

🛍️ Shopify Products Scraper extracts product data from any Shopify store — titles, prices, variants, SKUs, images, inventory, descriptions, tags & vendor. ⚡ Fast, scalable, bulk & export-ready (CSV/JSON). ✅ Perfect for catalog building, price tracking, competitive research & dropshipping.

ScraperX

Shopify Scraper

crawlerbros/shopify-scraper

Scrape products from any Shopify store. Extract product titles, prices, variants, SKUs, images, descriptions, inventory availability, and more using Shopify's public products.json API.

Crawler Bros

Shopify Scraper

simpleapi/shopify-scraper

🛒 Shopify Scraper (shopify-scraper) pulls structured data from any public Shopify store—products, variants, prices, inventory, images, descriptions, vendors & tags. ⚡ Export to CSV/JSON. 🔎 Perfect for competitor analysis, price monitoring, catalog building & lead generation. 🚀

SimpleAPI

Shopify Store Scraper

rupom888/shopify-store-scraper

Scrape any Shopify store for products, variants, prices, inventory, images, tags, and collections. No API key needed - uses Shopify's public JSON endpoints.