π Shopify Product Scraper β Full Catalogs
Pricing
from $2.00 / 1,000 products
π Shopify Product Scraper β Full Catalogs
Extract product catalogs from any Shopify store β names, prices, descriptions, images, variants. Competitive pricing analysis.
Pricing
from $2.00 / 1,000 products
Rating
0.0
(0)
Developer
Stephan Corbeil
Maintained by CommunityActor stats
0
Bookmarked
2
Total users
1
Monthly active users
2 hours ago
Last modified
Categories
Share
Shopify Store Product Scraper
Extract product data from any Shopify store with ease. This actor detects Shopify stores and scrapes all products using the public /products.json API endpointβno browser required, no login needed.
Features
- Fast & Lightweight: Uses httpx for direct API accessβno browser overhead
- Shopify Detection: Automatically detects if a domain is a Shopify store
- Complete Product Data: Extracts titles, descriptions, prices, variants, images, inventory, and more
- Bulk Scraping: Process multiple stores in a single run
- Smart Pagination: Handles stores with hundreds or thousands of products
- Flexible Output: Include/exclude variants and images as needed
- Error Handling: Gracefully handles blocked endpoints and non-Shopify sites
- Pricing Extraction: Captures regular prices, compare-at prices, and price ranges
Use Cases
- Competitor Price Monitoring: Track pricing changes across competing Shopify stores
- Market Research: Analyze product catalogs, categories, and vendor distribution
- Product Catalog Analysis: Map product offerings, variants, and inventory
- Drop-shipping Research: Find suppliers and analyze product availability
- SEO & Content Analysis: Extract product descriptions and metadata
- Inventory Tracking: Monitor product availability across multiple stores
How to Use
Input Parameters
| Parameter | Type | Description | Default |
|---|---|---|---|
storeUrl | string | Single Shopify store URL (e.g., https://www.allbirds.com) | - |
storeUrls | array | Multiple store URLs for bulk scraping | - |
includeVariants | boolean | Include product variants (sizes, colors, etc.) | true |
includeImages | boolean | Include product image URLs | true |
maxProducts | integer | Maximum products per store (0 = all) | 0 |
Example Input
{"storeUrl": "https://www.allbirds.com","includeVariants": true,"includeImages": true,"maxProducts": 0}
Or for bulk scraping:
{"storeUrls": ["https://www.allbirds.com","https://beardbrand.com","https://shop.gymshark.com"],"includeVariants": true,"includeImages": true}
Output Example
Each product is saved as a dataset item:
{"id": 7234567890,"title": "Classic Runner","handle": "classic-runner","description": "Lightweight and sustainable running shoes made from sugarcane-based foam.","vendor": "Allbirds","product_type": "Shoes","tags": ["womens", "running", "eco-friendly"],"created_at": "2023-01-15T10:30:00Z","updated_at": "2024-03-20T14:22:00Z","published_at": "2023-01-15T10:30:00Z","price_range": {"min": 79.95,"max": 89.95,"currency": "USD"},"compare_at_price_range": {"min": 95.00,"max": 105.00},"variants": [{"id": 112233445566,"title": "Women's Size 5","sku": "CLSRUN-W5","price": 79.95,"compare_at_price": 95.00,"available": true,"inventory_quantity": 42},{"id": 112233445567,"title": "Women's Size 6","sku": "CLSRUN-W6","price": 79.95,"compare_at_price": 95.00,"available": true,"inventory_quantity": 38}],"images": [{"src": "https://cdn.shopify.com/s/files/1/0001/2345/products/runner-1.jpg","alt": "Classic Runner - Front View"},{"src": "https://cdn.shopify.com/s/files/1/0001/2345/products/runner-2.jpg","alt": "Classic Runner - Side View"}],"store_url": "https://www.allbirds.com"}
How It Works
- Shopify Detection: The actor checks for Shopify headers (
x-shopify-stage,powered-by) and attempts to fetch/products.json - Product Fetching: Uses the public API endpoint
/products.json?limit=250&page={n}with pagination - Data Extraction: Processes each product to extract pricing, variants, images, and metadata
- Smart Fallbacks: If
/products.jsonis blocked, tries/collections/all/products.json - Rate Limiting: Includes 200ms delays between requests to avoid overwhelming servers
Pricing
Based on PPE (Pay-Per-Event) model:
- Actor Start: $0.01 per run
- Product Scraped: $0.002 per product
Example Cost Calculation
Scraping 1,000 products from a Shopify store:
- Actor start: $0.01
- Products: 1,000 Γ $0.002 = $2.00
- Total: $2.01
Limitations
- Works only with Shopify stores using the public product API
- Some Shopify stores may block the
/products.jsonendpoint (typically redirected to login) - Product data is limited to what the public API exposes (some stores hide certain fields)
- Private/password-protected stores are not supported
- Image URLs are CDN-hosted and may change over time
Supported Shopify Store Types
- Standard Shopify Plus stores
- Shopify Basic, Professional, Advanced plans
- Custom Shopify implementations
- Stores using custom domains
Troubleshooting
"Not a Shopify store" error
- Verify the domain is actually a Shopify store (check page source for
powered-by: Shopify) - Ensure the URL is correct and accessible
No products found
- The store may block public API access
- Try accessing
/products.jsondirectly in your browser - Some stores require authentication for product data
Timeout errors
- The store may be slow to respond
- Try reducing
maxProductsto fetch fewer items - Retry the run
API Reference
Shopify Public Product API
Endpoint: GET https://{domain}/products.json
Query parameters:
limit: Number of products per page (max 250)page: Page number (starts at 1)fields: Specific fields to return (optional)
Response: JSON object with products array
Data Quality
All product data comes directly from Shopify's public APIβno parsing or inference. Data accuracy depends on how shop owners maintain their product information.
Support & Questions
For issues, questions, or feature requests, contact the NexGenData team.
Built with httpx. No browser required. Lightning-fast Shopify scraping.
Related Shopify Actors
NexGenData offers a complete Shopify intelligence suite. Each actor targets a different use case:
| Actor | What It Does | Best For |
|---|---|---|
| Shopify Revenue Estimator | Revenue estimates, theme & app detection, traffic analysis | Competitive due diligence, investor research |
| Shopify Product Extractor | Full product catalogs with prices, variants, inventory | Dropshipping, price monitoring |
| Shopify Product Scraper | Lightweight product data via /products.json | Quick product lookups, data pipelines |
| Shopify Store Detector | Theme, 55+ app fingerprints, tech stack, Shopify plan | Tech stack analysis, market research |
| Shopify App Store Scraper | App listings, reviews, pricing, ratings from apps.shopify.com | App market intelligence, competitor tracking |
Sign up for Apify to get started with $5 in free credits every month.
Related tools
- Shopify Store Detector β Theme, Apps & Tech Stack Spy
- Shopify Product Extractor β Catalog & Prices
- Shopify Revenue Estimator β Store Intel
- βΏ Crypto Price Tracker β Real-Time & Alerts
π» Code Example β Python
from apify_client import ApifyClientclient = ApifyClient("YOUR_APIFY_TOKEN")run = client.actor("nexgendata/shopify-product-scraper").call(run_input={# Fill in the input shape from the actor's input_schema})for item in client.dataset(run["defaultDatasetId"]).iterate_items():print(item)
π Code Example β cURL
curl -X POST "https://api.apify.com/v2/acts/nexgendata~shopify-product-scraper/run-sync-get-dataset-items?token=YOUR_TOKEN" \-H "Content-Type: application/json" \-d '{ /* input schema */ }'
β FAQ
Q: How do I get started? Sign up at apify.com, grab your API token from Settings β Integrations, and run the actor via the Apify console, API, Python SDK, or any integration (Zapier, Make.com, n8n).
Q: What's the typical cost per run? See the pricing section below. Most runs finish under $0.10 for typical batches.
Q: Is this actor maintained? Yes. NexGenData maintains 165+ Apify actors and ships updates regularly. Bug reports via the Apify console issues tab get responses within 24 hours.
Q: Can I use the output commercially? Yes β you own the output data. Check the target site's Terms of Service for any usage restrictions on the scraped content itself.
Q: How do I handle rate limits? Apify manages concurrency and retries automatically. For very large batches (10K+ items), run multiple smaller jobs in parallel instead of one mega-job for better reliability.
π° Pricing
Pay-per-event pricing β you only pay for what you actually extract.
- Actor Start: $0.0001
- result: $0.0050
π Related NexGenData Actors
π Apify Affiliate Program
New to Apify? Sign up with our referral link β you get free platform credits on signup, and you help fund the maintenance of this actor fleet.
π More From NexGenData
Explore the full catalog, tutorials, Gumroad data packs, and newsletter at thenextgennexus.com β the brand home for everything we ship.
- π Tutorials & how-to guides
- ποΈ Full actor catalog with usage examples
- π¦ Gumroad data packs (one-time purchases)
- π¬ Newsletter β monthly drops of new actors and revenue experiments
Built and maintained by NexGenData β 165+ actors covering scraping, enrichment, MCP servers, and automation. π Home: thenextgennexus.com
Why Shopify Product Scraper Beats SimilarWeb, BuiltWith, Diffbot, Import.io & Manual Crawls
| Feature | NexGenData Shopify Product Scraper | SimilarWeb | BuiltWith Pro | Diffbot | Import.io |
|---|---|---|---|---|---|
| Cost | $5 per 1K products, pay-per-event | $199-499 / month | $295-995 / month | $299+ / month | $299+ / month |
| Native Shopify endpoint | Yes β /products.json, no browser | No | Tech detection only | Generic | Generic |
| Variants + images | Yes | No | No | Limited | Limited |
| Bulk store crawl | Yes β many stores per run | No | Plan-gated | Yes | Yes |
| Auth required | Apify token | Account + plan | Account + plan | Account + plan | Account + plan |
| Speed | Seconds-to-minutes | N/A | N/A | Minutes | Minutes |
| Monthly minimum | None | $199+ | $295+ | $299+ | $299+ |
Most DTC analysts pick this scraper instead of building a custom /products.json crawler because the bot-block + rate-limit handling alone takes weeks; the actor absorbs that for $5 per 1K products flat. Cheaper than Diffbot or Import.io for the same e-commerce use case, and drop-in compatible with downstream tools that expect Shopify-style JSON.
Related NexGenData Commerce Actors
| Use case | Actor |
|---|---|
| Shopify analyzer (revenue + traffic) | shopify-analyzer |
| Shopify store analyzer (revenue + tech) | shopify-store-analyzer |
| Shopify store detector (BuiltWith alt) | shopify-store-detector |
| Amazon product scraper | amazon-product-scraper |
| Google Shopping price tracker | google-shopping-scraper |
| SaaS / DTC pricing tracker | saas-pricing-tracker |
| JD.com product scraper (China commerce) | jd-com-product-scraper |
| Indie Hackers product launches | indie-hackers-products-tracker |
| Product Hunt daily launches | product-hunt-launches-scraper |
| E-commerce intelligence MCP for AI | ecommerce-intelligence-mcp-server |
Browse the full NexGenData catalog of 260+ actors at https://apify.com/nexgendata?fpr=2ayu9b