Shopify Product Catalog Scraper
Pricing
from $1.70 / 1,000 products
Shopify Product Catalog Scraper
Advanced Shopify product catalog scraper that pulls pricing, variants, stock status, and images from multiple stores with full pagination control. Set limits to 0 to scrape entire catalogs. Fast, lightweight, and built for automation.
Pricing
from $1.70 / 1,000 products
Rating
0.0
(0)
Developer

Techionik
Actor stats
1
Bookmarked
5
Total users
3
Monthly active users
2 days ago
Last modified
Categories
Share
Scrape structured product data from one or multiple Shopify-based e-commerce stores.
This Actor is optimized specifically for Shopify websites and extracts product data using Shopify’s built-in product API.
✅ What This Actor Extracts
For each product:
- Product Name
- Price
- Availability
- Currency
- Images
- Variants (with ID, title, price, availability)
🚀 How It Works
Shopify stores expose product data through:
/products.json
Instead of scraping HTML pages, this Actor fetches structured JSON data directly from Shopify. This makes it:
- Faster
- More stable
- More reliable
- Less prone to layout changes
The Actor automatically converts a store URL into:
https://store.com/products.json?page=1
and paginates through all available product pages.
🔢 Input Options
startUrls
Enter one or multiple Shopify store URLs.
Example:
maxPages
Maximum number of pages to fetch per URL.
- Set to
0→ Unlimited pages - Set to any positive number → Limits page count
maxProductsPerPage
Maximum number of products extracted from each page per URL.
- Set to
0→ Unlimited products per page - Set to any positive number → Limits products per page
📌 Important Behavior
If you enter:
maxPages = 2
maxProductsPerPage = 10
The Actor will:
- Extract 2 pages from URL 1
- Extract 2 pages from URL 2
- Extract 10 products from each page
If both values are 0 → Unlimited scraping.
🧠 HTML vs JSON
You only need to enter the store homepage URL.
Example: https://www.nativecos.com
You do NOT need to enter /products.json.
The Actor automatically handles everything internally.
⚠️ Limitations
Some Shopify stores use:
- Cloudflare protection
- Bot blocking
- Geo-restrictions
If a store blocks access to products.json, the Actor will skip it and log a warning.
🏗 Built With
- Apify SDK
- Crawlee
- Node.js
- Shopify JSON API
- cheerio
🎯 Best For
- Product data collection
- Market research
- Competitor analysis
- E-commerce monitoring
- Bulk Shopify scraping
💡 Example Use Cases
- Scrape multiple Shopify stores at once
- Extract full product catalogs
- Monitor availability changes
- Collect variant-level pricing
📦 Output
All extracted products are stored in the default Apify Dataset in structured JSON format.