Shopify Store Intelligence Scraper - Products, Leads & Apps
Pricing
from $2.00 / 1,000 results
Shopify Store Intelligence Scraper - Products, Leads & Apps
Scrape Shopify stores for product catalogs, prices, variants, collections, contact details, social links, theme/app signals, pages, and store-level ecommerce intelligence.
Pricing
from $2.00 / 1,000 results
Rating
0.0
(0)
Developer
Elliot Padfield
Maintained by CommunityActor stats
0
Bookmarked
1
Total users
0
Monthly active users
a day ago
Last modified
Categories
Share
Scrape Shopify stores for product catalogs, prices, variants, collections, contact details, social links, theme/app signals, useful pages, and store-level ecommerce intelligence. This Actor uses Shopify's public storefront endpoints plus homepage and sitemap parsing, so it is fast, structured, and does not require a Shopify API key.
Use it for ecommerce competitor research, DTC lead generation, price monitoring, product catalog exports, brand enrichment, app-stack research, supplier discovery, and automated Shopify store tracking.
What can this Shopify scraper do?
- Scrape one or many Shopify stores by URL or domain
- Extract product catalogs from
/products.json - Extract product prices, compare-at prices, variants, options, availability, SKUs, tags, vendors, types, images, and descriptions
- Extract collections from
/collections.json - Save a store-level summary row with catalog statistics
- Detect common Shopify apps and marketing tools from storefront HTML
- Detect theme name, theme ID, locale, country, currency, and myshopify.com domain when available
- Extract social links, emails, and phone numbers from the homepage and useful pages
- Discover contact, about, wholesale, affiliate, privacy, and policy pages from the Shopify sitemap
- Export structured ecommerce data to Apify datasets
- Run on schedules for store monitoring
- Use Apify residential proxies on every run for production reliability
What data can you extract?
| Field | Description |
|---|---|
itemType | store, product, collection, or page |
storeUrl | Normalized Shopify store URL |
storeDomain | Store domain |
shopifyDomain | myshopify.com domain when visible |
storeName | Store or site name |
currency, country, locale | Storefront metadata when available |
themeName, themeId, themeRole | Shopify theme signals from storefront HTML |
detectedApps | Detected tools such as Klaviyo, Gorgias, Yotpo, Loox, Okendo, Recharge, Attentive, Postscript, Judge.me, Elevar, and more |
emails, phones, socialUrls | Public contact and social links |
warnings, failedSections | Optional sections that failed while the Actor still saved available data |
productCount, collectionCount, totalVariants | Store-level catalog statistics |
productId, productUrl, title, handle | Product identity fields |
vendor, productType, tags | Product classification fields |
descriptionText, descriptionHtml | Clean or raw product description |
minPrice, maxPrice, compareAtMinPrice, onSale | Product pricing fields |
available, availableVariants, variantsCount | Product availability fields |
imageUrls, featuredImageUrl | Product images |
options, variants | Product option and variant data |
collectionUrl, productsCount | Collection fields |
pageUrl, text | Useful page extraction |
scrapedAt | Timestamp when the row was saved |
How to scrape Shopify stores
- Add one or more Shopify store URLs or domains.
- Choose whether to save store summaries, products, collections, and pages.
- Set
maxProductsPerStore,maxCollectionsPerStore, andmaxPagesPerStorefor test or production runs. - Keep
includeVariantsenabled if you need SKU, option, price, and stock-level detail. - Run the Actor and export the dataset in JSON, CSV, Excel, XML, RSS, or HTML from Apify.
Input examples
Scrape a Shopify store with products and lead signals
{"storeUrls": ["https://www.allbirds.com"],"includeStoreSummary": true,"includeProducts": true,"includeCollections": true,"includePages": true,"maxProductsPerStore": 250,"maxCollectionsPerStore": 250,"maxPagesPerStore": 10}
Fast store enrichment without product rows
{"storeUrls": ["https://www.allbirds.com","https://www.gymshark.com"],"includeStoreSummary": true,"includeProducts": false,"includeCollections": false,"includePages": true}
Full catalog export with variants
{"storeUrls": ["https://www.example-store.com"],"includeProducts": true,"includeVariants": true,"includeHtmlDescription": false,"maxProductsPerStore": 5000}
Output example
{"itemType": "product","storeUrl": "https://www.allbirds.com","storeDomain": "www.allbirds.com","productId": 6616124981328,"productUrl": "https://www.allbirds.com/products/trino-cozy-crew-heathered-onyx","title": "Trino Cozy Crew - Heathered Onyx","handle": "trino-cozy-crew-heathered-onyx","vendor": "Allbirds","productType": "Socks","tags": ["collection:apr26", "shoprunner"],"minPrice": 16,"maxPrice": 16,"currency": "USD","available": true,"availableVariants": 4,"variantsCount": 4,"featuredImageUrl": "https://cdn.shopify.com/s/files/...","scrapedAt": "2026-05-28T11:20:00.000Z"}
Search methods and filters
| Capability | Supported |
|---|---|
| Store URL/domain input | Yes |
| Multi-store runs | Yes |
| Product catalog extraction | Yes |
| Collection extraction | Yes |
| Store summary row | Yes |
| Contact page discovery | Yes |
| Emails and phone numbers | Yes |
| Social links | Yes |
| Product variants | Yes |
| Product availability | Yes |
| Compare-at pricing and sale flag | Yes |
| Product images | Yes |
| Product descriptions | Yes |
| Theme detection | Yes |
| App/tool detection | Yes |
| myshopify.com domain detection | Yes |
| Forced Apify Residential Proxy | Yes |
Pricing
This Actor is designed for pay-per-result pricing. Each saved dataset item is one billable result.
| Result type | What counts as one result |
|---|---|
| Store summary | One saved store-level intelligence row |
| Product | One saved Shopify product row |
| Collection | One saved collection row |
| Page | One saved useful page row |
For small tests, set maxProductsPerStore to 10 or disable product rows and run store-summary enrichment only. The Actor stops saving new rows when the Apify pay-per-result charge limit is reached.
The Actor always uses Apify residential proxies. For scheduled monitoring, run daily or weekly with the same store inputs and deduplicate product rows by storeDomain plus productId or productUrl in your downstream workflow.
Reliability notes
The Actor saves best-effort results. If optional sections such as /products.json, /collections.json, or sitemap page discovery fail for a store, the store summary is still saved with failedSections and warnings when the homepage was reachable. This keeps lead-enrichment and monitoring runs useful even when one Shopify endpoint is disabled, blocked, or temporarily unavailable.
Why use this Actor?
Most Shopify scrapers focus only on product exports. This Actor is designed for ecommerce intelligence: it combines catalog extraction with lead enrichment, contact discovery, social links, theme/app hints, and store-level metrics.
It helps answer questions like:
- What products does a Shopify store sell?
- Which variants are available and which products are on sale?
- What apps and marketing tools does a store appear to use?
- Which social profiles, emails, and useful pages are publicly linked?
- How large is the catalog, and what is the pricing range?
- Which Shopify stores are good leads for ecommerce SaaS, agencies, affiliates, or suppliers?
Because it runs on Apify, you also get scheduling, API access, datasets, webhooks, proxy rotation, and integrations without maintaining your own server.