Shopify Store Intelligence Scraper - Products, Leads & Apps avatar

Shopify Store Intelligence Scraper - Products, Leads & Apps

Pricing

from $2.00 / 1,000 results

Go to Apify Store
Shopify Store Intelligence Scraper - Products, Leads & Apps

Shopify Store Intelligence Scraper - Products, Leads & Apps

Scrape Shopify stores for product catalogs, prices, variants, collections, contact details, social links, theme/app signals, pages, and store-level ecommerce intelligence.

Pricing

from $2.00 / 1,000 results

Rating

0.0

(0)

Developer

Elliot Padfield

Elliot Padfield

Maintained by Community

Actor stats

0

Bookmarked

1

Total users

0

Monthly active users

a day ago

Last modified

Share

Scrape Shopify stores for product catalogs, prices, variants, collections, contact details, social links, theme/app signals, useful pages, and store-level ecommerce intelligence. This Actor uses Shopify's public storefront endpoints plus homepage and sitemap parsing, so it is fast, structured, and does not require a Shopify API key.

Use it for ecommerce competitor research, DTC lead generation, price monitoring, product catalog exports, brand enrichment, app-stack research, supplier discovery, and automated Shopify store tracking.

What can this Shopify scraper do?

  • Scrape one or many Shopify stores by URL or domain
  • Extract product catalogs from /products.json
  • Extract product prices, compare-at prices, variants, options, availability, SKUs, tags, vendors, types, images, and descriptions
  • Extract collections from /collections.json
  • Save a store-level summary row with catalog statistics
  • Detect common Shopify apps and marketing tools from storefront HTML
  • Detect theme name, theme ID, locale, country, currency, and myshopify.com domain when available
  • Extract social links, emails, and phone numbers from the homepage and useful pages
  • Discover contact, about, wholesale, affiliate, privacy, and policy pages from the Shopify sitemap
  • Export structured ecommerce data to Apify datasets
  • Run on schedules for store monitoring
  • Use Apify residential proxies on every run for production reliability

What data can you extract?

FieldDescription
itemTypestore, product, collection, or page
storeUrlNormalized Shopify store URL
storeDomainStore domain
shopifyDomainmyshopify.com domain when visible
storeNameStore or site name
currency, country, localeStorefront metadata when available
themeName, themeId, themeRoleShopify theme signals from storefront HTML
detectedAppsDetected tools such as Klaviyo, Gorgias, Yotpo, Loox, Okendo, Recharge, Attentive, Postscript, Judge.me, Elevar, and more
emails, phones, socialUrlsPublic contact and social links
warnings, failedSectionsOptional sections that failed while the Actor still saved available data
productCount, collectionCount, totalVariantsStore-level catalog statistics
productId, productUrl, title, handleProduct identity fields
vendor, productType, tagsProduct classification fields
descriptionText, descriptionHtmlClean or raw product description
minPrice, maxPrice, compareAtMinPrice, onSaleProduct pricing fields
available, availableVariants, variantsCountProduct availability fields
imageUrls, featuredImageUrlProduct images
options, variantsProduct option and variant data
collectionUrl, productsCountCollection fields
pageUrl, textUseful page extraction
scrapedAtTimestamp when the row was saved

How to scrape Shopify stores

  1. Add one or more Shopify store URLs or domains.
  2. Choose whether to save store summaries, products, collections, and pages.
  3. Set maxProductsPerStore, maxCollectionsPerStore, and maxPagesPerStore for test or production runs.
  4. Keep includeVariants enabled if you need SKU, option, price, and stock-level detail.
  5. Run the Actor and export the dataset in JSON, CSV, Excel, XML, RSS, or HTML from Apify.

Input examples

Scrape a Shopify store with products and lead signals

{
"storeUrls": ["https://www.allbirds.com"],
"includeStoreSummary": true,
"includeProducts": true,
"includeCollections": true,
"includePages": true,
"maxProductsPerStore": 250,
"maxCollectionsPerStore": 250,
"maxPagesPerStore": 10
}

Fast store enrichment without product rows

{
"storeUrls": [
"https://www.allbirds.com",
"https://www.gymshark.com"
],
"includeStoreSummary": true,
"includeProducts": false,
"includeCollections": false,
"includePages": true
}

Full catalog export with variants

{
"storeUrls": ["https://www.example-store.com"],
"includeProducts": true,
"includeVariants": true,
"includeHtmlDescription": false,
"maxProductsPerStore": 5000
}

Output example

{
"itemType": "product",
"storeUrl": "https://www.allbirds.com",
"storeDomain": "www.allbirds.com",
"productId": 6616124981328,
"productUrl": "https://www.allbirds.com/products/trino-cozy-crew-heathered-onyx",
"title": "Trino Cozy Crew - Heathered Onyx",
"handle": "trino-cozy-crew-heathered-onyx",
"vendor": "Allbirds",
"productType": "Socks",
"tags": ["collection:apr26", "shoprunner"],
"minPrice": 16,
"maxPrice": 16,
"currency": "USD",
"available": true,
"availableVariants": 4,
"variantsCount": 4,
"featuredImageUrl": "https://cdn.shopify.com/s/files/...",
"scrapedAt": "2026-05-28T11:20:00.000Z"
}

Search methods and filters

CapabilitySupported
Store URL/domain inputYes
Multi-store runsYes
Product catalog extractionYes
Collection extractionYes
Store summary rowYes
Contact page discoveryYes
Emails and phone numbersYes
Social linksYes
Product variantsYes
Product availabilityYes
Compare-at pricing and sale flagYes
Product imagesYes
Product descriptionsYes
Theme detectionYes
App/tool detectionYes
myshopify.com domain detectionYes
Forced Apify Residential ProxyYes

Pricing

This Actor is designed for pay-per-result pricing. Each saved dataset item is one billable result.

Result typeWhat counts as one result
Store summaryOne saved store-level intelligence row
ProductOne saved Shopify product row
CollectionOne saved collection row
PageOne saved useful page row

For small tests, set maxProductsPerStore to 10 or disable product rows and run store-summary enrichment only. The Actor stops saving new rows when the Apify pay-per-result charge limit is reached.

The Actor always uses Apify residential proxies. For scheduled monitoring, run daily or weekly with the same store inputs and deduplicate product rows by storeDomain plus productId or productUrl in your downstream workflow.

Reliability notes

The Actor saves best-effort results. If optional sections such as /products.json, /collections.json, or sitemap page discovery fail for a store, the store summary is still saved with failedSections and warnings when the homepage was reachable. This keeps lead-enrichment and monitoring runs useful even when one Shopify endpoint is disabled, blocked, or temporarily unavailable.

Why use this Actor?

Most Shopify scrapers focus only on product exports. This Actor is designed for ecommerce intelligence: it combines catalog extraction with lead enrichment, contact discovery, social links, theme/app hints, and store-level metrics.

It helps answer questions like:

  • What products does a Shopify store sell?
  • Which variants are available and which products are on sale?
  • What apps and marketing tools does a store appear to use?
  • Which social profiles, emails, and useful pages are publicly linked?
  • How large is the catalog, and what is the pricing range?
  • Which Shopify stores are good leads for ecommerce SaaS, agencies, affiliates, or suppliers?

Because it runs on Apify, you also get scheduling, API access, datasets, webhooks, proxy rotation, and integrations without maintaining your own server.