Shopify Store Leads Scraper, Email & Agency Audit Finder avatar

Shopify Store Leads Scraper, Email & Agency Audit Finder

Pricing

$10.00 / 1,000 saved shopify leads

Go to Apify Store
Shopify Store Leads Scraper, Email & Agency Audit Finder

Shopify Store Leads Scraper, Email & Agency Audit Finder

Find Shopify store leads and extract public business emails, socials, product samples, marketing stack gaps, agency audit scores, and outreach offers.

Pricing

$10.00 / 1,000 saved shopify leads

Rating

0.0

(0)

Developer

sanfeng zhang

sanfeng zhang

Maintained by Community

Actor stats

0

Bookmarked

9

Total users

5

Monthly active users

9 days ago

Last modified

Share

Find Shopify stores and extract public business emails, social profiles, product samples, marketing stack gaps, agency audit scores, and outreach offers for ecommerce agency prospecting.

Use seed URLs or Common Crawl discovery to turn Shopify domains into contactable leads. By default, this Actor saves only higher-quality stores with public business role emails, so pay-per-event users are not charged for weak raw store URLs. Each saved row now includes an agency audit layer for retention, paid ads tracking, CRO, subscriptions, support tooling, and wholesale/B2B fit.

Fast start

For the quickest test, paste one or more Shopify store URLs into startUrls and keep discovery disabled:

{
"startUrls": ["https://www.allbirds.com"],
"includeSearchDiscovery": false,
"includeCommonCrawlDiscovery": false,
"requireBusinessEmail": true,
"includePhones": false,
"includePersonalEmails": false,
"minOpportunityScore": 50,
"maxStores": 1,
"maxPagesPerStore": 4
}

For discovery, enable Common Crawl and keep the first run small:

{
"includeCommonCrawlDiscovery": true,
"commonCrawlLimit": 50,
"maxStores": 25,
"requireBusinessEmail": true,
"minOpportunityScore": 50
}

Example output

Each saved row is a contactable Shopify lead, not just a raw URL:

{
"domain": "example-store.com",
"isShopify": true,
"shopifyConfidence": 95,
"emails": ["support@example-store.com"],
"socials": { "instagram": ["https://www.instagram.com/example-store"] },
"sampleProductTitles": ["Product sample 1", "Product sample 2"],
"techSignals": { "metaPixel": true, "klaviyo": false },
"agencyAuditScore": 78,
"agencyFit": "high",
"missingStackSignals": ["No visible email/SMS retention tool"],
"recommendedOffers": ["Klaviyo or email/SMS retention audit"],
"opportunityScore": 80,
"suggestedAngles": ["Klaviyo or email/SMS retention audit", "Paid ads tracking audit"]
}

The run also writes RUN_SUMMARY to the key-value store, so you can see how many candidates were analyzed, confirmed, filtered, and saved.

Why use it

  • Build Shopify email finder and agency prospecting lists without manually opening every store.
  • Extract public business emails such as info@, support@, sales@, and hello@.
  • Find public social profiles, contact/about links, currency hints, product samples, and marketing tech signals.
  • Surface agency-specific gaps such as missing retention stack, missing ad pixels, missing review apps, slow homepages, and subscription opportunities.
  • Score each store from 0 to 100 using practical B2B outreach signals.
  • Filter low-quality stores before they reach the paid dataset.
  • Keep privacy exposure low with conservative defaults.

Best for

  • Shopify agency prospecting
  • Email/SMS retention service lead lists
  • Klaviyo, Postscript, and lifecycle marketing audits
  • Paid ads tracking and pixel audit prospecting
  • Conversion rate optimization prospecting
  • Product reviews, subscriptions, support tooling, and wholesale/B2B audit prospecting
  • Ecommerce SaaS market research
  • DTC competitor and niche store mapping

What it does

  • Discovers candidate stores from seed URLs, best-effort HTML search, or public Common Crawl indexes.
  • Confirms Shopify signals such as cdn.shopify.com, myshopify.com, Shopify globals, cart routes, and public product JSON endpoints.
  • Extracts public business emails, optional public phone/WhatsApp numbers, public social links, currency hints, product title samples, app/pixel signals, contact links, and reachable pages.
  • Adds an agency audit score, buyer personas, missing stack signals, detected tools, audit flags, and concrete recommended offers.
  • Suggests outreach angles such as retention/email setup, ad tracking audit, conversion cleanup, or storefront trust cleanup.
  • Writes a run summary with candidate counts, filtering decisions, charge attempts, and cost-control information.

Quality defaults

The default settings are intentionally conservative:

  • requireBusinessEmail=true: only save leads with a public business role email.
  • includePersonalEmails=false: filter individual-looking emails by default.
  • includePhones=false: avoid phone/WhatsApp extraction unless explicitly enabled.
  • minOpportunityScore=50: keep weak leads out of the default dataset.
  • respectRobotsTxt=true: skip disallowed paths when enabled.
  • productSampleLimit=5: include only a small public product sample.

Pricing

This Actor is configured for pay-per-event pricing. The launch test price is USD 0.01 per saved Shopify store lead in the default dataset.

Confirmed Shopify stores that do not pass the business email and opportunity score filters are counted in RUN_SUMMARY, but are not saved as paid dataset leads by default.

Input

  • queries: niche search terms such as organic skincare shopify.
  • startUrls: known store URLs to analyze directly.
  • includeSearchDiscovery: enable or disable search-based discovery.
  • includeCommonCrawlDiscovery: discover public myshopify.com candidates from Common Crawl.
  • commonCrawlLimit: maximum Common Crawl candidates before analysis.
  • includePhones: include public phone/WhatsApp numbers. Disabled by default.
  • includePersonalEmails: include individual-looking emails. Disabled by default.
  • requireBusinessEmail: only save leads with at least one public business role email.
  • includeProductSamples: include small public product-title samples.
  • productSampleLimit: maximum product titles included per store.
  • minOpportunityScore: only save leads with this score or higher.
  • maxStores: maximum unique domains to analyze.
  • maxSearchResultsPerQuery: candidate URLs collected per search query.
  • maxPagesPerStore: public pages fetched per store.
  • requestDelayMs: delay between requests.
  • respectRobotsTxt: skip disallowed paths when enabled.

Output fields

  • domain
  • url
  • isShopify
  • shopifyConfidence
  • shopifySignals
  • shopifyDomain
  • title
  • description
  • emails
  • phones
  • socials
  • currencyHints
  • sampleProductTitles
  • productCountSample
  • techSignals
  • agencyAuditScore
  • agencyFit
  • agencyBuyerPersonas
  • recommendedOffers
  • missingStackSignals
  • detectedStackSignals
  • auditFlags
  • agencyAudit
  • contactLinks
  • analysisStats
  • pagesFetched
  • opportunityScore
  • scoreReasons
  • suggestedAngles
  • painPoints

Example use cases

Shopify agency prospecting

Use Common Crawl discovery or seed URLs to find Shopify stores with public business emails, social links, agency audit scores, missing stack signals, and recommended offers. Export the dataset to CSV, Google Sheets, Clay, Airtable, or your CRM.

Shopify agency audit prospecting

Use agencyAuditScore, agencyFit, agencyBuyerPersonas, missingStackSignals, and recommendedOffers to build lists for Klaviyo setup, paid ads tracking, CRO, product review apps, subscriptions, support tooling, or wholesale/B2B audits.

Seed URL enrichment

Upload a list of known ecommerce domains and enrich each one with Shopify confirmation, public contacts, product context, and marketing stack signals.

Niche research

Search for categories such as skincare, supplements, pet products, jewelry, coffee, baby products, and home decor to map active Shopify brands in a niche.

Local run

This repository can run locally without installing npm packages. If the Apify SDK is unavailable, it writes output to local storage.

node .\src\main.js --input .\sample-input.json

Output:

storage/datasets/default/items.json
storage/datasets/default/items.jsonl

If PowerShell blocks scripts, run the Node command directly instead of run-local.ps1.

Apify deployment

This project includes a deployment script that reads the Apify token from:

C:\Users\10635\Documents\New project 3\.secrets\apify-token.txt

Deploy and run a smoke test:

node .\scripts\deploy-apify.mjs

Build only:

node .\scripts\deploy-apify.mjs --skip-run

Compliance posture

This Actor only analyzes public web pages. It does not log in, bypass paywalls, collect private account data, or scrape personal social profiles beyond public links already shown on store pages.

Default mode keeps privacy exposure low: public phone extraction is off, individual-looking emails are filtered out, and product samples are capped. See COMPLIANCE_AND_COST.md.

Search discovery can be throttled by search engines. For the most reliable production use, provide seed URLs, enable Common Crawl discovery, or pair this Actor with a compliant SERP provider.

Keywords

Shopify store leads, Shopify store leads scraper, Shopify email finder, Shopify leads scraper, Shopify store scraper, Shopify contact extractor, Shopify agency audit, ecommerce leads, store leads scraper, DTC lead generation, Shopify agency prospecting, ecommerce agency leads, business email finder, ecommerce tech stack detector, marketing pixel audit, Klaviyo audit, retention audit, CRO audit, Shopify app stack audit.