Pricing

Pay per usage

Try for free

Go to Apify Store

Website Contact Crawler

Try for free

Crawls websites to extract emails, phones, and social links.

Pricing

Pay per usage

Rating

5.0

(1)

Developer

Man Mohit verma

Actor stats

Bookmarked

Total users

Monthly active users

a month ago

Last modified

Output

Default dataset — one row per unique contact (standard Apify export as JSON/CSV).
Key-Value Store
- contacts.json — full aggregated array of all contacts from the run.
- pages-scraped.json — per seed URL, all HTML pages that were successfully scraped (startingUrl + pagesScraped array).

Input

startUrls: list of seed URLs (JSON array; supports large lists such as ~1,000 sites)
depthOfPages: crawl depth from each seed URL
defaultPhoneRegion: default region for phonenumbers
maxConcurrencyPerIp: concurrent fetches per worker band (default 50)
proxyPoolSize: number of worker bands (default 10); total workers = maxConcurrencyPerIp × proxyPoolSize
maxConcurrencyPerHost: cap simultaneous requests per website host (default 5; set 0 to disable)
dedupeScope: global (one row per value) or perStartingUrl (same value allowed under different seeds)
proxyConfiguration: Apify Proxy or custom proxy settings (RESIDENTIAL recommended on Apify)
additionalPaths / excludeKeywords: add depth-1 paths and filter URLs

Concurrency and proxy

Worker bands: proxyPoolSize × maxConcurrencyPerIp async workers (default 500) share a global crawl queue.
Per-request IP rotation: when Apify Proxy is enabled, every HTTP request uses a new residential proxy session (session_id is unique per fetch). Worker bands organize parallelism; they do not pin 10 fixed IPs.
Per-host limit: maxConcurrencyPerHost reduces hammering a single domain when many seeds or pages target the same host.
Cost: high concurrency with residential proxies can be expensive; lower maxConcurrencyPerIp or proxyPoolSize if you hit rate limits or budget limits.

Notes

The crawler stays on the same host or subdomain family as the seed URL, and also follows links on other hosts seen in that crawl branch (common for Shopify: *.myshopify.com seed redirecting to a custom domain while HTML still links to myshopify.com pages).
Static assets, mailto:, tel:, javascript:, and fragment-only links are ignored for crawling.
additionalPaths are applied when the seed page at depth 0 is fetched, so they become depth-1 pages alongside links discovered from that page. excludeKeywords blocks matching URLs at every depth.
429 responses trigger host-specific cooldowns and respect Retry-After. Lower concurrency if a site still rate limits heavily.
Local runs work without Apify Proxy credentials; on Apify, the actor uses the residential proxy pool when available.
Default run options: 2-hour timeout, 8 GB memory (see .actor/actor.json). Increase timeout for very large seed lists and depth.

Local run

python -m pip install -r requirements.txt
python -m src

For local testing, put an INPUT.json file under storage/key_value_stores/default/ or set APIFY_LOCAL_STORAGE_DIR to a folder with that structure.

After a run, check storage/datasets/default/ for dataset rows and storage/key_value_stores/default/contacts.json and pages-scraped.json for aggregated JSON files.

Publish to Apify

apify login
apify push

Smoke-test with a few startUrls and depthOfPages=1, then scale up gradually before running ~1,000 seeds at full concurrency.

Website Contact + Social Link Extractor

isotonic/website-contact-social-link-extractor

Extracts public emails, phones, contact pages, and social profile links from websites.

Brian Keefe

Website Contact Extractor - Emails, Phones & Social Links

santhej/website-contact-extractor

Bulk-extract contact details from any list of websites: email addresses, phone numbers, and social profiles (LinkedIn, X, Facebook, Instagram, YouTube). Crawls homepage + contact/about pages. Clean JSON/CSV for lead lists & enrichment.

Santhej Kallada

5.0

Contact Info Scraper — Emails, Phones & Social Links

junipr/contact-info-scraper

Extract public emails, phone numbers, addresses, contact pages, and social links from websites with lead-quality scoring and deduped exports.

junipr

Website Contact Scraper – Email, Phone & Social

logiover/website-contact-scraper

Bulk email and contact extractor for any website. Scrape emails, phones and social links with no API and export leads to CSV or JSON.

Logiover

538

Contact Info Scraper — Extract Emails & Phones from Websites

lanky_quantifier/contact-info-scraper

Extract emails, phone numbers, and social profiles (LinkedIn, Twitter, Facebook, Instagram, YouTube, TikTok, GitHub) from any website. Crawls contact pages, footers, and team pages. B2B lead gen and CRM enrichment.

Vhub Systems

Domain Contact Enrichment

toronto_777/domain-contact-enrichment

Extract public contact emails, phone links, contact pages, and social links from company websites.

Steven Feng

Website Contact Information Extractor

gio21/website-contact-extractor

Extract contact info (emails, phones, addresses, social links) from any website. Crawls homepage plus /contact, /about, /impressum pages, deduplicates results, and returns one row per website. Pay per website processed.

Gio

5.0

Contact Details Extractor — Emails, Phones & Social

reflective_plagioclase/contact-details-extractor

Extract emails, phone numbers and social media links (15+ platforms) from any website. Includes lead enrichment.

Matt

Website Contact & Social Finder

glowing_glove/website-contact-finder

Extract public emails, phone numbers, contact pages, about pages, and social profile links from company websites.

Ushba Khan

Website Contact Scraper — Emails, Phones & Socials

hipersoft/website-contact-scraper

Extract business emails, phone numbers and social media profiles from any list of websites. De-obfuscates emails, reads Organization schema, and optionally crawls contact/about pages. Bulk-ready, no login or API key.