Pricing

Pay per usage

Website Contact & Social Discovery Crawler

High-throughput crawler that extracts emails, phone numbers, and social media profiles from websites using HTTP-first Crawlee crawling with Selectolax parsing and Playwright SPA fallback.

Pricing

Pay per usage

Rating

0.0

(0)

Developer

Man Mohit verma

Actor stats

Bookmarked

Total users

Monthly active users

a month ago

Last modified

Features

Discover emails, phone numbers, and social profiles (LinkedIn, X/Twitter, Facebook, Instagram, YouTube, TikTok, Threads, GitHub, and more)
Crawl multiple websites in one run
Sitemap discovery — finds contact-related pages faster via common sitemap locations
Multi-site friendly — balances load across domains for stable multi-site runs
Proxy fallback — uses proxy support when configured and needed
Event-based output — one row per discovered email, phone, or social URL

Input

Configure the Actor in the Input tab. Main fields:

Field	Description
`websites`	Required. One or more website URLs to crawl. Each entry may be a URL string or `{ "url": "…", "countryCode": "IN" }` for phone parsing.
`defaultCountryCode`	Default ISO country code for phone parsing when a website entry omits `countryCode` (default: `US`).
`maxPagesPerSite`	Maximum pages to crawl per website (default: `25`).
`maxDepthPerSite`	Maximum link hops from the seed URL (default: `10`; `0` = seed pages only).
`terminationStrategy`	`early` stops when email, phone, and social are found; `lazy` crawls until page/depth limits (default: `early`).
`maxConcurrency`	Max parallel requests across all sites (default: `10`).
`maxConcurrencyPerDomain`	Max in-flight requests per host (default: `2`).
`maxRequestsPerDomainPerSecond`	Per-domain request rate limit (default: `2`). Lower if you see HTTP 429 errors.
`minEnqueueScore`	How selective the crawler is when following links (default: `0.333`). Higher = fewer, more contact-focused pages.
`useSemanticScoring`	Improves link selection on sites with generic URLs and descriptive link text (default: `false`).
`useSitemapDiscovery`	Resolve redirects and import URLs from `robots.txt` / `sitemap.xml` before crawling (default: `true`).
`maxSitemapUrls`	Cap on sitemap URLs imported per site (default: `50`).
`treatSubdomainsAsSameSite`	Follow links on subdomains of the same brand domain (default: `false`).
`additionalPaths`	Extra path suffixes probed per site (e.g. contact and policy pages).
`proxyConfiguration`	Optional. Direct first; proxy after HTTP 403/429 when set. Sites without proxy are skipped on 403/429. Sessions rotate on 403/429.
`maxProxySessions`	Max active proxy sessions at once (default: `10`).

Website examples

[
  "https://www.apify.com",
  "https://example.com"
]

With per-site phone region (recommended for non-US sites):

[
  { "url": "https://www.kalyansilks.com/", "countryCode": "IN" },
  { "url": "https://example.co.uk/", "countryCode": "GB" }
]

URLs with optional object form (uses defaultCountryCode when countryCode is omitted):

[
  { "url": "https://www.apify.com" }
]

Output

Each discovered entity is saved as one dataset record. Download results as JSON, CSV, Excel, HTML, XML, or RSS from the run's Storage tab.

Output fields

Field	Description
`startingUrl`	The seed URL you provided for this website
`currentPage`	The page where the entity was found
`pageFetched`	The actual URL that was fetched (may differ after redirects)
`type`	Entity type: `email`, `phone`, `twitter`, `linkedin`, `facebook`, `instagram`, `youtube`, `tiktok`, `threads`, `github`, `whatsapp`, `telegram`, `discord`, or `contact_form`
`value`	The extracted email, phone number, social profile URL, or contact page URL

Output example

{
  "startingUrl": "https://www.example.com/",
  "currentPage": "https://www.example.com/contact-us",
  "pageFetched": "https://www.example.com/contact-us",
  "type": "email",
  "value": "hello@example.com"
}

{
  "startingUrl": "https://www.example.com/",
  "currentPage": "https://www.example.com/about",
  "pageFetched": "https://www.example.com/about",
  "type": "linkedin",
  "value": "https://www.linkedin.com/company/example"
}

Tips

Start with a low maxPagesPerSite when testing new domains.
Set countryCode (or defaultCountryCode) to match each site's market so local phone numbers parse correctly.
Use terminationStrategy: "lazy" to collect more contacts within your page and depth limits.
Use terminationStrategy: "early" (default) for faster runs when one email, phone, and social per site is enough.
Set proxyConfiguration if sites return HTTP 403 or 429 without proxy.
Lower maxRequestsPerDomainPerSecond or maxConcurrencyPerDomain if you encounter rate limiting (HTTP 429).
Set useSitemapDiscovery to false if you only want to crawl pages discovered via links from the homepage.

Limitations

Extracts only publicly visible contact information on crawled pages.
Phone numbers without a country code need the correct countryCode or defaultCountryCode for your target market.
Some sites block automated access; proxy may be required.
Respects maxPagesPerSite, maxDepthPerSite, and termination strategy; lazy mode still does not guarantee every contact on large sites.

Website Contact Crawler

competent_clarinet/website-contact-crawler

Crawls websites to extract emails, phones, and social links.

Man Mohit verma

5.0

Website Email, Phone & Social Data Extract

smart-digital/website-contact-scraper-extract-email-phone-social

Extract emails, phone numbers, and social media profiles from websites. Automatic normalization (E.164), deduplication, and smart filtering. Intelligent crawling with adaptive depth (8-15 pages). Fast and efficient with Cheerio/HTTP and Playwright fallback.

My Smart Digital

Website Content Crawler

rupom888/website-content-crawler

Syed Rupom

TechCognita Website Contact Extractor v1

atharvshinde2004/techcognita-contact-extractor

Extracts emails, phone numbers, social media links, page metadata, and tech stack from starting URLs using Crawlee, Playwright, and headless Chrome.

Atharv Shinde

Website Email, Phone & Social Extractor

toolsnmoreapi/Website-Lead-Scraper

Extract business emails, phone numbers, and social profiles from websites — clean, structured, and ready for lead generation.

ToolsAPI

Contact Details Scraper – Emails, Phone Numbers & Social Media

davidsharadbhatt/socialprofilescrapper

Extract verified emails, phone numbers, and social media profiles from any website using this Contact Details Scraper. Perfect for lead generation, sales outreach, and business data collection. Automatically find contact info, LinkedIn, Twitter, and company profiles from multiple domains with ease.

David Bhatt

1.0

Website Email & Phone Finder

jacksu/website-contact-intelligence-agent

Find public business emails, phone numbers, social profiles, and source URLs from company websites with bounded crawling.

jack su

Extract Emails, Phone & Social Media from Website

contacts-api/extract-emails-phone-social-media-from-website

Easily extract emails, phone numbers, and social media links from websites. Perfect for lead generation, prospecting, and outreach with fast and accurate results.