Domain Contact Scraper – Intelligent Email & Phone Extractor avatar

Domain Contact Scraper – Intelligent Email & Phone Extractor

Pricing

from $2.00 / 1,000 domains

Go to Apify Store
Domain Contact Scraper – Intelligent Email & Phone Extractor

Domain Contact Scraper – Intelligent Email & Phone Extractor

Intelligent website contact scraper that extracts emails, phone numbers, WhatsApp links, and social profiles from any domain. Uses priority-first crawling and returns one structured, consolidated result per start URL with transparent crawl metadata.

Pricing

from $2.00 / 1,000 domains

Rating

0.0

(0)

Developer

Emiliano Mastragostino

Emiliano Mastragostino

Maintained by Community

Actor stats

0

Bookmarked

4

Total users

2

Monthly active users

6 days ago

Last modified

Share

Domain Contact Scraper

Looking for a reliable website contact details scraper?

This Apify Actor intelligently crawls websites to extract:

  • 📧 Email addresses
  • 📞 Phone numbers
  • 💬 WhatsApp links
  • 🔗 Social media profiles (LinkedIn, Facebook, Instagram, Twitter/X, and more)

Unlike generic email extractors that dump raw page data, this Actor performs priority-first crawling and returns one clean, consolidated result per domain, including crawl metadata for full transparency.

Built for developers, agencies, and automation pipelines that need predictable and structured contact discovery.


🚀 Why Try This Actor?

✔ Intelligent, signal-driven crawl strategy ✔ Multilingual contact-page detection (contact, kontakt, contacto, contato, impressum, about, legal, etc.) ✔ One structured result per start URL ✔ Transparent crawl metadata (pages visited, stop reason, timestamp) ✔ Configurable crawl depth and scope ✔ Optional early-stop modes for adaptive crawling

Whether you're building a lead generation workflow, enriching business directories, or automating contact discovery, this Actor provides structured, reliable output ready for integration.


🔍 What This Contact Scraper Does

For each start URL:

  1. Prioritizes high-value pages (contact, about, legal, multilingual equivalents)
  2. Crawls within a controlled scope (hostname, domain, origin, or all links)
  3. Extracts contact details
  4. Aggregates all results in memory
  5. Outputs exactly one structured result per domain

No fragmented per-page records. No messy datasets.


📦 Output Structure

Each start URL produces one result:

{
"schemaVersion": 1,
"startUrl": "https://example.com",
"finalUrl": "https://www.example.com",
"contacts": {
"emails": ["info@example.com"],
"phones": ["+1 555 123 4567"],
"socials": [
{
"type": "linkedin",
"url": "https://linkedin.com/company/example"
}
],
"whatsapps": ["+15551234567"]
},
"crawl": {
"pagesVisited": 4,
"visitedSample": ["https://example.com", "https://example.com/contact"],
"stoppedReason": "early-stop",
"updatedAt": "2026-01-01T12:00:00.000Z"
}
}

Key Benefits of This Structure

  • One consolidated result per domain
  • Crawl transparency via pagesVisited and stoppedReason
  • Structured contacts object for easy automation
  • Versioned schema (schemaVersion: 1) for stability

🧩 Input Examples

Basic Usage (Quick Start)

{
"startUrls": [{ "url": "https://apify.com" }]
}

That’s it. The Actor will:

  • Crawl the site
  • Prioritize contact-related pages
  • Return structured contact data

Advanced Configuration

{
"maxPagesPerStartUrl": 8,
"maxDepth": 1,
"enqueueStrategy": "same-hostname",
"earlyStopMode": "balanced",
"proxyConfigurationOptions": {
"useApifyProxy": true
},
"startUrls": [{ "url": "https://apify.com" }]
}

Key Options Explained

  • maxPagesPerStartUrl Limits pages visited per domain.

  • maxDepth Controls how deep links are followed.

  • enqueueStrategy Defines crawl scope:

    • same-hostname (default)
    • same-domain (includes subdomains)
    • same-origin
    • all
  • earlyStopMode (optional)

    • off (default)
    • balanced
    • aggressive

Balanced and aggressive modes allow the crawl to stop once sufficient direct contact signals are discovered or when exploration becomes stale.

  • proxyConfigurationOptions Optional proxy settings for improved reach.

🧠 Intelligent Priority-First Crawling

This is not a simple depth-first or breadth-first scraper.

The Actor:

  • Scores links before enqueuing
  • Prioritizes high-value contact pages
  • Supports multilingual detection (contact, kontakt, contacto, contato, impressum, about, etc.)
  • Uses adaptive early-stop logic (optional)

This results in faster, more disciplined contact discovery.


🌍 SEO & Lead Generation Use Cases

This website contact scraper is ideal for:

  • Lead generation workflows
  • Business directory building
  • Sales prospecting automation
  • Agency outreach campaigns
  • CRM enrichment
  • Data collection pipelines
  • Email extraction from company websites

⚙️ Designed for Developers & Automation

  • Clean JSON output with multi-format export (CSV, Excel, XML, HTML)
  • Domain-level aggregation
  • Deterministic crawl behavior
  • No per-page dataset fragmentation
  • Works seamlessly with Apify API, webhooks, and SDKs

⚠️ Limitations

  • Uses HTML parsing (Cheerio-based). Heavy client-side-rendered sites may require a browser-based crawler.
  • Does not perform email verification (while it does perform basic email and phone validation)
  • Does not identify specific individuals or job titles.

🚧 Roadmap & Upcoming Improvements

This Actor is continuously evolving. Key upcoming features include:

  • Browser-Based Crawling: Introducing Playwright Crawler to handle dynamic SPAs (React, Vue, Angular) while retaining lightweight HTML crawling as the high-speed default.

  • Email Verification: Optional features for MX record validation, SMTP checks, and deliverability indicators to enhance lead quality.


🏁 Summary

If you're looking for a reliable email extractor, phone number scraper, or website contact details scraper, this Actor provides:

  • Structured, domain-level output
  • Intelligent crawl prioritization
  • Transparent crawl metadata
  • Configurable exploration control

Built for predictable, disciplined website contact discovery — not blind scraping.


Support

This Actor is actively maintained. Need help or want to customize it?