Website Contact Data Extractor avatar

Website Contact Data Extractor

Pricing

from $0.28 / result

Go to Apify Store
Website Contact Data Extractor

Website Contact Data Extractor

Extract public business contact data from websites, including validated emails, phone numbers, contact/about pages, and social profiles. Delivers clean, deduplicated JSON output for CRM enrichment, lead generation, prospecting, research, and automation workflows.

Pricing

from $0.28 / result

Rating

0.0

(0)

Developer

Techionik

Techionik

Maintained by Community

Actor stats

0

Bookmarked

13

Total users

0

Monthly active users

2 days ago

Last modified

Share

Extract publicly available business contact data from company websites in a clean, structured, and automation-ready format.

Website Contact Data Extractor is built for CRM enrichment, lead generation, prospecting, business research, data collection, and automation workflows. It extracts validated emails, phone numbers, contact pages, about pages, and major social media links from business websites.

What This Actor Does

Website Contact Data Extractor helps you collect public contact and social information from websites without manually checking each page.

For each website, it can extract:

  • Business email addresses
  • Public phone numbers
  • Contact page URL
  • About page URL
  • Facebook profile link
  • Instagram profile link
  • LinkedIn profile or company page
  • Twitter / X profile link
  • YouTube channel link
  • Website domain

The output is clean, deduplicated, and returned in a consistent JSON structure.

Best For

  • Lead generation
  • Sales prospecting
  • CRM enrichment
  • Business contact discovery
  • Company research
  • Market research
  • Competitive analysis
  • Website intelligence collection
  • Contact database building
  • Automation workflows using Apify, Make, n8n, Zapier, Google Sheets, or custom APIs

Data Extracted

Each processed website returns the following fields:

FieldDescription
domainRoot domain of the website
emailsValid public email addresses found on the website
phonesPublic phone numbers found on the website
contactPageDetected contact page URL
aboutPageDetected about page URL
facebookFacebook profile/page URL
instagramInstagram profile URL
linkedinLinkedIn profile/company URL
twitterTwitter / X profile URL
youtubeYouTube channel URL

How It Works

  1. Website Contact Data Extractor starts from the website URLs you provide.
  2. It loads each website using Crawlee and CheerioCrawler.
  3. It scans the page for public email addresses and phone numbers.
  4. It checks mailto links, visible page text, tel links, footer content, and JSON-LD structured data.
  5. It discovers contact and about pages from internal links.
  6. It follows selected same-domain pages to improve contact discovery.
  7. It detects major social media links.
  8. It removes duplicate emails and phone numbers.
  9. It filters invalid emails, invalid phone-like strings, share links, policy links, and support/social noise.
  10. It saves clean structured results to the Apify dataset.

Input Options

Website URLs

Add one or more website URLs to extract contact information from.

Example input:

{
"startUrls": [
{
"url": "https://www.example.com"
}
]
}

You can process one website or multiple websites in the same run.

Output Example

Example output item:

{
"domain": "example.com",
"emails": ["info@example.com"],
"phones": ["+1-800-123-4567"],
"contactPage": "https://example.com/contact",
"aboutPage": "https://example.com/about",
"facebook": "https://facebook.com/example",
"instagram": "https://instagram.com/example",
"linkedin": "https://linkedin.com/company/example",
"twitter": "https://x.com/example",
"youtube": "https://youtube.com/@example"
}

If a specific field is not found, it may return null or an empty array depending on the field type.

Key Features

  • Extracts public business emails
  • Extracts public phone numbers
  • Detects contact pages
  • Detects about pages
  • Extracts major social media links
  • Parses JSON-LD structured data
  • Reads mailto and tel links
  • Uses footer-based phone detection
  • Uses same-domain page discovery
  • Cleans and validates extracted emails
  • Filters invalid phone-like values
  • Removes duplicate results
  • Avoids social share and policy links
  • Returns structured JSON output
  • Simple input configuration
  • Easy integration with automation tools

Supported Social Platforms

Website Contact Data Extractor can detect links from:

  • Facebook
  • Instagram
  • LinkedIn
  • Twitter / X
  • YouTube

Typical Use Cases

CRM Enrichment

Find public emails, phone numbers, and social profiles to complete company records in your CRM.

Lead Generation

Collect publicly available business contact data from company websites for prospecting workflows.

Sales Prospecting

Build structured contact datasets that can be used for outreach preparation, research, and qualification.

Business Research

Gather contact pages, about pages, and social links to better understand companies and their online presence.

Automation Pipelines

Send extracted data to Apify integrations, Google Sheets, Make, n8n, Zapier, databases, or custom APIs.

Market Research

Analyze company websites and collect public contact signals at scale.

For One Website

Use one start URL when you only need contact data from a single company website.

Example:

https://www.example.com

For Multiple Websites

Add multiple URLs in the input to process many company domains in one run.

Example:

{
"startUrls": [
{ "url": "https://www.example1.com" },
{ "url": "https://www.example2.com" },
{ "url": "https://www.example3.com" }
]
}

Output Access

After the run finishes, you can access results from:

  • Apify Dataset
  • Dataset API
  • Overview table
  • Raw JSON output
  • CSV export
  • Excel export
  • JSON export
  • XML export
  • Apify integrations
  • Webhooks

Data Quality Approach

Website Contact Data Extractor is designed to prioritize clean and reliable results.

It uses:

  • Strict email format validation
  • Phone number length checks
  • Phone normalization
  • Duplicate removal
  • Same-domain crawling
  • Contact/about page discovery
  • Social URL sanitization
  • Filtering for share, help, privacy, policy, and support links

This helps reduce noisy results and keeps the output useful for professional workflows.

Notes and Limitations

  • Only publicly available website information is collected.
  • Websites that hide contact details behind forms may return limited data.
  • Websites that require login are not supported.
  • Heavily JavaScript-rendered websites may return fewer results.
  • Some companies may not publish emails or phone numbers directly on their website.
  • The Actor focuses on quality and stability rather than aggressive deep crawling.
  • Results depend on how the target website structures its public information.

Why Use Website Contact Data Extractor

Website Contact Data Extractor saves time by automatically collecting public contact and social information from business websites.

It is useful for teams and professionals who need structured contact data for CRM enrichment, prospecting, sales research, market research, reporting, and automation workflows.

Technology

Built with:

  • Apify SDK
  • Crawlee
  • CheerioCrawler
  • Cheerio

Status

Production-ready for public website contact and social information extraction.