Website Contact & Email Scraper avatar

Website Contact & Email Scraper

Pricing

from $15.00 / 1,000 company profile with contacts

Go to Apify Store
Website Contact & Email Scraper

Website Contact & Email Scraper

Turn company domains into structured B2B company profiles. Collect emails, phone numbers, social links, offices, leadership, funding, and company overview from public websites. One dataset row per domain—built for lead generation, sales outreach, CRM enrichment, and market research.

Pricing

from $15.00 / 1,000 company profile with contacts

Rating

0.0

(0)

Developer

Coder

Coder

Maintained by Community

Actor stats

1

Bookmarked

16

Total users

11

Monthly active users

7 days ago

Last modified

Share

Website Contact & Company Data Scraper

Turn company website domains into structured business profiles using public website data. For each domain you provide, this Actor visits the company site, reads the most relevant pages, and returns a single JSON record in your Apify dataset.

Use it for lead research, account enrichment, competitive intelligence, and CRM or outreach workflows—without maintaining your own scraping stack.


What you get

ModeInput settingBest for
Contact profile (default)deepScraping: falseEmails, phones, offices, official social links, company name & short description
Company intelligence reportdeepScraping: trueEverything in the contact profile plus leadership, funding, products, market positioning, technical footprint, workforce signals, and SEO/community metrics

Each successful domain produces one dataset item (one JSON object per row).


Quick start on Apify

  1. Open the Actor in the Apify Console.
  2. Under Domains, add one or more company domains (e.g. apify.com). Use the domain only—no https://, paths, or query strings.
  3. Set Max pages per domain (default 5). Higher values may find more contact pages but increase run time and cost.
  4. Leave Detailed company research off for a fast contact-focused run, or turn it on for the full report.
  5. Click Start and open the Dataset tab when the run finishes.

Example input (contact profile):

{
"domains": ["apify.com"],
"maxPages": 10,
"deepScraping": false
}

Example input (full intelligence report):

{
"domains": ["apify.com", "amplitude.com"],
"maxPages": 10,
"deepScraping": true
}

Input parameters

Domains (required)

TypeList of strings
FormatRoot domain only: apify.com, www.example.co.uk → use example.co.uk
Avoidhttps://apify.com/contact, /about, or email addresses

The Actor processes each domain independently. Multiple domains can run in parallel (see Multiple domains).

Max pages per domain (required)

TypeInteger
Range1100
Default5

This controls how many additional pages are scanned on each company site (beyond the homepage). The Actor prioritizes high-value paths such as contact, about, team, support, pricing, and company information.

maxPagesTypical effect
5Fast, cost-efficient; good for quick lead checks
10Balanced default for production enrichment
20+Deeper coverage; longer runs; useful for large or complex sites

Detailed company research (deepScraping)

TypeBoolean
Defaultfalse

Off (contact profile)
Focused output: company vitals (name, tagline, short description), emails, phone numbers, office locations, and social profile URLs grouped by platform.

On (company intelligence report)
Includes the contact profile and expands into leadership, financial/funding signals, products and innovation, market positioning, technical infrastructure, workforce insights, and digital presence (SEO and community ratings where available).

Read the Pricing section on the Actor store before enabling detailed mode—billing includes page reads and one or two company-profile charges per domain depending on depth and data found.


What you will see during a run

The run log is written for end users: progress messages describe phases such as scanning a domain, choosing pages to read, preparing the profile, and saving results. You do not need technical logs to use the Actor.

Typical messages:

  • Run started — processing N domains.
  • Scanning apify.com for contact information...
  • Choosing the best pages to scan...
  • Reading 10 pages on apify.com...
  • Preparing contact profile... (or Building company intelligence report... in detailed mode)
  • Run finished — N company profiles saved to the dataset.

If a domain cannot be completed (unreachable site, blocking, or invalid domain), you will see a warning for that domain; other domains in the same run can still succeed.


Output overview

Dataset structure

  • One JSON object per domain
  • Field presence varies by site: not every company publishes phones, all social networks, or leadership pages
  • Empty arrays ([]) or null mean no confident public signal was found—not an error

Top-level field

FieldDescription
domainThe input domain for this record

Contact profile output (deepScraping: false)

Returned sections:

SectionContents
company_vitalsofficial_name, tagline, description_short
contact_and_locationsdiscovery_emails, phone_numbers, global_offices
social_ecosystemOfficial profile URLs per platform (arrays of strings)

Social platforms include: LinkedIn, X (Twitter), Facebook, Instagram, GitHub, YouTube, Discord, TikTok, Crunchbase, Wellfound/AngelList, Glassdoor, Trustpilot, Reddit, Medium, Slack community, and other_social_links.

Example: apify.com (contact profile)

Input:

{
"domains": ["apify.com"],
"maxPages": 10,
"deepScraping": false
}

Excerpt of dataset output:

{
"domain": "apify.com",
"company_vitals": {
"official_name": "Apify",
"tagline": "Apify: Full-stack web scraping and data extraction platform",
"description_short": "Cloud platform for web scraping, browser automation, AI agents, and data for AI. Use 34,000+ ready-made tools, code templates, or order a custom solution."
},
"contact_and_locations": {
"discovery_emails": [
"hello@apify.com",
"legal@apify.com",
"privacy@apify.com",
"support@apify.com",
"security@apify.com"
],
"phone_numbers": [],
"global_offices": [
{
"type": "Headquarters",
"address": "Vodičkova 704/36, Nové Město, 110 00 Praha-Praha 1, Czechia",
"map_link": "https://maps.google.com/?cid=4018967042655020156"
}
]
},
"social_ecosystem": {
"linkedin": ["https://www.linkedin.com/company/apify"],
"twitter_x": ["https://x.com/apify"],
"facebook": [],
"instagram": [],
"github": [
"https://github.com/apify",
"https://github.com/apify-projects"
],
"youtube": ["https://www.youtube.com/apify"],
"discord": ["https://discord.com/invite/crawlee-apify-801163717915574323"],
"tiktok": [],
"crunchbase": ["https://www.crunchbase.com/organization/apify"],
"wellfound_angellist": [],
"glassdoor": ["https://www.glassdoor.com/Overview/Working-at-Apify-EI_IE3100324.11,16.htm"],
"trustpilot": ["https://www.trustpilot.com/review/apify.com"],
"reddit": ["https://www.reddit.com/r/apify"],
"medium": [],
"slack_community": [],
"other_social_links": []
}
}

Note: Phone numbers and social links depend on what the company publishes publicly. Some runs may include a phone number when it is listed on trusted public business listings; others may return an empty phone_numbers array.


Company intelligence report (deepScraping: true)

Includes all contact profile fields, plus:

SectionContents
company_vitalsAdds description_long, year_founded, employee_count_range, industry_sector, estimated_revenue, legal_entity_type, stock_symbol, exchange
leadership_and_governanceexecutive_team, board_of_directors (name, title, LinkedIn where found)
financial_intelligenceFunding rounds, investors, totals; public market fields when applicable
product_and_innovationPlatform overview, core offerings, AI/automation initiatives
market_positioningPricing strategy summary, target audience, competitors
technical_infrastructureSecurity headers, SSL, DNS/MX, technology stack groupings
workforce_insightsDepartment headcount estimates, languages
digital_presenceSEO meta signals, community/review scores where available

Example: apify.com (detailed report)

Input:

{
"domains": ["apify.com"],
"maxPages": 10,
"deepScraping": true
}

Excerpt — company vitals & contacts (same contact block as above, with extended vitals):

{
"domain": "apify.com",
"company_vitals": {
"official_name": "Apify",
"tagline": "Apify: Full-stack web scraping and data extraction platform",
"description_long": "Apify is a cloud platform specializing in full-stack web scraping, browser automation, and data extraction, particularly for AI applications. It provides a robust ecosystem for developers and businesses to collect real-time web data at scale. The platform features the Apify Store, a marketplace offering over 34,000 ready-made automation tools, known as Actors…",
"description_short": "Cloud platform for web scraping, browser automation, AI agents, and data for AI. Use 34,000+ ready-made tools, code templates, or order a custom solution.",
"year_founded": 2016,
"employee_count_range": "51-200",
"industry_sector": "information technology & services",
"estimated_revenue": 13300000,
"legal_entity_type": "Private Company",
"stock_symbol": null,
"exchange": null
},
"contact_and_locations": {
"discovery_emails": [
"hello@apify.com",
"privacy@apify.com",
"support@apify.com",
"legal@apify.com",
"security@apify.com"
],
"phone_numbers": ["+1 925 399 8290"],
"global_offices": [
{
"type": "Headquarters",
"address": "Vodičkova 704/36, Nové Město, 110 00 Praha-Praha 1, Czechia",
"map_link": "https://maps.google.com/?cid=4018967042655020156"
}
]
}
}

Excerpt — leadership:

{
"leadership_and_governance": {
"executive_team": [
{
"name": "Jan Čurn",
"title": "CEO",
"linkedin": "https://www.linkedin.com/in/jancurn",
"other_profiles": []
},
{
"name": "Ondra Urban",
"title": "COO & GTM",
"linkedin": "https://www.linkedin.com/in/ondra-urban",
"other_profiles": []
},
{
"name": "Simona Baxa",
"title": "Head of People",
"linkedin": "https://www.linkedin.com/in/simonabaxa",
"other_profiles": []
}
],
"board_of_directors": [
{
"name": "Jan Čurn",
"title": "Board Member",
"details": "Board Member"
}
]
}
}

Excerpt — financial intelligence:

{
"financial_intelligence": {
"public_company_info": {
"ipo_date": "",
"market_cap": null,
"valuation_history": ""
},
"funding_rounds": [
{
"round": "Venture (Round not Specified)",
"amount": 2800000,
"date": "2024-04-01",
"lead_investors": ["J&T Ventures"]
},
{
"round": "Seed",
"amount": 850000,
"date": "2019-10-01",
"lead_investors": ["Reflex Capital"]
}
],
"total_funding_raised": 4480000,
"key_investors": [
"J&T Ventures",
"Reflex Capital",
"Incomming Ventures",
"Y Combinator"
]
}
}

Excerpt — products & market:

{
"product_and_innovation": {
"platform_overview": "Apify is a cloud platform for web scraping, browser automation, AI agents, and data extraction for AI applications…",
"core_offerings": [
{
"name": "Web Scraping",
"description": "Tools and services for extracting structured data from any website."
},
{
"name": "Browser Automation",
"description": "Automating repetitive tasks and interactions within web browsers."
},
{
"name": "AI Agents",
"description": "AI-powered tools for intelligent web browsing and data extraction."
},
{
"name": "Apify Store (Actors Marketplace)",
"description": "A marketplace with over 34,000 pre-built automation tools and scrapers."
}
],
"ai_and_automation": {
"strategy_overview": "Apify leverages AI and automation to enable efficient data extraction…",
"specific_tools": [
{
"name": "Apify Actors",
"description": "Customizable automation tools for web scraping and data processing.",
"key_features": []
}
],
"ai_partnerships": []
},
"mergers_and_acquisitions": []
},
"market_positioning": {
"pricing_strategy": "Apify employs a subscription-based pricing model that includes a monthly platform fee and prepaid platform usage credits…",
"target_audience": [
"Individual developers",
"Startups",
"Mid-market to enterprise customers",
"AI builders"
],
"competitive_landscape": [
"Bright Data",
"ScraperAPI",
"Octoparse",
"Phantombuster"
],
"subscription_plans": []
}
}

Excerpt — technical & digital presence:

{
"technical_infrastructure": {
"network_audit": {
"server": "nginx",
"security_headers": {
"HSTS": true,
"XSS_Protection": "1; mode=block",
"Frame_Options": "SAMEORIGIN",
"CSP": true
}
},
"trust_and_compliance": {
"ssl_issuer": "Amazon",
"compliance_standards": ["SOC 2 Type II", "GDPR", "CCPA"],
"security_txt_url": "https://apify.com/.well-known/security.txt"
},
"technology_stack": {
"frontend": ["Next.js", "React", "TypeScript"],
"analytics": ["Google Analytics", "Mixpanel", "Segment"],
"infrastructure": ["Amazon AWS", "Kubernetes", "Nginx", "Redis"]
}
},
"workforce_insights": {
"department_headcounts": {
"engineering": 91,
"marketing": 20,
"sales": 4,
"operations": 6
},
"multilingual_support": ["English"]
},
"digital_presence": {
"seo_signals": {
"meta_title": "Apify: Full-stack web scraping and data extraction platform",
"meta_description": "Cloud platform for web scraping, browser automation, AI agents, and data for AI…",
"canonical_url": "https://apify.com",
"keywords": ["web scraping", "browser automation", "ai agents", "data extraction"]
},
"community_sentiment": [
{ "platform": "Trustpilot", "score": 4.8, "count": 525 }
]
}
}

Social links in detailed mode use the same social_ecosystem structure as the contact profile (URL string arrays per platform).


Field reference (contact profile)

company_vitals

FieldTypeDescription
official_namestringCompany name as presented publicly
taglinestringShort marketing line from the site or metadata
description_shortstringBrief company summary

contact_and_locations

FieldTypeDescription
discovery_emailsstring[]Email addresses found on the company site (brand and role addresses)
phone_numbersstring[]Phone numbers in international format when publicly available
global_officesobject[]type, address, optional map_link

social_ecosystem

Each key is a platform. Values are arrays of profile URLs (empty array if none found). Up to a few links per platform may appear when multiple official profiles exist.


Field reference (additional sections in detailed mode)

SectionKey fields
company_vitalsdescription_long, year_founded, employee_count_range, industry_sector, estimated_revenue, legal_entity_type, stock_symbol, exchange
leadership_and_governanceexecutive_team[], board_of_directors[]
financial_intelligencefunding_rounds[], total_funding_raised, key_investors, public_company_info
product_and_innovationplatform_overview, core_offerings[], ai_and_automation, mergers_and_acquisitions
market_positioningpricing_strategy, target_audience[], competitive_landscape[]
technical_infrastructurenetwork_audit, trust_and_compliance, dns_intelligence, technology_stack
workforce_insightsdepartment_headcounts, multilingual_support
digital_presenceseo_signals, community_sentiment[]

Multiple domains

  • Add multiple entries under Domains to process a list in one run.
  • Each domain is handled separately; results appear as separate dataset rows.
  • If one domain fails, others can still complete successfully.
  • For large lists, split into batches if you need predictable scheduling or rate limits on downstream systems.

Example — three companies, detailed mode:

{
"domains": ["apify.com", "amplitude.com", "openai.com"],
"maxPages": 10,
"deepScraping": true
}

You receive three dataset items—one per domain—with the same schema per row.


Free plan limits

On the Apify free plan, the Actor may automatically apply:

LimitValue
Domains per run1 (only the first domain in your list)
Max pages per domain5
Detailed company researchDisabled (contact profile only)

Paid Apify plans can use the full input: multiple domains, higher maxPages, and detailed mode. Upgrade on Apify if you see free-plan warnings in the log.


Pricing

Pricing is pay-per-event on Apify. See the Pricing tab on the Actor store for current rates.

Contact profile (deepScraping: false)

EventWhen it applies
Website page read (standard)Each company page successfully loaded (homepage plus up to maxPages additional pages)
Company profile with contactsOnce per domain when the result includes at least one email, phone number, or social profile link

Detailed company research (deepScraping: true)

EventWhen it applies
Website page read (standard)Each billable page successfully loaded (homepage, contact pages, and additional company information pages)
Company profile with contactsWhen contact or social data is present (same rule as contact profile)
Company profile with contacts (second charge)When the detailed report reaches sufficient depth across major sections (e.g. leadership, financial, product, market)—independent of the first profile charge

Higher maxPages and detailed mode increase page reads and run time. Start with maxPages: 5 and contact profile mode to estimate cost per domain.


Data quality and limitations

Public data only
The Actor reads publicly reachable web pages. It does not bypass logins, paywalls, or private databases.

Accuracy
Company websites and public listings change. Emails, leadership titles, funding figures, and competitor lists should be verified before high-stakes decisions (contracts, compliance, outbound at scale).

Coverage varies by site
Minimal sites may return mostly vitals and one email. Rich corporate sites yield more contacts, social links, and detailed sections.

Phones
Not every company publishes a phone number on their website or on public business listings. An empty phone_numbers array is normal.

Detailed mode depth
Intelligence sections depend on available public content. Sparse sites may return shorter description_long text or empty funding/leadership arrays.

No guarantee of deliverability
Email addresses are discovered on the site; the Actor does not verify that inboxes accept mail.

Compliance
You are responsible for using collected data in line with applicable laws (GDPR, CAN-SPAM, etc.) and the target website’s terms.


Tips for best results

  1. Use the root domainapify.com, not https://apify.com/pricing.
  2. Start with maxPages: 5–10 — increase only if contacts look incomplete.
  3. Use contact profile first — enable detailed mode when you need leadership, funding, or tech stack fields.
  4. B2B corporate sites work best — very small landing pages or single-page apps may return thinner profiles.
  5. Check the dataset, not only the log — the full JSON is in dataset storage.
  6. Re-run periodically — refresh quarterly for outreach lists or account research.

Frequently asked questions

Why are some social arrays empty?
The company may not link that platform, or the profile may not be detectable from public pages. Empty arrays are expected.

Why did one domain in my list fail?
The site may be down, block automated access, or the domain may be invalid. Other domains in the same run are unaffected.

Does detailed mode always return every section?
No. Sections appear when enough public signals exist. The report is structured consistently, but fields may be empty.

Can I scrape subpages or paths in the domain field?
No. Pass only the root domain; the Actor discovers relevant paths automatically.

How is this different from a simple email finder?
You get a multi-field company record—contacts, social graph, and optional firmographics—not only a single email column.


Support

For Actor-specific issues, use the Issues tab on the Apify store listing or contact the publisher through Apify.

For platform questions (runs, billing, API), see Apify documentation and support.


Changelog

VersionNotes
1.0.0Initial release: contact profiles and detailed company intelligence reports