Website Contact & Email Scraper
Pricing
from $15.00 / 1,000 company profile with contacts
Website Contact & Email Scraper
Turn company domains into structured B2B company profiles. Collect emails, phone numbers, social links, offices, leadership, funding, and company overview from public websites. One dataset row per domain—built for lead generation, sales outreach, CRM enrichment, and market research.
Pricing
from $15.00 / 1,000 company profile with contacts
Rating
0.0
(0)
Developer
Coder
Maintained by CommunityActor stats
1
Bookmarked
16
Total users
11
Monthly active users
7 days ago
Last modified
Categories
Share
Website Contact & Company Data Scraper
Turn company website domains into structured business profiles using public website data. For each domain you provide, this Actor visits the company site, reads the most relevant pages, and returns a single JSON record in your Apify dataset.
Use it for lead research, account enrichment, competitive intelligence, and CRM or outreach workflows—without maintaining your own scraping stack.
What you get
| Mode | Input setting | Best for |
|---|---|---|
| Contact profile (default) | deepScraping: false | Emails, phones, offices, official social links, company name & short description |
| Company intelligence report | deepScraping: true | Everything in the contact profile plus leadership, funding, products, market positioning, technical footprint, workforce signals, and SEO/community metrics |
Each successful domain produces one dataset item (one JSON object per row).
Quick start on Apify
- Open the Actor in the Apify Console.
- Under Domains, add one or more company domains (e.g.
apify.com). Use the domain only—nohttps://, paths, or query strings. - Set Max pages per domain (default
5). Higher values may find more contact pages but increase run time and cost. - Leave Detailed company research off for a fast contact-focused run, or turn it on for the full report.
- Click Start and open the Dataset tab when the run finishes.
Example input (contact profile):
{"domains": ["apify.com"],"maxPages": 10,"deepScraping": false}
Example input (full intelligence report):
{"domains": ["apify.com", "amplitude.com"],"maxPages": 10,"deepScraping": true}
Input parameters
Domains (required)
| Type | List of strings |
| Format | Root domain only: apify.com, www.example.co.uk → use example.co.uk |
| Avoid | https://apify.com/contact, /about, or email addresses |
The Actor processes each domain independently. Multiple domains can run in parallel (see Multiple domains).
Max pages per domain (required)
| Type | Integer |
| Range | 1 – 100 |
| Default | 5 |
This controls how many additional pages are scanned on each company site (beyond the homepage). The Actor prioritizes high-value paths such as contact, about, team, support, pricing, and company information.
maxPages | Typical effect |
|---|---|
5 | Fast, cost-efficient; good for quick lead checks |
10 | Balanced default for production enrichment |
20+ | Deeper coverage; longer runs; useful for large or complex sites |
Detailed company research (deepScraping)
| Type | Boolean |
| Default | false |
Off (contact profile)
Focused output: company vitals (name, tagline, short description), emails, phone numbers, office locations, and social profile URLs grouped by platform.
On (company intelligence report)
Includes the contact profile and expands into leadership, financial/funding signals, products and innovation, market positioning, technical infrastructure, workforce insights, and digital presence (SEO and community ratings where available).
Read the Pricing section on the Actor store before enabling detailed mode—billing includes page reads and one or two company-profile charges per domain depending on depth and data found.
What you will see during a run
The run log is written for end users: progress messages describe phases such as scanning a domain, choosing pages to read, preparing the profile, and saving results. You do not need technical logs to use the Actor.
Typical messages:
Run started — processing N domains.Scanning apify.com for contact information...Choosing the best pages to scan...Reading 10 pages on apify.com...Preparing contact profile...(orBuilding company intelligence report...in detailed mode)Run finished — N company profiles saved to the dataset.
If a domain cannot be completed (unreachable site, blocking, or invalid domain), you will see a warning for that domain; other domains in the same run can still succeed.
Output overview
Dataset structure
- One JSON object per domain
- Field presence varies by site: not every company publishes phones, all social networks, or leadership pages
- Empty arrays (
[]) ornullmean no confident public signal was found—not an error
Top-level field
| Field | Description |
|---|---|
domain | The input domain for this record |
Contact profile output (deepScraping: false)
Returned sections:
| Section | Contents |
|---|---|
company_vitals | official_name, tagline, description_short |
contact_and_locations | discovery_emails, phone_numbers, global_offices |
social_ecosystem | Official profile URLs per platform (arrays of strings) |
Social platforms include: LinkedIn, X (Twitter), Facebook, Instagram, GitHub, YouTube, Discord, TikTok, Crunchbase, Wellfound/AngelList, Glassdoor, Trustpilot, Reddit, Medium, Slack community, and other_social_links.
Example: apify.com (contact profile)
Input:
{"domains": ["apify.com"],"maxPages": 10,"deepScraping": false}
Excerpt of dataset output:
{"domain": "apify.com","company_vitals": {"official_name": "Apify","tagline": "Apify: Full-stack web scraping and data extraction platform","description_short": "Cloud platform for web scraping, browser automation, AI agents, and data for AI. Use 34,000+ ready-made tools, code templates, or order a custom solution."},"contact_and_locations": {"discovery_emails": ["hello@apify.com","legal@apify.com","privacy@apify.com","support@apify.com","security@apify.com"],"phone_numbers": [],"global_offices": [{"type": "Headquarters","address": "Vodičkova 704/36, Nové Město, 110 00 Praha-Praha 1, Czechia","map_link": "https://maps.google.com/?cid=4018967042655020156"}]},"social_ecosystem": {"linkedin": ["https://www.linkedin.com/company/apify"],"twitter_x": ["https://x.com/apify"],"facebook": [],"instagram": [],"github": ["https://github.com/apify","https://github.com/apify-projects"],"youtube": ["https://www.youtube.com/apify"],"discord": ["https://discord.com/invite/crawlee-apify-801163717915574323"],"tiktok": [],"crunchbase": ["https://www.crunchbase.com/organization/apify"],"wellfound_angellist": [],"glassdoor": ["https://www.glassdoor.com/Overview/Working-at-Apify-EI_IE3100324.11,16.htm"],"trustpilot": ["https://www.trustpilot.com/review/apify.com"],"reddit": ["https://www.reddit.com/r/apify"],"medium": [],"slack_community": [],"other_social_links": []}}
Note: Phone numbers and social links depend on what the company publishes publicly. Some runs may include a phone number when it is listed on trusted public business listings; others may return an empty
phone_numbersarray.
Company intelligence report (deepScraping: true)
Includes all contact profile fields, plus:
| Section | Contents |
|---|---|
company_vitals | Adds description_long, year_founded, employee_count_range, industry_sector, estimated_revenue, legal_entity_type, stock_symbol, exchange |
leadership_and_governance | executive_team, board_of_directors (name, title, LinkedIn where found) |
financial_intelligence | Funding rounds, investors, totals; public market fields when applicable |
product_and_innovation | Platform overview, core offerings, AI/automation initiatives |
market_positioning | Pricing strategy summary, target audience, competitors |
technical_infrastructure | Security headers, SSL, DNS/MX, technology stack groupings |
workforce_insights | Department headcount estimates, languages |
digital_presence | SEO meta signals, community/review scores where available |
Example: apify.com (detailed report)
Input:
{"domains": ["apify.com"],"maxPages": 10,"deepScraping": true}
Excerpt — company vitals & contacts (same contact block as above, with extended vitals):
{"domain": "apify.com","company_vitals": {"official_name": "Apify","tagline": "Apify: Full-stack web scraping and data extraction platform","description_long": "Apify is a cloud platform specializing in full-stack web scraping, browser automation, and data extraction, particularly for AI applications. It provides a robust ecosystem for developers and businesses to collect real-time web data at scale. The platform features the Apify Store, a marketplace offering over 34,000 ready-made automation tools, known as Actors…","description_short": "Cloud platform for web scraping, browser automation, AI agents, and data for AI. Use 34,000+ ready-made tools, code templates, or order a custom solution.","year_founded": 2016,"employee_count_range": "51-200","industry_sector": "information technology & services","estimated_revenue": 13300000,"legal_entity_type": "Private Company","stock_symbol": null,"exchange": null},"contact_and_locations": {"discovery_emails": ["hello@apify.com","privacy@apify.com","support@apify.com","legal@apify.com","security@apify.com"],"phone_numbers": ["+1 925 399 8290"],"global_offices": [{"type": "Headquarters","address": "Vodičkova 704/36, Nové Město, 110 00 Praha-Praha 1, Czechia","map_link": "https://maps.google.com/?cid=4018967042655020156"}]}}
Excerpt — leadership:
{"leadership_and_governance": {"executive_team": [{"name": "Jan Čurn","title": "CEO","linkedin": "https://www.linkedin.com/in/jancurn","other_profiles": []},{"name": "Ondra Urban","title": "COO & GTM","linkedin": "https://www.linkedin.com/in/ondra-urban","other_profiles": []},{"name": "Simona Baxa","title": "Head of People","linkedin": "https://www.linkedin.com/in/simonabaxa","other_profiles": []}],"board_of_directors": [{"name": "Jan Čurn","title": "Board Member","details": "Board Member"}]}}
Excerpt — financial intelligence:
{"financial_intelligence": {"public_company_info": {"ipo_date": "","market_cap": null,"valuation_history": ""},"funding_rounds": [{"round": "Venture (Round not Specified)","amount": 2800000,"date": "2024-04-01","lead_investors": ["J&T Ventures"]},{"round": "Seed","amount": 850000,"date": "2019-10-01","lead_investors": ["Reflex Capital"]}],"total_funding_raised": 4480000,"key_investors": ["J&T Ventures","Reflex Capital","Incomming Ventures","Y Combinator"]}}
Excerpt — products & market:
{"product_and_innovation": {"platform_overview": "Apify is a cloud platform for web scraping, browser automation, AI agents, and data extraction for AI applications…","core_offerings": [{"name": "Web Scraping","description": "Tools and services for extracting structured data from any website."},{"name": "Browser Automation","description": "Automating repetitive tasks and interactions within web browsers."},{"name": "AI Agents","description": "AI-powered tools for intelligent web browsing and data extraction."},{"name": "Apify Store (Actors Marketplace)","description": "A marketplace with over 34,000 pre-built automation tools and scrapers."}],"ai_and_automation": {"strategy_overview": "Apify leverages AI and automation to enable efficient data extraction…","specific_tools": [{"name": "Apify Actors","description": "Customizable automation tools for web scraping and data processing.","key_features": []}],"ai_partnerships": []},"mergers_and_acquisitions": []},"market_positioning": {"pricing_strategy": "Apify employs a subscription-based pricing model that includes a monthly platform fee and prepaid platform usage credits…","target_audience": ["Individual developers","Startups","Mid-market to enterprise customers","AI builders"],"competitive_landscape": ["Bright Data","ScraperAPI","Octoparse","Phantombuster"],"subscription_plans": []}}
Excerpt — technical & digital presence:
{"technical_infrastructure": {"network_audit": {"server": "nginx","security_headers": {"HSTS": true,"XSS_Protection": "1; mode=block","Frame_Options": "SAMEORIGIN","CSP": true}},"trust_and_compliance": {"ssl_issuer": "Amazon","compliance_standards": ["SOC 2 Type II", "GDPR", "CCPA"],"security_txt_url": "https://apify.com/.well-known/security.txt"},"technology_stack": {"frontend": ["Next.js", "React", "TypeScript"],"analytics": ["Google Analytics", "Mixpanel", "Segment"],"infrastructure": ["Amazon AWS", "Kubernetes", "Nginx", "Redis"]}},"workforce_insights": {"department_headcounts": {"engineering": 91,"marketing": 20,"sales": 4,"operations": 6},"multilingual_support": ["English"]},"digital_presence": {"seo_signals": {"meta_title": "Apify: Full-stack web scraping and data extraction platform","meta_description": "Cloud platform for web scraping, browser automation, AI agents, and data for AI…","canonical_url": "https://apify.com","keywords": ["web scraping", "browser automation", "ai agents", "data extraction"]},"community_sentiment": [{ "platform": "Trustpilot", "score": 4.8, "count": 525 }]}}
Social links in detailed mode use the same social_ecosystem structure as the contact profile (URL string arrays per platform).
Field reference (contact profile)
company_vitals
| Field | Type | Description |
|---|---|---|
official_name | string | Company name as presented publicly |
tagline | string | Short marketing line from the site or metadata |
description_short | string | Brief company summary |
contact_and_locations
| Field | Type | Description |
|---|---|---|
discovery_emails | string[] | Email addresses found on the company site (brand and role addresses) |
phone_numbers | string[] | Phone numbers in international format when publicly available |
global_offices | object[] | type, address, optional map_link |
social_ecosystem
Each key is a platform. Values are arrays of profile URLs (empty array if none found). Up to a few links per platform may appear when multiple official profiles exist.
Field reference (additional sections in detailed mode)
| Section | Key fields |
|---|---|
company_vitals | description_long, year_founded, employee_count_range, industry_sector, estimated_revenue, legal_entity_type, stock_symbol, exchange |
leadership_and_governance | executive_team[], board_of_directors[] |
financial_intelligence | funding_rounds[], total_funding_raised, key_investors, public_company_info |
product_and_innovation | platform_overview, core_offerings[], ai_and_automation, mergers_and_acquisitions |
market_positioning | pricing_strategy, target_audience[], competitive_landscape[] |
technical_infrastructure | network_audit, trust_and_compliance, dns_intelligence, technology_stack |
workforce_insights | department_headcounts, multilingual_support |
digital_presence | seo_signals, community_sentiment[] |
Multiple domains
- Add multiple entries under Domains to process a list in one run.
- Each domain is handled separately; results appear as separate dataset rows.
- If one domain fails, others can still complete successfully.
- For large lists, split into batches if you need predictable scheduling or rate limits on downstream systems.
Example — three companies, detailed mode:
{"domains": ["apify.com", "amplitude.com", "openai.com"],"maxPages": 10,"deepScraping": true}
You receive three dataset items—one per domain—with the same schema per row.
Free plan limits
On the Apify free plan, the Actor may automatically apply:
| Limit | Value |
|---|---|
| Domains per run | 1 (only the first domain in your list) |
| Max pages per domain | 5 |
| Detailed company research | Disabled (contact profile only) |
Paid Apify plans can use the full input: multiple domains, higher maxPages, and detailed mode. Upgrade on Apify if you see free-plan warnings in the log.
Pricing
Pricing is pay-per-event on Apify. See the Pricing tab on the Actor store for current rates.
Contact profile (deepScraping: false)
| Event | When it applies |
|---|---|
| Website page read (standard) | Each company page successfully loaded (homepage plus up to maxPages additional pages) |
| Company profile with contacts | Once per domain when the result includes at least one email, phone number, or social profile link |
Detailed company research (deepScraping: true)
| Event | When it applies |
|---|---|
| Website page read (standard) | Each billable page successfully loaded (homepage, contact pages, and additional company information pages) |
| Company profile with contacts | When contact or social data is present (same rule as contact profile) |
| Company profile with contacts (second charge) | When the detailed report reaches sufficient depth across major sections (e.g. leadership, financial, product, market)—independent of the first profile charge |
Higher maxPages and detailed mode increase page reads and run time. Start with maxPages: 5 and contact profile mode to estimate cost per domain.
Data quality and limitations
Public data only
The Actor reads publicly reachable web pages. It does not bypass logins, paywalls, or private databases.
Accuracy
Company websites and public listings change. Emails, leadership titles, funding figures, and competitor lists should be verified before high-stakes decisions (contracts, compliance, outbound at scale).
Coverage varies by site
Minimal sites may return mostly vitals and one email. Rich corporate sites yield more contacts, social links, and detailed sections.
Phones
Not every company publishes a phone number on their website or on public business listings. An empty phone_numbers array is normal.
Detailed mode depth
Intelligence sections depend on available public content. Sparse sites may return shorter description_long text or empty funding/leadership arrays.
No guarantee of deliverability
Email addresses are discovered on the site; the Actor does not verify that inboxes accept mail.
Compliance
You are responsible for using collected data in line with applicable laws (GDPR, CAN-SPAM, etc.) and the target website’s terms.
Tips for best results
- Use the root domain —
apify.com, nothttps://apify.com/pricing. - Start with
maxPages: 5–10— increase only if contacts look incomplete. - Use contact profile first — enable detailed mode when you need leadership, funding, or tech stack fields.
- B2B corporate sites work best — very small landing pages or single-page apps may return thinner profiles.
- Check the dataset, not only the log — the full JSON is in dataset storage.
- Re-run periodically — refresh quarterly for outreach lists or account research.
Frequently asked questions
Why are some social arrays empty?
The company may not link that platform, or the profile may not be detectable from public pages. Empty arrays are expected.
Why did one domain in my list fail?
The site may be down, block automated access, or the domain may be invalid. Other domains in the same run are unaffected.
Does detailed mode always return every section?
No. Sections appear when enough public signals exist. The report is structured consistently, but fields may be empty.
Can I scrape subpages or paths in the domain field?
No. Pass only the root domain; the Actor discovers relevant paths automatically.
How is this different from a simple email finder?
You get a multi-field company record—contacts, social graph, and optional firmographics—not only a single email column.
Support
For Actor-specific issues, use the Issues tab on the Apify store listing or contact the publisher through Apify.
For platform questions (runs, billing, API), see Apify documentation and support.
Changelog
| Version | Notes |
|---|---|
| 1.0.0 | Initial release: contact profiles and detailed company intelligence reports |