Website Contact Extractor
Pricing
from $0.01 / 1,000 results
Website Contact Extractor
Extract emails, phone numbers, and social links from thousands of websites. Automatically scans contact pages and returns clean, export-ready contact data.
Pricing
from $0.01 / 1,000 results
Rating
5.0
(1)
Developer

Artashes Arakelyan
Actor stats
1
Bookmarked
21
Total users
8
Monthly active users
4 days ago
Last modified
Categories
Share
Website Contact Extractor & Lead Generation Extract Emails, Phones & Company Intelligence from up to 10,000 Websites Automatically The Website Contact Extractor is a scalable web intelligence Actor that automatically scans company websites and extracts contact information and business signals. Instead of manually visiting thousands of websites, the Actor automatically: • crawls company websites • discovers contact and about pages • extracts business emails • extracts phone numbers • detects company names • identifies LinkedIn company pages • detects Google Maps locations • estimates company size • classifies industry • detects country from domain The result is a clean structured dataset ready for lead generation, CRM enrichment, and market research.
Typical Use Cases Lead Generation Build prospect lists by extracting company emails and phone numbers directly from websites. CRM Data Enrichment Enhance company records with missing contact information and company intelligence signals. Market Research Analyze industry signals, company presence, and contact channels across thousands of websites. Business Development Discover new companies and identify potential outreach channels. Startup Intelligence Track emerging companies and their online presence.
How It Works The Actor uses targeted website crawling to extract valuable information quickly. Website List │ ▼ Homepage Crawl │ ▼ Page Discovery (Contact / About / Support) │ ▼ Contact Extraction (Emails / Phones) │ ▼ Company Intelligence Detection (Industry / LinkedIn / Maps) │ ▼ Structured Dataset Instead of crawling entire websites, the Actor scans only the most relevant pages, keeping runs fast and cost-efficient.
Quick Start (60-Second Test Run) Test the Actor in less than a minute using the following configuration. { "startUrls": [ {"url": "https://stripe.com"}, {"url": "https://shopify.com"}, {"url": "https://notion.so"} ], "maxSites": 3, "concurrency": 3, "maxPagesPerSite": 1, "extractSocialLinks": true, "outputCsv": true, "outputXlsx": true } After the run completes you can download: OUTPUT.csv OUTPUT.xlsx
Results Preview Example structured output produced by the Actor. { "organization_name": "Example Company", "industry": "Technology", "country": "United States", "linkedin_company_url": "https://linkedin.com/company/example", "company_size_estimate": "120", "contact_form_url": "https://example.com/contact", "google_maps_url": "https://maps.google.com/...", "input_url": "https://example.com", "final_url": "https://example.com", "best_email": "info@example.com", "best_phone": "+1 555 123 456", "emails": ["info@example.com"], "phones": ["+1 555 123 456"], "address_text": "123 Main Street, New York", "contact_page": "https://example.com/contact", "pages_scanned_count": 2, "status": "OK" } From a single website the Actor can automatically detect: • company name • emails • phone numbers • LinkedIn company page • company size estimate • contact page • physical address • industry classification • country detection
What Data Is Extracted Contact Information • business emails • phone numbers • best ranked email • best ranked phone • contact page URL • contact form URL Company Intelligence • company name • industry classification • country detection • company size estimate • physical address Online Presence • LinkedIn company page • Google Maps location • social media profiles Technical Metadata • pages scanned • final redirected URL • extraction evidence
Who This Actor Is For This Actor is designed for professionals working with large website datasets. Sales & Lead Generation Teams Automatically collect company contact information from thousands of websites. Market Research Analysts Analyze industry signals and company presence across many websites. Business Development Teams Discover new companies and contact channels. Data Enrichment & CRM Teams Enhance company records with missing data fields. Recruiters & Talent Platforms Find company contact information and recruitment signals. Data Scientists Build structured datasets of company intelligence.
Capacity Designed for large-scale website processing. Parameter Value Websites per run 10,000 Concurrent workers up to 200 Pages per website up to 10 Typical pages scanned 3–4
Recommended Configuration Typical production configuration: maxSites = 10000 concurrency = 35 maxPagesPerSite = 2 Estimated workload: 10,000 websites × ~3 pages each ≈ 30,000 page requests Typical runtime: 20–40 minutes depending on website speed.
Pricing This Actor uses event-based pricing based on both processing workload and extracted value. Event Description Price Website analyzed Charged for each website scanned $0.04 Lead found Charged when a valid email or phone is discovered $0.06 Actor start Container startup cost $0.00005
Example Cost Scenarios Small Run 100 websites analyzed 40 contacts discovered Cost: 100 × $0.04 = $4 40 × $0.06 = $2.40 Total ≈ $6.40
Medium Run 1,000 websites analyzed 300 contacts discovered Cost: 1000 × $0.04 = $40 300 × $0.06 = $18 Total ≈ $58
Large Run 10,000 websites analyzed 3,000 contacts discovered Cost: 10,000 × $0.04 = $400 3,000 × $0.06 = $180 Total ≈ $580
Input Parameters startUrls List of websites to scan. { "startUrls": [ {"url": "https://example.com"}, {"url": "https://company.org"} ] }
maxSites Maximum number of websites processed. Range: 1 – 10,000 Example "maxSites": 5000
concurrency Number of websites processed simultaneously. Recommended values: Websites Concurrency 100 10 1,000 20 5,000 30 10,000 35–40 Example "concurrency": 30
maxPagesPerSite Number of additional pages scanned beyond the homepage. Common discovered pages: • /contact • /about • /support • /locations • /impressum Allowed values: 0 – 10 Recommended: maxPagesPerSite = 2
extractSocialLinks Enable extraction of social media profiles. Extracts: • LinkedIn • Facebook • Twitter • Instagram Example "extractSocialLinks": true
useConcurrent Enable high-performance concurrent scraping. Example "useConcurrent": true
outputCsv Export results as CSV. "outputCsv": true
outputXlsx Export results as Excel. "outputXlsx": true
Output Files The Actor automatically generates export files: OUTPUT.csv OUTPUT.xlsx Files are available in the Key-Value Store.
Best Practices Start Small Test with 10–50 websites first. Increase Scale Gradually Run in stages: 100 websites 1,000 websites 5,000 websites 10,000 websites Limit Page Depth Most websites expose contact information on homepage or contact page. Recommended: maxPagesPerSite = 2
How This Actor Is Different Feature Typical Scrapers This Actor Basic HTML scraping ✓ ✓ Contact page discovery ✗ ✓ Email extraction ✓ ✓ Phone extraction ✓ ✓ Best contact ranking ✗ ✓ Company name detection ✗ ✓ Industry classification ✗ ✓ LinkedIn company detection ✗ ✓ Google Maps extraction ✗ ✓ Company size estimation ✗ ✓ High-scale concurrency limited ✓
Limitations Some websites may: • block automated requests • hide contact information • load contact data using JavaScript In these cases the Actor may return partial results.
FAQ Can this Actor crawl entire websites? No. The Actor scans only the most relevant pages to keep runs fast.
How many websites can be processed? Up to 10,000 websites per run.
Can the Actor extract emails from PDFs? No. The Actor processes HTML pages only.
Summary The Website Contact Extractor is a scalable contact intelligence engine capable of scanning thousands of websites and extracting structured company information. It combines: • intelligent crawling • contact extraction • company data enrichment • scalable concurrent processing to produce datasets suitable for: • lead generation • market research • CRM enrichment • business intelligence