Website Emails And Socials Scraper
Pricing
$3.99/month + usage
Website Emails And Socials Scraper
Only $3.99/month — Extract verified contact emails and social media links from any website. Scans up to 20 contact-related pages per url with smart detection. Returns structured data: URL info, prioritized emails, social profiles, source pages, counts, and timestamps perfect for B2B lead generation
Pricing
$3.99/month + usage
Rating
5.0
(2)
Developer

CodeScraper
Actor stats
1
Bookmarked
3
Total users
2
Monthly active users
6 days ago
Last modified
Categories
Share
⭐ Website Contact Email & Social Scraper – B2B Lead Generation
This Apify actor scrapes contact email addresses and up to 8 official social media profile links from business websites with high accuracy and intelligent filtering.
It utilizes network request interception, smart contact page detection, and advanced social profile filtering to extract emails and official social accounts from homepage DOMs, API responses, and specific "Contact Us" pages — all packaged into a structured dataset ideal for lead generation and outreach.
🚀 What It Does
For every website URL provided, the actor extracts:
🏢 Site Overview
- 🌐 Input URL (As entered)
- 🔗 Normalized URL (Standardized format)
- 📧 Total Emails Found
- 🌍 Social Media Profiles (Up to 8 platforms)
- ⏱️ Processing Duration
- 📅 Scrape Timestamp
📩 Email Data
For each email found:
✉️ Email Address
🚦 Priority Classification
- High: info@, contact@, support@ (or matches domain name)
- Medium: sales@, jobs@, press@
- Low: Personal/Specific names
🔗 Source URL (Exact page where the email was found)
🌐 Social Media Data (Up to 8 Platforms)
The actor intelligently extracts one best official profile link per platform, filtering out share links, login pages, and post URLs.
Supported Platforms
- 🐦 Twitter / X
- ▶️ YouTube
- 🎵 TikTok
- 💬 Discord
Smart Filtering Rules
- Ignores share links (e.g.,
intent/tweet,share.php) - Ignores login pages
- Ignores post/video URLs (e.g., Instagram reels, YouTube watch links)
- Selects the cleanest and shortest valid profile URL
- Deduplicates automatically
- Returns only one best candidate per platform
If a platform is not found, it simply will not appear in the socials object.
⚡ It Handles
- ✅ Multiple Website URLs (Batch processing)
- 🕵️ Smart Contact Page Detection (Finds /contact, /about, /support etc.)
- 🕸️ Network Interception (Catches emails hidden in API JSON responses)
- 🌐 Social Profile Extraction (Up to 8 platforms)
- 🛡️ Domain Scoping (Ignores 3rd party emails like gmail.com unless matched)
- 🧹 Deduplication (Ensures unique emails & social links per domain)
- 🚦 Intelligent Prioritization (Sorts emails by relevance)
⚠️ The actor crawls the Homepage and up to 20 candidate contact pages per domain to ensure deep coverage without getting stuck.
🧠 How It Works
- Normalizes input URLs to ensure valid formats.
- Crawls the homepage to extract visible emails.
- Intercepts background network requests (XHR/Fetch) to find hidden emails in JSON data.
- Detects & navigates to "Contact Us" or "About Us" pages automatically.
- Extracts and filters official social media profile links.
- Classifies emails by priority (High/Medium/Low).
- Deduplicates results to provide a clean structured dataset.
- Saves structured data to Apify Dataset.
⚙️ Input Configuration
| Field | Type | Description | Example |
|---|---|---|---|
| startUrls | Array | List of target websites | ["example.com"] |
| maxConcurrency | Integer | Max pages processed in parallel (Default: 10) | 10 |
| maxRequestRetries | Integer | Retries per page on failure (Default: 2) | 2 |
🧩 Example Input
{"startUrls": ["https://www.iana.org", "apify.com", "https://inoleds.com"],"maxConcurrency": 10,"maxRequestRetries": 2}
📊 Example Output
{"inputUrl": "https://friendlyglitter.de/","normalizedUrl": "https://friendlyglitter.de","emails": [{"email": "info@glitzer-tattoo.net","priority": "high","source": "https://friendlyglitter.de/pages/impressum"}],"socials": {"facebook": "https://www.facebook.com/YbodyGlitzerTattoo/","instagram": "https://www.instagram.com/friendly.glitter","youtube": "https://www.youtube.com/channel/UC8S3IUHg1xozhVe84rK-Xaw"},"emailCount": 1,"processTimeMs": 35013,"scrapedAt": "2026-02-16T08:14:26.284Z"}
🧠 Features
- ⭐ Strict Domain Matching (Only extracts emails belonging to the target website)
- 📊 Priority Sorting (Get the most relevant contact info first)
- 🌐 Extracts Up To 8 Official Social Profiles
- 🧹 Automatic Deduplication (Emails & Social Links)
- 🕒 Fast Execution (Optimized navigation limits)
- 🌍 Universal Support (Works on .com, .de, .org, and all TLDs)
- 📦 Structured JSON Output
💡 Use Cases
- B2B Lead Generation
- Sales Outreach Automation
- Influencer Outreach
- Brand Research
- Market Research
- Recruitment & HR Sourcing
- Directory Building
❓ FAQs
1. Does this scrape generic emails like Gmail or Yahoo?
No. The scraper enforces Strict Domain Matching. If you scrape example.com, it will only return emails ending in @example.com to ensure lead quality.
2. How does it prioritize emails?
It uses a prefix classification system:
- High: Generic contacts (info, hello) or matches the domain name.
- Medium: Department specific (sales, hr).
- Low: Everything else.
3. Does it return social share links?
No. The actor filters out share links, login pages, and post URLs. It returns only official profile URLs.
4. Why didn't it find an email?
Some sites protect emails using:
- Images instead of text
- Complex JavaScript obfuscation (though network interception catches many of these)
- Contact forms without visible email addresses
🧑💻 Developer Info
Author: codescraper Email: codescraper011@gmail.com
🏷️ Tags
website-emails-scraper · social-media-scraper · email-scraper · contact-extractor · lead-generation · b2b-leads · crawler · web-scraping · marketing-automation · sales-tools