Pricing

Pay per event

Automae Email Extractor

An advanced Apify crawler to automatically extract email addresses from any website, with anti-detection protection and Cloudflare decoding.

Pricing

Pay per event

Rating

0.0

(0)

Developer

Theo Jim

Actor stats

Bookmarked

Total users

Monthly active users

a month ago

Last modified

🕵️‍♂️ Intelligent Email Extractor

An advanced Apify crawler to automatically extract email addresses from any website, with anti-detection protection and Cloudflare decoding.

✨ Features

🔍 Multi-Source Extraction

Mailto links : Direct extraction from mailto: links
Cloudflare protection : Automatic decoding of data-cfemail emails
Smart regex : Email detection in HTML content
Metadata : Extraction from <meta> tags

Contact pages : Automatic detection of contact pages via keywords
Multilingual keywords : French, English, German support
Smart scoring : Prioritization of most relevant pages
Configurable limits : Control the number of pages to analyze

🛡️ Anti-Detection Protection

Fingerprinting : Realistic browser fingerprint
Random delays : Avoids detectable navigation patterns
Human headers : Realistic User-Agent and headers
Session management : Session pool to avoid bans

📧 Filtering and Prioritization

Whitelist : Priority emails (contact@, hello@, info@, etc.)
Blacklist : Filter unwanted emails (no-reply@, etc.)
Validation : Email validity verification
Deduplication : Automatic duplicate removal

🚀 Installation

# Clone the project
git clone <repository-url>
cd apify-get-emails-from-site

# Install dependencies
npm install

# Install Playwright (automatic via postinstall)
npx playwright install --with-deps chromium

📖 Usage

Input Configuration

{
  "baseUrl": "https://example.com",
  "maxContactPages": 2,
  "navigationTimeoutMs": 30000,
  "blacklist": ["spam@", "test@", "@example.org"]
}

Parameters:

baseUrl (required) : Base URL to analyze
maxContactPages (optional, default: 2) : Maximum number of contact pages to analyze
navigationTimeoutMs (optional, default: 30000) : Navigation timeout in ms
blacklist (optional, default: []) : Array of email patterns to filter out. These patterns are added to the default blacklist (no-reply@, noreply@, donotreply@, @mail.com). Any email containing these patterns will be excluded from results.

Execution

# Local execution
npm start

# Or directly
node main.js

Output

{
  "hit": true,
  "primaryEmail": "contact@example.com",
  "domain": "example.com",
  "emails": ["contact@example.com", "info@example.com"],
  "sourceUrl": "https://example.com/contact",
  "scanned": ["https://example.com", "https://example.com/contact"],
  "baseUrl": "https://example.com"
}

🔧 Advanced Configuration

Contact Keywords

The crawler automatically detects contact pages via these keywords:

const contactKeywords = [
  "contact", "contact-us", "contactez", "kontakt", "mentions",
  "mentions-legales", "legal", "imprint", "impressum", "privacy",
  "privacy-policy", "confidentialite", "support", "help", "aide",
  "about", "a-propos", "team", "equipe", "cgu", "cgv", "faq"
];

Priority Emails

const roleWhitelist = [
  "contact@", "hello@", "info@", "support@", "sales@",
  "partners@", "partnership@", "team@", "hi@", "help@"
];

Filtered Emails

The crawler uses a default blacklist to filter unwanted emails:

const roleBlacklist = [
  "no-reply@", "noreply@", "donotreply@", "@mail.com"
];

You can extend this blacklist by providing a custom blacklist parameter in the input:

{
  "baseUrl": "https://example.com",
  "blacklist": ["spam@", "test@", "@example.org"]
}

Any email containing patterns from both the default blacklist and your custom blacklist will be filtered out.

🏗️ Architecture

Execution Flow

Initialization : Crawler and queue configuration
Navigation : Main page loading
Extraction : Multi-source email analysis
Decision : If emails found → stop, else → contact pages
Result : Return prioritized emails

Anti-Detection Protection

Limited concurrency : 1 tab at a time
Random delays : 300-800ms before navigation, 200-600ms after
Fingerprinting : Unique browser fingerprint
Realistic headers : Recent Chrome User-Agent
Error handling : Automatic retry on failures

📊 Performance

Configurable timeout : Avoids blocking
Request limiting : Server load control
Smart stop : Stop as soon as a valid email is found
Deduplication : Avoids redundant analysis

🛠️ Dependencies

apify (^3.5.0) : Scraping framework
crawlee (^3.15.1) : Crawling library
playwright (^1.46.0) : Browser automation

📝 Technical Notes

ES6 modules : Modern imports/exports usage
Error handling : Try/catch for robustness
Email validation : Regex and domain verification
Relative URLs : Automatic link resolution

🤝 Contributing

Contributions are welcome! Feel free to:

Report bugs
Suggest improvements
Add new contact keywords
Optimize performance

📄 License

This project is under MIT license.

Website Email Extractor

alex_claw/website-email-extractor

Alex Claw

Email Scraper - Extract Emails from Websites

dominic-quaiser/email-scraper

Powerful actor to crawl websites and extract email addresses using advanced detection. It bypasses Cloudflare protection, RTL obfuscation, and text patterns to deliver structured data. Features include configurable crawl depth, proxy support, and anti-detection measures.

Dominic M. Quaiser

Website Email & Socials Extractor

ionbelei549/email-extractor

Extract unlimited number of email addresses and social media links from any website automatically. Ideal for lead generation, sales prospecting, and outreach.

Ion Belei

All in one Social media Email Scraper

bhansalisoft/all-in-one-social-media-email-scraper

All in One Social Email Extractor – Software can Extract Email from All Social Network Profile Linkedin Email Extractor Facebook Email Extractor Youtube Email Extractor Twitter Email Extractor Instagram Email Extractor Reditt Email Extractor Pinterest Email Extractor TikTok Email Extracto

bhansalisoft

2.9K

2.7

Website Email Scraper

contacts-api/website-email-scraper

Collect verified email addresses with our Website Email Scraper. Extract emails from websites quickly for outreach, marketing campaigns, and lead generation.

Lead Heaven

Website Email Finder, Socials & Phone Scraper

code-node-tools/website-email-socials-phone-number-scraper-lookup

The most powerful email extractor online and website email scraper. Extract emails, phone numbers, and social media profiles from any website. Advanced email scraping tool with obfuscation handling, email lookup API, and phone number finder. Perfect for lead generation, B2B sales, & to find contact.

CodeNodeTools

Extract Contact Details from Any Website – Email, Phone, Social

unlimitedleadtestinbox/extract-contact-details-from-any-website---email-phone-social

Find website contact details from website url (Email + Phone)

unli

Extract Emails from any website

scraplib/extract-emails-from-any-website

Extract email addresses from any website. Whether you're scraping a single company website or automating bulk email collection across thousands of URLs, this actor ensures high accuracy and scalability.

Scraplib

239

Email Extractor

gordian/email-extractor

Find and extract email addresses from any website in seconds. This actor will crawl entire websites and return all emails after validation. Easy to use and extremely fast.

Gordian

196

5.0

🛡️⚡ Cloudflare Scraper - Bypass All Captchas

neatrat/cloudflare-scraper

Updated June 2025, No proxies needed! A powerful web scraper that bypasses Cloudflare protection.

Neatrat

103

1.0

Automae Email Extractor

🕵️‍♂️ Intelligent Email Extractor

✨ Features

🔍 Multi-Source Extraction

🎯 Intelligent Navigation

🛡️ Anti-Detection Protection

📧 Filtering and Prioritization

🚀 Installation

📖 Usage

Input Configuration

Execution

Output

🔧 Advanced Configuration

Contact Keywords

Priority Emails

Filtered Emails

🏗️ Architecture

Execution Flow

Anti-Detection Protection

📊 Performance

🛠️ Dependencies

📝 Technical Notes

🤝 Contributing

📄 License

Website Email Extractor

Email Scraper - Extract Emails from Websites

Website Email & Socials Extractor

All in one Social media Email Scraper

Website Email Scraper

Website Email Finder, Socials & Phone Scraper

Extract Contact Details from Any Website – Email, Phone, Social

Extract Emails from any website

Email Extractor

🛡️⚡ Cloudflare Scraper - Bypass All Captchas

Related articles

Automae Email Extractor

🕵️‍♂️ Intelligent Email Extractor

✨ Features

🔍 Multi-Source Extraction

🎯 Intelligent Navigation

🛡️ Anti-Detection Protection

📧 Filtering and Prioritization

🚀 Installation

📖 Usage

Input Configuration

Execution

Output

🔧 Advanced Configuration

Contact Keywords

Priority Emails

Filtered Emails

🏗️ Architecture

Execution Flow

Anti-Detection Protection

📊 Performance

🛠️ Dependencies

📝 Technical Notes

🤝 Contributing

📄 License

You might also like

Website Email Extractor

Email Scraper - Extract Emails from Websites

Website Email & Socials Extractor

All in one Social media Email Scraper

Website Email Scraper

Website Email Finder, Socials & Phone Scraper

Extract Contact Details from Any Website – Email, Phone, Social

Extract Emails from any website

Email Extractor

🛡️⚡ Cloudflare Scraper - Bypass All Captchas

Related articles