Website Enrichment Scraper
Pricing
from $6.00 / 1,000 results
Website Enrichment Scraper
Website Enrichment Scraper extracts structured business intelligence from any website, including business name, category, and verified email addresses. Designed for lead enrichment, sales intelligence, and data validation workflows at scale.
Pricing
from $6.00 / 1,000 results
Rating
0.0
(0)
Developer

Gyanendra Thakur
Actor stats
0
Bookmarked
2
Total users
1
Monthly active users
5 days ago
Last modified
Categories
Share
Website Enrichment Scraper extracts structured business intelligence directly from company websites. It is designed for lead enrichment, contact discovery, and scalable data workflows.
Overview
This actor processes provided website URLs and retrieves publicly available business information in a clean, structured format. It focuses on extracting core identity and contact details while maintaining controlled crawl limits for performance and cost efficiency.
The scraper supports limited internal page scanning (such as contact or about pages) to improve data completeness without unnecessary crawling.
Extracted Data
For each website, the actor collects:
- Business Name
- Business Type (inferred from metadata and page content)
- Public Email Addresses
- Number of Pages Scanned
- Scrape Status
All results are returned in structured JSON format and can be exported to CSV or integrated into CRM systems, outreach platforms, and automation pipelines.
Use Cases
- Lead enrichment and verification
- Sales prospecting workflows
- Agency outreach campaigns
- Market research
- Contact database building
How It Works
The actor:
- Scans the homepage of each website.
- Extracts metadata and structured signals to determine business identity.
- Detects public email addresses using pattern recognition.
- Optionally scans limited internal pages (e.g., contact or about).
- Returns deduplicated, structured results.
Input
- List of website URLs
- Maximum pages to scan per website (recommended: 1–3)
Output Structure
Each dataset record includes:
websitebusinessNamebusinessTypeemailspagesScannedscrapeStatus
Performance & Architecture
- Built with Crawlee + Cheerio for lightweight crawling
- Controlled concurrency and retry handling
- Same-domain restriction to prevent unnecessary crawling
- Optimized for scalable batch enrichment
Notes
This actor extracts only publicly available information from websites. Data availability depends on the structure and transparency of each site.