Pricing

from $5.00 / 1,000 results

Structured Business Data Extractor

Extracts structured business information from company websites for research and data enrichment. Converts public website content into clean, machine-readable data such as company name, contact details, and metadata. Intended for research, CRM enrichment, and company profiling. No outreach.

Pricing

from $5.00 / 1,000 results

Rating

0.0

(0)

Developer

Leoncio Jr Coronado

Actor stats

Bookmarked

Total users

Monthly active users

a day ago

Last modified

Crawls homepage and relevant internal pages (contact, about, support)

Extracts validated email addresses and phone numbers

Detects the company or organization name

Tracks pages that were scanned

Outputs clean, structured records to an Apify dataset

To ensure reliability and Store safety, the Actor always produces at least one dataset item, even when no contact data is found.

This guarantees stable automation and auto-test compatibility.

👥 Who This Actor Is For

This Actor is designed for:

CRM & RevOps teams enriching company records

Researchers and analysts building structured datasets

Founders and operators profiling businesses

Developers building lead generation and automation pipelines

✅ Key Features

🟢 Python-based (stable, maintainable, production-ready)

🟢 Smart internal page discovery (contact, about, support pages)

🟢 Strict email and phone validation

🟢 Placeholder and low-quality data filtering

🟢 Honest status reporting when data is unavailable

🟢 Dataset is never empty (auto-test safe)

🟢 Apify Store compliant and automation-ready

📥 Input Required

Start URLs List of public website URLs to scan.

Optional

Max pages – Maximum number of internal pages to scan per site (default: 5)

Example Input { "start_urls": [ { "url": "https://www.iana.org/contact" } ], "max_pages": 5 }

📤 Output

The Actor writes results to the default dataset with the following structure:

Example Output { "company_name": "IANA", "website": "https://www.iana.org", "emails": ["iana@iana.org"], "email_status": "found", "phones": ["+14242542545"], "phone_status": "found", "pages_checked": [ "https://www.iana.org", "https://www.iana.org/contact" ], "industry": null, "linkedin_company": null, "scrape_status": "completed", "scraped_at": "2026-02-22T03:53:38Z" }

Output Fields

Field	Description
company_name	Detected company name
website	Website processed
emails	Extracted email addresses
email_status	found / not_found
phones	Normalized phone numbers
phone_status	found / not_found
pages_checked	Pages scanned during extraction
industry	Reserved for enrichment (optional)
linkedin_company	Reserved for enrichment (optional)
scrape_status	Run status
scraped_at	UTC timestamp

🧠 How It Works

Loads the homepage
Discovers relevant internal pages
Scans visible content
Extracts emails and phones using robust patterns
Applies validation and filtering
Normalizes formats
Saves structured output to dataset

The workflow is designed for transparency, data quality, and long-term reliability.

⚠️ Limitations & Notes

Works only on publicly accessible websites

No CAPTCHA bypassing

Accuracy depends on website structure

This Actor prioritizes stability over aggressive scraping.

🛡️ Legal & Ethical Use

You are responsible for complying with:

Website terms of service

Ethical data usage standards

Use this Actor only for legitimate business purposes.

⭐ Recommended Use Cases

CRM enrichment

Business research

Lead validation

Market research

Company profiling

Data pipeline enrichment

🔧 Customization

This Actor can be extended for:

Deeper multi-page crawling

Clay / API enrichment

CSV / Excel pipelines

Custom validation rules

Monitoring and alerting

You may fork or customize it for advanced workflows.

✅ Status

Production-ready · Store-safe · Auto-test compliant · Reliability-focused

👤 Author

Leoncio U. Coronado Jr Data Automation & Web Scraping Engineer Apify Verified Actor Developer

website-business-data-extractor

keratogenous_surgeon/website-business-data-extractor

Extract structured business information from company websites into clean JSON. Automatically finds company name, description, contact details, address, and social links from user-provided websites. Built for automation, data enrichment, and Apify pipelines.

King Shepherd

LinkedIn Company Information Scrapper

zerobreak/linkedin-company-information-scrapper

LinkedIn Company Information Scraper that extracts detailed company data, employees, posts, and business insights from any LinkedIn company page. Ideal for market research, competitor analysis, sales intelligence, and business profiling.

ZeroBreak

Company Enrichment

vivid_astronaut/company-enrichment

Enrich company data from domain names. Get company details including name, industry, employee count, location, and social profiles. Perfect for lead enrichment, sales intelligence, and B2B data analysis.

Fabio Suizu

Website Enrichment Scraper

gtgyani206/website-enrichment-scraper

Website Enrichment Scraper extracts structured business intelligence from any website, including business name, category, and verified email addresses. Designed for lead enrichment, sales intelligence, and data validation workflows at scale.

Gyanendra Thakur

Linkedin Company About Scraper

scrapio/linkedin-company-about-scraper

Scrapes the About section of any public LinkedIn company page, capturing company description, industry, size, specialties, headquarters, founding year, and website. Ideal for market research, lead enrichment, competitive analysis, and automated company profiling

Scrapio

Company Domain

meetwithyash/company-domain

Find any company's official website and social media profiles instantly. AI-powered search and verification for accurate results. Perfect for lead generation, market research, and CRM data enrichment.

Yash Khokhaneshiya

Lead Enrichment Tool

ghewaretech/lead-enrichment-tool

Transform company URLs into actionable sales intelligence. Extract contact info, tech stack, social profiles, and business signals from any company website.

Unisuraksha Tracking Systems Pvt Ltd

LinkedIn Company Search Scraper

powerai/linkedin-company-search-scraper

Extract company information from LinkedIn with detailed metadata including company profiles, size, industry, and more. Perfect for market research, competitor analysis, and business development.

PowerAI

383

1.8

Company Detail Scraper for LinkedIn (No Cookies)

apimaestro/linkedin-company-detail

Extract detailed LinkedIn company data instantly. Get company overview, employee count, locations, funding info, and more. Perfect for market research, lead generation, and competitor analysis. Clean, structured data ready for your business needs.

API Maestro

3.6K

4.7

Trustpilot Business & Review Data Extractor

dataraptor/trustpilot-business-review-data-extractor

Powerful and flexible Trustpilot scraper to extract business information, company details, and customer reviews. Supports scraping reviews by business or category, customizable limits, and advanced filtering. Ideal for market research, sentiment analysis, lead generation, and data enrichment.

DataRaptor

5.0