Pricing

Pay per event

Go to Store

Website Contact Scraper - AI-Powered Lead Finder

Try for free

Developed by

Timo Sieber

AI-powered website scraper that extracts real contact data from company sites! Finds people, positions, emails & phone numbers using LLM technology. Scans team pages, contact sections & company info. Perfect for B2B lead generation and sales research.

0.0 (0)

Pricing

Pay per event

Last modified

a month ago

Lead generation

Agents

LLM-Guided Corporate Website Scraper

An advanced Apify actor that uses LLMs (Large Language Models) to identify and extract high-value business contact information from corporate websites.

🚀 Overview

This scraper goes far beyond traditional crawling. It:

Uses GPT (OpenAI) to intelligently rank internal URLs based on their relevance to contact data
Maximizes content extraction, including hidden and modal content
Parses and validates contact fields using LLMs and custom regex preprocessing
Aggregates data across multiple pages for higher confidence

💡 Key Features

🧰 LLM-based URL Evaluation: Scores and selects only the most promising URLs per domain
🔍 Maximum Content Extraction: Scrapes visible and hidden elements, emails, phone numbers, and text sections
🔧 Custom Prompt Engineering: Tailored prompts for URL scoring and field extraction
📊 Smart Aggregation: Merges multiple extractions into one confident, enriched result per domain
🚪 Resilient Parsing: Handles edge cases, malformed responses, and fallback scoring
✅ GDPR-friendly Proxy Support: With optional German residential proxies

⚙️ Input

This actor expects the following input:

{
  "urls": ["https://example.com"],
  "openaiApiKey": "sk-...",
  "maxRequests": 50,
  "useProxy": true,
  "enableUrlEvaluation": true,
  "aggregateResults": true,
  "includeExtendedFields": true,
  "costLimit": 1.0
}

🔄 Workflow

Main page is loaded
LLM evaluates internal links for contact relevance
Top N URLs are crawled (contact, impressum, team, etc.)
Content is extracted (even from modals, hidden fields, footers)
Text is preprocessed for LLM efficiency
LLM parses the data into a structured JSON object
Data is validated, weighted, and aggregated into one high-confidence result

🌐 Output Format

Each record pushed to the dataset contains:

{
  "executive_name": "Max Mustermann",
  "executive_title": "Geschäftsführer",
  "company_email": "info@example.com",
  "company_phone": "+41 44 123 45 67",
  "company_address": "Musterstrasse 1, 8000 Zürich",
  "confidence_score": 0.92,
  "sources": [...],
  "aggregated_from_pages": 6,
  "domain": "example.com"
}

📈 Performance & Cost

Average ~40 websites for 0.07 $ (at gpt-3.5-turbo rates)
Each domain result is based on up to 8 evaluated subpages
Internal cost tracking included

🔐 Notes

Requires valid OpenAI API key (gpt-3.5-turbo)
Proxy use is optional, but recommended for stable scraping
Works well for DE/CH/Austria-based companies (Impressum detection)

🚪 Limitations

Not optimized for dynamic SPAs
Some LLM responses may still need fallback handling (included)

🚧 Future Improvements

Add multilingual prompt switching (based on targetLanguage input)
Upgrade to gpt-4-turbo for more robust data quality
Add custom scoring model for aggregation weighting

🌟 Created by Timo Sieber — for smarter, LLM-powered scraping at scale.

On this page

LLM-Guided Corporate Website Scraper

Share Actor:

Website Social Scraper Api

oussemafr/website-social-scraper-api

You will get access to the Website Contact Details - Get Contact Info Efficiently!

Oussema FRIKHA

250

5.0

Extract Contact Details from Any Website – Email, Phone, Social

creative_tablecloth/extract-email-phone-social-media-from-any-website

Discover our powerful scraper that effortlessly extracts emails, phone numbers, and social media links from any website. Ideal for marketers and businesses seeking to enhance their contact database quickly and efficiently.

Jinny Kim

1.8K

3.0

📧✨ Extract Emails, Socials and Contacts from Any Website

logical_scrapers/extract-email-from-any-website

(fastest) An advanced Actor for extracting email addresses, social links and contact details from websites. This tool is perfect for web scraping, contact collection, and lead generation.

Goldmine

481

5.0

Contact Info Scraper: Pay Per Result

delicious_zebu/contact-info-scraper-pay-per-result

Effortlessly scrape contact information, including emails, phone numbers, and social media links like Twitter, YouTube, Facebook, Instagram, TikTok, and LinkedIn, from any website URL.

ВAH

256

Contact scraper Extracts Email Phone Social Media from website

giovannibiancia/website-actor

LeadFinder Scraper is a powerful lead generation tool designed to extract emails, phone numbers, and social media profiles from websites. Perfect for B2B and B2C businesses, market research, and seamless CRM data integration.

Giovanni Bianciardi

292

Contact Info Scraper -Extract Business Contact Information

dainty_screw/contact-info-scraper--extract-business-contact-information

Looking to gather business contact information fast? Our Business Contact Info Scraper extracts emails, phone numbers, and social profiles like Facebook, Twitter, LinkedIn, and Instagram from websites at scale. Get accurate contact details quickly and efficiently with this powerful tool.

codemaster devops

681

5.0

Advanced Website Email, Phone and Social Media Scraper

perfectscrape/actor

This advanced contact scraper is an ALL-IN-ONE scraper that navigates pages likely to contain contact data, extracting emails, phone numbers, and social media links, with precision and speed. This scraper can bypass cloudflare and captchas. Very good scraper for lead generation.

Sadnan

365

2.0

German Imprint Contact Scraper

dominic-quaiser/imprint-contact-scraper

An Actor that automatically locates and scrapes key contact details from German website imprint pages (Impressum). It extracts information such as company name, address, phone numbers, emails, and decision-maker details.

Dominic M. Quaiser

132

Contact Details Scraper

vdrmota/contact-info-scraper

Free email extractor and lead scraper to extract and download emails, phone numbers, Facebook, Twitter, LinkedIn, and Instagram profiles from any website. Extract contact information at scale from lists of URLs and download the data as Excel, CSV, JSON, HTML, and XML.

Vojta Drmota

35K

3.9

Website Email Scraper

thenetaji/website-email-scraper

Extract videos, images, audio, APKs & emails from websites. This Apify actor crawls pages to discover media links with configurable depth, proxy support & domain filtering. Boost content research & lead gen.