Pricing

from $3.00 / 1,000 successful data extractions

Try for free

Go to Apify Store

German Imprint Scraper + Email Validation

Try for free

Smart Actor for German websites that detects Impressum pages, extracts company details, contact data, and verifies emails. Offers reliable scraping, structured JSON results, and robust performance for lead generation at scale.

Pricing

from $3.00 / 1,000 successful data extractions

Rating

5.0

(3)

Developer

Winning Solutions

Actor stats

Bookmarked

Total users

Monthly active users

21 days ago

Last modified

Features

🤖 AI-Powered Link Detection: Uses AI to find imprint page links when traditional parsing fails

🧠 Smart Contact Extraction: Extracts comprehensive contact and company information

🇩🇪 German Website Optimization: Specifically optimized for German website structures and imprint pages

🛡️ Robust Error Handling: Built-in retry mechanisms and comprehensive error handling

📊 Detailed Output: Structured data with contact person, company details, and legal information

🌐 Proxy Support: Optional proxy configuration for enhanced reliability

🔍 Advanced Email Validation

When enabled, the Actor performs comprehensive email validation using multiple verification methods:

Syntax validation - Ensures proper email format
Domain existence check - Verifies the domain is active and reachable
MX record validation - Confirms the domain can receive emails
Disposable email detection - Filters out temporary/disposable email addresses
Role-based email detection - Identifies generic addresses (info@, admin@, etc.)
Email alias detection - Detects aliases for major providers (Gmail, Yahoo, Outlook/Hotmail)
Typo suggestions - Provides corrections for common email typos
Real-time monitoring - Live validation status tracking

This ensures you only get high-quality, deliverable email addresses for your lead generation campaigns.

Input

The Actor accepts the following input parameters:

targetUrls (required): Array of URLs to scrape for contact information
skipResultsWithoutEmail (optional): When enabled, results without an email address will not be saved to the dataset (default: false)
maxRetries (optional): Maximum number of retry attempts for failed requests (default: 3)
timeout (optional): Request timeout in seconds (default: 5)
proxyConfiguration (optional): Configure proxy settings for your crawler
validateEmail (optional): Enable email validation to ensure deliverability (default: false)

Input Example

{
  "targetUrls": [
    {"url": "https://example.de"},
    {"url": "https://another-site.de"}
  ],
  "skipResultsWithoutEmail": false,
  "maxRetries": 3,
  "timeout": 5,
  "validateEmail": true,
  "proxyConfiguration": {
    "useApifyProxy": false
  }
}

Output Structure

The Actor returns structured data for each processed URL. Here's the complete output schema:

Field	Type	Description	Example Value
`imprint_url`	string	Found impressum page URL	`"https://example.de/impressum"`
`contact_person.first_name`	string	First name of the contact person	`"Max"`
`contact_person.last_name`	string	Last name of the contact person	`"Mustermann"`
`contact_person.salutation`	string	Salutation (Herr/Frau/Dr.)	`"Herr"`
`company_name`	string	Full official company name	`"Example GmbH"`
`company_address.street`	string	Street name	`"Musterstraße"`
`company_address.house_number`	string	House number	`"123"`
`company_address.postalcode`	string	Postal code	`"12345"`
`company_address.city`	string	City name	`"Musterstadt"`
`phone_number`	string	Contact phone number	`"+49 123 456789"`
`email`	string	Contact email address	`"info@example.de"`
`email_status`	string	Email validation status (only if validation enabled)	`"DELIVERABLE"`
`register_number`	string	Commercial register number / Handelsregisternummer	`"HRB 12345"`
`vat_id`	string	VAT ID number / Umsatzsteuer-Id	`"DE123456789"`
`retryTriggered`	boolean	Whether retry was triggered during scraping	`false`
`retryReasons`	array	Array of reasons why retry was triggered (optional)	`["email is empty"]`
`_metadata.websiteProcessed`	boolean	Whether the website was successfully processed	`true`
`_metadata.resultCharged`	boolean	Whether this result was charged	`true`
`_metadata.emailValidated`	boolean	Whether email validation was performed	`true`
`_metadata.limitReached`	boolean	Whether usage limit was reached	`false`

Output Example

{
  "imprint_url": "https://example.de/impressum",
  "contact_person": {
    "first_name": "Max",
    "last_name": "Mustermann",
    "salutation": "Herr"
  },
  "company_name": "Example GmbH",
  "company_address": {
    "street": "Musterstraße",
    "house_number": "123",
    "postalcode": "12345",
    "city": "Musterstadt"
  },
  "phone_number": "+49 123 456789",
  "email": "info@example.de",
  "email_status": "DELIVERABLE",
  "register_number": "HRB 12345",
  "vat_id": "DE123456789",
  "retryTriggered": false,
  "retryReasons": [],
  "_metadata": {
    "websiteProcessed": true,
    "resultCharged": true,
    "emailValidated": true,
    "limitReached": false
  }
}

German Imprint Scraper

codescraper/german-imprint-scraper

A powerful Actor scraper to find and extract legal "Impressum" data from German websites. Get company names, addresses, decision-makers, legal IDs, and more, all automatically.

CodeScraper

5.0

German Imprint Scraper (Contact+Social Links)

codescraper/german-impressum-scraper-fast

Very fast actor, Get Impressum data for just $1.5/1000 Results. This powerful scraper finds any German impressum page and extracts key company data: companyName, address, registerNumber, taxId, emails, phones, socialLinks, and page metadata. Get clean, reliable B2B data in seconds.

CodeScraper

5.0

German Imprint Scraper with Decision Makers Names Extraction

dominic-quaiser/imprint-contact-scraper

An Actor that automatically locates and scrapes key contact details from German website imprint pages (Impressum). It extracts information such as company name, address, phone numbers, emails, and decision-makers (Entscheider, Entscheidungsträger)

Dominic M. Quaiser

389

4.1

German Impressum Scraper (Bulk)

luca-artur/german-impressum-scraper-bulk

Scrape german website imprints for: Company data, decision maker, phone, mail, social profiles, register number, meta description, and more.

Luca S.

Gelbe Seiten (German Yellow Pages) Scraper

dominic-quaiser/gelbe-seiten-german-yellow-pages-scraper

Scrape German business listings from Gelbe Seiten with flexible detail levels. This Apify Actor supports fast, basic, and deep search modes, rate limiting, proxy rotation, and index control. Ideal for lead gen, SEO, and market research. Outputs structured data to Apify datasets.

Dominic M. Quaiser

110

5.0

Gelbe Seiten Scraper - German Business Leads & Company Data

plowdata/gelbe-seiten

Extract German business leads and company information from Gelbe Seiten (gelbeseiten.de). Collect emails, phone numbers, addresses, reviews, and rich listing data. Export to CSV, Excel, JSON, or integrate into automation workflows.

Frederic

308

5.0

Synthetic Data Generator

web.harvester/synthetic-data-generator

Generate realistic fake data for testing and development. Create profiles, addresses, companies, and transactions using Faker. 50+ locales, deterministic mode, custom schemas.

Web Harvester

Gelbe Seiten Scraper – German Business Leads (Pay per Result)

plowdata/gelbe-seiten-ppr

Extract German business leads and company data from Gelbe Seiten (gelbeseiten.de) with pay-per-result pricing. You only pay for successfully extracted, deduplicated listings (one business = one result). Includes emails, phone numbers, addresses, reviews, and rich profile data.

Frederic

Decision Maker Name & Email Extractor

dominic-quaiser/decision-maker-name-email-extractor

An actor that crawls a website to identify key decision‑maker names and job titles, then uses NER‑powered matching to extract and pair their email addresses for streamlined lead generation and B2B data enrichment.

Dominic M. Quaiser

245

1.0

Yellow Pages Business Scraper Worldwide

tuguidragos/yellow-pages-business-scraper-worldwide

Extract business leads from Yellow Pages directories in over 50 countries. Scrape company names, phone numbers, verified emails, physical addresses, and websites. Perfect for B2B sales prospecting, lead generation, and market research. Fast, reliable data extraction. Export to CSV, JSON via API.