Pricing

Pay per event

German Imprint Leads Scraper

Extract German Impressum legal contacts, company details, VAT IDs, HRB records, emails, and decision-makers from domains.

Pricing

Pay per event

Rating

0.0

(0)

Developer

Stas Persiianenko

Actor stats

Bookmarked

Total users

Monthly active users

25 days ago

Last modified

What does German Imprint Leads Scraper do?

It visits each submitted domain, checks common German legal-contact pages such as /impressum, /service/impressum, /imprint, and /kontakt, follows likely footer links, and saves one structured lead record per domain.

Who is it for?

🧑‍💼 Sales teams enriching German B2B account lists
🧾 Compliance teams checking public company disclosures
🧲 Lead-generation agencies building Germany-specific datasets
🧑‍💻 Recruiters finding company decision makers
🧹 CRM operations teams normalizing German legal contacts

Why use it?

German websites often place high-value company data in the Impressum instead of on a marketing contact page. This actor targets that legal-contact workflow directly instead of returning generic page text.

What data can it extract?

Field	Description
`inputUrl`	Submitted domain or URL
`imprintUrl`	Best Impressum/contact page found
`companyName`	Legal company name when detected
`legalForm`	GmbH, AG, KG, UG, e.K., and similar forms
`address`	Registered or legal address snippet
`emails`	Public email addresses
`phoneNumbers`	Public phone numbers
`vatId`	German VAT ID / USt-IdNr
`registrationCourt`	Amtsgericht / register court
`registrationNumber`	HRB/HRA registration number
`managingDirectors`	Geschäftsführer, Vorstand, or similar names
`responsiblePerson`	Responsible person when disclosed
`confidenceFlags`	Flags showing which important fields were found
`sourceSnippets`	Text snippets for verification

How much does it cost to extract German Impressum leads?

The actor uses pay-per-event pricing with a small start fee and a per-result fee. Current configured pricing is:

Event	Free	Bronze	Silver	Gold	Platinum	Diamond
Run start	$0.005	$0.005	$0.005	$0.005	$0.005	$0.005
Result extracted	$0.0006508	$0.00056591	$0.00044141	$0.00033955	$0.00022636	$0.00015845

Example estimates before Apify platform fees: 100 extracted domains cost about $0.070 on Free, $0.062 on Bronze, and $0.039 on Gold, including the start event. The default two-domain prefill costs about $0.0063 on Free, so it stays suitable for a quick first test.

Input

Provide domains or URLs in startUrls.

{
  "startUrls": [
    { "url": "https://www.rewe.de" },
    { "url": "https://www.dm.de" }
  ],
  "maxPagesPerDomain": 8,
  "includeSubpages": true,
  "proxyConfiguration": { "useApifyProxy": false }
}

Output

Each dataset item represents one submitted domain or URL.

{
  "inputUrl": "https://www.rewe.de",
  "inputDomain": "rewe.de",
  "imprintUrl": "https://www.rewe.de/service/impressum/",
  "status": "found",
  "companyName": "REWE Markt GmbH",
  "legalForm": "GmbH",
  "emails": ["impressum@rewe.de"],
  "vatId": "DE812706034",
  "registrationNumber": "HRB 66773",
  "confidenceFlags": ["company_name_found", "email_found"]
}

How to use it

Prepare a list of German domains or websites.
Paste them into the Start URLs field.
Keep maxPagesPerDomain low for quick enrichment.
Run the actor.
Export the dataset as JSON, CSV, Excel, or via API.

Tips for better results

Submit homepages, not random blog posts.
Keep includeSubpages enabled so footer Impressum links are followed.
Use no proxy first; most public legal pages are accessible directly.
Increase maxPagesPerDomain only for sites with unusual navigation.

Status values

found means an Impressum/contact page was located and parsed.
not_found means pages were checked but no legal-contact page scored high enough.
error means the domain could not be processed due to a network or parsing error.

Confidence flags

Confidence flags help filter records:

company_name_found
address_found
email_found
phone_found
vat_id_found
registration_found
decision_maker_found

Integrations

Use the output with:

HubSpot or Salesforce enrichment workflows
Clay tables and lead-routing systems
Google Sheets lead lists
Compliance review queues
Internal data-quality checks

API usage: Node.js

import { ApifyClient } from 'apify-client';

const client = new ApifyClient({ token: process.env.APIFY_TOKEN });
const run = await client.actor('automation-lab/german-imprint-leads-scraper').call({
  startUrls: [{ url: 'https://www.rewe.de' }],
  maxPagesPerDomain: 8,
});
console.log(run.defaultDatasetId);

API usage: Python

from apify_client import ApifyClient

client = ApifyClient('YOUR_APIFY_TOKEN')
run = client.actor('automation-lab/german-imprint-leads-scraper').call(run_input={
    'startUrls': [{'url': 'https://www.rewe.de'}],
    'maxPagesPerDomain': 8,
})
print(run['defaultDatasetId'])

API usage: cURL

curl -X POST 'https://api.apify.com/v2/acts/automation-lab~german-imprint-leads-scraper/runs?token=YOUR_APIFY_TOKEN' \
  -H 'Content-Type: application/json' \
  -d '{"startUrls":[{"url":"https://www.rewe.de"}],"maxPagesPerDomain":8}'

MCP usage

Connect Apify MCP with this actor enabled:

https://mcp.apify.com/?tools=automation-lab/german-imprint-leads-scraper

Claude Code setup:

$claude mcp add apify-german-imprint https://mcp.apify.com/?tools=automation-lab/german-imprint-leads-scraper

Claude Desktop JSON config:

{
  "mcpServers": {
    "apify-german-imprint": {
      "url": "https://mcp.apify.com/?tools=automation-lab/german-imprint-leads-scraper"
    }
  }
}

Example prompts:

"Extract Impressum contacts for these 20 German domains."
"Find VAT IDs and managing directors for this German prospect list."
"Check which domains have no public legal contact details."

Legality

This actor extracts publicly available business information from websites you provide. You are responsible for using the data lawfully, respecting website terms, and complying with GDPR, ePrivacy, and other applicable rules.

FAQ

Why did one domain return `not_found`?

The site may use a non-standard legal page URL, block automated HTTP clients, or render legal data only in JavaScript. Try submitting the exact Impressum URL or increasing maxPagesPerDomain.

Does this actor validate email deliverability?

No. It extracts public emails from pages. Use a dedicated email validation service if you need deliverability checks.

Troubleshooting

If a site returns no data, try raising maxPagesPerDomain or submitting the exact Impressum URL.

If many requests fail, enable Apify Proxy or retry later. Some sites block automated traffic intermittently.

Limitations

The actor uses HTTP and Cheerio for speed and low cost. Some JavaScript-only pages may expose fewer fields than a browser-based scraper.

Privacy notes

The actor does not log in, bypass paywalls, or access private systems. It only reads public pages reachable from submitted domains.

Changelog

Initial version extracts German Impressum legal-contact fields from submitted domains and URLs.

Support

If you need fields tuned for a specific German industry or CMS pattern, open an Apify issue with sample URLs and expected output.

Field reference

pagesChecked lists every URL requested for the domain. sourceSnippets contains nearby text around key legal labels so users can audit extraction quality.

Performance

HTTP-only crawling keeps runs lightweight. The default platform memory is 512 MB and the default crawl depth is capped per domain.

Data quality workflow

Use confidenceFlags to route complete leads into your CRM and send lower-confidence rows to manual review.

German Imprint Scraper

codescraper/german-imprint-scraper

A powerful Actor scraper to find and extract legal "Impressum" data from German websites. Get company names, addresses, decision-makers, legal IDs, and more, all automatically.

CodeScraper

108

5.0

Impressum Standby Scraper (HTTP Version)

dominic-quaiser/impressum-standby-scraper-http

Scrape German imprint pages instantly. Using a HTTP for fast scraping of common simple imprint pages. This Apify Actor finds and extracts structured contact & legal data from any German website — company name, address, phone, fax, email, VAT ID, register number, social media & decision makers.

Dominic M. Quaiser

German Imprint Scraper

dominic-quaiser/imprint-contact-scraper

An Actor that automatically locates and scrapes key contact details from German website imprint pages (Impressum). It extracts information such as company name, address, phone numbers, emails, and decision-makers (Entscheider, Entscheidungsträger)

Dominic M. Quaiser

526

3.9

✨ German Imprint Scraper & Leads Finder (Google Search)

winningsolutions/german-imprint-scraper

AI-powered Apify Actor that scrapes Impressum pages on German websites and extracts decision-makers (Geschäftsführer, Vorstand), validated B2B emails, company addresses, VAT IDs, and Handelsregister numbers. Structured JSON output for B2B lead generation, sales prospecting, and CRM enrichment.

Winning Solutions

185

5.0

Impressum Standby Scraper (Playwright Version)

dominic-quaiser/impressum-standby-scraper

Scrape German imprint pages instantly. Using a headless-browser for dynamic modern sites. This Apify Actor finds and extracts structured contact & legal data from any German website — company name, address, phone, fax, email, VAT ID, register number, social media & decision makers.

Dominic M. Quaiser

German Impressum Scraper (Bulk)

luca-artur/german-impressum-scraper-bulk

Scrape german website imprints for: Company data, decision maker, phone, mail, social profiles, register number, meta description, and more.

Luca S.

German Imprint Scraper (Contact+Social Links)

codescraper/german-impressum-scraper-fast

Very fast actor, Get Impressum data for just $1.5/1000 Results. This powerful scraper finds any German impressum page and extracts key company data: companyName, address, registerNumber, taxId, emails, phones, socialLinks, and page metadata. Get clean, reliable B2B data in seconds.

CodeScraper

5.0

German Company Registry Scraper

dataharvest/handelsregister-scraper

Scrape German company data from Handelsregister.de.

Alex v

DACH Impressum Scraper — Emails, Phone, VAT & Directors

scrapersdelight/imprint-contact-scraper

Turn a list of DE/AT/CH company domains into firmographic B2B leads from each site's legally-mandated Impressum: email, phone, address, managing director, register court + HRB/HRA/FN, and VAT ID. Full DACH coverage. No login.

Scrapers Delight

Impressum & EU Legal Notice Extractor — Company & VAT Data

haketa/impressum-legal-notice-extractor

Turn a list of domains into structured company data from Impressum, Mentions légales, Aviso legal and other EU legal-notice pages: legal company name, VAT / USt-IdNr, managing director, registration number, court, address, email and phone. Fast, keyless B2B lead & KYB enrichment.

Haketa

German Imprint Leads Scraper

What does German Imprint Leads Scraper do?

Who is it for?

Why use it?

What data can it extract?

How much does it cost to extract German Impressum leads?

Input

Output

How to use it

Tips for better results

Status values

Confidence flags

Integrations

API usage: Node.js

API usage: Python

API usage: cURL

MCP usage

Legality

FAQ

Why did one domain return not_found?

Does this actor validate email deliverability?

Troubleshooting

Related scrapers

Limitations

Privacy notes

Changelog

Support

Field reference

Performance

Data quality workflow

You might also like

German Imprint Scraper

Impressum Standby Scraper (HTTP Version)

German Imprint Scraper

✨ German Imprint Scraper & Leads Finder (Google Search)

Impressum Standby Scraper (Playwright Version)

German Impressum Scraper (Bulk)

German Imprint Scraper (Contact+Social Links)

German Company Registry Scraper

DACH Impressum Scraper — Emails, Phone, VAT & Directors

Impressum & EU Legal Notice Extractor — Company & VAT Data

Why did one domain return `not_found`?