Pricing

from $2.52 / 1,000 results

Impressum Standby Scraper (Playwright Version)

Scrape German imprint pages instantly. Using a headless-browser for dynamic modern sites. This Apify Actor finds and extracts structured contact & legal data from any German website — company name, address, phone, fax, email, VAT ID, register number, social media & decision makers.

Pricing

from $2.52 / 1,000 results

Rating

0.0

(0)

Developer

Dominic M. Quaiser

Actor stats

Bookmarked

Total users

Monthly active users

24 days ago

Last modified

German Imprint Scraper (Standby API)

Find and extract structured contact and legal information from German imprint pages ("Impressum") — in real time, one URL per request. Send a homepage URL to the actor's HTTP endpoint and it automatically discovers the site's imprint page and returns clean, structured data: company name, address, phone/fax, email, commercial register number, VAT ID, social media links, and decision-makers.

This actor runs in Apify Standby mode as a long-lived HTTP server. That makes it ideal for on-demand enrichment: low per-request latency, no run start-up overhead per URL, and a simple GET/POST API you can call directly from your application, a workflow tool, or another actor.

ℹ️ Which version is this?

This scraper is published in two variants, optimised for different kinds of websites:

🎭 Playwright version (this actor)

A headless-browser scraper that renders pages with a real Chromium engine. Use it for modern, JavaScript-heavy websites whose imprint links or content only appear after the page renders (e.g. Next.js / React apps). It is more robust but slower, and adds a small headless-browser charge per processed URL.

👉 Most imprint pages are plain server-rendered HTML and don't need a browser. For those, the HTTP version is faster and cheaper.

💡 Features

Automatic imprint-page discovery: point the actor at a homepage; it finds the correct "Impressum" page for you.
Selective data extraction: request only the fields you need, from basic contact info to ML-extracted decision-makers.
Real-time Standby API: GET or POST a single URL and get structured JSON back immediately. One request is processed at a time per container.
Proxy support: integrates with Apify Proxy for IP rotation and to reduce blocking.
Structured JSON output: clean, predictable records ready for your CRM, database, or downstream pipeline.

🔌 Standby API

In Standby mode the actor exposes an HTTP server. Apify gives every Standby actor a base URL; append the query parameters below and authenticate with your Apify API token (e.g. as a ?token= query parameter or Authorization: Bearer <token> header).

`GET /` — scrape one URL (query string)

Parameter	Required	Description
`startUrl`	Yes	Homepage URL to scrape. The actor discovers the imprint page automatically. `https://` is prepended if the scheme is missing.
`fieldsToExtract`	No	Comma-separated list of fields to extract. Defaults to all fields.
`metaData`	No	`true`/`false` — include extra technical details in the response. Default `false`.

$curl 'https://dominic-quaiser--impressum-standby-scraper.apify.actor/?startUrl=https://www.renault.de/&fieldsToExtract=company_name,emails,phone_number&token=<APIFY_TOKEN>'

`POST /` — scrape one URL (JSON body)

curl -X POST 'https://dominic-quaiser--impressum-standby-scraper.apify.actor/?token=<APIFY_TOKEN>' \
  -H 'Content-Type: application/json' \
  -d '{
        "startUrl": "https://www.renault.de/",
        "fieldsToExtract": ["company_name", "emails", "phone_number"],
        "metaData": false
      }'

`GET /health` — health check & stats

Returns 200 with a snapshot of running counters (total requests, successful scrapes, errors, etc.). Useful for uptime checks.

$curl 'https://dominic-quaiser--impressum-standby-scraper.apify.actor/health'

Responses

Status	Meaning
`200`	Scrape completed. Body is `{ "url": ..., "result": { ... } }`, or `{ "url": ..., "result": null, "message": "No data extracted" }` when nothing could be extracted.
`400`	Missing or invalid `startUrl`, or an invalid JSON body.
`500`	Unhandled scraper error.
`504`	Processing timed out.

Each successful result is also pushed to the actor's default dataset, so you can browse or export your scrape history from the Apify Console even when calling the API directly.

📊 Extractable data

Select any combination of the following fields via fieldsToExtract:

Field	Description	Type
`company_name`	The official company name, with a `confidence` score for the match.	`Object`
`business_address`	Full address parsed into `full_address`, `street`, `house_number`, `postal_code`, `city`.	`Object`
`phone_number`	One or more phone numbers, keyed `phone_1`, `phone_2`, …	`Object`
`fax_number`	One or more fax numbers, keyed `fax_1`, `fax_2`, …	`Object`
`emails`	One or more email addresses; emails matching the site's domain are prioritised.	`Object`
`register_number`	Commercial register number ("Handelsregisternummer") and the registration `court` ("Registergericht").	`Object`
`vat_id`	German VAT ID ("Umsatzsteuer-ID") with checksum validation, e.g. `DE123456788`.	`Object`
`social_media`	Links to platforms like LinkedIn, Xing, Facebook, Instagram, etc.	`Object`
`decision_makers`	(Premium) Names of key decision-makers ("Entscheidungsträger") extracted via an external NER (Named Entity Recognition) model.	`Array`

Numbered outputs (emails, phone numbers, …) are ordered by how likely each value is the company's main contact.

📤 Output structure

The exact fields depend on your fieldsToExtract selection.

{
  "start_url": "https://muster-firma.de/",
  "imprint_url": "https://muster-firma.de/impressum",
  "company_name": {
    "name": "Muster GmbH",
    "confidence": 1
  },
  "business_address": {
    "full_address": "Musterstraße 123, 12345 Berlin",
    "street": "Musterstraße",
    "house_number": "123",
    "postal_code": "12345",
    "city": "Berlin"
  },
  "phone_number": { "phone_1": "+493012345678" },
  "fax_number": { "fax_1": "+493012345679" },
  "emails": { "email_1": "kontakt@muster-firma.de" },
  "register_number": {
    "number": "HRB 12345 B",
    "court": "Amtsgericht Charlottenburg"
  },
  "vat_id": { "vat_id": "DE123456788" },
  "social_media": {
    "linkedin": "https://www.linkedin.com/company/muster-firma"
  },
  "decision_makers": ["Max Mustermann"],
  "metadata": {
    "domain": "muster-firma.de",
    "fetch_method": "http",
    "fallback_attempted": false,
    "scraped_at": "2026-06-22T12:04:48.003780"
  }
}

The metadata block is only included when metaData is enabled.

⚖️ Legal disclaimer

You are solely responsible for determining the legality of your use of this actor and the data it generates. Scraping and handling data — particularly personal information — is subject to legal frameworks such as the GDPR (DSGVO), copyright law, and the terms of service of the sites you scrape. Ensure your use case is compliant with all applicable laws. This text is not legal advice.

The decision_makers feature uses an external API hosted on a private server in Europe (Germany) to process data.

What is processed: the text of the imprint page is sent to the API to identify personal names.
Why: the NER model needs the page text to accurately extract decision-makers.
Data controller: you, the user, are the data controller; the actor's developer acts as data processor for this task.
Location & compliance: all processing occurs within the EU and is subject to the GDPR (DSGVO).
Data storage: the text is processed in-memory and is not stored or logged on the external server.
Important: this processing is external to the Apify platform and not covered by Apify's DPA. By using this feature you acknowledge this separate processing activity.

🤖 Other actors

Gelbe Seiten (German Yellow Pages) Scraper: extract business listings from Germany's Yellow Pages with three detail levels.
Das Telefonbuch Scraper: extract business listings from Das Telefonbuch, Germany's official telephone directory.
Das Örtliche Scraper: extract business listings from Das Örtliche, Germany's nationwide telephone directory.

🎯 Use cases

Lead generation — build targeted contact lists for sales and marketing.
Real-time enrichment — call the Standby API to enrich a record the moment a lead enters your CRM.
Compliance & verification — check for legally compliant imprint information.
Market research — aggregate company data for a specific industry or region.

🛠️ Maintainer

Author: Dominic M. Quaiser
Contact: dev@krake.run
Website: krake.run

Impressum Standby Scraper (HTTP Version)

dominic-quaiser/impressum-standby-scraper-http

Scrape German imprint pages instantly. Using a HTTP for fast scraping of common simple imprint pages. This Apify Actor finds and extracts structured contact & legal data from any German website — company name, address, phone, fax, email, VAT ID, register number, social media & decision makers.

Dominic M. Quaiser

German Imprint Leads Scraper

automation-lab/german-imprint-leads-scraper

Extract German Impressum legal contacts, company details, VAT IDs, HRB records, emails, and decision-makers from domains.

Stas Persiianenko

German Imprint Scraper

codescraper/german-imprint-scraper

A powerful Actor scraper to find and extract legal "Impressum" data from German websites. Get company names, addresses, decision-makers, legal IDs, and more, all automatically.

CodeScraper

108

5.0

German Imprint Leads

studio-amba/german-imprint-leads

Extracts contact and legal data from the mandatory Impressum page of German company websites: company name, legal form, managing directors, address, phone, fax, email, VAT ID, and Handelsregister entry. Give it company websites or domains, get back structured B2B lead data.

Studio Amba

German Imprint Scraper

dominic-quaiser/imprint-contact-scraper

An Actor that automatically locates and scrapes key contact details from German website imprint pages (Impressum). It extracts information such as company name, address, phone numbers, emails, and decision-makers (Entscheider, Entscheidungsträger)

Dominic M. Quaiser

529

3.9

German Impressum Scraper (Bulk)

luca-artur/german-impressum-scraper-bulk

Scrape german website imprints for: Company data, decision maker, phone, mail, social profiles, register number, meta description, and more.

Luca S.

✨ German Imprint Scraper & Leads Finder (Google Search)

winningsolutions/german-imprint-scraper

AI-powered Apify Actor that scrapes Impressum pages on German websites and extracts decision-makers (Geschäftsführer, Vorstand), validated B2B emails, company addresses, VAT IDs, and Handelsregister numbers. Structured JSON output for B2B lead generation, sales prospecting, and CRM enrichment.

Winning Solutions

186

5.0

German Imprint Scraper (Contact+Social Links)

codescraper/german-impressum-scraper-fast

Very fast actor, Get Impressum data for just $1.5/1000 Results. This powerful scraper finds any German impressum page and extracts key company data: companyName, address, registerNumber, taxId, emails, phones, socialLinks, and page metadata. Get clean, reliable B2B data in seconds.

CodeScraper

5.0

German Trade Register Scraper - Company Data & KYC

actorpilot/german-trade-register-scraper

Extract structured German Handelsregister company data by company name or register number: legal form, court, register number, office, address, purpose, capital, representatives, prokura, timestamps, stable IDs, and run summary.

S. Klein

Impressum & EU Legal Notice Extractor — Company & VAT Data

haketa/impressum-legal-notice-extractor

Turn a list of domains into structured company data from Impressum, Mentions légales, Aviso legal and other EU legal-notice pages: legal company name, VAT / USt-IdNr, managing director, registration number, court, address, email and phone. Fast, keyless B2B lead & KYB enrichment.