German Impressum Scraper (Bulk) avatar

German Impressum Scraper (Bulk)

Pricing

from $8.00 / 1,000 results

Go to Apify Store
German Impressum Scraper (Bulk)

German Impressum Scraper (Bulk)

Scrape german website imprints for: Company data, decision maker, phone, mail, social profiles, register number, meta description, and more.

Pricing

from $8.00 / 1,000 results

Rating

0.0

(0)

Developer

Luca S.

Luca S.

Maintained by Community

Actor stats

1

Bookmarked

29

Total users

7

Monthly active users

a month ago

Last modified

Share

Apify German Impressum Scraper

An Apify Actor that automatically finds and extracts Impressum details from German websites—ideal for compliance checks, lead research, or data maintenance.

✅ What this Actor delivers

  • automatically discovers Impressum pages (including a /impressum fallback)
  • extracts relevant company and contact data
  • normalizes fields for easy export (CSV/Excel/JSON)
  • enriches results with metadata like title, description, source URLs, and timestamps

🧩 Input (Apify UI)

FieldTypeDescriptionExample
startUrlsArrayList of websites to check (scheme optional)[{ "url": "https://example.de" }]
maxPagesNumberMax number of pages per domain3
includeEmailsBooleanEnable email extractiontrue
maxConcurrencyNumberParallelism for faster crawling4
userAgentStringOptional user agentMozilla/5.0 ...

Example input

{
"startUrls": [
{ "url": "https://www.intercon-spacetec.de/impressum" },
{ "url": "www.dieakw.de/impressum" }
],
"maxPages": 3,
"includeEmails": true,
"maxConcurrency": 4
}

📦 Output (Dataset)

Each Dataset row represents a single website. Example:

{
"mainUrl": "https://example.de",
"impressumUrl": "https://example.de/impressum",
"companyName": "Example GmbH",
"address": "Musterstraße 1, 12345 Berlin",
"emails": "info@example.de",
"phones": "+49 30 123456",
"vatId": "DE123456789",
"registerInfo": "HRB 12345",
"metaTitle": "Example – Impressum",
"metaDescription": "…",
"country": "DE",
"fetchTime": "2024-01-01T12:00:00.000Z"
}

Note: emails and phones are stored as comma-separated strings for easy Excel export.

🎯 Common use cases

  • compliance checks (Impressum requirements)
  • lead generation and enrichment
  • monitoring changes in Impressum details
  • export to CRM or spreadsheets (CSV/Excel)

🧪 Test locally

npm install
node src/main.js

Local input is read from input.json. Results are written to Crawlee's local Dataset directory.

⚙️ Notes & tips

  • For larger lists, use proxies (useApifyProxy) and increase concurrency.
  • If emails are obfuscated, OCR can be added optionally.
  • A German Accept-Language header is already applied by default.

If you need additional fields (e.g., managing directors, legal form, or registry court), this can be extended.