German Impressum Scraper (Bulk) avatar
German Impressum Scraper (Bulk)

Pricing

from $8.00 / 1,000 results

Go to Apify Store
German Impressum Scraper (Bulk)

German Impressum Scraper (Bulk)

Scrape german website imprints for: Company data, decision maker, phone, mail, social profiles, register number, meta description, and more.

Pricing

from $8.00 / 1,000 results

Rating

0.0

(0)

Developer

Luca S.

Luca S.

Maintained by Community

Actor stats

0

Bookmarked

13

Total users

2

Monthly active users

16 hours ago

Last modified

Share

Apify German Impressum Scraper

An Apify Actor that automatically finds and extracts Impressum details from German websites—ideal for compliance checks, lead research, or data maintenance.

✅ What this Actor delivers

  • automatically discovers Impressum pages (including a /impressum fallback)
  • extracts relevant company and contact data
  • normalizes fields for easy export (CSV/Excel/JSON)
  • enriches results with metadata like title, description, source URLs, and timestamps

🧩 Input (Apify UI)

FieldTypeDescriptionExample
startUrlsArrayList of websites to check (scheme optional)[{ "url": "https://example.de" }]
maxPagesNumberMax number of pages per domain3
includeEmailsBooleanEnable email extractiontrue
maxConcurrencyNumberParallelism for faster crawling4
userAgentStringOptional user agentMozilla/5.0 ...

Example input

{
"startUrls": [
{ "url": "https://www.intercon-spacetec.de/impressum" },
{ "url": "www.dieakw.de/impressum" }
],
"maxPages": 3,
"includeEmails": true,
"maxConcurrency": 4
}

📦 Output (Dataset)

Each Dataset row represents a single website. Example:

{
"mainUrl": "https://example.de",
"impressumUrl": "https://example.de/impressum",
"companyName": "Example GmbH",
"address": "Musterstraße 1, 12345 Berlin",
"emails": "info@example.de",
"phones": "+49 30 123456",
"vatId": "DE123456789",
"registerInfo": "HRB 12345",
"metaTitle": "Example – Impressum",
"metaDescription": "…",
"country": "DE",
"fetchTime": "2024-01-01T12:00:00.000Z"
}

Note: emails and phones are stored as comma-separated strings for easy Excel export.

🎯 Common use cases

  • compliance checks (Impressum requirements)
  • lead generation and enrichment
  • monitoring changes in Impressum details
  • export to CRM or spreadsheets (CSV/Excel)

🧪 Test locally

npm install
node src/main.js

Local input is read from input.json. Results are written to Crawlee's local Dataset directory.

⚙️ Notes & tips

  • For larger lists, use proxies (useApifyProxy) and increase concurrency.
  • If emails are obfuscated, OCR can be added optionally.
  • A German Accept-Language header is already applied by default.

If you need additional fields (e.g., managing directors, legal form, or registry court), this can be extended.