German Imprint Scraper (Contact+Social Links)
Pricing
$1.50 / 1,000 results
German Imprint Scraper (Contact+Social Links)
Very fast actor, Get Impressum data for just $1.5/1000 Results. This powerful scraper finds any German impressum page and extracts key company data: companyName, address, registerNumber, taxId, emails, phones, socialLinks, and page metadata. Get clean, reliable B2B data in seconds.
Pricing
$1.50 / 1,000 results
Rating
5.0
(1)
Developer

CodeScraper
Actor stats
0
Bookmarked
25
Total users
17
Monthly active users
18 days ago
Last modified
Categories
Share
🇩🇪 German Impressum Scraper – Extract Legal Website Information
This Apify actor automatically discovers and scrapes Impressum (Legal Notice) data from German websites. It intelligently locates the Impressum page, extracts business details, and returns structured JSON output ready for analysis.
🚀 What It Does
The actor processes each website URL and returns detailed Impressum information, including:
- 🏢 Company Details: Name, Address, Register Numbers, Tax IDs
- 📧 Contacts: Email Addresses and Phone Numbers
- 🌐 Links: Detected Social Media Profiles (Facebook, Instagram, etc.)
- 🧠 Metadata: Page Title and H1 Header Text
- 🔗 Impressum URL: Automatically detected Impressum or Legal Notice page
💡 It handles:
- ✅ Automatic Impressum link discovery
- 🔍 Smart pattern recognition for addresses, emails, and tax IDs
- ⚙️ Parallel scraping with adjustable concurrency
- 🧩 Works for any German (.de) or localized domain
⚙️ Input Configuration
| Field | Type | Description | Default Example |
|---|---|---|---|
startUrls | Array | List of website URLs to scrape. The scraper finds and extracts Impressum data from each site. | ["https://www.decathlon.de", "https://kooduu.de"] |
maxConcurrency | Number | How many pages to process at the same time. Default is 10. | 10 |
🧩 Example Input
{"startUrls": ["https://www.dr-johanna-budwig.de/", "https://kooduu.de"],"maxConcurrency": 10}
📊 Example Output
{"originalUrl": "https://www.dr-johanna-budwig.de/","impressumUrl": "https://www.dr-johanna-budwig.de/service/impressum/","companyName": "Dr. Johanna Budwig GmbH","address": "An den Kolonaten 2-4, 26160 Bad Zwischenahn, Deutschland","emails": ["kontakt@dr-johanna-budwig.de"],"phones": ["+494413906300"],"registerNumber": "HRB 209987","taxId": "DE300469959","socialLinks": {"instagram": "https://www.instagram.com/p/CztvWVtt8qM/?utm_source=ig_web_copy_link&igsh=MzRlODBiNWFlZA==","facebook": "https://www.facebook.com/Dr.Johanna.Budwig","youtube": "https://www.youtube.com/user/DrJohannaBudwig"},"metadata": {"title": "Dr. Johanna Budwig | Öle, Rezepte & mehr","description": "Dein Shop für eine gesunde Ernährung ➜ Entdecke Öle, Rezepte, Ernährungstipps & mehr ➜ Wissen, was stärkt!","keywords": "Home Start"}}
🧠 Features
- 🔍 Automatically finds the Impressum page on each domain
- 🧾 Extracts company names, addresses, register numbers, and tax IDs
- 📧 Detects all emails and phone numbers on the Impressum page
- 🌍 Finds social media links (Facebook, Instagram, X, LinkedIn)
- ⚙️ Adjustable concurrency for faster scraping
- 🧱 Structured JSON output – ready for automation or database import
💡 Use Cases
- 🏢 Business Data Collection: Enrich datasets with verified German company contact data
- ⚖️ Compliance Checks: Validate if a website meets German Impressum law requirements
- 📊 Market Research: Gather contact and legal details from target company sites
- 🔍 Lead Generation: Identify verified company emails and tax IDs
🧑💻 Developer Info
Author: codescraper Platform: Apify Language: TypeScript (Cheerio Crawler)
🏷️ Tags
German Impressum Detector . impressum · germany · legal-scraper · company-data · contact-scraper · cheerio · apify · automation