German Imprint Scraper (Contact+Social Links) avatar
German Imprint Scraper (Contact+Social Links)

Pricing

$1.50 / 1,000 results

Go to Apify Store
German Imprint Scraper (Contact+Social Links)

German Imprint Scraper (Contact+Social Links)

Very fast actor, Get Impressum data for just $1.5/1000 Results. This powerful scraper finds any German impressum page and extracts key company data: companyName, address, registerNumber, taxId, emails, phones, socialLinks, and page metadata. Get clean, reliable B2B data in seconds.

Pricing

$1.50 / 1,000 results

Rating

5.0

(1)

Developer

CodeScraper

CodeScraper

Maintained by Community

Actor stats

0

Bookmarked

25

Total users

17

Monthly active users

18 days ago

Last modified

Share

🇩🇪 German Impressum Scraper – Extract Legal Website Information

This Apify actor automatically discovers and scrapes Impressum (Legal Notice) data from German websites. It intelligently locates the Impressum page, extracts business details, and returns structured JSON output ready for analysis.


🚀 What It Does

The actor processes each website URL and returns detailed Impressum information, including:

  • 🏢 Company Details: Name, Address, Register Numbers, Tax IDs
  • 📧 Contacts: Email Addresses and Phone Numbers
  • 🌐 Links: Detected Social Media Profiles (Facebook, Instagram, etc.)
  • 🧠 Metadata: Page Title and H1 Header Text
  • 🔗 Impressum URL: Automatically detected Impressum or Legal Notice page

💡 It handles:

  • ✅ Automatic Impressum link discovery
  • 🔍 Smart pattern recognition for addresses, emails, and tax IDs
  • ⚙️ Parallel scraping with adjustable concurrency
  • 🧩 Works for any German (.de) or localized domain

⚙️ Input Configuration

FieldTypeDescriptionDefault Example
startUrlsArrayList of website URLs to scrape. The scraper finds and extracts Impressum data from each site.["https://www.decathlon.de", "https://kooduu.de"]
maxConcurrencyNumberHow many pages to process at the same time. Default is 10.10

🧩 Example Input

{
"startUrls": ["https://www.dr-johanna-budwig.de/", "https://kooduu.de"],
"maxConcurrency": 10
}

📊 Example Output

{
"originalUrl": "https://www.dr-johanna-budwig.de/",
"impressumUrl": "https://www.dr-johanna-budwig.de/service/impressum/",
"companyName": "Dr. Johanna Budwig GmbH",
"address": "An den Kolonaten 2-4, 26160 Bad Zwischenahn, Deutschland",
"emails": ["kontakt@dr-johanna-budwig.de"],
"phones": ["+494413906300"],
"registerNumber": "HRB 209987",
"taxId": "DE300469959",
"socialLinks": {
"instagram": "https://www.instagram.com/p/CztvWVtt8qM/?utm_source=ig_web_copy_link&igsh=MzRlODBiNWFlZA==",
"facebook": "https://www.facebook.com/Dr.Johanna.Budwig",
"youtube": "https://www.youtube.com/user/DrJohannaBudwig"
},
"metadata": {
"title": "Dr. Johanna Budwig | Öle, Rezepte & mehr",
"description": "Dein Shop für eine gesunde Ernährung ➜ Entdecke Öle, Rezepte, Ernährungstipps & mehr ➜ Wissen, was stärkt!",
"keywords": "Home Start"
}
}

🧠 Features

  • 🔍 Automatically finds the Impressum page on each domain
  • 🧾 Extracts company names, addresses, register numbers, and tax IDs
  • 📧 Detects all emails and phone numbers on the Impressum page
  • 🌍 Finds social media links (Facebook, Instagram, X, LinkedIn)
  • ⚙️ Adjustable concurrency for faster scraping
  • 🧱 Structured JSON output – ready for automation or database import

💡 Use Cases

  • 🏢 Business Data Collection: Enrich datasets with verified German company contact data
  • ⚖️ Compliance Checks: Validate if a website meets German Impressum law requirements
  • 📊 Market Research: Gather contact and legal details from target company sites
  • 🔍 Lead Generation: Identify verified company emails and tax IDs

🧑‍💻 Developer Info

Author: codescraper Platform: Apify Language: TypeScript (Cheerio Crawler)


🏷️ Tags

German Impressum Detector . impressum · germany · legal-scraper · company-data · contact-scraper · cheerio · apify · automation