Pricing

Pay per usage

Go to Store

Imprint Contact Scraper

Try for free

Developed by

Azquaier

An Actor that automatically locates and scrapes key contact details from German website imprint pages (Impressum). It extracts information such as company name, address, phone numbers, emails, and decision-maker details.

0.0 (0)

Pricing

Pay per usage

Total users

Monthly users

Runs succeeded

>99%

Last modified

2 months ago

Automation

Lead generation

Notice:
This Scraper is currently under development and may not be fully stable or feature-complete.
Use with caution!

Imprint Contact Scraper

This Apify Actor automatically scrapes German company information from Imprint (Impressum) pages. You provide a list of website homepages, and the Actor attempts to find the imprint link, visit it, and extract key details like the company name, address, phone numbers, email addresses, and optionally, decision-makers. It is designed to work with typical German website structures where an imprint link is present on the homepage.

What can this Actor do?

Find Imprint Pages: Automatically locates links containing "Impressum" or "Imprint" on the provided website homepages.
HTML Cleaning: Removes common clutter like cookie banners, scripts, styles, and overly long text paragraphs before extraction to improve accuracy.
Extract Company Details: Parses the imprint page to find:
- Company Name
- Full Address (Street, Postal Code, City)
- Phone Number(s)
- Email Address(es)
Extract Decision Makers ("Entscheidungsträger") (Optional): If enabled, it identifies and extracts names associated with roles like "Geschäftsführer", "Vorstand", "Inhaber", etc.

Use Cases

This Actor is useful for various tasks, including:

Lead Generation: Quickly gather contact information for German businesses.
Market Research: Collect data on companies within a specific sector or region.
B2B Data Enrichment: Augment existing company records with imprint data.
Compliance Checks: Verify if websites have accessible and complete imprint information.

Input

The Actor requires the following input:

Start URLs (start_urls): A list of homepage URLs for the websites you want to scrape. Each entry should be an object containing a url key.
Search Decision Makers (search_decision_makers): A checkbox (true/false). Check this box if you want the Actor to attempt extracting decision-maker names and roles. This is disabled by default (false).

Example Input JSON

{
  "start_urls": [
    { "url": "[https://www.example.de](https://www.example.de)" },
    { "url": "[https://www.another-company.com](https://www.another-company.com)" }
  ],
  "search_decision_makers": true
}

Output

The Actor outputs a dataset item for each successfully processed start URL. Each item is a JSON object containing the extracted data.

Example Output JSON

{
	"source_url": "https://www.example.de",
	"imprint_url": "https://www.example.de/impressum",
	"homepage_title": "Example Company - Homepage",
	"company_name": "Example GmbH",
	"address": "Musterstraße 1, 12345 Musterstadt",
	"phone_number_1": "+49 (0) 123 456789",
	"email_1": "info@example.de",
	"Geschäftsführer": [ // Indicator found becomes the key
		["Max Mustermann", 0], // Name and rank (order found)
		["Erika Beispiel", 1]
	],
	"primary_decision_maker": "Max Mustermann" // Highest priority role found
	"linkedin_url": "https://linkedin.com/..."
}

phone_number_X and email_X fields are numbered sequentially if multiple are found.
Decision maker fields (like Geschäftsführer, Vorstand, etc.) contain a list of [name, rank] tuples, where rank indicates the order of appearance. The primary_decision_maker field contains the name of the highest-priority contact found.

How to Use

Add your desired homepage URLs to the Start URLs input field.
Optionally, check the Search Decision Makers box.
Click Start to run the Actor.
Once the run finishes, preview or download the extracted data from the Dataset tab.

Limitations

The Actor relies on finding standard "Impressum" or "Imprint" links. It may fail if the link text or URL is unconventional.
Extraction accuracy depends on the HTML structure of the imprint page. Complex or unusual layouts might lead to incomplete or incorrect data.
Address validation uses an internal German postal code lookup table. While extensive, it might not cover every single postal code or city variant.
Decision maker extraction uses heuristics based on common German role titles. It may not capture all possible roles or names, especially if formatted unusually.

Important Legal Notice Regarding Scraping Personal Data

This Actor is designed to scrape contact information, which may include personal data such as names, phone numbers, and email addresses. Please be aware that the scraping and processing of personal data are subject to various laws and regulations, including but not limited to the General Data Protection Regulation (GDPR) in the European Union and other national and regional data protection laws.

It is your responsibility to research and understand the laws that apply to your specific use case and jurisdiction before using this Actor to scrape personal data. Ensure that you have a lawful basis for collecting and processing this information and that you comply with all applicable legal requirements.

Use this Actor responsibly and ethically.

Maintainer

Author: Azquaier
Contact: 📧 mail@azquaier.xyz
Website: 🌍 azquaier.xyz

On this page

Imprint Contact Scraper

Share Actor:

4imprint Scraper

dtrungtin/4imprint-scraper

4imprint Scraper extracts all and any items from any category page on 4imprint or by the specific keyword, provided you copy&paste the search URL.

Tin

Website Contact Scraper - AI-Powered Lead Finder

timo.sieber/website-lead-scraper

AI-powered website scraper that extracts real contact data from company sites! Finds people, positions, emails & phone numbers using LLM technology. Scans team pages, contact sections & company info. Perfect for B2B lead generation and sales research.

Timo Sieber

📧✨ Extract Emails, Socials and Contacts from Any Website

logical_scrapers/extract-email-from-any-website

(fastest) An advanced Actor for extracting email addresses, social links and contact details from websites. This tool is perfect for web scraping, contact collection, and lead generation.

Goldmine

325

5.0

Contact Info Scraper -Extract Business Contact Information

dainty_screw/contact-info-scraper--extract-business-contact-information

Looking to gather business contact information fast? Our Business Contact Info Scraper extracts emails, phone numbers, and social profiles like Facebook, Twitter, LinkedIn, and Instagram from websites at scale. Get accurate contact details quickly and efficiently with this powerful tool.

codemaster devops

569

5.0

Extract Contact Details from Any Website – Email, Phone, Social

creative_tablecloth/extract-email-phone-social-media-from-any-website

Discover our powerful scraper that effortlessly extracts emails, phone numbers, and social media links from any website. Ideal for marketers and businesses seeking to enhance their contact database quickly and efficiently.

Jinny Kim

1.5K

3.0

🏡immowelt.de search results scraper (By search URL)

azzouzana/immowelt-de-search-results-scraper-by-search-url

🔥 Scrape immowelt.de search results pages and monitor them for new or delisted ads, easily exportable to JSON, CSV, Excel, or as an API... Simply enter the search page URL and get your data! Blazing fast & affordable ⚡

Azzouzana

5.0

Immowelt Scraper

real_spidery/immowelt-scraper

Fast and lightweight Immowelt scraper allows you to deep dive in the #1 real estate classifieds website in Germany which help to connect buyers, investors, landlords, sellers, tenants and brokers. Export scraped data, run the scraper via API, schedule and monitor runs or integrate with other tools

Real Spidery

Immobilienscout24

rigelbytes/immobilienscout24

Scrape UNLIMITED listings from Immobilienscout24.de for just $25/month. Get detailed property info, contact details, and export data in JSON, CSV, or Excel. Perfect for market research and lead generation!

Rigel Bytes

🏘️immobilienscout24.de properties pages scraper

azzouzana/immobilienscout24-de-properties-pages-scraper

🔥 Scrape immobilienscout24.de properties pages with this NO-CODE tool! Extract info fast and export to JSON, CSV, Excel, or API. Just paste properties URLs and get your data. Blazing speed, affordable pricing, and effortless insights await. Start today and supercharge your workflow! ⚡

Azzouzana

5.0

🏘️immobilienscout24.de search results scraper (By search URL)

azzouzana/immobilienscout24-de-search-results-scraper-by-search-url

🔥 Scrape immobilienscout24.de search results pages and monitor them for new or delisted ads, easily exportable to JSON, CSV, Excel, or as an API... Simply enter the search page URL and get your data! Blazing fast & affordable ⚡

Azzouzana

5.0