Pricing

$10.00/month + usage

Try for free

Go to Apify Store

Gelbe Seiten Scraper

Try for free

Developed by

Frederic

Gather leads and information from one of Germany's most comprehensive business directories, Gelbe Seiten. Download your data as HTML table, JSON, CSV, XML, Excel, RSS, or JSONL.

5.0 (3)

Pricing

$10.00/month + usage

Last modified

5 months ago

Lead generation

Automation

This scraper is a specialized information gathering / lead-gen tool for the German market.
It's designed to extract all information from Gelbe Seiten listings, including emails, phone numbers, addresses, reviews, booking.com information and more.
Make use of one of the most comprehensive business directories in Germany, and gather all the information you need in one go.

Why choose this scraper?

Easy to use: Just enter your search query and let the scraper do the rest.
No page limit: This scraper can handle any number of pages, and will automatically stop when it reaches the end of the results.
Deep Extraction: This scraper can extract all information from a listing, including reviews, photos, and more.
Wide Range of output formats: Export your data as CSV, Excel, JSON, XML, or more.
Committed to quality: We're constantly improving our scrapers to ensure you get the best data possible.
Technical support and continous improvements: We're always here to help you with any issues you might have, and we're constantly improving our scrapers to ensure you get the best data possible. If the scraper encounters and information it cannot yet handle, it will give you a warning, but continue to scrape the rest of the data, just open an issue with the log-output and we'll get to work on it.
Fast: Our scrapers are designed to be fast, so you can get the data you need quickly and easily. Even with detailed per-listing extraction, the scraper only takes ~45s to scrape 1500 listings, thats over 30 listings per second!
Reliable: The scraper is resistant to malformed data and can automatically recover from most errors. Even when starting and stopping the scraper (or if apify migrates the scraper to a different server), the scraper will continue where it left off. And thanks to it's built-in deduplication engine, it will not scrape the same listing twice, reducing any post-processing work you might have to do.
Strong typing: We provide a strong typing system for the output, so you can be sure that the data you get is clean and consistent.

Input

The scraper only has a single required input, the query parameter. This is the search query you would enter on the Gelbe Seiten website, the other inputs are:

location: (optional) The location to search in, if not provided, the scraper will search in all of Germany.
sort: (optional) The sort-order of the results, can be either relevance or bewertung, if not provided, the scraper will fall back to Gelbe Seiten's default sort-order, relevance.
maxPages: (optional) The maximum number of pages to scrape, if not provided, the scraper will scrape all pages. (Note that there's on average ~10 listings per page, where the last can have less)

Output format

The scraper returns a dataset, with one item per listing, which are structured as follows:

// Hotels, etc. have a lot of additional information, that's provided via the embedded booking.com widget
    // This part of the output is a parsed and cleaned version of this widget 
    const BookingComInfoSchema = z.object({
        images: z.array(z.string()),
        score: z.number(),
        description: z.string(),
        outdoorArea: z.array(z.string()).optional(),
        distancing: z.array(z.string()).optional(),
        foodAndDrink: z.array(z.string()).optional(),
        safety: z.array(z.string()).optional(),
        cleaningAndDisinfection: z.array(z.string()).optional(),
        services: z.array(z.string()).optional(),
        internet: z.array(z.string()).optional(),
        general: z.array(z.string()).optional(),
        parking: z.array(z.string()).optional(),
        rooms: z.array(z.string()).optional(),
        paymentMethods: z.array(z.string()).optional(),
        openingHours: z.array(z.string()).optional(),
        receptionService: z.array(z.string()).optional(),
        foodSafety: z.array(z.string()).optional(),
        safetyFacilities: z.array(z.string()).optional(),
        poolAndWellness: z.array(z.string()).optional(),
        access: z.array(z.string()).optional(),
        activities: z.array(z.string()).optional(),
        publicTransport: z.array(z.string()).optional(),
        entertainmentAndFamilyOffers: z.array(z.string()).optional(),
        membershipAndServiceLanguages: z.array(z.string()).optional(),
        kitchen: z.array(z.string()).optional(),
        businessFacilities: z.array(z.string()).optional(),
        cleaningServices: z.array(z.string()).optional(),
        generalFacilities: z.array(z.string()).optional(),
        miscellaneous: z.array(z.string()).optional(),
        shops: z.array(z.string()).optional(),
        ski: z.array(z.string()).optional(),
    });

    // This part of the output is directly taken from gelbeseiten.de, and just passed through
    const ReviewSchema = z.object({
        text: z.string(),
        erstellungsDatumIso: z.string(),
        erstellungsDatumFormatiert: z.string(),
        bearbeitungsDatum: z.string().nullable(),
        bewertungBeiAnbieter: z.number().nullable(),
        geraetetypBewertungsabgabe: z.string().nullable(),
        bewertungsbogenTextUrl: z.string().nullable(),
        bewertungsKriteriumListe: z.array(
            z.object({
                kriterium: z.string(),
                bewertung: z.number(),
                text: z.string().nullable(),
            }),
        ).nullable(),
        bewertungsId: z.string().nullable(),
        bewertungsportal: z.string().nullable(),
        likeListe: z.array(z.object({
            benutzer: z.object({
                name: z.string().nullable(),
                nutzerprofilUrl: z.string().nullable(),
                nutzerbildUrl: z.string().nullable(),
            }),
        })).nullable(),
        partnerBewertungstextUrl: z.string().nullable(),
        bewertungNormiert: z.number().min(0).max(5),
        anzahlLikes: z.string().nullable(),
        anzahlKommentare: z.number(),
        reaktionListe: z.array(
            z.object({
                text: z.string(),
                erstellungsDatumFormatiert: z.string(),
                reaktionsTyp: z.string(),
                anzahlLikes: z.string().nullable(),
                erstellungsDatum: z.string(),
                benutzer: z.object({
                    name: z.string().nullable(),
                    nutzerprofilUrl: z.string().nullable(),
                    nutzerbildUrl: z.string().nullable(),
                }),
                spamUrl: z.string().nullable(),
            }),
        ),
        verifikationListe: z.array(z.string()),
        bewertungTextAnzahl: z.number(),
        erstellungsDatum: z.string(),
        bewertungsbogenSterneUrl: z.string().nullable(),
        produkt: z.object({
            partnerName: z.string().nullable(),
            name: z.string().nullable(),
            information: z.string().nullable(),
        }),
        benutzer: z.object({
            name: z.string().nullable(),
            nutzerprofilUrl: z.string().nullable(),
            nutzerbildUrl: z.string().nullable(),
        }),
        titel: z.string().nullable(),
        partnerName: z.string().nullable(),
        spamUrl: z.string().nullable(),
    })
    const ExtraInfoSchema = z.object({
        brands: z.array(z.string()).optional(),
        memberships: z.array(z.string()).optional(),
        languages: z.array(z.string()).optional(),
        accessibility: z.array(z.string()).optional(),
    });

    const OutputSchema = z.object({
        // Search-results
        id: z.string(),
        memberId: z.string().optional(),
        name: z.string(),
        logoURL: z.string().optional(),
        bestIndustry: z.string(),
        googleMapsAddress: z.string().optional(),
        address: z.string(),
        phone: z.string(),
        website: z.string().optional(),
        shortDescription: z.string().optional(),
        highlightLevel: z.number().optional(),
        partnerLevel: z.string().optional(),
        rating: z.number().optional(),
        ratingCount: z.number().optional(),

        // Detail-page
        email: z.string().optional(),
        openingHours: z.array(z.object({
            day: z.string(),
            hours: z.array(
                z.object({
                    closed: z.boolean(),
                    from: z.string().optional(),
                    to: z.string().optional(),
                }),
            ),
        })).optional(),
        additionalPhoneNumbers: z.array(z.object({
            title: z.string(),
            number: z.string(),
        })).optional(),
        menu: z.string().optional(),
        // TODO:
        reviews: z.array(ReviewSchema).optional(),
        description: z.string().optional(),
        acceptedPaymentMethods: z.array(z.string()).optional(),
        images: z.array(z.object({
            src: z.string(),
            caption: z.string(),
        })).optional(),
        socialAccounts: z.array(z.object({
            url: z.string(),
            type: z.enum([
                'unknown', 'facebook', 'twitter', 'instagram',
                'linkedin', 'youtube', 'google', 'google_maps', 'pinterest',
                'tiktok', 'snapchat', 'whatsapp', 'telegram',
                'xing', 'unknown',
            ]),
        })).optional(),
        brochure: z.string().optional(),
        openPositions: z.array(z.record(z.string())).optional(),
        faq: z.array(z.object({
            question: z.string(),
            answer: z.string(),
        })).optional(),
        industries: z.array(z.string()).optional(),
        services: z.array(z.string()).optional(),
        extraInfo: ExtraInfoSchema.optional(),
        bookingInfo: BookingComInfoSchema.optional(),
        // The ids of any other listings, that are related to this one
        relatedIds: z.array(z.string()).optional(),
    });

Note that the schema is enforced, so you can be sure that the data you get is clean and consistent.
If there's any changes to the data, e.g. if additional properties are added, the schema would be updated accordingly, and you'll be notified of the changes.

Target Audience

This scraper is designed for anyone who needs to gather information from Gelbe Seiten listings, including:

Marketing/Sales teams: Use the scraper to gather leads for your sales team or to find potential customers for your marketing campaigns.
Business owners: Use the scraper to gather information about your competitors or to find potential partners.
Researchers: Use the scraper to gather data for your research projects or to find information for your academic papers.
Journalists: Use the scraper to gather information for your articles or to find potential sources for your stories.
Data analysts: Use the scraper to gather data for your analysis projects or to find information for your reports.
Anyone else who needs to gather information from Gelbe Seiten listings: Use the scraper to gather information for any other purpose you might have.

On this page

Gelbe Seiten Scraper

Share Actor:

Gelbe Seiten Scraper

caprolok/gelbe-seiten-scraper

Unleash the full potential of market research with the Gelbe Seiten Scraper. This efficient tool expertly navigates and extracts vital business information from Germany's premier directory, offering invaluable data for insightful analysis and strategy development.

Caprolok

128

Gelbe Seiten (German Yellow Pages) Scraper

dominic-quaiser/gelbe-seiten-german-yellow-pages-scraper

Scrape German business listings from Gelbe Seiten with flexible detail levels. This Apify Actor supports fast, basic, and deep search modes, rate limiting, proxy rotation, and index control. Ideal for lead gen, SEO, and market research. Outputs structured data to Apify datasets.

Dominic M. Quaiser

Yellow Pages Germany (Gelbe Seiten) Business Lead Generator

lead.gen.labs/yellow-pages-germany-gelbe-seiten-business-lead-generator

Unlock high-quality business leads from Germany’s leading directory, GelbeSeiten.de. This powerful scraper extracts company names, addresses, phone numbers, emails, websites, and social media links—giving you verified contact details for B2B outreach, sales prospecting, and market research.

LeadGen Labs

Gelbeseiten.de Business Details Scraper

ecomscrape/gelbeseiten-business-details-scraper

Gelbeseiten.de Business Details Scraper extracts comprehensive German business data including contacts, ratings, reviews, photos, geo-location and more. Automates research, delivers structured JSON for market analysis, lead generation, competitive intelligence in Germany.

ecomscrape

Herold.at Scraper

lexis-solutions/herold-at-scraper

Scrape business listings from Herold.at - including company names, contact details, locations, and reviews. Ideal for market research, lead generation, and local SEO. Fast, structured, and customizable extraction.

Lexis Solutions

5.0

Gelbeseiten.de Business Search Scraper

ecomscrape/gelbeseiten-business-search-scraper

Gelbeseiten.de Business Search Scraper automates German business data extraction, transforming weeks of manual research into hours. Extract company names, contacts, addresses, ratings from Germany's leading directory. Perfect for market research, sales prospecting, competitive analysis.

ecomscrape

Herold.at Business Details Scraper

ecomscrape/herold-business-details-scraper

Extract business data from Herold.at efficiently with automated scraping. Features multi-format export (JSON, CSV, Excel), seamless integration, custom reporting. Process thousands of records with high accuracy. Perfect for lead generation, market research, CRM systems.

ecomscrape

Herold.at Business Search Scraper

ecomscrape/herold-business-search-scraper

Extract comprehensive Austrian business data effortlessly with Herold.at Business Search Scraper. Automated tool transforms millions of company profiles into structured JSON format for market research, lead generation, competitive analysis, CRM integration & sales prospecting.

ecomscrape

Phone Number Formatter

dominic-quaiser/phone-number-formatter

Easily parse and format phone numbers in bulk with this Apify Actor. Supports E.164, International, National, and RFC3966 formats, configurable regions, batch processing, concurrency, rate limiting, and retries. Ideal for CRMs, SMS campaigns, and data migrations.

Dominic M. Quaiser

Europages Business Directory Scraper

easyapi/europages-business-directory-scraper

🏭 Extract detailed company information from Europages business directory. Get comprehensive data including company profiles, products, certifications, and contact details. Perfect for lead generation, market research, and B2B prospecting.