Yellow Pages Business Scraper Worldwide avatar
Yellow Pages Business Scraper Worldwide
Under maintenance

Pricing

from $0.50 / 1,000 results

Go to Apify Store
Yellow Pages Business Scraper Worldwide

Yellow Pages Business Scraper Worldwide

Under maintenance

Extract business leads from Yellow Pages directories in over 50 countries. Scrape company names, phone numbers, verified emails, physical addresses, and websites. Perfect for B2B sales prospecting, lead generation, and market research. Fast, reliable data extraction. Export to CSV, JSON via API.

Pricing

from $0.50 / 1,000 results

Rating

5.0

(5)

Developer

Țugui Dragoș

Țugui Dragoș

Maintained by Community

Actor stats

11

Bookmarked

32

Total users

6

Monthly active users

7 days ago

Last modified

Share

Apify Actor Node.js Crawlee Playwright License: Proprietary

Locate. Scan. Capture. Global Business Data & Direct Emails in One Flow.


Description

Yellow Pages Scraper is a professional-grade extraction tool designed for high-reliability data harvesting from Yellow Pages directories across 10 countries. Unlike simple scrapers, this actor employs advanced anti-blocking techniques, session warming, and fingerprint randomization to ensure consistent access and high success rates.

It is ideal for:

  • B2B Lead Generation: Build targeted lists of businesses (gyms, restaurants, lawyers, etc.).
  • Market Research: Analyze competitor density and offerings in specific regions.
  • Direct Marketing: optional email extraction capabilities for outreach campaigns.

Key Features

  • Advanced Stealth: Uses Playwright with residential proxies, browser fingerprinting, and session warming to mimic human behavior.
  • Global Support: Specialized 10-country support with proper locale handling.
  • Email Discovery: Optional "deep scan" mode that visits business websites to find email addresses.
  • Smart Caching: Built-in deduplication and caching to save resources on repeated runs.
  • Auto-Retry: Resilient error handling with exponential backoff for network issues.

Supported Countries

CountryCodeWebsite
🇺🇸 USAusYellowPages.com
🇨🇦 CanadacaYellowPages.ca
🇬🇧 UKukYell.com
🇩🇪 GermanydeGelbeSeiten.de
🇫🇷 FrancefrPagesJaunes.fr
🇪🇸 SpainesPaginasAmarillas.es
🇮🇹 ItalyitPagineGialle.it
🇷🇴 RomaniaroPaginiAurii.ro
🇦🇺 AustraliaauYellowPages.com.au
🇧🇷 BrazilbrGuiaMais.com.br

Cost Estimation

Consumption varies heavily based on the "Operating Mode".

ModeCost Per 1k ResultsSpeedNotes
Standard~$2 - $4FastExtracts only directory info (Phone, Address, Name).
Email Extraction~$15 - $25SlowVisits every business website to find emails. Uses significantly more bandwidth and compute time.

Tip: Start with "Standard Mode" to verify your search query returns good results, then enable scanForEmails for the final scrape.

Input Parameters

ParameterTypeDefaultDescription
targetSitestringusTarget country code (e.g., us, de, uk).
searchQuerystring-Search terms, e.g., "Dentists in London" or "Pizza New York".
maxResultsinteger100Soft limit for number of businesses to scrape.
scanForEmailsbooleanfalseIf true, visits business websites to extract emails.

Output Example

The actor produces a clean, structured JSON dataset.

{
"businessName": "Joe's Famous Pizza",
"address": "123 Mulberry St, New York, NY 10013",
"city": "New York",
"country": "US",
"phoneNumber": "+1 212-555-0199",
"website": "https://www.joespizzeria-example.com",
"emails": [
"contact@joespizzeria-example.com",
"info@joespizzeria-example.com"
],
"categories": [
"Pizza",
"Italian Restaurant"
],
"rank": 1,
"sourceUrl": "https://www.yellowpages.com/new-york-ny/mip/joes-pizza-...",
"scrapedAt": "2023-10-27T14:30:00.000Z"
}

Performance & Limitations

  • Rate Limits: To maintain stealth, the actor limits concurrency. Do not force high concurrency (>5) or you risk 403 blocks.
  • Email Success: Email extraction success varies by region (Germany ~70%, US ~50%, UK ~45%) due to local laws and digital adoption.
  • Memory: For large email extraction jobs (>1000 items), meaningful memory is required. The actor handles this, but large runs may take time.

Technical Architecture

This is not just a simple HTML parser. It is a full browser automation system:

  • Engine: Playwright (Chromium)
  • Proxy: Residential Proxies (Required for Yellow Pages)
  • Anti-Bot:
    • canvas fingerprinting noise
    • navigator.webdriver masking
    • Human-like mouse movements and scrolling (Session Warming)

This tool is for educational and legitimate business intelligence purposes only. You are responsible for complying with:

  • GDPR (Europe)
  • CCPA (California)
  • CAN-SPAM / CASL (Email marketing)
  • The Terms of Service of the target websites.

Built with 🩶 for the Apify community 🫡