Yellow Pages Business Scraper Worldwide avatar
Yellow Pages Business Scraper Worldwide
Under maintenance

Pricing

from $0.50 / 1,000 results

Go to Apify Store
Yellow Pages Business Scraper Worldwide

Yellow Pages Business Scraper Worldwide

Under maintenance

Extract business leads from Yellow Pages directories in over 50 countries. Scrape company names, phone numbers, verified emails, physical addresses, and websites. Perfect for B2B sales prospecting, lead generation, and market research. Fast, reliable data extraction. Export to CSV, JSON via API.

Pricing

from $0.50 / 1,000 results

Rating

5.0

(5)

Developer

Țugui Dragoș

Țugui Dragoș

Maintained by Community

Actor stats

11

Bookmarked

30

Total users

9

Monthly active users

6 days ago

Last modified

Share

Table of Contents

Description

Yellow Pages Scraper is a robust web scraping solution designed to extract business contact information from Yellow Pages directories worldwide. The actor supports 10 different country-specific Yellow Pages websites and provides comprehensive business data including contact details, addresses, and optionally extracted email addresses from business websites.

Supported Countries

CountryWebsiteCode
USAYellowPages.comus
CanadaYellowPages.caca
United KingdomYell.comuk
GermanyGelbeSeiten.dede
FrancePagesJaunes.frfr
SpainPaginasAmarillas.eses
ItalyPagineGialle.itit
RomaniaPaginiAurii.roro
AustraliaYellowPages.com.auau
BrazilGuiaMais.com.brbr

Features

  • Multi-country support with 10 Yellow Pages directories
  • Configurable result limits from 1 to 1000 entries
  • Optional email extraction from business websites
  • Multiple export formats: CSV, JSON, Excel
  • Automatic proxy rotation with residential proxies
  • Smart retry mechanisms with exponential backoff
  • Session persistence with fingerprint randomization
  • Intelligent caching and deduplication

Input Parameters

ParameterTypeRequiredDefaultDescription
targetSitestringYesusCountry code: us, ca, uk, de, fr, es, it, ro, au, br
searchQuerystringYes-Search query with business type and location
maxResultsintegerNo2Number of results to extract (range: 1-1000)
scanForEmailsbooleanNofalseEnable website scanning for email extraction

Output Schema

FieldDescription
Business NameCompany or business name
Full AddressComplete street address
CityExtracted city name
CountryCountry code identifier
Phone NumberPrimary contact telephone number
Website URLBusiness website address (when available)
Email AddressExtracted email (when scanForEmails is enabled)
Business CategoryIndustry or category classification
Status LabelBusiness status indicator
Source URLOriginal Yellow Pages listing URL

Supported export formats: CSV, JSON, Excel

Usage Examples

Extract restaurant listings from Germany without email scanning:

{
"targetSite": "de",
"searchQuery": "restaurants Berlin",
"maxResults": 50,
"scanForEmails": false
}

Estimated execution time: 1-2 minutes

Search with Email Extraction

Extract hotel listings from Canada with email scanning enabled:

{
"targetSite": "ca",
"searchQuery": "hotels Toronto",
"maxResults": 25,
"scanForEmails": true
}

Estimated execution time: 10-15 minutes

Large Dataset Extraction

Extract a large dataset of lawyer listings from the USA:

{
"targetSite": "us",
"searchQuery": "lawyers New York",
"maxResults": 500,
"scanForEmails": false
}

Estimated execution time: 10-12 minutes

Operating Modes

Standard Mode (Default)

  • Processing speed: approximately 1.4 seconds per business
  • Data source: Yellow Pages directory only
  • Email extraction: disabled
  • Recommended for: high-volume data collection

Email Extraction Mode

  • Processing speed: approximately 20-30 seconds per business
  • Data source: Yellow Pages directory and business websites
  • Email extraction: enabled
  • Scans: homepage, contact pages, legal/impressum pages
  • Recommended for: lead generation requiring email contacts

Performance Specifications

ModeResultsEstimated TimeRecommended RAM
Standard501-2 minutes4 GB
Standard2004-6 minutes8 GB
Standard50010-12 minutes8 GB
Email2510-15 minutes8 GB
Email5020-30 minutes16 GB
Email10040-60 minutes16 GB

Note: Allocate 16 GB RAM for email extraction tasks to ensure optimal performance.

Email Extraction Success Rates

CountrySuccess RateNotes
Germany60-75%Impressum law mandates contact information
Canada55-70%Strong business transparency standards
USA50-65%Extensive database coverage
Australia50-65%High-quality business listings
UK45-60%Strong B2B sector presence
France45-60%Comprehensive directory coverage
Italy45-60%Well-maintained business records
Spain40-55%Growing directory coverage
Romania40-55%Focus on local businesses
Brazil40-55%Leading Latin American directory

Best Practices

  1. Test with small datasets: Begin with 10-20 results to validate query accuracy before executing large extractions.

  2. Leverage German directory for emails: Germany has the highest email success rate due to Impressum legal requirements.

  3. Include location in queries: Specify city names in search queries for more targeted and relevant results.

  4. Segment large extractions: Execute multiple runs of 100 results rather than single runs of 500+ results for improved reliability.

  5. Monitor resource allocation: Increase RAM allocation when enabling email extraction mode.

Troubleshooting

No Results Returned

  • Verify the search query returns results on the Yellow Pages website directly
  • Check spelling and location format accuracy
  • Attempt broader search terms

Insufficient Results

  • Increase the maxResults parameter value
  • Use broader category terms (e.g., "restaurants" instead of "vegan restaurants")
  • Verify the location has sufficient business listings

Missing Email Addresses

  • Confirm scanForEmails parameter is set to true
  • Note that not all businesses have websites listed in their profiles
  • Some websites use contact forms exclusively without displaying email addresses
  • Consider targeting Germany for highest email extraction success rates

Actor Timeout Issues

  • Reduce the maxResults parameter value
  • Disable email extraction for faster execution
  • Divide large extractions into multiple smaller runs

Error Status Responses

  • Typically caused by proxy or rate limiting issues
  • The actor implements automatic retry mechanisms for failed requests
  • Verify Yellow Pages website accessibility from your region

Known Limitations

  • Data extraction limited to Yellow Pages listing pages
  • Website scanning requires a listed website URL in the business profile
  • Email extraction not possible when businesses use contact forms exclusively
  • Some websites implement scraping protection or JavaScript-based content loading
  • Email extraction success varies by website structure and regional regulations
  • Rate limiting may affect extractions exceeding 500 results

Technical Architecture

ComponentTechnology
FrameworkApify SDK with Crawlee
Browser EnginePlaywright (headless mode)
Proxy SystemResidential proxies with automatic rotation
Search MethodForm-based submission
Selectors5-7 fallback selectors per data field
SessionsPersistent with fingerprint randomization
CachingSmart deduplication system
Error HandlingExponential backoff with circuit breakers

This actor extracts publicly available business information from Yellow Pages directories. Users are responsible for:

  • Using extracted data for legitimate business purposes only
  • Compliance with applicable data protection regulations (GDPR, CCPA, and similar)
  • Adherence to anti-spam legislation (CAN-SPAM, CASL, and similar)
  • Respecting website terms of service
  • Obtaining appropriate consent before using data for marketing purposes

The developers assume no liability for misuse of this tool or extracted data.