Yellow Pages Business Scraper Worldwide
Pricing
from $0.50 / 1,000 results
Yellow Pages Business Scraper Worldwide
Extract business leads from Yellow Pages directories in over 50 countries. Scrape company names, phone numbers, verified emails, physical addresses, and websites. Perfect for B2B sales prospecting, lead generation, and market research. Fast, reliable data extraction. Export to CSV, JSON via API.
Pricing
from $0.50 / 1,000 results
Rating
5.0
(5)
Developer

Țugui Dragoș
Actor stats
11
Bookmarked
32
Total users
6
Monthly active users
7 days ago
Last modified
Categories
Share
Locate. Scan. Capture. Global Business Data & Direct Emails in One Flow.
Description
Yellow Pages Scraper is a professional-grade extraction tool designed for high-reliability data harvesting from Yellow Pages directories across 10 countries. Unlike simple scrapers, this actor employs advanced anti-blocking techniques, session warming, and fingerprint randomization to ensure consistent access and high success rates.
It is ideal for:
- B2B Lead Generation: Build targeted lists of businesses (gyms, restaurants, lawyers, etc.).
- Market Research: Analyze competitor density and offerings in specific regions.
- Direct Marketing: optional email extraction capabilities for outreach campaigns.
Key Features
- Advanced Stealth: Uses
Playwrightwith residential proxies, browser fingerprinting, and session warming to mimic human behavior. - Global Support: Specialized 10-country support with proper locale handling.
- Email Discovery: Optional "deep scan" mode that visits business websites to find email addresses.
- Smart Caching: Built-in deduplication and caching to save resources on repeated runs.
- Auto-Retry: Resilient error handling with exponential backoff for network issues.
Supported Countries
| Country | Code | Website |
|---|---|---|
| 🇺🇸 USA | us | YellowPages.com |
| 🇨🇦 Canada | ca | YellowPages.ca |
| 🇬🇧 UK | uk | Yell.com |
| 🇩🇪 Germany | de | GelbeSeiten.de |
| 🇫🇷 France | fr | PagesJaunes.fr |
| 🇪🇸 Spain | es | PaginasAmarillas.es |
| 🇮🇹 Italy | it | PagineGialle.it |
| 🇷🇴 Romania | ro | PaginiAurii.ro |
| 🇦🇺 Australia | au | YellowPages.com.au |
| 🇧🇷 Brazil | br | GuiaMais.com.br |
Cost Estimation
Consumption varies heavily based on the "Operating Mode".
| Mode | Cost Per 1k Results | Speed | Notes |
|---|---|---|---|
| Standard | ~$2 - $4 | Fast | Extracts only directory info (Phone, Address, Name). |
| Email Extraction | ~$15 - $25 | Slow | Visits every business website to find emails. Uses significantly more bandwidth and compute time. |
Tip: Start with "Standard Mode" to verify your search query returns good results, then enable
scanForEmailsfor the final scrape.
Input Parameters
| Parameter | Type | Default | Description |
|---|---|---|---|
targetSite | string | us | Target country code (e.g., us, de, uk). |
searchQuery | string | - | Search terms, e.g., "Dentists in London" or "Pizza New York". |
maxResults | integer | 100 | Soft limit for number of businesses to scrape. |
scanForEmails | boolean | false | If true, visits business websites to extract emails. |
Output Example
The actor produces a clean, structured JSON dataset.
{"businessName": "Joe's Famous Pizza","address": "123 Mulberry St, New York, NY 10013","city": "New York","country": "US","phoneNumber": "+1 212-555-0199","website": "https://www.joespizzeria-example.com","emails": ["contact@joespizzeria-example.com","info@joespizzeria-example.com"],"categories": ["Pizza","Italian Restaurant"],"rank": 1,"sourceUrl": "https://www.yellowpages.com/new-york-ny/mip/joes-pizza-...","scrapedAt": "2023-10-27T14:30:00.000Z"}
Performance & Limitations
- Rate Limits: To maintain stealth, the actor limits concurrency. Do not force high concurrency (>5) or you risk 403 blocks.
- Email Success: Email extraction success varies by region (Germany
~70%, US~50%, UK~45%) due to local laws and digital adoption. - Memory: For large email extraction jobs (>1000 items), meaningful memory is required. The actor handles this, but large runs may take time.
Technical Architecture
This is not just a simple HTML parser. It is a full browser automation system:
- Engine: Playwright (Chromium)
- Proxy: Residential Proxies (Required for Yellow Pages)
- Anti-Bot:
canvasfingerprinting noisenavigator.webdrivermasking- Human-like mouse movements and scrolling (Session Warming)
Legal Disclaimer
This tool is for educational and legitimate business intelligence purposes only. You are responsible for complying with:
- GDPR (Europe)
- CCPA (California)
- CAN-SPAM / CASL (Email marketing)
- The Terms of Service of the target websites.
Built with 🩶 for the Apify community 🫡
