Yellow Pages Scraper: Emails & B2B Leads
Pricing
from $1.00 / 1,000 results
Yellow Pages Scraper: Emails & B2B Leads
Extract unlimited B2B leads, emails, and phone numbers from YellowPages.com. 10x faster & cheaper than browser scrapers. Auto-enriches data with social links (LinkedIn, FB, IG). Perfect for cold outreach, marketing agencies, and sales teams. Supports Residential Proxies for 100% success rate.
Pricing
from $1.00 / 1,000 results
Rating
0.0
(0)
Developer

Logiover Data
Actor stats
0
Bookmarked
2
Total users
0
Monthly active users
8 hours ago
Last modified
Categories
Share
🚀 Yellow Pages Lead Machine — YellowPages Scraper + Email Enricher (Fast & Cheap)
Generate high-quality B2B leads from YellowPages in minutes.
Scrape business listings by keyword + location, extract phone, address, ratings, website, and optionally enrich emails from business websites.
If you are building outbound sales lists, local lead generation pipelines, or agency prospecting workflows, this Actor delivers structured, ready-to-use leads optimized for speed and low cost.
✅ What you get (Business Leads Dataset)
This Actor outputs a clean dataset with fields designed for immediate use in:
- CRM imports (HubSpot / Salesforce)
- Cold email tools (Apollo-like workflows)
- Lead enrichment pipelines
- Data warehouses / spreadsheets
Extracted Fields
- Business Name (
name) - Phone (
telephone) - Address (
address,locality,region,postalCode) - Website (
website) - Rating + Review Count (
rating,reviewCount) - Search Context (
searchTerm,searchLocation) - Tracking (
yellowPagesUrl,scrapedAt)
Note: Your dataset view can display
socialsif you later enable enrichment. The current implementation focuses on high-speed JSON-LD extraction (most reliable + lowest cost).
⚡ Why this scraper is faster and more reliable
Most scrapers parse messy HTML and break often. Yellow Pages Lead Machine primarily mines structured metadata from the page source:
JSON-LD extraction (stable)
- Pulls data from
<script type="application/ld+json">where YellowPages exposes structured business objects. - Far more robust than brittle CSS selector scraping.
- Less DOM interaction = lower runtime cost.
Performance optimizations
- Resource blocking: images/fonts/trackers blocked for speed.
- Residential proxy required: reduces blocks/403 and improves success rate.
- Pagination support: automatically crawls “next” pages until
maxItemsis satisfied.
🧠 Best use cases (high-converting niches)
- Local Services: plumbers, roofers, electricians, HVAC, locksmiths
- Healthcare: dentists, chiropractors, clinics, med spas
- Professional Services: law firms, accountants, real estate agencies
- B2B Services: IT services, marketing agencies, staffing, logistics
If your goal is high traffic + high intent, these keywords historically perform well in B2B lead-gen because buyers actively search and call.
🛠 How to Use
- Add one or more Search Keywords (e.g.
Plumbers,Dentists) - Add one or more Locations (e.g.
New York, NY) - Set Max Items (per run)
- Keep Residential Proxy enabled (required)
- Run the Actor and export results (JSON/CSV/Excel)
⚙️ Input Configuration
This Actor uses an Apify input schema optimized for non-technical users.
Fields
- searchTerms (required): list of keywords
- locations (required): list of locations
- maxItems: maximum number of results per run (default: 100)
- enrichEmails: optional (default: true; can add time if you enable external website visits)
- proxy (required): Residential proxy group recommended/required
Example Input (JSON)
{"searchTerms": ["Plumbers", "Electricians"],"locations": ["New York, NY", "Brooklyn, NY"],"maxItems": 200,"enrichEmails": true,"proxy": {"useApifyProxy": true,"apifyProxyGroups": ["RESIDENTIAL"]}}
📦 Output Example (Dataset Item)
{"name": "ACME Plumbing Co","telephone": "+1 212-555-0101","address": "123 Example St","locality": "New York","region": "NY","postalCode": "10001","website": "https://www.yellowpages.com/new-york-ny/mip/acme-plumbing-co-123456","rating": 4.6,"reviewCount": 128,"yellowPagesUrl": "https://www.yellowpages.com/search?search_terms=Plumbers&geo_location_terms=New%20York%2C%20NY","searchTerm": "Plumbers","searchLocation": "New York, NY","scrapedAt": "2026-01-12T15:20:00.000Z"}
📊 Dataset View (Overview Table)
The Actor includes a dataset table view showing:
Business Name
Phone
Email (Enriched)
Website
Address
Rating
And can also include:
Social Profiles
Search Term
Search Location
This makes it easy to validate lead quality inside Apify before exporting.
🔥 Pro Tips (to maximize leads & reduce cost)
- Use multiple keywords and locations in one run
This Actor generates start URLs by combining every keyword with every location:
5 keywords × 10 locations = 50 start searches
- Keep maxItems realistic per run
Large runs increase pagination depth and WAF risk. For high-volume scraping:
Run smaller batches
Rotate sessions with Residential proxy
Combine results across runs
- Email enrichment strategy
If you enable “visit business websites”, expect:
Higher runtime
More requests
Sometimes lower success if sites block bots
Recommended approach:
First scrape leads fast
Then enrich the best leads (filtered by rating / review count / category)
🧱 Anti-bot & Proxy Notes (Important)
YellowPages is protected by aggressive anti-bot measures. For production reliability:
Residential proxies are required
If you see 403 / Access Denied:
Increase proxy rotation (new sessions)
Reduce concurrency (if you tune advanced settings)
Run smaller batches
This Actor already checks “Access Denied / 403” signals and fails fast so you don’t burn compute unnecessarily.
🧩 Common Questions
Do I need a GitHub token?
No. This Actor scrapes YellowPages, not GitHub.
Why is email sometimes empty?
Email enrichment depends on visiting external websites and extracting emails from pages. Many businesses don’t publish emails publicly, or use contact forms. This Actor is designed to prioritize speed + low cost first; enrichment can be enabled/extended as needed.
Can it extract categories, opening hours, or more fields?
Yes—YellowPages often exposes additional structured fields. If you need extra fields, extend the JSON-LD mapping.
💰 Pricing
This Actor is optimized for speed and low compute usage:
Resource blocking enabled
JSON-LD extraction is lightweight
Minimal DOM parsing
You pay for lead data, not ads and images.
✅ Ideal Buyers (SEO Targets)
This Actor is built for:
Lead generation agencies
SaaS founders doing outbound
Local service marketers
Growth teams and SDRs
Data brokers & enrichment pipelines
Keywords users search in Apify Store that this actor targets: yellow pages scraper, yellowpages leads, business leads scraper, phone number scraper, local business scraper, b2b leads, email enrichment, lead generation, directory scraper
🧭 Roadmap
Planned improvements:
True website email extraction (crawl business domains safely)
Social link extraction (LinkedIn/Facebook/Instagram)
Smarter deduplication across pages
Per-lead “quality score” (rating + reviews + completeness)
Support & Feedback
If you have a feature request or run into an issue:
Open an issue in the Actor page
Include the keyword, location, and a sample YellowPages URL
We actively maintain this Actor to keep pace with YellowPages anti-bot changes.