Yellow Pages Scraper: Emails & B2B Leads avatar
Yellow Pages Scraper: Emails & B2B Leads

Pricing

from $1.00 / 1,000 results

Go to Apify Store
Yellow Pages Scraper: Emails & B2B Leads

Yellow Pages Scraper: Emails & B2B Leads

Extract unlimited B2B leads, emails, and phone numbers from YellowPages.com. 10x faster & cheaper than browser scrapers. Auto-enriches data with social links (LinkedIn, FB, IG). Perfect for cold outreach, marketing agencies, and sales teams. Supports Residential Proxies for 100% success rate.

Pricing

from $1.00 / 1,000 results

Rating

0.0

(0)

Developer

Logiover Data

Logiover Data

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

0

Monthly active users

8 hours ago

Last modified

Share

🚀 Yellow Pages Lead Machine — YellowPages Scraper + Email Enricher (Fast & Cheap)

Generate high-quality B2B leads from YellowPages in minutes.
Scrape business listings by keyword + location, extract phone, address, ratings, website, and optionally enrich emails from business websites.

If you are building outbound sales lists, local lead generation pipelines, or agency prospecting workflows, this Actor delivers structured, ready-to-use leads optimized for speed and low cost.


✅ What you get (Business Leads Dataset)

This Actor outputs a clean dataset with fields designed for immediate use in:

  • CRM imports (HubSpot / Salesforce)
  • Cold email tools (Apollo-like workflows)
  • Lead enrichment pipelines
  • Data warehouses / spreadsheets

Extracted Fields

  • Business Name (name)
  • Phone (telephone)
  • Address (address, locality, region, postalCode)
  • Website (website)
  • Rating + Review Count (rating, reviewCount)
  • Search Context (searchTerm, searchLocation)
  • Tracking (yellowPagesUrl, scrapedAt)

Note: Your dataset view can display email / socials if you later enable enrichment. The current implementation focuses on high-speed JSON-LD extraction (most reliable + lowest cost).


⚡ Why this scraper is faster and more reliable

Most scrapers parse messy HTML and break often. Yellow Pages Lead Machine primarily mines structured metadata from the page source:

JSON-LD extraction (stable)

  • Pulls data from <script type="application/ld+json"> where YellowPages exposes structured business objects.
  • Far more robust than brittle CSS selector scraping.
  • Less DOM interaction = lower runtime cost.

Performance optimizations

  • Resource blocking: images/fonts/trackers blocked for speed.
  • Residential proxy required: reduces blocks/403 and improves success rate.
  • Pagination support: automatically crawls “next” pages until maxItems is satisfied.

🧠 Best use cases (high-converting niches)

  • Local Services: plumbers, roofers, electricians, HVAC, locksmiths
  • Healthcare: dentists, chiropractors, clinics, med spas
  • Professional Services: law firms, accountants, real estate agencies
  • B2B Services: IT services, marketing agencies, staffing, logistics

If your goal is high traffic + high intent, these keywords historically perform well in B2B lead-gen because buyers actively search and call.


🛠 How to Use

  1. Add one or more Search Keywords (e.g. Plumbers, Dentists)
  2. Add one or more Locations (e.g. New York, NY)
  3. Set Max Items (per run)
  4. Keep Residential Proxy enabled (required)
  5. Run the Actor and export results (JSON/CSV/Excel)

⚙️ Input Configuration

This Actor uses an Apify input schema optimized for non-technical users.

Fields

  • searchTerms (required): list of keywords
  • locations (required): list of locations
  • maxItems: maximum number of results per run (default: 100)
  • enrichEmails: optional (default: true; can add time if you enable external website visits)
  • proxy (required): Residential proxy group recommended/required

Example Input (JSON)

{
"searchTerms": ["Plumbers", "Electricians"],
"locations": ["New York, NY", "Brooklyn, NY"],
"maxItems": 200,
"enrichEmails": true,
"proxy": {
"useApifyProxy": true,
"apifyProxyGroups": ["RESIDENTIAL"]
}
}

📦 Output Example (Dataset Item)

{
"name": "ACME Plumbing Co",
"telephone": "+1 212-555-0101",
"address": "123 Example St",
"locality": "New York",
"region": "NY",
"postalCode": "10001",
"website": "https://www.yellowpages.com/new-york-ny/mip/acme-plumbing-co-123456",
"rating": 4.6,
"reviewCount": 128,
"yellowPagesUrl": "https://www.yellowpages.com/search?search_terms=Plumbers&geo_location_terms=New%20York%2C%20NY",
"searchTerm": "Plumbers",
"searchLocation": "New York, NY",
"scrapedAt": "2026-01-12T15:20:00.000Z"
}

📊 Dataset View (Overview Table)

The Actor includes a dataset table view showing:

Business Name

Phone

Email (Enriched)

Website

Address

Rating

And can also include:

Social Profiles

Search Term

Search Location

This makes it easy to validate lead quality inside Apify before exporting.

🔥 Pro Tips (to maximize leads & reduce cost)

  1. Use multiple keywords and locations in one run

This Actor generates start URLs by combining every keyword with every location:

5 keywords × 10 locations = 50 start searches

  1. Keep maxItems realistic per run

Large runs increase pagination depth and WAF risk. For high-volume scraping:

Run smaller batches

Rotate sessions with Residential proxy

Combine results across runs

  1. Email enrichment strategy

If you enable “visit business websites”, expect:

Higher runtime

More requests

Sometimes lower success if sites block bots

Recommended approach:

First scrape leads fast

Then enrich the best leads (filtered by rating / review count / category)

🧱 Anti-bot & Proxy Notes (Important)

YellowPages is protected by aggressive anti-bot measures. For production reliability:

Residential proxies are required

If you see 403 / Access Denied:

Increase proxy rotation (new sessions)

Reduce concurrency (if you tune advanced settings)

Run smaller batches

This Actor already checks “Access Denied / 403” signals and fails fast so you don’t burn compute unnecessarily.

🧩 Common Questions

Do I need a GitHub token?

No. This Actor scrapes YellowPages, not GitHub.

Why is email sometimes empty?

Email enrichment depends on visiting external websites and extracting emails from pages. Many businesses don’t publish emails publicly, or use contact forms. This Actor is designed to prioritize speed + low cost first; enrichment can be enabled/extended as needed.

Can it extract categories, opening hours, or more fields?

Yes—YellowPages often exposes additional structured fields. If you need extra fields, extend the JSON-LD mapping.

💰 Pricing

This Actor is optimized for speed and low compute usage:

Resource blocking enabled

JSON-LD extraction is lightweight

Minimal DOM parsing

You pay for lead data, not ads and images.

✅ Ideal Buyers (SEO Targets)

This Actor is built for:

Lead generation agencies

SaaS founders doing outbound

Local service marketers

Growth teams and SDRs

Data brokers & enrichment pipelines

Keywords users search in Apify Store that this actor targets: yellow pages scraper, yellowpages leads, business leads scraper, phone number scraper, local business scraper, b2b leads, email enrichment, lead generation, directory scraper

🧭 Roadmap

Planned improvements:

True website email extraction (crawl business domains safely)

Social link extraction (LinkedIn/Facebook/Instagram)

Smarter deduplication across pages

Per-lead “quality score” (rating + reviews + completeness)

Support & Feedback

If you have a feature request or run into an issue:

Open an issue in the Actor page

Include the keyword, location, and a sample YellowPages URL

We actively maintain this Actor to keep pace with YellowPages anti-bot changes.