Yellow Pages US Scraper avatar

Yellow Pages US Scraper

Pricing

from $1.05 / 1,000 results

Go to Apify Store
Yellow Pages US Scraper

Yellow Pages US Scraper

Scrape US business listings from yellowpages.com — names, phone numbers, addresses, websites, categories, ratings, and reviews. Search by category + location or pass direct URLs. Production-grade with battle-tested anti-bot handling and daily fixture tests.

Pricing

from $1.05 / 1,000 results

Rating

0.0

(0)

Developer

Giga Data

Giga Data

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

4 days ago

Last modified

Share

📞 Yellow Pages US Scraper

Extract US business listings from yellowpages.com — names, phone numbers, addresses, websites, categories, ratings, and reviews. Built for lead-gen agencies, local-SEO teams, and B2B sales ops.

Yellow Pages US Scraper turns yellowpages.com into a clean, structured data feed. Provide a category and a city, and you get every business listed — with phones, websites, addresses, ratings, and category labels — pushed straight to a downloadable dataset (JSON, CSV, Excel).

Use it to:

  • Generate qualified B2B lead lists — plumbers in Boston, dentists in Denver, electricians in Austin — every business with phone and website in seconds.
  • Track local competitors across regions to spot expansion opportunities or gaps.
  • Seed your CRM or marketing tool with verified local business data.
  • Run market research — TAM sizing, density mapping, vertical surveys.
  • Power RAG / LLM datasets that need structured local-business records.

Unlike Google's Places API (which caps you at 60 results per query and charges per call) or one-off freebie scrapers (which break the moment Yellow Pages tweaks its HTML), this Actor ships with active maintenance: every selector is fixture-tested daily, breakage gets fixed within hours, and you only ever pay for results — never for failed runs.

💰 Pricing

$1.00 per 1,000 results — Pay-Per-Result. The first 100 results are free, and Apify's $5/month free credit covers a further ~5,000 results out of the box. Stops costing you anything if you stop running it.

🎥 Demo

Demo video coming soon — meanwhile, see "Sample output" below for what a single row looks like.

✨ What you get

Every business record contains:

FieldDescription
nameBusiness name as listed
phonePrimary phone (e.g. "(617) 555-1234")
websiteDirect website URL (when provided by the listing)
addressStreet address
city, state, zipParsed locality components
categoriesArray of Yellow Pages category labels
ratingStar rating (0–5, half-stars supported)
reviewCountNumber of reviews on Yellow Pages
yearsInBusinessSelf-reported years operating
isAdtrue for paid/sponsored listings, false for organic
businessUrlCanonical Yellow Pages business-detail URL
email, hours, servicesExtra fields populated when "Visit business detail pages" is enabled
searchTerm, searchLocationEcho of the input that produced this row
scrapedAtISO timestamp of extraction

🚀 How to use

  1. Pick what you want. Enter one or more search terms (e.g. plumber, italian restaurant) and one or more locations (e.g. Boston, MA, Austin, TX). The Actor crawls every combination.
  2. Cap the result count. Set maxItems and maxPagesPerSearch so the Actor stops exactly where you want.
  3. (Optional) toggle "Visit business detail pages" to pull richer fields per business (email, hours, services). Roughly 2× the runtime and cost.
  4. Click Start. Results stream into the Apify dataset — download as JSON, CSV, or Excel from the Actor's Storage tab when the run completes.

Advanced users can pass startUrls directly to control which exact category landing pages get crawled (e.g. https://www.yellowpages.com/boston-ma/plumbers).

📦 Sample output

{
"name": "A Hohmann & Co Inc",
"phone": "(844) 825-3055",
"website": "http://www.hohmannoilandplumbing.com",
"address": "1146 Dorchester Ave",
"city": "Boston",
"state": "MA",
"zip": "02125",
"categories": ["Plumbers"],
"rating": 3,
"reviewCount": 2,
"yearsInBusiness": 116,
"isAd": false,
"businessUrl": "https://www.yellowpages.com/boston-ma/mip/a-hohmann-co-inc-474353566",
"email": null,
"hours": null,
"services": [],
"searchTerm": "plumber",
"searchLocation": "Boston, MA",
"scrapedAt": "2026-05-12T14:23:17.504Z"
}

⚙️ Input

FieldTypeDefaultDescription
searchTermsstring[]["plumber"]Categories or keywords to search for.
locationsstring[]["Boston, MA"]US locations in "City, ST" format.
startUrlsURL[]Direct category URLs (advanced; overrides searchTerms × locations).
maxItemsint200Stop after N results. 0 = unlimited.
maxPagesPerSearchint5Max result pages crawled per query (YP caps near 30).
extractDetailsboolfalseFollow each business's detail page for email, hours, services.
proxyConfigurationproxyRESIDENTIALApify proxy settings — residential strongly recommended.

See the input schema panel for the full reference.

❓ FAQ

How is this different from the official Yellow Pages search? Yellow Pages' on-site search is built for human browsers — there's no bulk export, no structured fields, and no API to feed your CRM. This Actor gives you the same listings as clean rows you can pipe into Salesforce, Apollo, a spreadsheet, or your own dashboard.

How accurate is the data? Every field you see on a listing page is captured. Yellow Pages updates listings continuously, so freshness depends on when they were last updated by the business owner. Phone numbers and business names are the most reliable fields; emails and hours often live on the detail page (enable extractDetails to pull them).

Will it break when Yellow Pages changes their HTML? That's the bet most scrapers lose. This one ships with daily fixture tests that detect breakage within hours, and fixes are deployed fast — typically same-day. Run-success-rate is publicly visible on this page.

What about anti-bot? Will my runs get blocked? Yellow Pages is fronted by Cloudflare. This Actor uses Apify's residential proxy by default plus a Chrome-grade TLS fingerprint, which sails past Cloudflare's normal protections. We monitor block rates and adapt.

Does this scrape user-personal information? No. Every field is publicly-listed business data (business names, business phones, business addresses, business reviews). We don't extract reviewer profiles, individual emails, or anything Yellow Pages classifies as personal.

How big a job can it handle? Tens of thousands of records per run is comfortable. For multi-million-record jobs, contact us — we'll suggest sharding strategy and run dedicated infrastructure.

⚠️ Limits, ToS & compliance

  • Rate limits: Yellow Pages caps results around 30 pages per category-location pair. Run-pace is throttled by Apify residential proxy.
  • Anti-bot: Cloudflare protection bypassed via real browser TLS fingerprint + residential IPs. Datacenter proxies will not work.
  • Legal posture: Output is public business data — names, addresses, phones of registered businesses. No personal data, no protected content. GDPR-clean.
  • Source data ownership: Yellow Pages content is published by Thryv, Inc. This Actor sells the capability to extract that public data into your own workflow — buyers are responsible for how they use the results.

🏷️ Other Actors by Giga Data

  • More B2B-directory scrapers coming soon — bookmark our org page.

📬 Support & contact

  • Open an issue on the Apify support console.
  • Issue templates auto-route maintenance failures to a fix queue.

Built and maintained by Giga Data. We treat scraping as infrastructure — fixture-tested, monitored, and patched fast when target sites change.

Disclosure: this Actor is AI-assisted (the parser, schema, and listing copy were authored with Anthropic Claude). The maintainer reviews every change before deploy.