Yellow Pages Scraper — US Business Leads | from $1.50/1K avatar

Yellow Pages Scraper — US Business Leads | from $1.50/1K

Pricing

Pay per usage

Go to Apify Store
Yellow Pages Scraper — US Business Leads | from $1.50/1K

Yellow Pages Scraper — US Business Leads | from $1.50/1K

Scrape Yellow Pages US business listings — name, phone, address, website, categories, rating. Bulk lead-gen by search term + location. Dual address parsing, organic/ad flag, clean website URLs. Each record has parse_confidence.

Pricing

Pay per usage

Rating

0.0

(0)

Developer

Vitalii Bondarev

Vitalii Bondarev

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

3 days ago

Last modified

Share

Yellow Pages Scraper — US Local Business Directory

Pay-per-result: $1.50/1K businesses scraped. Phone numbers and ratings included. No proxy, no monthly fee.

Yellow Pages still indexes 20 million+ US local businesses. This scraper gives you clean, structured leads — name, phone, address, website, categories, rating — in seconds. Pay only for results you get.

Scrape yellowpages.com search results into clean, lead-gen-ready JSON. Extract business name, phone, address, website, categories, rating, review count, and years in business — with a parse_confidence score on every record.

What you get

FieldDescription
business_nameFull business name
phonePrimary phone number
streetStreet address
cityCity
stateState (2-letter code)
zipZIP code
websiteBusiness website URL (external, not a YP redirect)
categoriesList of YP business categories
ratingStar rating (1.0–5.0, half-star precision)
review_countNumber of reviews
years_in_businessYears in business (from YP badge)
yp_urlCanonical Yellow Pages listing URL
is_adWhether this is a sponsored listing
parse_confidenceParse quality score (1.0 = perfect)
warningsList of machine-readable quality codes

Input

ParameterTypeDefaultDescription
searchTermsstring"plumbers"Business type or keyword
locationstring"Austin, TX"City, state, or ZIP
maxResultsinteger30Max records to return
includeAdsbooleanfalseInclude sponsored listings
startPageinteger1Start page (for resuming runs)

Why this scraper is different

Most Yellow Pages scrapers on the market break within weeks because they anchor on CSS class names that change with every site deploy. This scraper uses structural HTML anchors (div.v-card, a.business-name, p.adr, div.result-rating class words) that are tied to semantic meaning, not generated class names. When YP updates their CSS, this parser keeps working.

Every record ships a parse_confidence score (0.0–1.0). Below 0.7 is a machine-readable signal that the page structure has drifted — your data pipeline can filter automatically.

This actorGeneric YP scrapers
Parse methodStructural HTML anchors (resilient)CSS class names (breaks on redeploy)
parse_confidenceYes — per recordNo
Address parsingDual-pattern (service-area + physical)Often breaks on service-area listings
Proxy requiredNoOften yes
is_ad flagYes (filter sponsored)Rarely
years_in_businessYes (unique YP data point)Rarely
Price$1.50/1K$2-5/1K

Use with AI agents (MCP)

This scraper is callable as a tool by AI agents (Claude Desktop, Cursor, VS Code, n8n, LangGraph, CrewAI, or any MCP-compatible client) via Apify's hosted Model Context Protocol server. Any AI agent can look up US business contacts mid-conversation.

Point your MCP client at this tool:

{
"mcpServers": {
"apify": {
"command": "npx",
"args": [
"mcp-remote",
"https://mcp.apify.com/?tools=bovi/yellowpages-scraper",
"--header",
"Authorization: Bearer <YOUR_APIFY_TOKEN>"
]
}
}
}

Pricing example

Pay-per-result: $1.50 per 1,000 business records ($0.0015/record). You only pay for actual results scraped — no monthly fee, no minimum.

Run sizeCost
100 businesses$0.15
500 businesses$0.75
1,000 businesses$1.50
5,000 businesses$7.50

Output sample

{
"business_name": "Austin Plumbing Co",
"phone": "(512) 555-0147",
"street": "1423 S Lamar Blvd",
"city": "Austin",
"state": "TX",
"zip": "78704",
"website": "https://austinplumbing.com",
"categories": ["Plumbers", "Drain Cleaning"],
"rating": 4.5,
"review_count": 87,
"years_in_business": 12,
"yp_url": "https://www.yellowpages.com/austin-tx/mip/austin-plumbing-co-123456",
"is_ad": false,
"parse_confidence": 1.0,
"warnings": []
}

FAQ

Do I need a proxy or API key? No — Yellow Pages serves plain HTML without heavy bot protection. No proxy or API key required. The actor runs zero-COGS.

What export formats are available? JSON, CSV, Excel, and XML — downloadable from the dataset page or via the Apify REST API.

Can I schedule regular runs? Yes. Use Apify Scheduler (or n8n/Zapier) to run on a schedule and push new records to Google Sheets, a CRM, or a webhook.

What if the actor returns empty results? Confirm the searchTerms and location match Yellow Pages conventions (e.g. "Austin, TX" not "Austin Texas"). YP returns empty when there are no results for that exact search pair.

Use cases

  • Lead generation: build targeted prospect lists (plumbers in Austin, dentists in Chicago)
  • Local SEO research: audit competitors' listings, ratings, and categories
  • Market research: map service providers in a region
  • Data enrichment: match phone/address data to existing business lists

Technical notes

  • No proxy required for most runs — YP is plain HTML without heavy anti-bot
  • Handles both service-area listings (no street address) and physical locations
  • Filters YP redirect links to return the actual business website URL
  • Pagination via ?page=N — 30 results per page

Integrations

Built for lead-gen teams and local-market researchers extracting US business contacts and ratings by category and city — the JSON/dataset output drops into the tools you already run, no glue code:

  • n8n / Make / Zapier — trigger a run or pipe every new dataset item into 500+ apps (Google Sheets, Airtable, Slack, HubSpot, your database) with no code: n8n, Make, Zapier.
  • Webhooks — fire your own endpoint the moment a run finishes, to push results straight into your pipeline (docs).
  • MCP server — expose this actor as a tool to Claude, Cursor, or any MCP client so an AI agent can pull this data mid-conversation (guide).
  • API & SDKs — fetch the dataset as JSON, CSV, or Excel through the Apify REST API or the Python / JS SDKs.

See all Apify integrations.