🏒 Yellow Pages US Scraper avatar

🏒 Yellow Pages US Scraper

Pricing

from $4.99 / 1,000 results

Go to Apify Store
🏒 Yellow Pages US Scraper

🏒 Yellow Pages US Scraper

Pricing

from $4.99 / 1,000 results

Rating

0.0

(0)

Developer

API Empire

API Empire

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

2 days ago

Last modified

Share

Scrape Yellow Pages US business listings into clean, structured data β€” business name, phone, full address, website, star rating, review count, review snippets, photos and categories β€” from a simple search term + location or from bulk search URLs.

⚑ Streams results into the dataset as each listing is parsed β€” no waiting until the run ends for partial output. Refresh the run page and watch rows appear live.


✨ Why choose this scraper?

  • 🧠 Smart proxy auto-escalation β€” starts direct (no proxy, cheapest and fastest), then automatically falls back to a datacenter proxy, then a US residential proxy if Yellow Pages pushes back. Once on residential it stays there. You never have to think about it.
  • πŸ”Ž Search OR bulk URLs β€” type Dentist + New York and hit Start, or paste a list of Yellow Pages search URLs. Both work; mix them freely.
  • πŸ“ Location auto-correct β€” loose input like austin tx is resolved to the Yellow Pages canonical form (Austin, TX) via their own autosuggest service.
  • 🧱 Browser TLS impersonation β€” every request carries a genuine Chrome/Edge fingerprint (via curl_cffi), so Yellow Pages serves real HTML instead of a bot page β€” without the cost of a headless browser.
  • πŸ“„ Automatic pagination β€” follows the "Next" link to the last page of every search.
  • πŸ“Š Sectioned dataset tabs β€” Overview, Contact, Reviews and Media views built in.
  • πŸ’³ Pay per event β€” you pay only for listings actually delivered.

πŸ”‘ Key Features

Field extracted
πŸ“’Paid-ad flag (isAd) β€” sponsored vs organic result
🏒Business name
πŸ“Street address
πŸ“žPhone number
🌐Business website
⭐Star rating (1–5)
πŸ—³οΈReview count
πŸ’¬Customer review snippet
ℹ️"From Business" description snippet
πŸ–ΌοΈBusiness photo (full-size URL)
🏷️Category list
πŸ”—Yellow Pages profile URL

πŸ“₯ Input

FieldTypeRequiredDefaultDescription
searchstring❌*DentistBusiness type, category or company name
locationstring❌*New YorkUS city, City, ST, or ZIP code
startUrlsarray❌*–Bulk Yellow Pages search-result URLs
maxItemsinteger❌50Cap on total listings (0 = unlimited)
debugModeboolean❌falseVerbose debug logging in the run log (troubleshooting)
requestDelaynumber❌1.0Base seconds between requests (jitter added)
proxyConfigurationobject❌directProxy override β€” defaults to no proxy

* Provide either search + location or startUrls (or both).

Example input

{
"search": "Dentist",
"location": "New York, NY",
"maxItems": 50,
"requestDelay": 1.0,
"proxyConfiguration": { "useApifyProxy": false }
}

Bulk URL example

{
"startUrls": [
"https://www.yellowpages.com/search?search_terms=plumber&geo_location_terms=New+York%2C+NY",
"https://www.yellowpages.com/search?search_terms=coffee&geo_location_terms=Austin%2C+TX"
],
"maxItems": 200
}

πŸ“€ Output

Each dataset row is one business listing. Empty fields are omitted, exactly like the reference output.

{
"isAd": false,
"url": "https://www.yellowpages.com/new-york-ny/mip/dr-kara-mason-dmd-5721648",
"name": "Dr. Kara Mason, DMD",
"address": "30 E 60th St Rm 503, New York, NY 10022",
"phone": "(212) 355-2195",
"website": "http://www.gentledentalnyc.com",
"rating": 5,
"ratingCount": 175,
"reviewSnippet": "Extremely knowledgeable of craft with sincere care for her patients.\"",
"image": "https://i4.ypcdn.com/blob/80cd8b97ee48d0b7715eacdc0801d77c44cc978b",
"categories": ["Dentists", "Oral & Maxillofacial Surgery"]
}
FieldTypeDescription
isAdbooleantrue for paid/sponsored listings, false for organic
urlstringYellow Pages business profile URL
namestringBusiness name
addressstringStreet address
phonestringPhone number
websitestringBusiness website (when listed)
ratingnumberStar rating, 1–5 (when rated)
ratingCountintegerNumber of reviews (when rated)
reviewSnippetstringA sample customer review (when present)
infoSnippetstring"From Business" description (when present)
imagestringBusiness photo URL (when present)
categoriesarrayYellow Pages category labels

The dataset has four ready-made views β€” πŸ“‹ Overview, πŸ“ž Contact, ⭐ Ratings & Reviews and πŸ–ΌοΈ Media β€” so you can focus on one section at a time.


πŸš€ How to Use (Apify Console)

  1. Log in at console.apify.com β†’ Actors.
  2. Open Yellow Pages US Scraper.
  3. Enter a πŸ”Ž Search term and πŸ“ Location β€” or paste 🌐 URLs.
  4. Set πŸ”’ Maximum listings (default 50).
  5. Click Start.
  6. Watch listings stream into the run log and dataset in real time.
  7. When the run finishes, open the Output tab.
  8. Export to JSON / CSV / Excel or pull via API.

πŸ€– Use via API

curl -X POST "https://api.apify.com/v2/acts/YOUR~yellow-pages-us-scraper/run-sync-get-dataset-items?token=$APIFY_TOKEN" \
-H "Content-Type: application/json" \
-d '{
"search": "Plumber",
"location": "Austin, TX",
"maxItems": 50
}'

πŸ›‘οΈ How blocking is handled

Yellow Pages occasionally rate-limits aggressive traffic. This actor handles it automatically:

  1. Direct β€” requests go straight to Yellow Pages (no proxy cost).
  2. On a block β†’ datacenter proxy.
  3. On another block β†’ US residential proxy (sticky from here on).
  4. Residential is retried up to 3 times with a fresh IP and a cooldown.

Every escalation is logged clearly in the run log, e.g. πŸ”„ Yellow Pages blocked the request on DIRECT β€” switching to DATACENTER proxy.


πŸ’‘ Best Use Cases

  • πŸ“‡ Lead generation β€” build B2B contact lists by category and city.
  • πŸ“Š Local market research β€” map competitors in any US metro.
  • πŸ—ΊοΈ Local SEO audits β€” track ratings, reviews and categories.
  • 🀝 Sales prospecting β€” phone/website lists for outreach.

πŸ’³ Pricing

This actor uses the pay-per-event model. You are charged once per business listing successfully delivered to the dataset (listing-scraped). Runs that return no data cost nothing beyond the standard run start. The actor stops cleanly when a configured spend limit is reached.


❓ FAQ

Do I need a proxy? No. The actor runs direct by default and only turns proxies on if Yellow Pages blocks it β€” and it does that for you.

Can I scrape multiple cities at once? Yes β€” use Bulk URL mode and paste one search URL per city.

Why are some fields missing on a row? Yellow Pages does not list a phone, website, rating or photo for every business. Empty fields are omitted rather than returned as null.

How many results per search? Yellow Pages caps each search at roughly 30 pages. Use a more specific location to surface different businesses.

Does it scrape individual business detail pages? No β€” it collects everything from the search-result cards, which is fast and covers all the fields above.


πŸ“¨ Support & Feedback

Found a bug or need an extra field? Contact dev.scraperengine@gmail.com or open an issue on the actor's Apify page.


This scraper collects only publicly available data from Yellow Pages. You are responsible for using the data in compliance with applicable laws (GDPR, CCPA, anti-spam regulations) and the Yellow Pages Terms of Service.