π’ Yellow Pages US Scraper
Pricing
from $4.99 / 1,000 results
π’ Yellow Pages US Scraper
Pricing
from $4.99 / 1,000 results
Rating
0.0
(0)
Developer
API Empire
Maintained by CommunityActor stats
0
Bookmarked
2
Total users
1
Monthly active users
2 days ago
Last modified
Categories
Share
Scrape Yellow Pages US business listings into clean, structured data β business name, phone, full address, website, star rating, review count, review snippets, photos and categories β from a simple search term + location or from bulk search URLs.
β‘ Streams results into the dataset as each listing is parsed β no waiting until the run ends for partial output. Refresh the run page and watch rows appear live.
β¨ Why choose this scraper?
- π§ Smart proxy auto-escalation β starts direct (no proxy, cheapest and fastest), then automatically falls back to a datacenter proxy, then a US residential proxy if Yellow Pages pushes back. Once on residential it stays there. You never have to think about it.
- π Search OR bulk URLs β type
Dentist+New Yorkand hit Start, or paste a list of Yellow Pages search URLs. Both work; mix them freely. - π Location auto-correct β loose input like
austin txis resolved to the Yellow Pages canonical form (Austin, TX) via their own autosuggest service. - π§± Browser TLS impersonation β every request carries a genuine Chrome/Edge fingerprint (via
curl_cffi), so Yellow Pages serves real HTML instead of a bot page β without the cost of a headless browser. - π Automatic pagination β follows the "Next" link to the last page of every search.
- π Sectioned dataset tabs β Overview, Contact, Reviews and Media views built in.
- π³ Pay per event β you pay only for listings actually delivered.
π Key Features
| Field extracted | |
|---|---|
| π’ | Paid-ad flag (isAd) β sponsored vs organic result |
| π’ | Business name |
| π | Street address |
| π | Phone number |
| π | Business website |
| β | Star rating (1β5) |
| π³οΈ | Review count |
| π¬ | Customer review snippet |
| βΉοΈ | "From Business" description snippet |
| πΌοΈ | Business photo (full-size URL) |
| π·οΈ | Category list |
| π | Yellow Pages profile URL |
π₯ Input
| Field | Type | Required | Default | Description |
|---|---|---|---|---|
search | string | β* | Dentist | Business type, category or company name |
location | string | β* | New York | US city, City, ST, or ZIP code |
startUrls | array | β* | β | Bulk Yellow Pages search-result URLs |
maxItems | integer | β | 50 | Cap on total listings (0 = unlimited) |
debugMode | boolean | β | false | Verbose debug logging in the run log (troubleshooting) |
requestDelay | number | β | 1.0 | Base seconds between requests (jitter added) |
proxyConfiguration | object | β | direct | Proxy override β defaults to no proxy |
* Provide either search + location or startUrls (or both).
Example input
{"search": "Dentist","location": "New York, NY","maxItems": 50,"requestDelay": 1.0,"proxyConfiguration": { "useApifyProxy": false }}
Bulk URL example
{"startUrls": ["https://www.yellowpages.com/search?search_terms=plumber&geo_location_terms=New+York%2C+NY","https://www.yellowpages.com/search?search_terms=coffee&geo_location_terms=Austin%2C+TX"],"maxItems": 200}
π€ Output
Each dataset row is one business listing. Empty fields are omitted, exactly like the reference output.
{"isAd": false,"url": "https://www.yellowpages.com/new-york-ny/mip/dr-kara-mason-dmd-5721648","name": "Dr. Kara Mason, DMD","address": "30 E 60th St Rm 503, New York, NY 10022","phone": "(212) 355-2195","website": "http://www.gentledentalnyc.com","rating": 5,"ratingCount": 175,"reviewSnippet": "Extremely knowledgeable of craft with sincere care for her patients.\"","image": "https://i4.ypcdn.com/blob/80cd8b97ee48d0b7715eacdc0801d77c44cc978b","categories": ["Dentists", "Oral & Maxillofacial Surgery"]}
| Field | Type | Description |
|---|---|---|
isAd | boolean | true for paid/sponsored listings, false for organic |
url | string | Yellow Pages business profile URL |
name | string | Business name |
address | string | Street address |
phone | string | Phone number |
website | string | Business website (when listed) |
rating | number | Star rating, 1β5 (when rated) |
ratingCount | integer | Number of reviews (when rated) |
reviewSnippet | string | A sample customer review (when present) |
infoSnippet | string | "From Business" description (when present) |
image | string | Business photo URL (when present) |
categories | array | Yellow Pages category labels |
The dataset has four ready-made views β π Overview, π Contact, β Ratings & Reviews and πΌοΈ Media β so you can focus on one section at a time.
π How to Use (Apify Console)
- Log in at console.apify.com β Actors.
- Open Yellow Pages US Scraper.
- Enter a π Search term and π Location β or paste π URLs.
- Set π’ Maximum listings (default 50).
- Click Start.
- Watch listings stream into the run log and dataset in real time.
- When the run finishes, open the Output tab.
- Export to JSON / CSV / Excel or pull via API.
π€ Use via API
curl -X POST "https://api.apify.com/v2/acts/YOUR~yellow-pages-us-scraper/run-sync-get-dataset-items?token=$APIFY_TOKEN" \-H "Content-Type: application/json" \-d '{"search": "Plumber","location": "Austin, TX","maxItems": 50}'
π‘οΈ How blocking is handled
Yellow Pages occasionally rate-limits aggressive traffic. This actor handles it automatically:
- Direct β requests go straight to Yellow Pages (no proxy cost).
- On a block β datacenter proxy.
- On another block β US residential proxy (sticky from here on).
- Residential is retried up to 3 times with a fresh IP and a cooldown.
Every escalation is logged clearly in the run log, e.g.
π Yellow Pages blocked the request on DIRECT β switching to DATACENTER proxy.
π‘ Best Use Cases
- π Lead generation β build B2B contact lists by category and city.
- π Local market research β map competitors in any US metro.
- πΊοΈ Local SEO audits β track ratings, reviews and categories.
- π€ Sales prospecting β phone/website lists for outreach.
π³ Pricing
This actor uses the pay-per-event model. You are charged once per business listing successfully delivered to the dataset (listing-scraped). Runs that return no data cost nothing beyond the standard run start. The actor stops cleanly when a configured spend limit is reached.
β FAQ
Do I need a proxy? No. The actor runs direct by default and only turns proxies on if Yellow Pages blocks it β and it does that for you.
Can I scrape multiple cities at once? Yes β use Bulk URL mode and paste one search URL per city.
Why are some fields missing on a row? Yellow Pages does not list a phone, website, rating or photo for every business. Empty fields are omitted rather than returned as null.
How many results per search? Yellow Pages caps each search at roughly 30 pages. Use a more specific location to surface different businesses.
Does it scrape individual business detail pages? No β it collects everything from the search-result cards, which is fast and covers all the fields above.
π¨ Support & Feedback
Found a bug or need an extra field? Contact dev.scraperengine@gmail.com or open an issue on the actor's Apify page.
βοΈ Legal
This scraper collects only publicly available data from Yellow Pages. You are responsible for using the data in compliance with applicable laws (GDPR, CCPA, anti-spam regulations) and the Yellow Pages Terms of Service.