Yellow Pages Scraper
1 day trial then $25.00/month - No credit card required now
Yellow Pages Scraper
1 day trial then $25.00/month - No credit card required now
Crawl the Yellow Pages site and extract data about businesses. Scrape business details with unlimited options like search terms, location, sorting options, and many more.
Crawl the Yellow Pages site and extract data about businesses. Scrape business details for any search term, location, and sorting option. Download and use the data in whatever way you want.
Features
- Multiple business listings and details
- Search by term and location
- Auto-verifying and finding location
- Sorting listing results
Cost of usage
When running the actor with memory of 512 MB:
- Using datacenter proxies, average consumption is about $0.12 usage credits and 15 minutes per 1000 businesses.
- Using residential proxies, average consumption is about $1 usage credits and 20 minutes per 1000 businesses.
Bugs, issues, features, and feedback
You can report issues on the Actor tab "Issues" or here and discuss or leave your feedback here.
Input
You can provide input either through the editor on the Apify platform or as a JSON object.
The only mandatory fields you need to provide are at least one of: startUrls or searchTerm & searchLocation.
An example of minimal input:
1{ 2 "startUrls": [ 3 { 4 "url": "https://www.yellowpages.com/search?search_terms=air+conditioning+service+repair&geo_location_terms=San+Francisco%2C+CA" 5 }, 6 { 7 "url": "https://www.yellowpages.com/san-francisco-ca/mip/air-conditioning-service-repair-515780833" 8 } 9 ], 10 "searchTerm": "Air conditioning", 11 "searchLocation": "San Francisco", 12 "sortOption": "NO_SORTING", 13 "includeAllDetails": true, 14 "maxItems": 1000, 15 "proxyConfiguration": { 16 "useApifyProxy": true, 17 "apifyProxyGroups": [ 18 "RESIDENTIAL" 19 ] 20 } 21}
The scraper forcibly use Apify Proxy so that you don’t get blocked by Yellow Pages. It is better to use residential proxies to reduce blocking. Also, it is advised to use a low value for max concurrency.
Output
The output of each business looks like this:
- Table format
- JSON format
1[ 2 { 3 "searchTerm": "Air conditioning", 4 "searchLocation": "San Francisco", 5 "name": "Schmitt Heating & Air Conditioning Inc.", 6 "address": { 7 "addressCountry": "US", 8 "streetAddress": "1580 Tennessee St", 9 "addressLocality": "San Francisco", 10 "addressRegion": "CA", 11 "postalCode": "94107" 12 }, 13 "phoneNumber": "(415) 527-0730", 14 "imageUrl": "https://i2.ypcdn.com/blob/7581113edb7f0c86acf86704af03fe5b49745ed6", 15 "openingHours": [ 16 "Mo-Fr 07:00-16:30" 17 ], 18 "websiteUrl": "https://www.schmittheating.com", 19 "ratingValue": 5, 20 "reviewCount": 1, 21 "url": "https://www.yellowpages.com/san-francisco-ca/mip/schmitt-heating-air-conditioning-inc-497104287?lid=1001921646556" 22 } 23]
Actor Metrics
8 monthly users
-
3 stars
87% runs succeeded
Created in Feb 2024
Modified 8 months ago