Yellow Pages US Scraper
Pay $1.00 for 1,000 results
Yellow Pages US Scraper
Pay $1.00 for 1,000 results
Scrape addresses, phone numbers, categories, and names from Yellow Pages US listings. Customizable Yellow Pages API to crawl and download all contact data.
Yellow Pages Scraper
Yellow Pages Scraper is an Apify actor for scraping information from Yellow Pages listings. It allows you to search records based on a combination of search term and location or a list of URLs. It is build on top of Apify SDK and you can run it both on Apify platform and locally.
Features
Yellow Pages Scraper empowers you to:
- Scrape Yellow Pages Listings: Extract detailed information from Yellow Pages listings.
- Search by Term and Location: Customize your search by specifying search terms and locations.
- Deep Crawling: Scrape data from Yellow Pages record list pages.
- Proxy Support: Use Apify Proxy for efficient scraping.
What Data Can You Extract?
Yellow Pages Scraper allows you to extract a variety of data from Yellow Pages listings, including:
- Business Name
- Address
- Phone Number
- Ratings and Review Counts
- Additional Information Snippets
- Images
- Categories
Please note that the availability of these attributes may vary based on the data source.
Usage and Cost
Keep in mind that it is much more efficient to run one longer scrape (at least one minute) than more shorter ones because of the startup time.
The average consumption is about 0.04 Compute unit per 2000 results scraped. Running Yellow Pages Scraper on the Apify platform offers great value. For pricing details and cost estimates, refer to the Apify pricing page.
Getting Started
Using Yellow Pages Scraper is easy and doesn't require coding skills. Here's a quick guide:
- Create a free Apify account using your email.
- Access the YOUR_ACTOR_LINK.
- Configure your search terms, locations, and other parameters.
- Click "Start" and let the scraper do its job.
- Download the scraped data in JSON, XML, CSV, Excel, or HTML format.
Input
Field | Type | Description | Default value |
---|---|---|---|
search | string | Query string to be searched on the site | "Dentist" |
location | string | Location string to search the records in | "Los Angeles" |
startUrls | array | List of Request objects that will be deeply crawled. The URL can be any Yellowpages.com record list page | none |
maxItems | number | Maximum number of pages that will be scraped | 200 |
extendOutputFunction | string | Function that takes a Cheerio object and a Cheerio representation of the record element ($, record) as arguments and returns data that will be merged with the default output. More information in Extend output function | ($, record) => { return {}; } |
Either the search
and location
attributes or the startUrls
atrribute has to be set.
Output
Output is stored in a dataset. Each item is information about a record.
1{ 2 "isAd": true, 3 "url": "https://www.yellowpages.com/compton-ca/mip/golden-state-dental-group-18768214?lid=1001760866489", 4 "name": "Golden State Dental Group", 5 "address": "1601 N Long Beach Blvd, Compton, CA 90221", 6 "phone": "(310) 507-7718", 7 "rating": 4, 8 "ratingCount": 6, 9 "infoSnippet": "*Please contact us for more information", 10 "image": "https://i4.ypcdn.com/blob/00a40d49e577606be9d82ced5404696022a7e2a0", 11 "categories": [ 12 "Dentists", 13 "Implant Dentistry", 14 "Pediatric Dentistry", 15 "Periodontists", 16 "Cosmetic Dentistry" 17 ] 18}
Please note that not all of the attributes will be present with all the results.
Actor Metrics
103 monthly users
-
35 stars
82% runs succeeded
2.4 hours response time
Created in Mar 2020
Modified 3 months ago