Yellow Pages Scraper avatar

Yellow Pages Scraper

Try for free

1 day trial then $25.00/month - No credit card required now

Go to Store
Yellow Pages Scraper

Yellow Pages Scraper

onidivo/yellow-pages-scraper
Try for free

1 day trial then $25.00/month - No credit card required now

Crawl the Yellow Pages site and extract data about businesses. Scrape business details with unlimited options like search terms, location, sorting options, and many more.

Crawl the Yellow Pages site and extract data about businesses. Scrape business details for any search term, location, and sorting option. Download and use the data in whatever way you want.

Features

  • Multiple business listings and details
  • Search by term and location
  • Auto-verifying and finding location
  • Sorting listing results

Cost of usage

When running the actor with memory of 512 MB:

  • Using datacenter proxies, average consumption is about $0.12 usage credits and 15 minutes per 1000 businesses.
  • Using residential proxies, average consumption is about $1 usage credits and 20 minutes per 1000 businesses.

Bugs, issues, features, and feedback

You can report issues on the Actor tab "Issues" or here and discuss or leave your feedback here.

Input

You can provide input either through the editor on the Apify platform or as a JSON object.

The only mandatory fields you need to provide are at least one of: startUrls or searchTerm & searchLocation.

An example of minimal input:

1{
2    "startUrls": [
3        {
4            "url": "https://www.yellowpages.com/search?search_terms=air+conditioning+service+repair&geo_location_terms=San+Francisco%2C+CA"
5        },
6        {
7            "url": "https://www.yellowpages.com/san-francisco-ca/mip/air-conditioning-service-repair-515780833"
8        }
9    ],
10    "searchTerm": "Air conditioning",
11    "searchLocation": "San Francisco",
12    "sortOption": "NO_SORTING",
13    "includeAllDetails": true,
14    "maxItems": 1000,
15    "proxyConfiguration": {
16        "useApifyProxy": true,
17        "apifyProxyGroups": [
18            "RESIDENTIAL"
19        ]
20    }
21}

The scraper forcibly use Apify Proxy so that you don’t get blocked by Yellow Pages. It is better to use residential proxies to reduce blocking. Also, it is advised to use a low value for max concurrency.

Output

The output of each business looks like this:

1[
2    {
3        "searchTerm": "Air conditioning",
4        "searchLocation": "San Francisco",
5        "name": "Schmitt Heating & Air Conditioning Inc.",
6        "address": {
7            "addressCountry": "US",
8            "streetAddress": "1580 Tennessee St",
9            "addressLocality": "San Francisco",
10            "addressRegion": "CA",
11            "postalCode": "94107"
12        },
13        "phoneNumber": "(415) 527-0730",
14        "imageUrl": "https://i2.ypcdn.com/blob/7581113edb7f0c86acf86704af03fe5b49745ed6",
15        "openingHours": [
16            "Mo-Fr 07:00-16:30"
17        ],
18        "websiteUrl": "https://www.schmittheating.com",
19        "ratingValue": 5,
20        "reviewCount": 1,
21        "url": "https://www.yellowpages.com/san-francisco-ca/mip/schmitt-heating-air-conditioning-inc-497104287?lid=1001921646556"
22    }
23]
Developer
Maintained by Community

Actor Metrics

  • 6 monthly users

  • 3 stars

  • 91% runs succeeded

  • Created in Feb 2024

  • Modified 9 months ago