# Yelp Business Scraper (`dash_authority/yelp-business-scraper`) Actor

Extract business data from Yelp including name, phone, website, address, rating, reviews, categories, and hours. Search by keyword and location or scrape directly from business URLs. Perfect for lead generation, competitive analysis, and local market research.

- **URL**: https://apify.com/dash\_authority/yelp-business-scraper.md
- **Developed by:** [Dash Authority](https://apify.com/dash_authority) (community)
- **Categories:** Lead generation, Automation, Developer tools
- **Stats:** 2 total users, 1 monthly users, 100.0% runs succeeded, NaN bookmarks
- **User rating**: No ratings yet

## Pricing

$2.00 / 1,000 results

This Actor is paid per event. You are not charged for the Apify platform usage, but only a fixed price for specific events.

Learn more: https://docs.apify.com/platform/actors/running/actors-in-store#pay-per-event

## What's an Apify Actor?

Actors are a software tools running on the Apify platform, for all kinds of web data extraction and automation use cases.
In Batch mode, an Actor accepts a well-defined JSON input, performs an action which can take anything from a few seconds to a few hours,
and optionally produces a well-defined JSON output, datasets with results, or files in key-value store.
In Standby mode, an Actor provides a web server which can be used as a website, API, or an MCP server.
Actors are written with capital "A".

## How to integrate an Actor?

If asked about integration, you help developers integrate Actors into their projects.
You adapt to their stack and deliver integrations that are safe, well-documented, and production-ready.
The best way to integrate Actors is as follows.

In JavaScript/TypeScript projects, use official [JavaScript/TypeScript client](https://docs.apify.com/api/client/js.md):

```bash
npm install apify-client
```

In Python projects, use official [Python client library](https://docs.apify.com/api/client/python.md):

```bash
pip install apify-client
```

In shell scripts, use [Apify CLI](https://docs.apify.com/cli/docs.md):

````bash
# MacOS / Linux
curl -fsSL https://apify.com/install-cli.sh | bash
# Windows
irm https://apify.com/install-cli.ps1 | iex
```bash

In AI frameworks, you might use the [Apify MCP server](https://docs.apify.com/platform/integrations/mcp.md).

If your project is in a different language, use the [REST API](https://docs.apify.com/api/v2.md).

For usage examples, see the [API](#api) section below.

For more details, see Apify documentation as [Markdown index](https://docs.apify.com/llms.txt) and [Markdown full-text](https://docs.apify.com/llms-full.txt).


# README

## Yelp Business Scraper

Extract business data from Yelp — names, phones, websites, ratings, reviews, and more. Bypasses Yelp's limited search interface and gives you structured data ready for lead lists, CRM imports, or competitive analysis.

### Use Cases

**Lead Generation:** Pull business names, phone numbers, websites, and addresses by keyword and location. Export straight to CSV for cold outreach or import into HubSpot/Salesforce.

**Competitor Tracking:** Monitor ratings, review counts, and category positioning across any market. See who's gaining reviews and who's slipping.

**Local Market Research:** Map out business density by category in any city. Find gaps — neighborhoods without a coffee shop, zip codes underserved by dentists.

---

### Input

Provide either a **search query** (keyword + location) or a list of **direct Yelp business URLs**.

| Field | Type | Description |
|-------|------|-------------|
| searchTerm | string | What to search for (e.g., "plumbers") |
| location | string | City, state, or zip code (e.g., "Austin, TX") |
| maxResults | integer | Max businesses to return (default: 30) |
| urls | array | Direct Yelp business URLs — bypasses search |
| extractReviews | boolean | Also pull top reviews per business |
| proxyConfiguration | object | Apify proxy settings (recommended) |

#### Search Tips
- Be specific with locations. "Austin, TX" works better than just "Austin."
- Use `urls` mode when you already have a list of Yelp pages — faster than searching.
- Set `extractReviews: true` if you need sentiment data, but expect 2-3x slower runs.

---

### Output

Each result is a business profile with contact info, ratings, and operational details.

#### Core Fields
- `name`, `url`, `phone`, `website` — the basics
- `address`, `city`, `state`, `zipCode`, `neighborhood` — full address breakdown
- `latitude`, `longitude` — for mapping or geo-analysis
- `rating`, `reviewCount` — current Yelp score
- `categories` — Yelp category tags (e.g., ["Coffee & Tea", "Breakfast"])
- `priceRange` — $, $$, $$$, or $$$$
- `hours` — opening hours by day

#### Review Data (when `extractReviews` is enabled)
- `reviews[]` — array of top reviews with `author`, `rating`, `text`, `date`

#### JSON Example
```json
{
  "url": "https://www.yelp.com/biz/joes-coffee-san-francisco",
  "name": "Joe's Coffee",
  "phone": "(415) 555-0123",
  "website": "https://joescoffee.com",
  "address": "123 Market St",
  "city": "San Francisco",
  "state": "CA",
  "zipCode": "94102",
  "neighborhood": "Financial District",
  "rating": 4.5,
  "reviewCount": 342,
  "categories": ["Coffee & Tea", "Breakfast & Brunch"],
  "priceRange": "$$",
  "hours": {"Monday": "7:00 AM - 7:00 PM", "Tuesday": "7:00 AM - 7:00 PM"},
  "latitude": 37.7749,
  "longitude": -122.4194
}
````

***

### Integrations & API

- **Zapier / Make:** Pipe results directly into Google Sheets, Airtable, or your CRM.
- **API Access:** Use the Apify Python (`apify-client` on PyPI) or Node.js (`apify-client` on NPM) client.
- **Scheduling:** Set up recurring runs to track rating changes over time.

***

### FAQ

**Is this legal?** The scraper only collects publicly available data from Yelp. You're responsible for complying with Yelp's Terms of Service and applicable data privacy laws.

**How fast is it?** A search for 30 results takes about 15-30 seconds. Larger runs (500+) take 2-5 minutes depending on proxy speed.

**Why use this instead of Yelp's API?** Yelp's Fusion API returns limited fields and has strict rate limits. This scraper gives you full business profiles, reviews, and hours — no API key needed.

**Can I scrape reviews?** Yes. Enable `extractReviews` to pull the top reviews for each business. Set a higher memory allocation (4GB+) for review-heavy runs.

***

### Support

For bugs or feature requests, open an issue in the **Issues** tab on this Actor's page.

# Actor input Schema

## `searchTerm` (type: `string`):

What to search for on Yelp (e.g., 'plumbers', 'coffee shops', 'dentists')

## `location` (type: `string`):

City, state, or zip code to search in (e.g., 'San Francisco, CA' or '94102')

## `maxResults` (type: `integer`):

Maximum number of businesses to return. Yelp shows 10 results per page.

## `urls` (type: `array`):

Direct Yelp business page URLs to scrape (one per line). Use this instead of search if you have specific businesses.

## `extractReviews` (type: `boolean`):

Also scrape the top reviews for each business. Adds ~2s per business.

## `proxyConfiguration` (type: `object`):

Use Apify Proxy for reliable scraping. Recommended for production use.

## Actor input object example

```json
{
  "maxResults": 30,
  "extractReviews": false
}
```

# Actor output Schema

## `results` (type: `string`):

No description

# API

You can run this Actor programmatically using our API. Below are code examples in JavaScript, Python, and CLI, as well as the OpenAPI specification and MCP server setup.

## JavaScript example

```javascript
import { ApifyClient } from 'apify-client';

// Initialize the ApifyClient with your Apify API token
// Replace the '<YOUR_API_TOKEN>' with your token
const client = new ApifyClient({
    token: '<YOUR_API_TOKEN>',
});

// Prepare Actor input
const input = {};

// Run the Actor and wait for it to finish
const run = await client.actor("dash_authority/yelp-business-scraper").call(input);

// Fetch and print Actor results from the run's dataset (if any)
console.log('Results from dataset');
console.log(`💾 Check your data here: https://console.apify.com/storage/datasets/${run.defaultDatasetId}`);
const { items } = await client.dataset(run.defaultDatasetId).listItems();
items.forEach((item) => {
    console.dir(item);
});

// 📚 Want to learn more 📖? Go to → https://docs.apify.com/api/client/js/docs

```

## Python example

```python
from apify_client import ApifyClient

# Initialize the ApifyClient with your Apify API token
# Replace '<YOUR_API_TOKEN>' with your token.
client = ApifyClient("<YOUR_API_TOKEN>")

# Prepare the Actor input
run_input = {}

# Run the Actor and wait for it to finish
run = client.actor("dash_authority/yelp-business-scraper").call(run_input=run_input)

# Fetch and print Actor results from the run's dataset (if there are any)
print("💾 Check your data here: https://console.apify.com/storage/datasets/" + run["defaultDatasetId"])
for item in client.dataset(run["defaultDatasetId"]).iterate_items():
    print(item)

# 📚 Want to learn more 📖? Go to → https://docs.apify.com/api/client/python/docs/quick-start

```

## CLI example

```bash
echo '{}' |
apify call dash_authority/yelp-business-scraper --silent --output-dataset

```

## MCP server setup

```json
{
    "mcpServers": {
        "apify": {
            "command": "npx",
            "args": [
                "mcp-remote",
                "https://mcp.apify.com/?tools=dash_authority/yelp-business-scraper",
                "--header",
                "Authorization: Bearer <YOUR_API_TOKEN>"
            ]
        }
    }
}

```

## OpenAPI specification

```json
{
    "openapi": "3.0.1",
    "info": {
        "title": "Yelp Business Scraper",
        "description": "Extract business data from Yelp including name, phone, website, address, rating, reviews, categories, and hours. Search by keyword and location or scrape directly from business URLs. Perfect for lead generation, competitive analysis, and local market research.",
        "version": "1.0",
        "x-build-id": "4nVC5gOzckXMeq0r4"
    },
    "servers": [
        {
            "url": "https://api.apify.com/v2"
        }
    ],
    "paths": {
        "/acts/dash_authority~yelp-business-scraper/run-sync-get-dataset-items": {
            "post": {
                "operationId": "run-sync-get-dataset-items-dash_authority-yelp-business-scraper",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor, waits for its completion, and returns Actor's dataset items in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK"
                    }
                }
            }
        },
        "/acts/dash_authority~yelp-business-scraper/runs": {
            "post": {
                "operationId": "runs-sync-dash_authority-yelp-business-scraper",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor and returns information about the initiated run in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK",
                        "content": {
                            "application/json": {
                                "schema": {
                                    "$ref": "#/components/schemas/runsResponseSchema"
                                }
                            }
                        }
                    }
                }
            }
        },
        "/acts/dash_authority~yelp-business-scraper/run-sync": {
            "post": {
                "operationId": "run-sync-dash_authority-yelp-business-scraper",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor, waits for completion, and returns the OUTPUT from Key-value store in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK"
                    }
                }
            }
        }
    },
    "components": {
        "schemas": {
            "inputSchema": {
                "type": "object",
                "properties": {
                    "searchTerm": {
                        "title": "Search Term",
                        "type": "string",
                        "description": "What to search for on Yelp (e.g., 'plumbers', 'coffee shops', 'dentists')"
                    },
                    "location": {
                        "title": "Location",
                        "type": "string",
                        "description": "City, state, or zip code to search in (e.g., 'San Francisco, CA' or '94102')"
                    },
                    "maxResults": {
                        "title": "Max Results",
                        "minimum": 1,
                        "maximum": 200,
                        "type": "integer",
                        "description": "Maximum number of businesses to return. Yelp shows 10 results per page.",
                        "default": 30
                    },
                    "urls": {
                        "title": "Business URLs",
                        "type": "array",
                        "description": "Direct Yelp business page URLs to scrape (one per line). Use this instead of search if you have specific businesses.",
                        "items": {
                            "type": "string"
                        }
                    },
                    "extractReviews": {
                        "title": "Extract Reviews",
                        "type": "boolean",
                        "description": "Also scrape the top reviews for each business. Adds ~2s per business.",
                        "default": false
                    },
                    "proxyConfiguration": {
                        "title": "Proxy Configuration",
                        "type": "object",
                        "description": "Use Apify Proxy for reliable scraping. Recommended for production use."
                    }
                }
            },
            "runsResponseSchema": {
                "type": "object",
                "properties": {
                    "data": {
                        "type": "object",
                        "properties": {
                            "id": {
                                "type": "string"
                            },
                            "actId": {
                                "type": "string"
                            },
                            "userId": {
                                "type": "string"
                            },
                            "startedAt": {
                                "type": "string",
                                "format": "date-time",
                                "example": "2025-01-08T00:00:00.000Z"
                            },
                            "finishedAt": {
                                "type": "string",
                                "format": "date-time",
                                "example": "2025-01-08T00:00:00.000Z"
                            },
                            "status": {
                                "type": "string",
                                "example": "READY"
                            },
                            "meta": {
                                "type": "object",
                                "properties": {
                                    "origin": {
                                        "type": "string",
                                        "example": "API"
                                    },
                                    "userAgent": {
                                        "type": "string"
                                    }
                                }
                            },
                            "stats": {
                                "type": "object",
                                "properties": {
                                    "inputBodyLen": {
                                        "type": "integer",
                                        "example": 2000
                                    },
                                    "rebootCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "restartCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "resurrectCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "computeUnits": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            },
                            "options": {
                                "type": "object",
                                "properties": {
                                    "build": {
                                        "type": "string",
                                        "example": "latest"
                                    },
                                    "timeoutSecs": {
                                        "type": "integer",
                                        "example": 300
                                    },
                                    "memoryMbytes": {
                                        "type": "integer",
                                        "example": 1024
                                    },
                                    "diskMbytes": {
                                        "type": "integer",
                                        "example": 2048
                                    }
                                }
                            },
                            "buildId": {
                                "type": "string"
                            },
                            "defaultKeyValueStoreId": {
                                "type": "string"
                            },
                            "defaultDatasetId": {
                                "type": "string"
                            },
                            "defaultRequestQueueId": {
                                "type": "string"
                            },
                            "buildNumber": {
                                "type": "string",
                                "example": "1.0.0"
                            },
                            "containerUrl": {
                                "type": "string"
                            },
                            "usage": {
                                "type": "object",
                                "properties": {
                                    "ACTOR_COMPUTE_UNITS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_WRITES": {
                                        "type": "integer",
                                        "example": 1
                                    },
                                    "KEY_VALUE_STORE_LISTS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_INTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_EXTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_RESIDENTIAL_TRANSFER_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_SERPS": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            },
                            "usageTotalUsd": {
                                "type": "number",
                                "example": 0.00005
                            },
                            "usageUsd": {
                                "type": "object",
                                "properties": {
                                    "ACTOR_COMPUTE_UNITS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_WRITES": {
                                        "type": "number",
                                        "example": 0.00005
                                    },
                                    "KEY_VALUE_STORE_LISTS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_INTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_EXTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_RESIDENTIAL_TRANSFER_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_SERPS": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            }
                        }
                    }
                }
            }
        }
    }
}
```
