Yelp Business Info Scraper avatar

Yelp Business Info Scraper

Pricing

$19.99/month + usage

Go to Apify Store
Yelp Business Info Scraper

Yelp Business Info Scraper

Extract Yelp business data quickly and accurately. Collect business names, categories, ratings, reviews, addresses, phone numbers, hours, and website links. Perfect for local lead generation, market research, and competitor analysis with clean, structured output.

Pricing

$19.99/month + usage

Rating

0.0

(0)

Developer

ScrapeLabs

ScrapeLabs

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

5 hours ago

Last modified

Share

Yelp Business Info Scraper

Yelp Business Info Scraper is a purpose-built data extraction tool that collects structured business details from public Yelp business pages at scale. It solves the repetitive, error-prone task of manual copy/paste by reliably returning names, categories, ratings, reviews, contact info, hours, and more — ideal for teams who need a Yelp business data scraper for lead generation, enrichment, and market research. Marketers, developers, data analysts, and researchers can use this Yelp business listings scraper and Yelp business info extractor to automate clean, structured data collection for pipelines and dashboards.

What data / output can you get?

Data typeDescriptionExample value
titleBusiness name extracted from the page“Dandelion Cafe”
ratingStar rating (0–5, as a string)“4.0”
reviewCountReview count in human-readable format“99 reviews”
isClaimedClaimed status if detected from HTML“Claimed”
priceLevelPrice indicator from page metadata“$$”
categoriesComma-separated category labels“Cafes,Breakfast & Brunch,Coffee & Tea”
fullAddressFull street address (if found)“5407 Bellaire Blvd, Bellaire”
cityCity parsed from HTML/meta“Houston”
stateState/region parsed from HTML/meta“TX”
zipcodeZIP/postal code (if present)“77081”
phoneNumberFormatted phone number when available“(832) 888-1568”
websiteExternal website URL (non‑Yelp)https://example.com”
hoursObject with daily opening hours{"Monday": "7:00 am - 3:00 pm", "Tuesday": "7:00 am - 3:00 pm"}

Bonus fields:

  • images: Array of image URLs derived from page assets
  • businessOwnerName and about: Owner name and “Specialties”/description text
  • businessServices: Map of service attributes (e.g., “Offers delivery”, “Vegan options”) to boolean
  • yelp_biz_id: Yelp’s internal business identifier
  • url and source_url: Target page URL and source reference
  • is_page_not_found and status: Page status flags
  • timestamp: When the record was scraped

You can export the Apify dataset to JSON or CSV for ingestion into CRMs, BI tools, or data pipelines.

Key features

  • 🚀 Always-on residential proxy Ensures access stability by always using the Apify residential proxy with up to 3 retries per request. The proxy input is accepted but ignored; residential proxy is enforced internally.

  • 🧠 Stealth Playwright browser Uses Playwright with anti-detection techniques and human-like behavior (mouse moves, scrolls) to reduce blocks when scraping Yelp business details.

  • 📦 Structured, consistent schema Returns a predictable Yelp business directory scraper output with fields like title, rating, reviewCount, categories, fullAddress, phoneNumber, website, hours, images, businessServices, yelp_biz_id, and more.

  • 📊 Real-time saving Pushes each business record to the dataset as soon as it’s scraped so you retain partial results even if a run stops — perfect for large Yelp business data mining jobs.

  • 📋 Bulk URL processing Paste many Yelp business URLs to run a batch — a reliable Yelp business leads scraper for enrichment and outreach workflows.

  • 🔌 Developer-friendly Operates on Apify infrastructure and integrates via the Apify API. Ideal if you need a Yelp API alternative scraper that plugs into data pipelines or a Yelp scraper Python workflow.

  • ✅ Reliability by design Single-session cookie handling for request IDs, three retries per step, and robust parsing logic make this a production-ready Yelp scraping service.

How to use Yelp Business Info Scraper - step by step

  1. Create or log in to your Apify account at https://console.apify.com.
  2. Open the “Yelp Business Info Scraper” actor.
  3. Add input:
    • startUrls: Paste one or more Yelp business page URLs. You can provide a list of strings or objects with a url key.
    • proxyConfiguration: This field is available in the UI, but the actor will always run with Apify residential proxy internally.
  4. Click Start to launch the run.
  5. Watch the run logs for progress. Each business is processed with up to 3 retries and saved to the dataset as it’s scraped.
  6. When finished, open the Output (Dataset) to preview results.
  7. Export your dataset to JSON or CSV for analysis, enrichment, or CRM upload.

Pro tip: Automate runs via the Apify API and connect the dataset export to your internal data stack for continuous Yelp data extraction at scale.

Use cases

Use case nameDescription
Local lead generationBuild prospect lists by extracting phone numbers, websites, and categories from multiple Yelp URLs with a Yelp contact info scraper.
Market research & benchmarkingCompare ratings, review counts, and price levels across competitors using a Yelp review scraper tool workflow.
Location analysisAggregate addresses, cities, and states to map coverage and identify gaps with a Yelp business directory scraper pipeline.
Service availability taggingFlag “Offers delivery”, “Vegan options”, or “Wheelchair accessible” using businessServices for segmentation.
CRM enrichmentAppend structured fields (website, phone, categories, hours) to existing accounts via a Yelp business info extractor.
Developer pipelinesUse the Apify API to run scheduled jobs and export JSON for a Yelp API alternative scraper in Python or ETL flows.
Academic & data journalismCollect public business metadata from Yelp to support studies and reports with a Yelp data extraction tool.

Why choose Yelp Business Info Scraper?

Yelp Business Info Scraper is engineered for precision, automation, and reliability on public Yelp business pages.

  • 🎯 Accurate, structured fields ready for analysis
  • 🌍 Works on public Yelp pages without login or cookies
  • 📈 Scales to batches of many URLs with 3-retry resilience
  • 💻 Built for developers — Apify API and dataset exports
  • 🛡️ Ethical by design — public data only, no private access
  • 💰 Efficient operations — production-ready infrastructure on Apify
  • 🔐 Residential proxy baked in — consistent stability without manual tuning

Compared to browser extensions and unstable scripts, this Yelp business data scraper delivers consistent output and saves results in real time — ideal for pipelines and automated workflows.

Yes — when used responsibly. This actor scrapes only publicly available Yelp business pages and does not access private or authenticated content.

Guidelines for compliant use:

  • Collect data from public pages only and avoid private or password-protected content.
  • Ensure your use complies with applicable laws and regulations (e.g., GDPR, CCPA).
  • Review and respect relevant terms where applicable.
  • Use scraped data responsibly and consult your legal team for edge cases.

Input parameters & output format

Example JSON input

{
"startUrls": [
"https://www.yelp.com/biz/dandelion-cafe-houston-3",
"https://www.yelp.com/biz/credence-houston-3"
],
"proxyConfiguration": {
"useApifyProxy": true
}
}

Input fields

  • startUrls (array)
    • Description: One or more Yelp business page URLs to scrape. You can paste many URLs for bulk scraping; the actor processes them one by one with anti-blocking delays. Accepts a list of strings or objects with a url key.
    • Default: ["https://www.yelp.com/biz/dandelion-cafe-houston-3"]
    • Required: Yes
  • proxyConfiguration (object)
    • Description: This actor always uses Apify residential proxy (your choice here is ignored). Each request is retried up to 3 times. Ensure Apify proxy is available in your plan.
    • Default: {"useApifyProxy": true}
    • Required: No

Output item structure

The actor pushes one object per business to the default dataset with the following fields:

  • title, rating, reviewCount, isClaimed, priceLevel, categories
  • fullAddress, city, state, zipcode
  • phoneNumber, images (array), website
  • hours (object), businessOwnerName, about, reviewhighlights (array)
  • businessServices (object), yelp_biz_id
  • timestamp, url, source_url, is_page_not_found, status

Example JSON output

{
"title": "Dandelion Cafe",
"rating": "4.0",
"reviewCount": "99 reviews",
"city": "Houston",
"state": "TX",
"phoneNumber": "(832) 888-1568",
"url": "https://www.yelp.com/biz/dandelion-cafe-houston-3",
"yelp_biz_id": "h4UA0ul9Y3grjjQRBvcgXQ",
"status": "SUCCEEDED",
"source_url": "https://www.yelp.com/biz/dandelion-cafe-houston-3"
}

Notes:

  • Some fields may be empty if they aren’t present on the public page (e.g., website, hours, zipcode).
  • businessServices is a map of service names to boolean values (true/false).

FAQ

Does this actor require a proxy?

Yes. It always uses the Apify residential proxy internally for all requests, with up to 3 retries per step to improve stability.

Can I provide my own proxy settings?

You can pass proxyConfiguration in the input, but the actor ignores user choice and enforces Apify residential proxy under the hood.

Do I need to log in to scrape Yelp?

No. The actor targets publicly available Yelp business pages and does not require login or cookies.

How many Yelp businesses can I scrape in one run?

There is no hardcoded item cap. Practical limits depend on your run’s time and resource allocation and the target pages’ availability. Each request is retried up to 3 times and results are saved as they’re scraped.

What data fields are included in the output?

The dataset includes fields such as title, rating, reviewCount, categories, fullAddress, city, state, zipcode, phoneNumber, website, hours, images, businessServices, yelp_biz_id, url, source_url, status, is_page_not_found, and timestamp.

Can I use this as a Yelp API alternative scraper in Python?

Yes. Run the actor programmatically with the Apify API and consume JSON outputs in your Yelp scraper Python workflows as a reliable Yelp API alternative scraper.

What export formats are supported?

You can export the Apify dataset to JSON or CSV for use in analytics, enrichment, or CRM systems.

Is it compliant to scrape Yelp data?

Yes, when you collect only public information and use it responsibly. You are responsible for compliance with applicable laws and terms of use. This tool does not access private or authenticated data.

Closing thoughts

Yelp Business Info Scraper is built for accurate, structured extraction of public Yelp business details at scale. With residential proxy baked in, Playwright-based stealth, and real-time dataset saving, it’s ideal for marketers, developers, analysts, and researchers who need dependable Yelp business data mining. Connect via the Apify API to automate runs, export JSON/CSV, and power a continuous Yelp scraping service in your stack. Start extracting smarter Yelp business insights and streamline your enrichment workflows today.