Yelp Business Info Scraper avatar

Yelp Business Info Scraper

Pricing

$19.99/month + usage

Go to Apify Store
Yelp Business Info Scraper

Yelp Business Info Scraper

🔎 Yelp Business Info Scraper pulls structured data from Yelp—business names, addresses, phones, ratings, reviews, categories, hours & websites. 🚀 Ideal for lead gen, local SEO, and market research. 📊 Keyword/location targeting. 📦 Exports CSV/JSON.

Pricing

$19.99/month + usage

Rating

0.0

(0)

Developer

ScrapePilot

ScrapePilot

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

2 hours ago

Last modified

Share

Yelp Business Info Scraper

Yelp Business Info Scraper is a production-ready Yelp scraper that extracts structured business details from Yelp business pages at scale. It solves the pain of manual copy-paste by delivering clean fields like name, rating, reviews, phone, address, hours, categories, services, images, and more — ideal for marketers, developers, data analysts, and researchers. Use this Yelp business data scraper to power lead gen, local SEO, and market research with repeatable, reliable outputs that scale.

What data / output can you get?

Data typeDescriptionExample value
titleBusiness name (from bizDetailsPageProps)Dandelion Cafe
ratingStar rating (0–5, parsed from page)"4.0"
reviewCountReviews count parsed from page title"99 reviews"
phoneNumberFormatted phone parsed from meta description"(832) 888-1568"
fullAddressStreet address parsed from page/meta"123 Main St, Suite 100"
cityCity parsed from page/meta"Houston"
stateState parsed from page/meta"TX"
zipcodeZIP parsed from meta description"77005"
categoriesComma-separated categories (derived from related businesses)"Breakfast & Brunch,Cafes"
websiteExternal website URL parsed from page (non-yelp)"https://business-website.com"
imagesArray of large image URLs (avatar + related businesses)["https://s3-media.../l.jpg"]
hoursObject of day -> time range (parsed from meta description){"Monday": "8:00 am - 5:00 pm"}
businessOwnerNameOwner/representative name (from The Business)"Jane Doe"
aboutSpecialties/description text"All-day breakfast and local coffee."
businessServicesMap of service flags (delivery, take-out, accessibility, etc.){"Offers delivery": true, "Bike parking": false, ...}
yelp_biz_idYelp internal business ID"h4UA0ul9Y3grjjQRBvcgXQ"
urlOriginal Yelp URL for the business"https://www.yelp.com/biz/dandelion-cafe-houston-3"
statusScrape status"SUCCEEDED"
timestampWhen the record was scraped"2026-04-13 10:05:00"
source_urlAlias of the source business URL"https://www.yelp.com/biz/dandelion-cafe-houston-3"
is_page_not_foundPage-not-found flagfalse

Note: Additional metadata includes isClaimed, priceLevel, and reviewhighlights. You can export results to CSV or JSON directly from the Apify dataset.

Key features

  • 🛡️ Always-on residential proxy reliability
    The actor always runs with Apify Residential Proxy and retries each request up to 3 times to reduce blocking and improve consistency.

  • 🧱 Structured, consistent schema
    Outputs a stable set of fields (title, rating, reviewCount, address, phoneNumber, hours, categories, businessServices, images, website, yelp_biz_id, etc.) that’s easy to map into CRMs, analytics, and pipelines.

  • 🧑‍🤝‍🧑 Bulk URL processing
    Paste one or many Yelp business URLs and scrape them sequentially. Live saving ensures partial results are preserved as they’re scraped.

  • 🕵️ Playwright stealth + human-like behavior
    Uses Playwright with anti-detection measures and randomized actions to better simulate real browsing.

  • ⚙️ Python-based, developer-friendly
    Built with the Apify Python SDK and Playwright. Ideal for teams integrating a Yelp scraper for business information or building a Yelp business scraper Python workflow.

  • 📝 Real-time dataset writes
    Each scraped business is immediately pushed to the dataset so you can monitor progress and export without waiting for the entire run.

  • 🔁 Smart session handling
    Pre-warms session, collects required cookies, and extracts a request ID to reliably fetch structured props JSON tied to the page session.

How to use Yelp Business Info Scraper - step by step

  1. Sign in to your Apify account at https://console.apify.com.
  2. Open the actor named “Yelp Business Info Scraper”.
  3. In the Input tab, add your Yelp business page URLs to startUrls. You can provide:
    • A list of URLs as strings, or
    • An array of objects with "url" keys.
  4. (Optional) proxyConfiguration shows a prefilled Apify proxy, but note the actor always uses Apify Residential Proxy internally.
  5. Click Start to run. The actor navigates with Playwright, extracts structured props and HTML signals, and saves results in real time.
  6. Check the Output tab to see saved items as they come in.
  7. Export the dataset to CSV or JSON for your workflow.

Pro tip: Automate runs via the Apify API to schedule and integrate this Yelp business listing scraper into enrichment or analytics pipelines.

Use cases

Use case nameDescription
Lead gen + enrichmentBuild prospect lists and enrich them with phoneNumber, categories, rating, and hours to prioritize outreach.
Local SEO benchmarkingTrack competitors’ ratings, reviews, and service attributes by location for SEO and growth insights.
Market research datasetsAggregate structured Yelp business data in bulk for analysis projects and trend reporting.
Data pipelines (API)Trigger runs via API and push results to a warehouse as a Yelp API alternative for business data.
Portfolio scoutingCompare businesses by categories, services, and sentiment signals for investment or partnership scouting.
Operations monitoringMonitor changes in hours, website links, or service flags to keep directories and apps up to date.

Why choose Yelp Business Info Scraper?

  • 🎯 Precision-first extraction using structured props, augmented with HTML parsing where needed.
  • 🧰 Python + Playwright foundation for teams who prefer a Yelp business data scraping tool that’s dev-friendly.
  • 📈 Built for batch runs — supports Yelp business data bulk scraping with live result streaming to datasets.
  • 🔒 Safer by design — always uses Apify Residential Proxy and applies stealth and delays to reduce blocks.
  • 💡 Consistent, schema-stable outputs that are easy to integrate into CRMs and analytics.
  • 🚫 No browser extensions or unstable hacks — a production-grade Yelp business directory scraper.
  • 🔗 Works great as a Yelp API alternative for business data when official endpoints don’t fit your use case.

In short: a reliable Yelp business details extractor designed for accuracy, scale, and automation.

Yes — when used responsibly. This actor collects data from publicly available Yelp business pages only. It does not access private or password-protected content.

Guidelines to follow:

  • Scrape only public business information.
  • Comply with applicable laws and regulations (e.g., GDPR, CCPA).
  • Respect website terms and use data responsibly.
  • Avoid scraping personal or sensitive data.

Always verify your specific use case with your legal team.

Input parameters & output format

Example input (JSON):

{
"startUrls": [
{ "url": "https://www.yelp.com/biz/dandelion-cafe-houston-3" },
{ "url": "https://www.yelp.com/biz/credence-houston-3" }
],
"proxyConfiguration": {
"useApifyProxy": true
}
}

Input parameter reference:

  • startUrls
    • Type: array
    • Description: One or more Yelp business page URLs to scrape. Example: https://www.yelp.com/biz/restaurant-name-city-1. You can paste many URLs for bulk scraping; the actor processes them one by one with anti-blocking delays.
    • Default: ["https://www.yelp.com/biz/dandelion-cafe-houston-3"]
    • Required: Yes
  • proxyConfiguration
    • Type: object
    • Description: This actor always uses Apify residential proxy (your choice here is ignored). Each request is retried up to 3 times. Ensure Apify proxy is available in your plan.
    • Default: { "useApifyProxy": true }
    • Required: No

Output format: One JSON object is pushed per business to the dataset with the fields below.

Example output item:

{
"title": "Dandelion Cafe",
"rating": "4.0",
"reviewCount": "99 reviews",
"city": "Houston",
"state": "TX",
"phoneNumber": "(832) 888-1568",
"url": "https://www.yelp.com/biz/dandelion-cafe-houston-3",
"yelp_biz_id": "h4UA0ul9Y3grjjQRBvcgXQ",
"status": "SUCCEEDED",
"source_url": "https://www.yelp.com/biz/dandelion-cafe-houston-3"
}

Complete field list (may be empty if unavailable on a page): title, rating, reviewCount, isClaimed, priceLevel, categories, fullAddress, city, state, zipcode, phoneNumber, images, website, hours, businessOwnerName, about, reviewhighlights, businessServices, yelp_biz_id, timestamp, url, is_page_not_found, status, source_url.

Notes:

  • Some fields can be empty strings, empty arrays, or empty objects when not present on the page.
  • Export your dataset to CSV or JSON from the Apify console.

FAQ

Does this actor use a proxy by default?

Yes. It always uses Apify Residential Proxy internally. Each request is retried up to 3 times to reduce blocking and improve reliability.

Can I provide URLs in different formats?

Yes. startUrls accepts a list of strings or an array of objects with a url key. The actor will normalize and process them sequentially.

Does it scrape Yelp search results or directories?

No. This is a Yelp business listing scraper for individual business pages. Provide direct business URLs (e.g., /biz/...) to scrape details.

Can it extract emails?

No. This is not a Yelp email extractor for businesses. It focuses on business information such as title, rating, reviews, phone, address, hours, categories, services, images, and website.

How many pages can I scrape at once?

You can paste many business URLs into startUrls. Results are saved live to the dataset as each page is processed, supporting Yelp business data bulk scraping workflows.

Is there an API or Python integration?

Yes. You can run the actor via the Apify API. The actor itself is implemented in Python with the Apify SDK and Playwright, making it a solid Yelp scraper for business information and a developer-friendly Yelp business scraper Python option.

What data types are included in the output?

Core fields include title, rating, reviewCount, phoneNumber, address components, website, images, hours, categories, businessServices, yelp_biz_id, url, status, and timestamp. See the Input parameters & output format section for details.

Is there a free trial or pricing?

The listing includes a monthly plan at $19.99 and a 120-minute trial window. Check the Apify actor page for the most up-to-date pricing and trial details.

Closing CTA / Final thoughts

Yelp Business Info Scraper is built for accurate, scalable extraction of business details from Yelp pages. With residential proxy enforcement, Playwright stealth, and a consistent schema, it’s ideal for marketers, developers, analysts, and researchers who need to scrape Yelp business listings reliably.

Use it as a Yelp API alternative for business data, enrich leads with phone and hours, and automate repeatable datasets. Developers can trigger runs via API and wire results into pipelines. Start extracting smarter business insights from Yelp at scale today.