USA Clinic Scraper avatar

USA Clinic Scraper

Under maintenance

Pricing

Pay per usage

Go to Apify Store
USA Clinic Scraper

USA Clinic Scraper

Under maintenance

Scrapes Yelp clinic listings and enriches them with email and social data from clinic websites.

Pricing

Pay per usage

Rating

0.0

(0)

Developer

Darshan Patel

Darshan Patel

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

7 days ago

Last modified

Categories

Share

What does USA Clinic Scraper do?

This Actor searches Yelp for clinic businesses in selected U.S. cities and specialties, then enriches each listing with contact details from the clinic website. It is built for lead generation workflows where you need clinic name, phone, address, website, emails, and social profiles in one dataset.

You can run it directly on Apify and immediately connect outputs to API-based systems, schedules, webhooks, integrations, monitoring, and proxy rotation. If you are testing locally, run with apify run; for production automation, deploy with apify push.

Why use USA Clinic Scraper?

  • Build targeted local outreach lists for healthcare marketing teams.
  • Find clinics with missing or outdated contact details and enrich records.
  • Track regional clinic coverage by specialty (for example, dermatology vs mental health).
  • Reduce manual research time by combining listing-page and website-level extraction.
  • Use one dataset output that can be consumed by sales ops, CRM imports, BI tools, and internal dashboards.

How to use USA Clinic Scraper

  1. Open the Actor in Apify Console and go to the Input tab.
  2. Add one or more cities (for example, "New York, NY").
  3. Add one or more specialties (for example, "dermatologist").
  4. Set maxPages to control Yelp pagination depth per city/specialty combination.
  5. Keep maxRequestsPerCrawl as a safety limit (or set 0 for no hard cap).
  6. Configure proxyConfiguration (Apify Proxy is recommended for stable runs).
  7. Click Start and wait for completion.
  8. Open the Output tab and download or consume data via API.

Input

Configure everything in the Actor Input tab.

{
"cities": ["New York, NY", "Los Angeles, CA"],
"specialties": ["dermatologist", "mental health clinic"],
"maxPages": 3,
"maxRequestsPerCrawl": 500,
"proxyConfiguration": {
"useApifyProxy": true
}
}

Input field notes:

  • cities (array of strings, required): U.S. city/state search locations. Default is preloaded with one major city from each U.S. state (50 total).
  • specialties (array of strings, required): Yelp search keywords.
  • maxPages (integer): Number of Yelp pages to scan for each city/specialty pair.
  • maxRequestsPerCrawl (integer): Global request cap for run safety.
  • proxyConfiguration (object): Proxy settings for better reliability.

Output

Each dataset item represents one clinic record with Yelp listing data and optional website enrichment.

[
{
"name": "Hudson Dermatology Center",
"city": "New York, NY",
"specialty": "dermatologist",
"phone": "(212) 555-1111",
"address": "123 Main St, New York, NY",
"website": "https://hudsonderm.com",
"emails": ["info@hudsonderm.com"],
"ownerEmails": ["dr.jane@hudsonderm.com"],
"facebook": "https://www.facebook.com/hudsonderm",
"instagram": "https://www.instagram.com/hudsonderm",
"twitter": null,
"linkedin": null,
"facebookProfiles": ["https://www.facebook.com/hudsonderm"],
"instagramProfiles": ["https://www.instagram.com/hudsonderm"],
"twitterProfiles": [],
"linkedinProfiles": [],
"youtubeProfiles": [],
"tiktokProfiles": [],
"allSocialProfiles": [
"https://www.facebook.com/hudsonderm",
"https://www.instagram.com/hudsonderm"
],
"source": "https://hudsonderm.com/contact"
},
{
"name": "MindBridge Wellness Clinic",
"city": "Los Angeles, CA",
"specialty": "mental health clinic",
"phone": "(310) 555-2222",
"address": "456 Ocean Ave, Los Angeles, CA",
"website": null,
"emails": [],
"ownerEmails": [],
"facebook": null,
"instagram": null,
"twitter": null,
"linkedin": null,
"facebookProfiles": [],
"instagramProfiles": [],
"twitterProfiles": [],
"linkedinProfiles": [],
"youtubeProfiles": [],
"tiktokProfiles": [],
"allSocialProfiles": [],
"source": "https://www.yelp.com/search?find_desc=mental%20health%20clinic"
}
]

You can download the dataset in various formats such as JSON, HTML, CSV, or Excel.

Data table

FieldTypeDescription
namestringClinic business name
citystringCity used in search
specialtystringSpecialty keyword used
phonestring/nullPhone parsed from listing
addressstring/nullAddress parsed from listing
websitestring/nullClinic website URL if found
emailsarrayUnique emails detected on website body
ownerEmailsarrayLikely owner/decision-maker emails (best-effort heuristic)
facebookstring/nullFirst Facebook URL found
instagramstring/nullFirst Instagram URL found
twitterstring/nullFirst Twitter/X URL found
linkedinstring/nullFirst LinkedIn URL found
facebookProfilesarrayAll discovered Facebook profile/page links
instagramProfilesarrayAll discovered Instagram profile links
twitterProfilesarrayAll discovered Twitter/X profile links
linkedinProfilesarrayAll discovered LinkedIn profile links
youtubeProfilesarrayAll discovered YouTube links
tiktokProfilesarrayAll discovered TikTok profile links
allSocialProfilesarrayDe-duplicated union of all social links found
sourcestringURL where enrichment data was captured

Pricing / Cost estimation

How much does it cost to scrape Yelp clinic data? Cost depends mostly on:

  • Number of cities and specialties
  • maxPages value
  • Number of clinic websites visited for enrichment
  • Proxy usage and retries

For small runs (single city, 1-2 specialties, low page depth), usage is typically low and often fits within free-tier experimentation. For larger lead-generation batches, start with smaller test runs, validate output quality, then scale gradually.

Tips or Advanced options

  • Start with maxPages: 1 to validate keyword quality before scaling.
  • Use precise specialty terms to reduce irrelevant businesses.
  • Keep maxRequestsPerCrawl enabled as a budget guardrail.
  • Schedule recurring runs (daily/weekly) to keep outreach data fresh.
  • If contact details are critical, keep proxy enabled and run during off-peak hours for better response stability.

FAQ, disclaimers, and support

This Actor is a technical tool. You are responsible for complying with applicable laws, website Terms of Service, and privacy requirements in your jurisdiction and target market.

Why are some fields empty?

Not all listings publish full contact data. Some clinics hide email addresses, do not link websites, or block automated requests.

Does local apify run upload output to Apify Console?

No. Local storage remains in the local storage/ folder. To verify Cloud output in Console, deploy and run on Apify platform.

Need help or customizations?

Use the Actor Issues tab for bugs/feedback. For custom extraction logic, CRM mapping, or private deployments, you can request a tailored version.