Y Combinator Companies Directory Scraper avatar

Y Combinator Companies Directory Scraper

Pricing

from $2.00 / 1,000 results

Go to Apify Store
Y Combinator Companies Directory Scraper

Y Combinator Companies Directory Scraper

Scrape all 5900+ Y Combinator portfolio companies — every batch from 2005 to present. Filter by batch, industry, status, team size, location, tags, or free-text. Returns founders, website, description, industry, location, launch year. Optional per-company enrichment with LinkedIn/Twitter founders.

Pricing

from $2.00 / 1,000 results

Rating

0.0

(0)

Developer

Logiover

Logiover

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

a day ago

Last modified

Share

Scrape all 5900+ Y Combinator portfolio companies across every batch from 2005 onward. Returns name, batch, status, founders (with LinkedIn/Twitter), website, descriptions, industry, subindustry, tags, team size, location, launch year, hiring status.

Uses a daily-refreshed open data mirror of YC's official directory (no auth, no key) plus optional per-company page scrape for founders + launches.

Features

  • Get every YC company in one call (~6000 records, ~3 MB)
  • Rich filtering: batch, industry, subindustry, status, team size range, country/location, tags, free-text
  • Optional: enrich each company with founders (name, role, LinkedIn, Twitter, bio) and launch posts
  • All YC batches: from S05 to W26
  • Stable schema across the run — handles YC's occasional field renames

Input

{
"filters": {
"batch": "W26",
"industry": "B2B",
"status": "Active",
"minTeamSize": 5,
"location": "United States",
"tags": ["AI"],
"isHiring": true
},
"searchQuery": "agent",
"maxResults": 100,
"fetchCompanyPageDetails": false
}
FieldTypeDefaultNotes
filtersobject{}Combined AND. Empty = all companies
searchQuerystring""Substring match on name + one_liner + long_description
maxResultsint100Cap returned (full directory ~6000)
fetchCompanyPageDetailsboolfalse+1 HTTP req per company for founders & launches

Filter fields

KeyMatchExample
batchexact"W26", "S25", "W24", etc.
statusexact"Active", "Acquired", "Inactive"
industrysubstring"B2B", "Consumer", "Fintech", "Healthcare"
subindustrysubstring"Marketing", "Devtools"
locationsubstring"San Francisco", "London"
countrysubstring"United States", "India"
minTeamSize / maxTeamSizerange5, 100
tagsall-required["AI", "Productivity"]
isHiringexacttrue

Output

{
"id": 12345,
"name": "Acme AI",
"slug": "acme-ai",
"formerNames": [],
"batch": "W26",
"status": "Active",
"oneLiner": "AI agents for customer support",
"longDescription": "Acme AI builds autonomous agents that...",
"website": "https://acme.ai",
"ycUrl": "https://www.ycombinator.com/companies/acme-ai",
"logoUrl": "https://bookface-images.s3.amazonaws.com/...",
"allLocations": "San Francisco, United States",
"country": "United States",
"industry": "B2B",
"subindustry": "B2B -> Sales",
"tags": ["AI", "Generative AI", "Customer Support"],
"tagsHighlighted": ["AI"],
"teamSize": 8,
"launchedAt": "2026-02-15T00:00:00.000Z",
"stage": "Seed",
"isHiring": true,
"regions": ["America / Canada", "Remote"],
"founders": [
{ "name": "Jane Doe", "title": "CEO", "bio": "...", "avatarUrl": "...", "linkedinUrl": "https://linkedin.com/in/janedoe", "twitterUrl": null }
],
"launches": [
{ "id": 99, "slug": "acme-ai", "title": "Acme AI: AI agents for support teams", "tagline": "...", "postedAt": "2026-02-20T...", "url": "https://www.ycombinator.com/launches/acme-ai" }
],
"scrapedAt": "2026-05-18T12:00:00.000Z"
}

Use cases

  • VC sourcing: find recent batches in specific industries (e.g. all W26 fintech with team>5)
  • Sales prospecting: export YC startups in your target market with founder LinkedIn URLs
  • Recruiting: filter isHiring=true to find startups currently building teams
  • Competitive intel: track competitors in your YC batch + sub-industry
  • Trend research: count companies per industry/batch over time
  • Investor pipeline: identify Series A candidates by team size + batch age
  • Founder outreach: filter by tag (e.g. "ML Infra") and bulk-email founders

Notes

  • Open data source: primary source is yc-oss.github.io/api/companies/all.json — community-maintained mirror updated daily from YC's official internal API. Reliable for last 24h freshness.
  • For fresher data (intra-day, breaking batch announcements), the actor scrapes individual YC company pages when fetchCompanyPageDetails=true.
  • No bot detection: YC's data is intentionally public for founder/funder visibility.
  • Founder enrichment is opt-in because it adds one HTTP request per company — for the full ~6000 directory that's significant. Filter first, enrich after.
  • Batch codes: W{YY} = Winter, S{YY} = Summer, F{YY} = Fall (newer), IK12 = old Imagine K12 cohort.
  • Status values: Active (still operating), Acquired, Inactive (defunct).

Changelog

  • 2026-05-20 — Maintenance pass: reviewed the input schema and default values for a smooth one-click start, and rebuilt the Actor on the latest base image.

Last reviewed: 2026-05-20.