Y Combinator Companies Scraper
Pricing
from $1.00 / 1,000 results
Y Combinator Companies Scraper
Scrape the full Y Combinator company directory with company profiles, founders, open jobs, batch, industry, status, and social links. HTTP-only, no login required.
Pricing
from $1.00 / 1,000 results
Rating
3.6
(10)
Developer
Crawler Bros
Actor stats
10
Bookmarked
1
Total users
0
Monthly active users
6 days ago
Last modified
Categories
Share
Scrape the complete Y Combinator company directory — 5,000+ startups across every batch from 2005 to today. Get company profiles, founders, open jobs, industry tags, batch, status, funding stage, team size, and social links. HTTP-only; no login, no cookies, no proxy required.
Output (per company)
type=yc_companyid(slug),slug,url,name,kind(startup / non-profit / etc.)shortDescription,longDescription,pitch(full pitch text when present)batch(e.g.S24),industry,subIndustry,status(Active / Inactive / Acquired / Public),stage(Seed / Series A / etc.)location,allLocations,regions,foundingYear,launchedAt(unix),teamSizewebsite,linkedin,twitter,facebook,crunchbase,wellfound,githublogo,logoThumb,demoDayVideoisHiring,isCompanyHiring,jobCounttags— when Algolia returns themformerNames— list, when the company has rebrandedtopCompany,topCompanyBadge,ycdcBadgeName,nonprofit— when flaggedquestionsAndAnswers—[{ question, answer }]short founder Q&A blocksteamHighlights— list of blurbs about the teamhighlightBlackFounders,highlightWomenFounders,highlightHispanicFounders— only when Algolia explicitly flags themfounders—[{ name, title, bio, email, linkedin, twitter, hackerNews, github, instagram }]whenscrapeFounders = trueopenJobs—[{ title, url, applyUrl, location, remote, type, role, team, yearsExperience, salaryMin, salaryMax, compensationCurrency, equity, equityRange, skills, experience, visa, visaSupported, englishFluent }]whenscrapeOpenJobs = truescrapedAt
If zero companies match the filters, a single yc_company_blocked sentinel record is emitted so runs always exit 0.
Input
| Field | Type | Description |
|---|---|---|
directoryUrl | string | YC directory URL. Default: https://www.ycombinator.com/companies. |
query | string | Optional free-text search (?q=<query>). |
batch | enum | any, S24, W24, F24, S23, W23, S22, W22, S21, W21. |
industry | string | Exact-match industry filter (e.g. B2B, Consumer, Fintech, Healthcare). |
status | enum | any, Active, Inactive, Acquired, Public. |
scrapeFounders | boolean | Fetch founder details per company. Default true. |
scrapeOpenJobs | boolean | Fetch open job postings per company. Default true. |
regions | string[] | Optional region filter (case-insensitive) — matched against location. |
tags | string[] | Optional tag filter (case-insensitive) — matched against industry, subIndustry, tags. |
highlightBlackFounders | boolean | Only include companies flagged with Black founders. |
highlightWomenFounders | boolean | Only include companies flagged with women founders. |
highlightHispanicFounders | boolean | Only include companies flagged with Hispanic / Latino founders. |
maxItems | integer | Max companies per run (1–5500). Default 3. |
How it works
- Query Y Combinator's public Algolia search index (
YCCompany_production) for companies matching your filters. Pagination is handled transparently. - For each company, optionally fetch its detail page (
/companies/<slug>) and parse the Inertiadata-pageJSON blob to get founders and open jobs. - Jobs expose salary range, equity, skills, visa policy, and the apply URL.
- Output uses a strict no-nulls contract — every field present is non-empty.
FAQ
Do I need a proxy? No. YC is publicly accessible from datacenter IPs.
Does the scraper need YC credentials? No. All data comes from public endpoints.
How many companies are in the directory? About 5,800 across all batches (growing each cycle). maxItems caps per run at 5,500.
Are historical founders included? Yes — every company's founder list is preserved on its public profile, including exits.
Why does jobCount sometimes differ from the directory badge? YC's directory badge counts only open job postings; we return the exact set embedded in the profile page.
What's the yc_company_blocked record? When your filter returns zero matches (e.g. a typo in industry), we emit one sentinel record so downstream pipelines never see an empty output.