Y Combinator Scraper avatar

Y Combinator Scraper

Pricing

from $2.00 / 1,000 results

Go to Apify Store
Y Combinator Scraper

Y Combinator Scraper

[πŸ’° $2.0 / 1K] Extract YC-funded companies, founders, and open jobs from the Y Combinator startup directory. Filter by batch, industry, region, status, team size, or hiring activity β€” or paste any directory URL with filters pre-applied.

Pricing

from $2.00 / 1,000 results

Rating

0.0

(0)

Developer

SolidCode

SolidCode

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

3 days ago

Last modified

Share

Pull every YC-funded company, founder, and open job from ycombinator.com/companies at scale β€” complete with batch codes, industry verticals, hiring flags, founder LinkedIn and X profiles, salary and equity ranges, and seven social handles per company. Built for VC sourcing teams, recruiters, and B2B sales teams who need a fresh, structured YC alumni dataset without hand-crafting filter URLs or stitching together founder bios one tab at a time.

Why This Scraper?

  • Full directory coverage β€” 44 batches from W05 through F25 β€” every cohort in YC's public alumni list, from the 2005 founding class to the latest Fall batch, all in one run.
  • Eight industry verticals, six team-size bands, eleven regions β€” multi-select filters for B2B, Consumer, Fintech, Healthcare, Industrials, Education, Government, Real Estate & Construction, plus team-size bands from solo founder up to 500+ and regions including the auto-expanding "Asia" group (covers South Asia, Southeast Asia, and East Asia in one click).
  • Founders embedded on every row β€” full name, title, bio, LinkedIn URL, and X (Twitter) URL for every co-founder, typically 1 to 4 per company, no extra credits charged.
  • Open jobs with salary and equity ranges β€” when a company is hiring, every posting comes with title, location, role, salary range, equity range, required skills, years of experience, and visa-sponsorship status.
  • Seven social handles per company β€” website, LinkedIn, X, Facebook, Crunchbase, GitHub, plus the canonical YC profile URL.
  • isHiring quick-filter β€” flip a single toggle to return only companies actively recruiting today, no URL building required.
  • Three input modes in one actor β€” paste filter URLs (?batch=W24&industry=B2B), paste single-company URLs (/companies/airbnb), or use the guided multi-select filters. Mix all three in the same run.
  • Free-text search across the directory β€” search by company name, tagline, or technology keyword ("AI agents", "developer tools") and get exactly what the YC search box returns.
  • camelCase output, ready for production β€” clean, predictable field names that drop straight into databases, dashboards, and CRMs without renaming.

Use Cases

VC Sourcing & Pipeline Building

  • Track every new batch the moment it goes live and triage by vertical
  • Build a watchlist of Active companies in your thesis area
  • Surface acquired and public alumni to study exit patterns and founder follow-ons

Recruiting & Talent Sourcing

  • Pull every YC company actively hiring with isHiring: true plus includeJobs: true
  • Filter by team-size band (51 to 200) to target Series A-stage startups
  • Capture founder LinkedIn URLs to map alumni networks for warm intros

B2B Sales & ABM

  • Build a YC-backed account list for go-to-market motions targeting startup buyers
  • Segment by industry vertical (B2B, Fintech) and team size to qualify ICPs
  • Refresh on a schedule to catch new launches before competitors do

Competitive & Market Research

  • Map every company in a vertical (e.g. all 2024 fintech batch graduates)
  • Track industry composition trends across batches over time
  • Build adjacency maps from the tags and industries[] fields

Press & Media Monitoring

  • Watch for newly launched companies tagged Top Company
  • Build founder profile lookups across batches for feature pieces
  • Monitor acquisition status changes for scoop coverage

Getting Started

Quick Start

The simplest possible run β€” fetches the first 100 companies from the full YC directory with founders embedded:

{}

Filter by Batch, Industry, and Hiring Activity

Pull every B2B startup from the two most recent winter and summer batches that's actively hiring:

{
"batches": ["W24", "S24"],
"industries": ["B2B"],
"isHiring": true,
"maxResults": 200
}

Free-Text Search by Region

Search for AI-agent companies headquartered in the United States:

{
"searchQuery": "AI agents",
"regions": ["United States of America"],
"maxResults": 50
}

Search by URL with Full Job Postings

Paste any YC directory URL with filters pre-applied β€” the actor parses the querystring and runs the same search. Add includeJobs: true to capture the full open-jobs list:

{
"startUrls": [
"https://www.ycombinator.com/companies?batch=W24&industry=Fintech&isHiring=true"
],
"includeJobs": true,
"maxResults": 100
}

Single-Company Deep-Dive

Paste one or more individual company URLs to pull just those rows with full founder and job detail:

{
"startUrls": [
"https://www.ycombinator.com/companies/airbnb",
"https://www.ycombinator.com/companies/stripe"
],
"includeFounders": true,
"includeJobs": true
}

Input Reference

Sources

ParameterTypeDefaultDescription
startUrlsstring[][]YC directory URLs (https://www.ycombinator.com/companies?...) or individual company URLs (https://www.ycombinator.com/companies/<slug>). Mix any combination. Leave empty to use the guided filters below.
searchQuerystringnullFree-text query against the YC directory. Matches company name, tagline, and tags.

Filters

ParameterTypeDefaultDescription
batchesstring[][]Funding cohorts. Multi-select from 44 batches (Winter 2005 through Fall 2025, plus Imagine K12). Leave empty for all batches.
statusstring[][]Operating status: Active, Public, Acquired, or Inactive. Leave empty for all statuses.
regionsstring[][]Headquarters region. 11 options including United States, Canada, Europe, United Kingdom, Asia, India, Latin America, Africa, Middle East, Oceania, and Remote.
industriesstring[][]Industry vertical: B2B, Consumer, Education, Fintech, Government, Healthcare, Industrials, or Real Estate and Construction.
teamSizestring[][]Team-size band: 1 (solo founder), 2 to 10, 11 to 50, 51 to 200, 201 to 500, or 500+.
isHiringbooleanfalseIf on, return only companies with at least one open job listed on YC.

What to Extract

ParameterTypeDefaultDescription
includeFoundersbooleantrueEmbed each company's founders on the result row β€” name, title, bio, LinkedIn, and X profile.
includeJobsbooleanfalseEmbed each company's open job listings on the result row β€” title, location, salary, equity, required skills, years of experience, and visa sponsorship.

Limits

ParameterTypeDefaultDescription
maxResultsinteger100Hard cap on total company rows. Set to 0 for no cap (an internal upper limit of 10,000 still applies). The actor stops requesting new pages once this number is reached but keeps the full final page even if it slightly overshoots.

Output

One row per company. founders[] and jobs[] ride along as embedded arrays controlled by the toggles above.

{
"companyId": "1234",
"name": "Airbnb",
"slug": "airbnb",
"url": "https://www.ycombinator.com/companies/airbnb",
"logo": "https://bookface-images.s3.amazonaws.com/...",
"shortDescription": "Book accommodations around the world.",
"longDescription": "Founded in August of 2008 and based in San Francisco...",
"batch": "Winter 2009",
"status": "Public",
"industry": "Consumer",
"subindustry": "Travel, Leisure and Tourism",
"industries": ["Consumer", "Travel, Leisure and Tourism"],
"tags": ["Marketplace", "Travel"],
"location": "San Francisco, CA, USA",
"regions": ["United States of America"],
"country": "USA",
"yearFounded": 2008,
"teamSize": 6132,
"primaryPartner": "Paul Graham",
"website": "http://airbnb.com",
"linkedin": "https://www.linkedin.com/company/airbnb",
"twitter": "https://twitter.com/airbnb",
"facebook": "https://www.facebook.com/airbnb",
"crunchbase": "https://www.crunchbase.com/organization/airbnb",
"github": null,
"isHiring": false,
"openJobsCount": 0,
"nonprofit": false,
"topCompany": true,
"launchedAt": 1234567890,
"founders": [
{
"id": 5678,
"name": "Brian Chesky",
"title": "Co-Founder & CEO",
"bio": "Brian Chesky is the co-founder and CEO of Airbnb...",
"linkedin": "https://www.linkedin.com/in/brianchesky",
"twitter": "https://twitter.com/bchesky"
}
],
"jobs": [],
"scrapedAt": "2026-05-02T14:30:00Z"
}

Core Fields

FieldTypeDescription
companyIdstringYC's internal company identifier
namestringCompany name
slugstringURL slug used in /companies/<slug>
urlstringCanonical YC profile URL
batchstringBatch name (e.g. Winter 2024, Fall 2025)
statusstringActive, Public, Acquired, or Inactive
industrystringTop-level industry vertical
subindustrystringMore specific industry category
industriesstring[]All industry tags applied by YC
tagsstring[]Topic tags (e.g. Developer Tools, AI, Marketplace)
teamSizenumberSelf-reported employee count
nonprofitbooleanWhether the company is a nonprofit
topCompanybooleanYC's "Top Company" badge flag

Description & Branding

FieldTypeDescription
shortDescriptionstringOne-line tagline as shown in the directory
longDescriptionstringFull company blurb from the YC profile page
logostringCompany logo URL
primaryPartnerstringYC partner attached to the company
yearFoundednumberFounding year
launchedAtnumberUnix epoch when the YC profile went live

Location

FieldTypeDescription
locationstringHeadquarters city and region
regionsstring[]YC region tags (e.g. United States of America, Remote)
countrystringHeadquarters country parsed from the location string

Hiring

FieldTypeDescription
isHiringbooleanWhether the company is actively hiring on YC
openJobsCountnumberNumber of open jobs at scrape time
jobsobject[]Open job postings (when includeJobs: true)

Each jobs[] entry contains:

FieldTypeDescription
idstringYC job posting identifier
titlestringJob title
locationstringPosted location
remotebooleanWhether the role is remote-friendly
typestringFull-time, part-time, intern, contract, or co-founder
rolestringEngineering, Design, Product, etc.
salaryMin / salaryMaxstringSalary range as published
equityMin / equityMaxstringEquity range as published
skillsstring[]Required skills tags
yearsExperienceMinnumberMinimum years of experience
visaSponsorshipbooleanWhether the company sponsors work visas
applyUrlstringDirect application link
descriptionstringFull job description body

Founders

When includeFounders: true, each row carries a founders[] array (1 to 4 entries typical).

FieldTypeDescription
idstringYC founder identifier
namestringFull name
titlestringTitle at the company (e.g. Co-Founder & CEO)
biostringFounder biography from the YC profile
linkedinstringFounder LinkedIn URL
twitterstringFounder X (Twitter) URL
FieldTypeDescription
websitestringCompany website
linkedinstringCompany LinkedIn URL
twitterstringCompany X (Twitter) URL
facebookstringCompany Facebook URL
crunchbasestringCompany Crunchbase URL
githubstringCompany GitHub URL
scrapedAtstringISO-8601 UTC timestamp of the run

Tips for Best Results

  • Combine batches and industries for tighter cohorts β€” for example, "all W24 + S24 fintech startups" is one filter combination away.
  • Use searchQuery for fuzzy matching β€” works across company names, taglines, and tags. Great for thematic sweeps like "developer tools" or "AI agents".
  • Founder emails are not publicly listed by Y Combinator β€” the actor returns LinkedIn and X URLs instead, which are typically more useful for warm outreach anyway.
  • Turn includeJobs on for hiring sweeps β€” captures full job descriptions, salary ranges, equity ranges, and visa-sponsorship status in a single pass. Skip it when you only need company and founder data for a faster, leaner run.
  • Logo URLs are short-lived signed links β€” expect them to expire about an hour after the run. Download or rehost the images promptly if you plan to display them yourself.
  • Mix URL inputs and guided filters in one run β€” paste a YC filter URL alongside a few specific /companies/<slug> URLs and the guided filters; the actor handles all three input modes in the same dataset.
  • For full coverage of one batch, set maxResults: 0 β€” the actor automatically fans out across the directory's per-query ceiling and returns every company in the cohort.

Pricing

$2.00 per 1,000 companies returned. No compute charges β€” you only pay per result returned.

CompaniesEstimated Cost
100$0.20
1,000$2.00
10,000$20.00
100,000$200.00

A "result" is one company row, regardless of how many founders or open jobs ride along in the embedded arrays. Platform fees (compute, storage) are additional and depend on your Apify plan.

Integrations

Export data in JSON, CSV, Excel, XML, or RSS. Connect to 1,500+ apps via:

  • Zapier / Make / n8n β€” Workflow automation
  • Google Sheets β€” Direct spreadsheet export
  • Airtable β€” Structured database sync
  • Slack / Email β€” Notifications on new results
  • Webhooks β€” Trigger custom APIs on run completion
  • Apify API β€” Full programmatic access via REST, JavaScript, and Python SDKs

This actor is designed for legitimate sourcing, recruiting, market research, and competitive analysis use cases. The data extracted is published openly by Y Combinator on its public alumni directory at ycombinator.com/companies. Users are responsible for complying with applicable laws and Y Combinator's terms of service. Do not use extracted data for spam, harassment, or any illegal purpose. Be mindful of personal-data and outreach-consent regulations (GDPR, CCPA, CAN-SPAM) when contacting founders found in the dataset.