Y Combinator API Scraper avatar
Y Combinator API Scraper

Pricing

Pay per event

Go to Store
Y Combinator API Scraper

Y Combinator API Scraper

Developed by

ClearPath

ClearPath

Maintained by Community

Extract complete Y Combinator ecosystem data - 5000+ companies, 8000+ founders, 3500+ jobs. Perfect for VCs, recruiters, and researchers. Get startup intelligence, funding trends, team data, and job listings. Reliable Python scraper with proxy support. Start at $3.50.

0.0 (0)

Pricing

Pay per event

0

Total users

5

Monthly users

5

Runs succeeded

86%

Last modified

a day ago

Y Combinator API Scraper | Extract Companies, Founders & Jobs Data

Extract comprehensive startup ecosystem data from Y Combinator including companies, founders, and job listings with this reliable Python-based scraper.

Y Combinator API scraper

Features

  • Complete Y Combinator data extraction - Companies, founders, and jobs from the entire YC ecosystem
  • Flexible data selection - Choose to scrape companies, founders, jobs, or any combination
  • Comprehensive company data - Batch info, industry, funding stage, team size, and more
  • Detailed founder profiles - Background, experience, current roles, and company associations
  • Job listings with full details - Positions, requirements, compensation, and company context
  • Proxy support - Built-in proxy configuration for reliable access
  • Data consolidation - Intelligent merging of data from multiple sources for enhanced insights

Use Cases

For Investors & VCs

  • Deal flow analysis - Track YC companies by batch, industry, and growth stage
  • Market intelligence - Analyze funding trends and startup ecosystem patterns
  • Due diligence - Research company backgrounds, team composition, and hiring activity
  • Portfolio monitoring - Track portfolio companies and competitive landscape

For Recruiters & Talent Acquisition

  • Startup talent sourcing - Access comprehensive founder and employee data
  • Hiring trend analysis - Monitor job postings and compensation across YC companies
  • Market research - Understand skill demands and role requirements in the startup ecosystem
  • Company intelligence - Research target companies, team sizes, and growth patterns

For Entrepreneurs & Founders

  • Competitive analysis - Research similar companies, team structures, and market positioning
  • Networking opportunities - Identify potential co-founders, advisors, and industry connections
  • Market validation - Analyze successful companies in your space and their approaches
  • Hiring insights - Understand compensation trends and role requirements

For Researchers & Analysts

  • Startup ecosystem research - Comprehensive data for academic and market research
  • Trend analysis - Track technology adoption, industry evolution, and geographic patterns
  • Longitudinal studies - Historical data for tracking company and founder journeys
  • Economic analysis - Job creation, funding patterns, and economic impact studies

Quick Start

Extract All Data Types

{
"scrapeAllCompanies": true,
"scrapeAllFounders": true,
"scrapeAllJobs": true
}

Companies and Founders Only

{
"scrapeAllCompanies": true,
"scrapeAllFounders": true,
"scrapeAllJobs": false
}

Test Mode Configuration

{
"scrapeAllCompanies": true,
"scrapeAllFounders": true,
"scrapeAllJobs": false
}

Input Parameters

ParameterTypeDescriptionDefault
scrapeAllCompaniesBooleanExtract all Y Combinator companiestrue
scrapeAllFoundersBooleanExtract all Y Combinator founderstrue
scrapeAllJobsBooleanExtract all Y Combinator jobsfalse
proxyConfigurationObjectProxy settings for reliable accessOptional

Proxy Configuration

Proxies are recommended for reliable access:

{
"proxyConfiguration": {
"useApifyProxy": true,
"apifyProxyGroups": []
}
}

Output example

{
"data_source": "jobs",
"id": 64,
"name": "Y Combinator",
"slug": "y-combinator",
"website": "https://www.ycombinator.com",
"description": "Not your average fixed point combinator.",
"long_description": "Y Combinator has a small ~12 person team that makes the software that runs YC. Hardly any investors write software, but YC was started by programmers so it's natural for us to solve our problems that way. We believe our software is a key competitive advantage and we are investing aggressively in new software products.\r\n\r\nAs a member of the software team, you'll get full access to the YC program, just like founders do. You'll learn the ins and outs of how YC works, and you'll get to follow and learn from hundreds of companies. You'll meet some of the most successful people in the startup world and get exposed to the best startup ideas. And of course, if you ever want to start your own company, you’ll learn a lot about that from working at YC.\r\n ",
"team_size": 100,
"batch": null,
"industry": "Financial Technology and Services",
"subindustry": null,
"primary_vertical": "Financial Technology and Services",
"regions": [],
"tags": [],
"top_company": false,
"is_hiring": true,
"launched_at": null,
"status": null,
"small_logo_thumb_url": "https://bookface-images.s3.amazonaws.com/small_logos/2db694dd7905db37d037821a2fdaf9fa0708a964.png",
"all_locations": null,
"stage": null,
"nonprofit": false,
"pretty_location": null,
"lat": null,
"lng": null,
"country": null,
"website_display": "ycombinator.com",
"one_liner": "Not your average fixed point combinator.",
"twitter_url": "https://twitter.com/ycombinator",
"fb_url": "https://www.facebook.com/YCombinator/",
"tech_description": "The YC internal software is mostly written in Ruby on Rails and React. It runs on AWS and PostgreSQL. Previous experience developing web apps will be helpful, but we don't care if you've used React or Rails specifically.\r\n\r\nNote: [Hacker News](https://news.ycombinator.com/) is written in Lisp, but Hacker News is run by a [separate team](https://blog.ycombinator.com/two-hn-announcements/).",
"hiring_description": "Y Combinator has a small ~12 person team that makes the software that runs YC. Hardly any investors write software, but YC was started by programmers so it's natural for us to solve our problems that way. We believe our software is a key competitive advantage and we are investing aggressively in new software products.\r\n\r\nAs a member of the software team, you'll get full access to the YC program, just like founders do. You'll learn the ins and outs of how YC works, and you'll get to follow and learn from hundreds of companies. You'll meet some of the most successful people in the startup world and get exposed to the best startup ideas. And of course, if you ever want to start your own company, you’ll learn a lot about that from working at YC.\r\n ",
"jobs": [
{
"id": 78637,
"company_id": 64,
"state": "visible",
"title": "Product Engineer - App ops",
"skills": [
{
"id": 111,
"name": "React",
"popularity": 174
},
{
"id": 117,
"name": "Ruby on Rails",
"popularity": 69
},
{
"id": 99,
"name": "PostgreSQL",
"popularity": 79
}
],
"description": "Y Combinator is run by a small team that is committed to helping founders build their startup and make it take off like the next Airbnb, Stripe, Reddit or Doordash (all companies that previously went through YC). \r\n\r\nWhile YC is primarily known for it’s [batch program for founders](https://www.ycombinator.com/about), what’s lesser known is that we have our own small-but-mighty software team. We build tools to help run YC, as well as to help our founders have an unfair advantage — through our unparalleled founder network, expert advice from YC General Partners, exclusive funding opportunities, and much more.\r\n\r\nWorking at YC is a unique place to operating like a startup — talking to customers and shipping code rapidly — and also be surrounded and learning at the frontier of the new startup world.\r\n\r\n**About the role**\r\n\r\nWhile the software team itself is very small (\\~15 people), the impact of our software for our founders and for YC itself is quite massive. Some examples of the software you’d get to work on:\r\n\r\n* **YC’s application software.** Every batch, we review over 20K applicants, manage interviews with thousands of founders, and select just over 100 companies — all within 3 weeks. To accomplish this feat, we write serious power tools to help partners like Garry and Harj manage the process and make decisions, and we’re constantly working to make the software better.\r\n* **Founder fundraising software.** We’ve built the most critical tools for founders to raise a successful seed round: a leads pipeline from Demo Day, legal tools to execute [YC safes](https://www.ycombinator.com/documents), and an all-in-one CRM to manage investor relations. Founders regularly tell us that this is the best part of YC.\r\n* **Investment software.** YC manages over $6B in our fund, and we write software to help us manage it end-to-end: how to invest it, know where it’s allocated, and even use AI to keep our records up-to-date with what founders need from us.\r\n* **Hiring platform.** We’ve built a job marketplace with over 2M job seekers for founders to engage, as well a full suite of messaging and AI-assisted recruiting tools to help founders find great people and hire faster.\r\n\r\nAnd despite all of this, we still feel like we’re just getting started. We have a lot of great ideas on every front to improve how YC operates and to bring tremendous value to our startups, and many of them are 🤫.\r\n\r\n**About the team**\r\n\r\nAs a company, we largely abide by the same principles we teach our startups - so if you've [read Paul Graham's essays](https://paulgraham.com/hwh.html) or [watched our videos on YouTube](https://www.youtube.com/watch?v=z1iF1c8w5Lg), you can already predict what it would be like to work here.  \r\n\r\nMany of our software teammates have worked at or started startups themselves, while also having experience from bigger companies like Meta and Google. This tends to be a good balance to know how to ship quickly, but also make strong product and technical decisions that don’t leave us with dead ends. True to YC advice, our product engineers talk to our customers regularly and ship fast. We also define our own roadmap and often design our own products when needed. \r\n\r\nOur stack is pretty straightforward (Rails, React, Postgres), and the last three engineers have learned it on the job. That said, we’ve found that enjoying front-end software development is key to being successful here, so it’s almost a requirement for the role. Our needs as an organization require a large degree of flexibility, and people change teams quite often. \r\n\r\nMore and more, our founders are introducing AI in all they do — how they code, how their product works, and how they streamline their own operations. At YC, we’re similarly pushing the boundaries of how AI can improve our software and help our companies. We’re excited to share more in person — so please take a moment to apply. ;-)\r\n\r\n**Lastly:** If you’re interested in starting your own startup one day: eight former engineers on our team have gone on to start YC-funded startups.  If you want to start your own startup someday but aren’t ready to do it now, working at YC could be an excellent way to get there, provided you’re willing to be here for at least 2-3 years.\r\n\r\n**Location:** This is an in-person role at YC's campus in Dogpatch, San Francisco. This is where our users (founders, partners, and employees) are five days a week, so it’s optimal for you to be here with them. You must live in the SF Bay Area or be willing to relocate.  We offer generous relocation support for those who want to move to SF to work here.\r\n\r\n**Compensation:** $180,000 to $270,000 base (depending on skills and experience) plus bonus. \r\n\r\n**Benefits** : YC has a profit-sharing program, which is comparable to equity in an early-stage startup or carry in a VC fund.\r\n\r\nOur full benefits package includes medical, vision, and dental plans, infertility benefit, STD/LTD, life insurance, commuter benefits, flexible spending account, health savings account, 401(k) + 4% matching, generous parental leave, paid holidays, and flexible paid time off policy.\r\n\r\n**Work Authorization:** This position does not support work authorization/visa sponsorship. \r\n\r\n**Legal note:** Y Combinator considers qualified applicants with criminal histories, consistent with applicable federal, state, and local law, including San Francisco’s Fair Chance Ordinance. Y Combinator is committed to protecting the privacy of the personal information of job applicants and complying with the California Consumer Privacy Act. The privacy policy of Ashby, Inc., the hiring platform used by Y Combinator, governs the collection of such data and can be found [here](https://www.ashbyhq.com/resources/privacy).",
"salary_min": 180000,
"salary_max": 270000,
"equity_min": null,
"equity_max": null,
"show_path": "https://www.workatastartup.com/jobs/78637",
"interview_process": "Our interview process includes:\r\n\r\n* Phone screen with a member of the YC software team about your background and interest/fit for YC’s team. (30 minutes)\r\n* Technical screen in which you build a full stack app with an engineer on our team (60 minutes)\r\n* 1-1 with a General Partner at YC (60 minutes)\r\n* Full round, including interviews in React, Schema Design, Debugging, Scripting and lunch with the team. All technical prompts are with an engineer, and feel more like pair programming than puzzles or algorithms.\r\n* 1-1 with Garry (30 minutes)\r\n\r\nWe are able to go as fast as the candidate might need, and have extended (and had an offer accepted) in as few as 7 days.",
"time_to_hire": 7,
"pretty_job_type": "Full-time",
"pretty_eng_type": "Full stack",
"pretty_design_type": "",
"pretty_science_type": "",
"pretty_recruiting_type": "",
"pretty_sub_type": "Full stack",
"pretty_min_experience": "3+ years",
"pretty_min_school_year": "",
"pretty_location_or_remote": "San Francisco, CA, US",
"pretty_salary_range": "$180K - $270K",
"pretty_equity_range": "",
"pretty_role": "Engineering",
"pretty_updated_at": "07/16/2025",
"pretty_sponsors_visa": "US citizen/visa only",
"job_type": "fulltime",
"eng_type": [
"fs"
],
"design_type": null,
"design_portfolio_only": null,
"science_type": null,
"recruiting_type": null,
"min_experience": 3,
"location": null,
"locations": [
"San Francisco, CA, US"
],
"remote": "no",
"local_only": null,
"visa": "yes",
"hiring_manager": {
"first_name": "Jared",
"last_name": "Friedman",
"avatar_thumb": "https://bookface-images.s3.amazonaws.com/avatars/72ea194512fa6b6cbd5979e7e4e1e209513aa311.jpg"
}
},
{
"id": 77068,
"company_id": 64,
"state": "visible",
"title": "Product Engineer",
"skills": [
{
"id": 111,
"name": "React",
"popularity": 174
},
{
"id": 117,
"name": "Ruby on Rails",
"popularity": 69
},
{
"id": 99,
"name": "PostgreSQL",
"popularity": 79
}
],
"description": "Y Combinator is run by a small team that is committed to helping founders build their startup and make it take off like the next Airbnb, Stripe, Reddit or Doordash (all companies that previously went through YC). \r\n\r\nWhile YC is primarily known for it’s [batch program for founders](https://www.ycombinator.com/about), what’s lesser known is that we have our own small-but-mighty software team. We build tools to help run YC, as well as to help our founders have an unfair advantage — through our unparalleled founder network, expert advice from YC General Partners, exclusive funding opportunities, and much more.\r\n\r\nWorking at YC is a unique place to operating like a startup — talking to customers and shipping code rapidly — and also be surrounded and learning at the frontier of the new startup world.\r\n\r\n**About the role**\r\n\r\nWhile the software team itself is very small (\\~15 people), the impact of our software for our founders and for YC itself is quite massive. Some examples of the software you’d get to work on:\r\n\r\n* **YC’s application software.** Every batch, we review over 20K applicants, manage interviews with thousands of founders, and select just over 100 companies — all within 3 weeks. To accomplish this feat, we write serious power tools to help partners like Garry and Harj manage the process and make decisions, and we’re constantly working to make the software better.\r\n* **Founder fundraising software.** We’ve built the most critical tools for founders to raise a successful seed round: a leads pipeline from Demo Day, legal tools to execute [YC safes](https://www.ycombinator.com/documents), and an all-in-one CRM to manage investor relations. Founders regularly tell us that this is the best part of YC.\r\n* **Investment software.** YC manages over $6B in our fund, and we write software to help us manage it end-to-end: how to invest it, know where it’s allocated, and even use AI to keep our records up-to-date with what founders need from us.\r\n* **Hiring platform.** We’ve built a job marketplace with over 2M job seekers for founders to engage, as well a full suite of messaging and AI-assisted recruiting tools to help founders find great people and hire faster.\r\n\r\nAnd despite all of this, we still feel like we’re just getting started. We have a lot of great ideas on every front to improve how YC operates and to bring tremendous value to our startups, and many of them are 🤫.\r\n\r\n**About the team**\r\n\r\nAs a company, we largely abide by the same principles we teach our startups - so if you've [read Paul Graham's essays](https://paulgraham.com/hwh.html) or [watched our videos on YouTube](https://www.youtube.com/watch?v=z1iF1c8w5Lg), you can already predict what it would be like to work here.  \r\n\r\nMany of our software teammates have worked at or started startups themselves, while also having experience from bigger companies like Meta and Google. This tends to be a good balance to know how to ship quickly, but also make strong product and technical decisions that don’t leave us with dead ends. True to YC advice, our product engineers talk to our customers regularly and ship fast. We also define our own roadmap and often design our own products when needed. \r\n\r\nOur stack is pretty straightforward (Rails, React, Postgres), and the last three engineers have learned it on the job. That said, we’ve found that enjoying front-end software development is key to being successful here, so it’s almost a requirement for the role. Our needs as an organization require a large degree of flexibility, and people change teams quite often. \r\n\r\nMore and more, our founders are introducing AI in all they do — how they code, how their product works, and how they streamline their own operations. At YC, we’re similarly pushing the boundaries of how AI can improve our software and help our companies. We’re excited to share more in person — so please take a moment to apply. ;-)\r\n\r\n**Lastly:** If you’re interested in starting your own startup one day: eight former engineers on our team have gone on to start YC-funded startups.  If you want to start your own startup someday but aren’t ready to do it now, working at YC could be an excellent way to get there, provided you’re willing to be here for at least 2-3 years.\r\n\r\n**Location:** This is an in-person role at YC's campus in Dogpatch, San Francisco. This is where our users (founders, partners, and employees) are five days a week, so it’s optimal for you to be here with them. You must live in the SF Bay Area or be willing to relocate.  We offer generous relocation support for those who want to move to SF to work here.\r\n\r\n**Compensation:** $180,000 to $270,000 base (depending on skills and experience) plus bonus. \r\n\r\n**Benefits** : YC has a profit-sharing program, which is comparable to equity in an early-stage startup or carry in a VC fund.\r\n\r\nOur full benefits package includes medical, vision, and dental plans, infertility benefit, STD/LTD, life insurance, commuter benefits, flexible spending account, health savings account, 401(k) + 4% matching, generous parental leave, paid holidays, and flexible paid time off policy.\r\n\r\n**Work Authorization:** This position does not support work authorization/visa sponsorship. \r\n\r\n**Legal note:** Y Combinator considers qualified applicants with criminal histories, consistent with applicable federal, state, and local law, including San Francisco’s Fair Chance Ordinance. Y Combinator is committed to protecting the privacy of the personal information of job applicants and complying with the California Consumer Privacy Act. The privacy policy of Ashby, Inc., the hiring platform used by Y Combinator, governs the collection of such data and can be found [here](https://www.ashbyhq.com/resources/privacy).",
"salary_min": 180000,
"salary_max": 270000,
"equity_min": null,
"equity_max": null,
"show_path": "https://www.workatastartup.com/jobs/77068",
"interview_process": "Our interview process includes:\r\n\r\n* Phone screen with a member of the YC software team about your background and interest/fit for YC’s team. (30 minutes)\r\n* Technical screen in which you build a full stack app with an engineer on our team (60 minutes)\r\n* 1-1 with a General Partner at YC (60 minutes)\r\n* Full round, including interviews in React, Schema Design, Debugging, Scripting and lunch with the team. All technical prompts are with an engineer, and feel more like pair programming than puzzles or algorithms.\r\n* 1-1 with Garry (30 minutes)\r\n\r\nWe are able to go as fast as the candidate might need, and have extended (and had an offer accepted) in as few as 7 days.",
"time_to_hire": 7,
"pretty_job_type": "Full-time",
"pretty_eng_type": "Full stack",
"pretty_design_type": "",
"pretty_science_type": "",
"pretty_recruiting_type": "",
"pretty_sub_type": "Full stack",
"pretty_min_experience": "3+ years",
"pretty_min_school_year": "",
"pretty_location_or_remote": "San Francisco, CA, US",
"pretty_salary_range": "$180K - $270K",
"pretty_equity_range": "",
"pretty_role": "Engineering",
"pretty_updated_at": "07/17/2025",
"pretty_sponsors_visa": "US citizen/visa only",
"job_type": "fulltime",
"eng_type": [
"fs"
],
"design_type": null,
"design_portfolio_only": null,
"science_type": null,
"recruiting_type": null,
"min_experience": 3,
"location": null,
"locations": [
"San Francisco, CA, US"
],
"remote": "no",
"local_only": null,
"visa": "yes",
"hiring_manager": {
"first_name": "Ryan",
"last_name": "Choi",
"avatar_thumb": "https://bookface-images.s3.amazonaws.com/avatars/43ea37a60668944c16f04b954b3f5114059217b9.jpg"
}
}
],
"founders": [
{
"id": 936,
"first_name": "Paul",
"last_name": "Graham",
"full_name": "Paul Graham",
"founder_bio": "",
"linkedin": "",
"avatar_thumb": "https://bookface-images.s3.amazonaws.com/avatars/3ef7989989265a8648c1c18e2c69d926eb001df3.jpg",
"past_companies": null,
"past_schools": null
},
{
"id": 1132,
"first_name": "Jessica",
"last_name": "Livingston",
"full_name": "Jessica Livingston",
"founder_bio": "",
"linkedin": "https://www.linkedin.com/in/jessicalivingston1",
"avatar_thumb": "https://bookface-images.s3.amazonaws.com/avatars/0a2ef071d73847ee50df8efea844874a0283c8d9.jpg",
"past_companies": null,
"past_schools": null
}
],
"company_news": [
{
"id": 3530,
"title": "The Circuit - Inside Silicon Valley's Famous Startup School",
"url": "https://www.bloomberg.com/news/videos/2023-08-11/the-circuit-inside-silicon-valley-s-famous-startup-school",
"image_url": "",
"posted_at": "2023-08-11T19:00:00.000Z",
"created_at": "2023-08-11T17:08:09.799Z",
"updated_at": "2023-08-16T14:47:58.479Z",
"domain": "www.bloomberg.com"
},
{
"id": 3524,
"title": "Getting Into Y Combinator Is Tougher Than It’s Ever Been",
"url": "https://www.bloomberg.com/news/articles/2023-08-10/y-combinator-applications-show-access-is-the-toughest-ever",
"image_url": "",
"posted_at": "2023-08-10T19:00:00.000Z",
"created_at": "2023-08-10T14:17:59.055Z",
"updated_at": "2023-08-10T14:17:59.055Z",
"domain": "www.bloomberg.com"
},
{
"id": 3492,
"title": "Want to get in Y Combinator? Don't do this | Sifted",
"url": "https://sifted.eu/articles/applying-to-y-combinator-tips",
"image_url": "https://images.sifted.eu/wp-content/uploads/2023/07/28171807/nicolas-dessaigne-e1690561294565.jpg?w=899&h=454&q=75&fit=crop&auto=compress,format",
"posted_at": "2023-07-30T19:00:00.000Z",
"created_at": "2023-07-31T16:30:02.835Z",
"updated_at": "2023-07-31T16:30:02.835Z",
"domain": "sifted.eu"
},
{
"id": 3456,
"title": "Early interviews for AI companies - apply by Tuesday, July 18, 2023 ",
"url": "https://www.ycombinator.com/blog/early-interviews-for-ai-companies",
"image_url": "https://www.ycombinator.com/blog/content/images/2023/07/Early-interviews-for-AI-companies---apply-by-Tuesday-July-18--2023---Y-Combinator-2023-07-14-16-56-16.png",
"posted_at": "2023-07-15T19:00:00.000Z",
"created_at": "2023-07-15T10:40:22.946Z",
"updated_at": "2023-07-16T13:19:44.657Z",
"domain": "www.ycombinator.com"
},
{
"id": 3400,
"title": "Garry Tan breaks down the key metric in Y Combinator's new list of top startups | Fortune",
"url": "https://fortune.com/2023/06/27/garry-tan-revenue-y-combinator-list-top-startups/?tpcc=NL_Marketing",
"image_url": "https://content.fortune.com/wp-content/uploads/2023/06/Term-Sheet-Garry-Tan-CEO-president-of-Y-Combinator.jpg?resize=1200,600",
"posted_at": "2023-06-27T19:00:00.000Z",
"created_at": "2023-06-28T13:11:47.851Z",
"updated_at": "2023-06-29T14:46:39.231Z",
"domain": "fortune.com"
}
],
"_source_companies": false,
"_source_jobs": true,
"_source_founders_enhanced": false,
"_metadata": {
"scraped_at": "2025-07-17T11:17:34.552308Z",
"scraping_options": {
"companies": true,
"founders": true,
"jobs": true
},
"proxy_used": false
}
}

Data Sources

The scraper intelligently consolidates data from multiple Y Combinator sources:

  • Companies: Core company information from YC's main database
  • Founders: Detailed founder profiles and background information
  • Jobs: Current job listings from Work at a Startup platform
  • Data Consolidation: Intelligent merging of founder data into company records

Pricing

Pay-per-extraction pricing model:

Data TypeDescriptionCountPrice
CompaniesAll YC companies with detailed information5,000+$5.00
FoundersAll YC founders with profiles and background8,000+$3.50
JobsAll current job listings on Work at a Startup3,500+$7.50

Pricing Examples

  • Companies only: $5.00
  • Founders only: $3.50
  • Jobs only: $7.50
  • Companies + Founders: $8.50
  • Complete dataset: $16.00

Note: You only pay for the data types you choose to extract. Failed extractions are not charged.

API Integration

Python Example

from apify_client import ApifyClient
client = ApifyClient("your_token")
# Extract all data types
run = client.actor("ycombinator-api-scraper").call({
"scrapeAllCompanies": True,
"scrapeAllFounders": True,
"scrapeAllJobs": True
})
# Process results
for item in client.dataset(run["defaultDatasetId"]).iterate_items():
if "companies" in item:
print(f"Found {len(item['companies'])} companies")
if "founders" in item:
print(f"Found {len(item['founders'])} founders")
if "jobs" in item:
print(f"Found {len(item['jobs'])} jobs")

JavaScript Example

import { ApifyApi } from 'apify-client';
const client = new ApifyApi({ token: 'your_token' });
// Extract companies and founders only
const run = await client.actor('ycombinator-api-scraper').call({
scrapeAllCompanies: true,
scrapeAllFounders: true,
scrapeAllJobs: false
});
const { items } = await client.dataset(run.defaultDatasetId).listItems();
console.log(`Extracted ${items.length} consolidated records`);

Data Filtering Examples

1. Investment Analysis - Filter High-Growth Companies

Use Case: VCs and investors looking for Series A+ companies from recent batches in specific industries.

Python Example:

from apify_client import ApifyClient
client = ApifyClient("your_token")
# Extract all company data
run = client.actor("ycombinator-api-scraper").call({
"scrapeAllCompanies": True,
"scrapeAllFounders": True,
"scrapeAllJobs": False
})
# Filter for investment opportunities
target_companies = []
for item in client.dataset(run["defaultDatasetId"]).iterate_items():
# Focus on recent batches with significant team growth
if (item.get('batch') in ['Summer 2023', 'Winter 2024', 'Summer 2024'] and
item.get('team_size', 0) >= 10 and
item.get('industry') in ['Fintech', 'Healthcare', 'Developer Tools'] and
item.get('is_hiring') == True):
target_companies.append({
'name': item['name'],
'batch': item['batch'],
'industry': item['industry'],
'team_size': item['team_size'],
'website': item['website'],
'founders_count': len(item.get('founders', [])),
'active_jobs': len(item.get('jobs', []))
})
print(f"Found {len(target_companies)} investment targets")

JavaScript Example:

import { ApifyApi } from 'apify-client';
const client = new ApifyApi({ token: 'your_token' });
const run = await client.actor('ycombinator-api-scraper').call({
scrapeAllCompanies: true,
scrapeAllFounders: true,
scrapeAllJobs: false
});
const { items } = await client.dataset(run.defaultDatasetId).listItems();
// Filter for investment opportunities
const targetCompanies = items.filter(item => {
return ['Summer 2023', 'Winter 2024', 'Summer 2024'].includes(item.batch) &&
(item.team_size || 0) >= 10 &&
['Fintech', 'Healthcare', 'Developer Tools'].includes(item.industry) &&
item.is_hiring === true;
}).map(item => ({
name: item.name,
batch: item.batch,
industry: item.industry,
team_size: item.team_size,
website: item.website,
founders_count: (item.founders || []).length,
active_jobs: (item.jobs || []).length
}));
console.log(`Found ${targetCompanies.length} investment targets`);

2. Talent Acquisition - Remote Engineering Opportunities

Use Case: Engineers and recruiters looking for remote opportunities in specific tech stacks.

Python Example:

from apify_client import ApifyClient
client = ApifyClient("your_token")
# Extract companies with jobs data
run = client.actor("ycombinator-api-scraper").call({
"scrapeAllCompanies": True,
"scrapeAllFounders": False,
"scrapeAllJobs": True
})
# Filter for remote engineering jobs
remote_opportunities = []
target_skills = ['React', 'Python', 'Node.js', 'TypeScript', 'PostgreSQL']
for item in client.dataset(run["defaultDatasetId"]).iterate_items():
for job in item.get('jobs', []):
# Check if job is remote and matches tech stack
if (job.get('remote') in ['yes', 'only'] and
job.get('pretty_role') == 'Engineering' and
job.get('salary_min', 0) >= 100000):
# Check if job requires target skills
job_skills = [skill['name'] for skill in job.get('skills', [])]
matching_skills = [skill for skill in target_skills if skill in job_skills]
if matching_skills:
remote_opportunities.append({
'company': item['name'],
'job_title': job['title'],
'salary_range': f"${job.get('salary_min', 0)/1000}K - ${job.get('salary_max', 0)/1000}K",
'skills': job_skills,
'matching_skills': matching_skills,
'location': job.get('pretty_location_or_remote'),
'job_url': job.get('show_path')
})
print(f"Found {len(remote_opportunities)} remote engineering opportunities")

JavaScript Example:

import { ApifyApi } from 'apify-client';
const client = new ApifyApi({ token: 'your_token' });
const run = await client.actor('ycombinator-api-scraper').call({
scrapeAllCompanies: true,
scrapeAllFounders: false,
scrapeAllJobs: true
});
const { items } = await client.dataset(run.defaultDatasetId).listItems();
// Filter for remote engineering jobs
const targetSkills = ['React', 'Python', 'Node.js', 'TypeScript', 'PostgreSQL'];
const remoteOpportunities = [];
items.forEach(item => {
(item.jobs || []).forEach(job => {
if (['yes', 'only'].includes(job.remote) &&
job.pretty_role === 'Engineering' &&
(job.salary_min || 0) >= 100000) {
const jobSkills = (job.skills || []).map(skill => skill.name);
const matchingSkills = targetSkills.filter(skill => jobSkills.includes(skill));
if (matchingSkills.length > 0) {
remoteOpportunities.push({
company: item.name,
job_title: job.title,
salary_range: `$${(job.salary_min || 0)/1000}K - $${(job.salary_max || 0)/1000}K`,
skills: jobSkills,
matching_skills: matchingSkills,
location: job.pretty_location_or_remote,
job_url: job.show_path
});
}
}
});
});
console.log(`Found ${remoteOpportunities.length} remote engineering opportunities`);

3. Market Research - Founder Experience Analysis

Use Case: Researchers analyzing founder backgrounds and geographic distribution for market insights.

Python Example:

from apify_client import ApifyClient
from collections import Counter
client = ApifyClient("your_token")
# Extract founders data
run = client.actor("ycombinator-api-scraper").call({
"scrapeAllCompanies": True,
"scrapeAllFounders": True,
"scrapeAllJobs": False
})
# Analyze founder patterns
founder_analysis = {
'by_industry': Counter(),
'by_batch': Counter(),
'by_region': Counter(),
'repeat_founders': []
}
for item in client.dataset(run["defaultDatasetId"]).iterate_items():
company_batch = item.get('batch')
company_industry = item.get('industry')
for founder in item.get('founders', []):
# Count founders by industry and batch
if company_industry:
founder_analysis['by_industry'][company_industry] += 1
if company_batch:
founder_analysis['by_batch'][company_batch] += 1
# Geographic analysis
region = founder.get('current_region')
if region:
founder_analysis['by_region'][region] += 1
# Identify repeat founders (multiple batches)
founder_batches = founder.get('batches', [])
if len(founder_batches) > 1:
founder_analysis['repeat_founders'].append({
'name': founder.get('full_name'),
'batches': founder_batches,
'companies': founder.get('all_companies_text'),
'current_company': item['name']
})
# Print insights
print("Top 5 Industries by Founder Count:")
for industry, count in founder_analysis['by_industry'].most_common(5):
print(f" {industry}: {count} founders")
print(f"\nFound {len(founder_analysis['repeat_founders'])} repeat founders")

JavaScript Example:

import { ApifyApi } from 'apify-client';
const client = new ApifyApi({ token: 'your_token' });
const run = await client.actor('ycombinator-api-scraper').call({
scrapeAllCompanies: true,
scrapeAllFounders: true,
scrapeAllJobs: false
});
const { items } = await client.dataset(run.defaultDatasetId).listItems();
// Analyze founder patterns
const founderAnalysis = {
byIndustry: {},
byBatch: {},
byRegion: {},
repeatFounders: []
};
items.forEach(item => {
const companyBatch = item.batch;
const companyIndustry = item.industry;
(item.founders || []).forEach(founder => {
// Count founders by industry and batch
if (companyIndustry) {
founderAnalysis.byIndustry[companyIndustry] = (founderAnalysis.byIndustry[companyIndustry] || 0) + 1;
}
if (companyBatch) {
founderAnalysis.byBatch[companyBatch] = (founderAnalysis.byBatch[companyBatch] || 0) + 1;
}
// Geographic analysis
const region = founder.current_region;
if (region) {
founderAnalysis.byRegion[region] = (founderAnalysis.byRegion[region] || 0) + 1;
}
// Identify repeat founders
const founderBatches = founder.batches || [];
if (founderBatches.length > 1) {
founderAnalysis.repeatFounders.push({
name: founder.full_name,
batches: founderBatches,
companies: founder.all_companies_text,
current_company: item.name
});
}
});
});
// Print insights
const topIndustries = Object.entries(founderAnalysis.byIndustry)
.sort(([,a], [,b]) => b - a)
.slice(0, 5);
console.log("Top 5 Industries by Founder Count:");
topIndustries.forEach(([industry, count]) => {
console.log(` ${industry}: ${count} founders`);
});
console.log(`\nFound ${founderAnalysis.repeatFounders.length} repeat founders`);

Advanced Usage

Research Configuration

For academic research with complete dataset:

{
"scrapeAllCompanies": true,
"scrapeAllFounders": true,
"scrapeAllJobs": true,
"proxyConfiguration": {
"useApifyProxy": true
}
}

Recruitment Focus

For talent acquisition and hiring intelligence:

{
"scrapeAllCompanies": true,
"scrapeAllFounders": true,
"scrapeAllJobs": true
}

Investment Analysis

For VC and investment research:

{
"scrapeAllCompanies": true,
"scrapeAllFounders": true,
"scrapeAllJobs": false
}

Data Export

Export results in multiple formats:

  • JSON - Structured data for programmatic analysis
  • CSV - Spreadsheet analysis and reporting
  • Excel - Advanced data manipulation and visualization

Data Quality

  • Comprehensive coverage - All YC companies from 2005 to present
  • Real-time data - Current job listings and up-to-date company information
  • Data consolidation - Intelligent merging of founder profiles with company data
  • Validation - Automatic data quality checks and error handling

Technical Requirements

  • Proxy: Apify proxy recommended for reliable access
  • Runtime: Typically 1-2 minutes for complete extraction and 2 minutes to store the data in the dataset (~45 MB for the full dataset)
  • Rate limits: Built-in request management and retry logic

Automation

Set up automated data collection:

  • Scheduled runs - Weekly or monthly extraction for trend analysis
  • Webhooks - Real-time notifications for new data
  • API integration - Connect with CRM, ATS, or research tools
  • Data pipelines - Automated processing and analysis workflows

FAQ

Q: What data is included for companies? A: Company name, batch, industry, description, team size, funding stage, website, location, company news and more comprehensive details.

Q: How current is the job data? A: Job listings are current as of the extraction time.

Q: Can I get historical data? A: Yes, the scraper includes companies from all YC batches since 2005, providing historical perspective.

Q: How does data consolidation work? A: The scraper intelligently merges founder profiles with company records, providing enhanced insights into team composition.

Q: How reliable is the extraction? A: The scraper includes retry logic, error handling, and proxy support for maximum reliability. Usually it takes between 1-2 minutes to generate a complete dataset with companies, founders and jobs options enabled and another 2 minutes to store the data in the dataset (~45 MB for the full dataset).

Q: Can I extract only specific data types? A: Yes, you can choose any combination of companies, founders, and jobs based on your needs.

Q: Is the data suitable for academic research? A: Absolutely! The comprehensive dataset is perfect for startup ecosystem research, economic analysis, and longitudinal studies.

Getting Started

  1. Create your Apify account here
  2. Configure your extraction - Choose data types and settings
  3. Run the scraper - Extract Y Combinator data in minutes
  4. Export results - Download as JSON, CSV, or Excel
  5. Set up automation - Schedule regular runs for ongoing analysis

Use Cases by Industry

Investment & VC

  • Deal flow analysis and company tracking
  • Market intelligence and trend identification
  • Portfolio monitoring and competitive analysis

Recruitment & HR

  • Startup talent sourcing and market research
  • Compensation analysis and hiring trends
  • Company intelligence and team composition

Research & Academia

  • Startup ecosystem analysis and economic research
  • Entrepreneurship studies and trend analysis
  • Innovation patterns and geographic insights

Business Development

  • Partnership opportunities and market analysis
  • Competitive intelligence and industry mapping
  • Customer discovery and market validation

Support

  • 📧 Email: max@mapa.slmail.me
  • 📖 Found a bug?: Use the issues tab and describe your issue
  • 🔧 Feature Requests: Contact via email or issues tab for additional features

This Y Combinator API scraper extracts publicly available data from Y Combinator's websites and platforms. Users must comply with Y Combinator's terms of service and applicable data protection regulations for their intended use.


🚀 Start Extracting Y Combinator Data