✨ Y Combinator Scraper Apify avatar
✨ Y Combinator Scraper Apify

Pricing

$5.00 / 1,000 results

Go to Store
✨ Y Combinator Scraper Apify

✨ Y Combinator Scraper Apify

Developed by

Imad

Imad

Maintained by Community

⚡ Scrape Y Combinator’s startup directory with rich data: company info, founders, job postings, batch, team size, social links, and more. Filter by industry, region, or hiring status. Ideal for lead gen, VC scouting, and recruiting. Clean JSON output ready for Airtable, Notion, or Excel.

5.0 (1)

Pricing

$5.00 / 1,000 results

1

Total users

3

Monthly users

3

Runs succeeded

>99%

Last modified

2 days ago

Y Combinator Scraper

Easily extract structured data on thousands of startups from YCombinator.com — including company descriptions, websites, founders, job postings, batch info, team size, LinkedIn, Crunchbase links, and more.

🚀 Whether you're a VC, recruiter, founder, journalist, or researcher — this scraper saves hours of manual work.


✅ What This Scraper Does

This Apify actor lets you scrape Y Combinator startup profiles in bulk using their public company directory. With just a few clicks, you'll get:

  • ✅ Company name, website, and YC page link
  • ✅ Short and long descriptions
  • ✅ Batch (e.g. Summer 2024), stage, status
  • ✅ Team size, location, founding year
  • ✅ Founders with social media profiles
  • ✅ Active job postings (title, location, type, salary)
  • ✅ Social links: LinkedIn, X (Twitter), Facebook, Crunchbase
  • ✅ Industry tags and advanced filters (like isHiring, region, nonprofit)

📦 Example Use Cases

  • Lead generation: Build targeted B2B lead lists based on industry, team size, and region
  • Recruiting: Find hiring startups and their open roles
  • Investor research: Track new YC companies, founder bios, and product pitches
  • Market intelligence: Analyze trends across batches, industries, or geographies
  • News & media: Source verified startup info for journalism or newsletters

🛠️ Input Options

Inputs The scraper is flexible. You can control the data you want via the following inputs:

directUrls (array of URLs)

Paste search result URLs from YCombinator.com, with filters applied.

Example:

[
"https://www.ycombinator.com/companies?batch=Winter%202025&industry=B2B&isHiring=true"
]

searchQuery (optional, string)

Search for specific keywords in the company name, description, or tags.

sort (optional, string)

Choose how results are sorted:

  • default: Y Combinator’s default sort
  • launch_date: Newest companies first

JSON example:

{
"directUrls": [
"https://www.ycombinator.com/companies?batch=Winter%202025&batch=Summer%202024&industry=B2B&isHiring=true&query=arv&team_size=%5B%225%22%2C%22500%22%5D"
],
"searchQuery": "ar",
"sort": "default"
}

📤 Output Data Format

Each scraped company will be saved as a JSON object in the Apify dataset. Here's a breakdown of the fields you’ll receive:

FieldTypeDescription
idstringUnique identifier of the company from Y Combinator
logostringURL to the company's logo image
namestringCompany name
yc_urlstringDirect link to the company's Y Combinator profile page
websitestringCompany’s official website URL
short_descriptionstringOne-liner description from YC
long_descriptionstringDetailed description of what the company does
batchstringY Combinator batch (e.g. "Winter 2025")
statusstringStartup status in the YC ecosystem (e.g. "active", "dead", "acquired")
stagestringStage of the company, if available (e.g. "pre-launch", "launched")
tagsarrayArray of industry tags assigned by YC
locationstringHQ or main location of the company
year_foundedintegerYear the company was founded
team_sizeintegerReported team size (min/max if available)
linkedinstringLink to company’s LinkedIn page
xstringLink to company’s X (Twitter) profile
facebookstringLink to company’s Facebook page
crunchbasestringLink to Crunchbase profile
job_postingsarrayList of current job openings posted on YC
titlestringJob title
urlstringDirect link to the job posting
locationstringJob location
typestringEmployment type (e.g. full-time, intern)
salarystringSalary range (if available)
experiencestringRequired experience (if listed)
foundersarrayList of founders and their public profiles
full_namestringFounder’s full name
titlestringTitle or role at the company
avatarstringThumbnail/avatar image of the founder
linkedinstringFounder’s LinkedIn profile
xstringFounder’s Twitter (X) profile

You can download the dataset in JSON, CSV, or XLSX format — or use the Apify API to fetch the data programmatically.


Example JSON Output:

[
{
"id": 29973,
"logo": "https://bookface-images.s3.us-west-2.amazonaws.com/logos/4e428f0bdd367f8e923ff5d314b4a4394e11ccd7.png?X-Amz-Algorithm=AWS4-HMAC-SHA256&X-Amz-Credential=ASIAQC4NIECAHGNCKMI3%2F20250716%2Fus-west-2%2Fs3%2Faws4_request&X-Amz-Date=20250716T214732Z&X-Amz-Expires=3600&X-Amz-Security-Token=IQoJb3JpZ2luX2VjEE0aCXVzLXdlc3QtMiJGMEQCIDjs%2FQg4zO30Z%2BYa80v4ekkNTPAWdSYtLHkZmV8YlhG0AiB89MskitEQd1Z6AZJ0LYx%2BGRpTDbY%2B63hu02UyMfFukSrlAwhmEAAaDDAwNjIwMTgxMTA3MiIM8AFe%2F52HPppDdE%2BoKsIDquMtiUdKBV9U9j7zhu3XXuenlRfreSsmF6kZPvt%2F7qxiHkskqyqOMLPQ1zAHH5E7YmonqIVMDwHMSXNOQsy4iYU7q%2B%2FhxjhpK39iHA51ZzL08hg4eB%2Fo5viH%2FxtbNBqCbikbOV%2FTWIeKaQcGqaiKFrSyKla6MPwtKSDQkAIhYeo4raPIM7Uw8thv6nXZfv2B4o97kPRoh7L41lHMUnzzthJdlfpXnZ3RUcB75kGM%2Bz9CKcaKrYM33%2F2FmePZdgPrbWM7uia8zGoZJ6My9wUQe%2BW33ar4R78lY2FvRdHjHNlLjhTJQnNe0qBfUBXRYx3BBs6v20j9XZVvIMR4LiFqY9JlmGpf4I4FPnUFF6v4Sowrjqar7Rz35KywqLLwIYsy%2F9pA6SpUNGsmtKhCJIW3%2BuwUyljiuzjDjehnPfjQeLgUMwte60FpC3eVPCMYkczCdLB%2BM7IwIJqsXBKk6QzF99hp9Oge3hQlyx43CVmthA2TgAtme1cnKbVrHy5tsM8QDNFIJB1%2BcVT7rjBxFu8EAVR5y2cEza0%2FbEavD1ccrqPySVTgpfjEASeAF1QTdVcaRmB%2F5dLMUxAkceEUxlVaxHNjMLeW4MMGOqYByse2D31Es9Uff1lmcvouVTicar2hp9fzsZecqKh46D%2FF3KA2FVj8HCx6gYPpx93RUukCJiG2gOmIcMysXAaATJ%2FInCvqGYFpAFycCekMDvbLsC3O%2FpTLNKsbatGD%2FSrRs7wqIC4pFC5llKnNItpgV4kCKElrFocxBWZ07o%2B5ZWAXYr247Kvfy%2BIQq9plaID%2Fn2D14TeFMDTM6HIkAeONzC9liPZpOQ%3D%3D&X-Amz-SignedHeaders=host&X-Amz-Signature=d007f9865e5e9c7f404243aeffa7c9bf0c8b9f5fd919fc8b6929ef4e4b3fe7e9",
"name": "Arva AI",
"yc_url": "https://www.ycombinator.com/companies/arva-ai",
"website": "https://www.arva.ai",
"short_description": "AI Agents to scale AML, KYB and KYC operations",
"long_description": "Banks and fintechs have large teams of human analysts who conduct manual checks and reviews. We replace those human compliance analysts with AI agents that automate 80% of the manual work for faster and more compliant financial crime (AML) reviews.\r\n\r\nOur first three products are: Screening AI (discount 91% of screening alerts), KYB/C AI (speed up onboarding), Transaction Monitoring AI (handle AML transaction monitoring alerts)\r\n\r\nOur end goal is to have a whole suite of AI workers that can handle manual compliance work for banks and fintechs. A $24B opportunity in the US alone.",
"batch": "Summer 2024",
"status": "Active",
"stage": "Early",
"tags": [
"Artificial Intelligence",
"Fintech",
"B2B",
"Compliance",
"Regtech"
],
"location": null,
"year_founded": 2024,
"team_size": 8,
"linkedin": "https://www.linkedin.com/company/arva-ai/",
"x": "https://x.com/arva_ai",
"facebook": null,
"crunchbase": "https://www.crunchbase.com/organization/arva-ai",
"job_postings": [
{
"title": "AI Research Engineer",
"url": "http://ycombinator.com/companies/arva-ai/jobs/f6APYpN-ai-research-engineer",
"location": "London, England, GB",
"type": "Full-time",
"salary": "£80K - £120K GBP",
"experience": "3+ years"
},
{
"title": "GTM Lead",
"url": "http://ycombinator.com/companies/arva-ai/jobs/dT2oqNZ-gtm-lead",
"location": "London, England, GB",
"type": "Full-time",
"salary": "£75K - £95K GBP",
"experience": "3+ years"
}
],
"founders": [
{
"avatar": "https://bookface-images.s3.us-west-2.amazonaws.com/avatars/560201fdbc53ae709c42ebe943c4888d076e5e7a.jpg?X-Amz-Algorithm=AWS4-HMAC-SHA256&X-Amz-Credential=ASIAQC4NIECAHGNCKMI3%2F20250716%2Fus-west-2%2Fs3%2Faws4_request&X-Amz-Date=20250716T214732Z&X-Amz-Expires=3600&X-Amz-Security-Token=IQoJb3JpZ2luX2VjEE0aCXVzLXdlc3QtMiJGMEQCIDjs%2FQg4zO30Z%2BYa80v4ekkNTPAWdSYtLHkZmV8YlhG0AiB89MskitEQd1Z6AZJ0LYx%2BGRpTDbY%2B63hu02UyMfFukSrlAwhmEAAaDDAwNjIwMTgxMTA3MiIM8AFe%2F52HPppDdE%2BoKsIDquMtiUdKBV9U9j7zhu3XXuenlRfreSsmF6kZPvt%2F7qxiHkskqyqOMLPQ1zAHH5E7YmonqIVMDwHMSXNOQsy4iYU7q%2B%2FhxjhpK39iHA51ZzL08hg4eB%2Fo5viH%2FxtbNBqCbikbOV%2FTWIeKaQcGqaiKFrSyKla6MPwtKSDQkAIhYeo4raPIM7Uw8thv6nXZfv2B4o97kPRoh7L41lHMUnzzthJdlfpXnZ3RUcB75kGM%2Bz9CKcaKrYM33%2F2FmePZdgPrbWM7uia8zGoZJ6My9wUQe%2BW33ar4R78lY2FvRdHjHNlLjhTJQnNe0qBfUBXRYx3BBs6v20j9XZVvIMR4LiFqY9JlmGpf4I4FPnUFF6v4Sowrjqar7Rz35KywqLLwIYsy%2F9pA6SpUNGsmtKhCJIW3%2BuwUyljiuzjDjehnPfjQeLgUMwte60FpC3eVPCMYkczCdLB%2BM7IwIJqsXBKk6QzF99hp9Oge3hQlyx43CVmthA2TgAtme1cnKbVrHy5tsM8QDNFIJB1%2BcVT7rjBxFu8EAVR5y2cEza0%2FbEavD1ccrqPySVTgpfjEASeAF1QTdVcaRmB%2F5dLMUxAkceEUxlVaxHNjMLeW4MMGOqYByse2D31Es9Uff1lmcvouVTicar2hp9fzsZecqKh46D%2FF3KA2FVj8HCx6gYPpx93RUukCJiG2gOmIcMysXAaATJ%2FInCvqGYFpAFycCekMDvbLsC3O%2FpTLNKsbatGD%2FSrRs7wqIC4pFC5llKnNItpgV4kCKElrFocxBWZ07o%2B5ZWAXYr247Kvfy%2BIQq9plaID%2Fn2D14TeFMDTM6HIkAeONzC9liPZpOQ%3D%3D&X-Amz-SignedHeaders=host&X-Amz-Signature=b249eba9cdcf43f1ab2627cbbb53171996338006b2c69fcf2bba16c68154f618",
"full_name": "Rhim Shah",
"title": "CEO",
"linkedin": "https://linkedin.com/in/rhim-shah",
"x": ""
},
{
"avatar": "https://bookface-images.s3.us-west-2.amazonaws.com/avatars/58517dc8081e6d7ecd4040bf285d2d77cd205ed1.jpg?X-Amz-Algorithm=AWS4-HMAC-SHA256&X-Amz-Credential=ASIAQC4NIECAHGNCKMI3%2F20250716%2Fus-west-2%2Fs3%2Faws4_request&X-Amz-Date=20250716T214732Z&X-Amz-Expires=3600&X-Amz-Security-Token=IQoJb3JpZ2luX2VjEE0aCXVzLXdlc3QtMiJGMEQCIDjs%2FQg4zO30Z%2BYa80v4ekkNTPAWdSYtLHkZmV8YlhG0AiB89MskitEQd1Z6AZJ0LYx%2BGRpTDbY%2B63hu02UyMfFukSrlAwhmEAAaDDAwNjIwMTgxMTA3MiIM8AFe%2F52HPppDdE%2BoKsIDquMtiUdKBV9U9j7zhu3XXuenlRfreSsmF6kZPvt%2F7qxiHkskqyqOMLPQ1zAHH5E7YmonqIVMDwHMSXNOQsy4iYU7q%2B%2FhxjhpK39iHA51ZzL08hg4eB%2Fo5viH%2FxtbNBqCbikbOV%2FTWIeKaQcGqaiKFrSyKla6MPwtKSDQkAIhYeo4raPIM7Uw8thv6nXZfv2B4o97kPRoh7L41lHMUnzzthJdlfpXnZ3RUcB75kGM%2Bz9CKcaKrYM33%2F2FmePZdgPrbWM7uia8zGoZJ6My9wUQe%2BW33ar4R78lY2FvRdHjHNlLjhTJQnNe0qBfUBXRYx3BBs6v20j9XZVvIMR4LiFqY9JlmGpf4I4FPnUFF6v4Sowrjqar7Rz35KywqLLwIYsy%2F9pA6SpUNGsmtKhCJIW3%2BuwUyljiuzjDjehnPfjQeLgUMwte60FpC3eVPCMYkczCdLB%2BM7IwIJqsXBKk6QzF99hp9Oge3hQlyx43CVmthA2TgAtme1cnKbVrHy5tsM8QDNFIJB1%2BcVT7rjBxFu8EAVR5y2cEza0%2FbEavD1ccrqPySVTgpfjEASeAF1QTdVcaRmB%2F5dLMUxAkceEUxlVaxHNjMLeW4MMGOqYByse2D31Es9Uff1lmcvouVTicar2hp9fzsZecqKh46D%2FF3KA2FVj8HCx6gYPpx93RUukCJiG2gOmIcMysXAaATJ%2FInCvqGYFpAFycCekMDvbLsC3O%2FpTLNKsbatGD%2FSrRs7wqIC4pFC5llKnNItpgV4kCKElrFocxBWZ07o%2B5ZWAXYr247Kvfy%2BIQq9plaID%2Fn2D14TeFMDTM6HIkAeONzC9liPZpOQ%3D%3D&X-Amz-SignedHeaders=host&X-Amz-Signature=e1fba3c71b330c4637371d1dd5138f5384a2dcda5a3aa0d8eabcbdc430840dcd",
"full_name": "Oli Wales",
"title": "Founder/CTO",
"linkedin": "https://linkedin.com/in/oliverfwales",
"x": ""
}
]
}
...
]

⚙️ How It Works

  1. Uses Apify's proxy infrastructure to handle Y Combinator traffic cleanly
  2. Parses YC search data for batch filtering
  3. Crawls each startup’s profile page to extract job postings, founders, and full metadata
  4. Outputs structured JSON data you can use in Airtable, Notion, CRMs, or spreadsheets

⚡ Performance Tips

  • For large-scale scraping, filter by region, industry, or batch to reduce load
  • Schedule the actor to run weekly or monthly to track new YC startups over time

🧠 Built With ❤️ by Damilo

I build fast, scalable Apify scrapers for serious use cases — B2B lead gen, research, job mining, content aggregation, and beyond.

If this actor saved you time, please consider leaving a ⭐⭐⭐⭐⭐ review to help others find it!


Happy scraping!