
✨ Y Combinator Scraper Apify
Pricing
$5.00 / 1,000 results

✨ Y Combinator Scraper Apify
⚡ Scrape Y Combinator’s startup directory with rich data: company info, founders, job postings, batch, team size, social links, and more. Filter by industry, region, or hiring status. Ideal for lead gen, VC scouting, and recruiting. Clean JSON output ready for Airtable, Notion, or Excel.
5.0 (1)
Pricing
$5.00 / 1,000 results
1
Total users
3
Monthly users
3
Runs succeeded
>99%
Last modified
2 days ago
Y Combinator Scraper
Easily extract structured data on thousands of startups from YCombinator.com — including company descriptions, websites, founders, job postings, batch info, team size, LinkedIn, Crunchbase links, and more.
🚀 Whether you're a VC, recruiter, founder, journalist, or researcher — this scraper saves hours of manual work.
✅ What This Scraper Does
This Apify actor lets you scrape Y Combinator startup profiles in bulk using their public company directory. With just a few clicks, you'll get:
- ✅ Company name, website, and YC page link
- ✅ Short and long descriptions
- ✅ Batch (e.g. Summer 2024), stage, status
- ✅ Team size, location, founding year
- ✅ Founders with social media profiles
- ✅ Active job postings (title, location, type, salary)
- ✅ Social links: LinkedIn, X (Twitter), Facebook, Crunchbase
- ✅ Industry tags and advanced filters (like isHiring, region, nonprofit)
📦 Example Use Cases
- Lead generation: Build targeted B2B lead lists based on industry, team size, and region
- Recruiting: Find hiring startups and their open roles
- Investor research: Track new YC companies, founder bios, and product pitches
- Market intelligence: Analyze trends across batches, industries, or geographies
- News & media: Source verified startup info for journalism or newsletters
🛠️ Input Options
The scraper is flexible. You can control the data you want via the following inputs:
directUrls
(array of URLs)
Paste search result URLs from YCombinator.com, with filters applied.
Example:
["https://www.ycombinator.com/companies?batch=Winter%202025&industry=B2B&isHiring=true"]
searchQuery
(optional, string)
Search for specific keywords in the company name, description, or tags.
sort
(optional, string)
Choose how results are sorted:
default
: Y Combinator’s default sortlaunch_date
: Newest companies first
JSON example:
{"directUrls": ["https://www.ycombinator.com/companies?batch=Winter%202025&batch=Summer%202024&industry=B2B&isHiring=true&query=arv&team_size=%5B%225%22%2C%22500%22%5D"],"searchQuery": "ar","sort": "default"}
📤 Output Data Format
Each scraped company will be saved as a JSON object in the Apify dataset. Here's a breakdown of the fields you’ll receive:
Field | Type | Description |
---|---|---|
id | string | Unique identifier of the company from Y Combinator |
logo | string | URL to the company's logo image |
name | string | Company name |
yc_url | string | Direct link to the company's Y Combinator profile page |
website | string | Company’s official website URL |
short_description | string | One-liner description from YC |
long_description | string | Detailed description of what the company does |
batch | string | Y Combinator batch (e.g. "Winter 2025") |
status | string | Startup status in the YC ecosystem (e.g. "active", "dead", "acquired") |
stage | string | Stage of the company, if available (e.g. "pre-launch", "launched") |
tags | array | Array of industry tags assigned by YC |
location | string | HQ or main location of the company |
year_founded | integer | Year the company was founded |
team_size | integer | Reported team size (min/max if available) |
linkedin | string | Link to company’s LinkedIn page |
x | string | Link to company’s X (Twitter) profile |
facebook | string | Link to company’s Facebook page |
crunchbase | string | Link to Crunchbase profile |
job_postings | array | List of current job openings posted on YC |
→ title | string | Job title |
→ url | string | Direct link to the job posting |
→ location | string | Job location |
→ type | string | Employment type (e.g. full-time, intern) |
→ salary | string | Salary range (if available) |
→ experience | string | Required experience (if listed) |
founders | array | List of founders and their public profiles |
→ full_name | string | Founder’s full name |
→ title | string | Title or role at the company |
→ avatar | string | Thumbnail/avatar image of the founder |
→ linkedin | string | Founder’s LinkedIn profile |
→ x | string | Founder’s Twitter (X) profile |
You can download the dataset in JSON, CSV, or XLSX format — or use the Apify API to fetch the data programmatically.
Example JSON Output:
[{"id": 29973,"logo": "https://bookface-images.s3.us-west-2.amazonaws.com/logos/4e428f0bdd367f8e923ff5d314b4a4394e11ccd7.png?X-Amz-Algorithm=AWS4-HMAC-SHA256&X-Amz-Credential=ASIAQC4NIECAHGNCKMI3%2F20250716%2Fus-west-2%2Fs3%2Faws4_request&X-Amz-Date=20250716T214732Z&X-Amz-Expires=3600&X-Amz-Security-Token=IQoJb3JpZ2luX2VjEE0aCXVzLXdlc3QtMiJGMEQCIDjs%2FQg4zO30Z%2BYa80v4ekkNTPAWdSYtLHkZmV8YlhG0AiB89MskitEQd1Z6AZJ0LYx%2BGRpTDbY%2B63hu02UyMfFukSrlAwhmEAAaDDAwNjIwMTgxMTA3MiIM8AFe%2F52HPppDdE%2BoKsIDquMtiUdKBV9U9j7zhu3XXuenlRfreSsmF6kZPvt%2F7qxiHkskqyqOMLPQ1zAHH5E7YmonqIVMDwHMSXNOQsy4iYU7q%2B%2FhxjhpK39iHA51ZzL08hg4eB%2Fo5viH%2FxtbNBqCbikbOV%2FTWIeKaQcGqaiKFrSyKla6MPwtKSDQkAIhYeo4raPIM7Uw8thv6nXZfv2B4o97kPRoh7L41lHMUnzzthJdlfpXnZ3RUcB75kGM%2Bz9CKcaKrYM33%2F2FmePZdgPrbWM7uia8zGoZJ6My9wUQe%2BW33ar4R78lY2FvRdHjHNlLjhTJQnNe0qBfUBXRYx3BBs6v20j9XZVvIMR4LiFqY9JlmGpf4I4FPnUFF6v4Sowrjqar7Rz35KywqLLwIYsy%2F9pA6SpUNGsmtKhCJIW3%2BuwUyljiuzjDjehnPfjQeLgUMwte60FpC3eVPCMYkczCdLB%2BM7IwIJqsXBKk6QzF99hp9Oge3hQlyx43CVmthA2TgAtme1cnKbVrHy5tsM8QDNFIJB1%2BcVT7rjBxFu8EAVR5y2cEza0%2FbEavD1ccrqPySVTgpfjEASeAF1QTdVcaRmB%2F5dLMUxAkceEUxlVaxHNjMLeW4MMGOqYByse2D31Es9Uff1lmcvouVTicar2hp9fzsZecqKh46D%2FF3KA2FVj8HCx6gYPpx93RUukCJiG2gOmIcMysXAaATJ%2FInCvqGYFpAFycCekMDvbLsC3O%2FpTLNKsbatGD%2FSrRs7wqIC4pFC5llKnNItpgV4kCKElrFocxBWZ07o%2B5ZWAXYr247Kvfy%2BIQq9plaID%2Fn2D14TeFMDTM6HIkAeONzC9liPZpOQ%3D%3D&X-Amz-SignedHeaders=host&X-Amz-Signature=d007f9865e5e9c7f404243aeffa7c9bf0c8b9f5fd919fc8b6929ef4e4b3fe7e9","name": "Arva AI","yc_url": "https://www.ycombinator.com/companies/arva-ai","website": "https://www.arva.ai","short_description": "AI Agents to scale AML, KYB and KYC operations","long_description": "Banks and fintechs have large teams of human analysts who conduct manual checks and reviews. We replace those human compliance analysts with AI agents that automate 80% of the manual work for faster and more compliant financial crime (AML) reviews.\r\n\r\nOur first three products are: Screening AI (discount 91% of screening alerts), KYB/C AI (speed up onboarding), Transaction Monitoring AI (handle AML transaction monitoring alerts)\r\n\r\nOur end goal is to have a whole suite of AI workers that can handle manual compliance work for banks and fintechs. A $24B opportunity in the US alone.","batch": "Summer 2024","status": "Active","stage": "Early","tags": ["Artificial Intelligence","Fintech","B2B","Compliance","Regtech"],"location": null,"year_founded": 2024,"team_size": 8,"linkedin": "https://www.linkedin.com/company/arva-ai/","x": "https://x.com/arva_ai","facebook": null,"crunchbase": "https://www.crunchbase.com/organization/arva-ai","job_postings": [{"title": "AI Research Engineer","url": "http://ycombinator.com/companies/arva-ai/jobs/f6APYpN-ai-research-engineer","location": "London, England, GB","type": "Full-time","salary": "£80K - £120K GBP","experience": "3+ years"},{"title": "GTM Lead","url": "http://ycombinator.com/companies/arva-ai/jobs/dT2oqNZ-gtm-lead","location": "London, England, GB","type": "Full-time","salary": "£75K - £95K GBP","experience": "3+ years"}],"founders": [{"avatar": "https://bookface-images.s3.us-west-2.amazonaws.com/avatars/560201fdbc53ae709c42ebe943c4888d076e5e7a.jpg?X-Amz-Algorithm=AWS4-HMAC-SHA256&X-Amz-Credential=ASIAQC4NIECAHGNCKMI3%2F20250716%2Fus-west-2%2Fs3%2Faws4_request&X-Amz-Date=20250716T214732Z&X-Amz-Expires=3600&X-Amz-Security-Token=IQoJb3JpZ2luX2VjEE0aCXVzLXdlc3QtMiJGMEQCIDjs%2FQg4zO30Z%2BYa80v4ekkNTPAWdSYtLHkZmV8YlhG0AiB89MskitEQd1Z6AZJ0LYx%2BGRpTDbY%2B63hu02UyMfFukSrlAwhmEAAaDDAwNjIwMTgxMTA3MiIM8AFe%2F52HPppDdE%2BoKsIDquMtiUdKBV9U9j7zhu3XXuenlRfreSsmF6kZPvt%2F7qxiHkskqyqOMLPQ1zAHH5E7YmonqIVMDwHMSXNOQsy4iYU7q%2B%2FhxjhpK39iHA51ZzL08hg4eB%2Fo5viH%2FxtbNBqCbikbOV%2FTWIeKaQcGqaiKFrSyKla6MPwtKSDQkAIhYeo4raPIM7Uw8thv6nXZfv2B4o97kPRoh7L41lHMUnzzthJdlfpXnZ3RUcB75kGM%2Bz9CKcaKrYM33%2F2FmePZdgPrbWM7uia8zGoZJ6My9wUQe%2BW33ar4R78lY2FvRdHjHNlLjhTJQnNe0qBfUBXRYx3BBs6v20j9XZVvIMR4LiFqY9JlmGpf4I4FPnUFF6v4Sowrjqar7Rz35KywqLLwIYsy%2F9pA6SpUNGsmtKhCJIW3%2BuwUyljiuzjDjehnPfjQeLgUMwte60FpC3eVPCMYkczCdLB%2BM7IwIJqsXBKk6QzF99hp9Oge3hQlyx43CVmthA2TgAtme1cnKbVrHy5tsM8QDNFIJB1%2BcVT7rjBxFu8EAVR5y2cEza0%2FbEavD1ccrqPySVTgpfjEASeAF1QTdVcaRmB%2F5dLMUxAkceEUxlVaxHNjMLeW4MMGOqYByse2D31Es9Uff1lmcvouVTicar2hp9fzsZecqKh46D%2FF3KA2FVj8HCx6gYPpx93RUukCJiG2gOmIcMysXAaATJ%2FInCvqGYFpAFycCekMDvbLsC3O%2FpTLNKsbatGD%2FSrRs7wqIC4pFC5llKnNItpgV4kCKElrFocxBWZ07o%2B5ZWAXYr247Kvfy%2BIQq9plaID%2Fn2D14TeFMDTM6HIkAeONzC9liPZpOQ%3D%3D&X-Amz-SignedHeaders=host&X-Amz-Signature=b249eba9cdcf43f1ab2627cbbb53171996338006b2c69fcf2bba16c68154f618","full_name": "Rhim Shah","title": "CEO","linkedin": "https://linkedin.com/in/rhim-shah","x": ""},{"avatar": "https://bookface-images.s3.us-west-2.amazonaws.com/avatars/58517dc8081e6d7ecd4040bf285d2d77cd205ed1.jpg?X-Amz-Algorithm=AWS4-HMAC-SHA256&X-Amz-Credential=ASIAQC4NIECAHGNCKMI3%2F20250716%2Fus-west-2%2Fs3%2Faws4_request&X-Amz-Date=20250716T214732Z&X-Amz-Expires=3600&X-Amz-Security-Token=IQoJb3JpZ2luX2VjEE0aCXVzLXdlc3QtMiJGMEQCIDjs%2FQg4zO30Z%2BYa80v4ekkNTPAWdSYtLHkZmV8YlhG0AiB89MskitEQd1Z6AZJ0LYx%2BGRpTDbY%2B63hu02UyMfFukSrlAwhmEAAaDDAwNjIwMTgxMTA3MiIM8AFe%2F52HPppDdE%2BoKsIDquMtiUdKBV9U9j7zhu3XXuenlRfreSsmF6kZPvt%2F7qxiHkskqyqOMLPQ1zAHH5E7YmonqIVMDwHMSXNOQsy4iYU7q%2B%2FhxjhpK39iHA51ZzL08hg4eB%2Fo5viH%2FxtbNBqCbikbOV%2FTWIeKaQcGqaiKFrSyKla6MPwtKSDQkAIhYeo4raPIM7Uw8thv6nXZfv2B4o97kPRoh7L41lHMUnzzthJdlfpXnZ3RUcB75kGM%2Bz9CKcaKrYM33%2F2FmePZdgPrbWM7uia8zGoZJ6My9wUQe%2BW33ar4R78lY2FvRdHjHNlLjhTJQnNe0qBfUBXRYx3BBs6v20j9XZVvIMR4LiFqY9JlmGpf4I4FPnUFF6v4Sowrjqar7Rz35KywqLLwIYsy%2F9pA6SpUNGsmtKhCJIW3%2BuwUyljiuzjDjehnPfjQeLgUMwte60FpC3eVPCMYkczCdLB%2BM7IwIJqsXBKk6QzF99hp9Oge3hQlyx43CVmthA2TgAtme1cnKbVrHy5tsM8QDNFIJB1%2BcVT7rjBxFu8EAVR5y2cEza0%2FbEavD1ccrqPySVTgpfjEASeAF1QTdVcaRmB%2F5dLMUxAkceEUxlVaxHNjMLeW4MMGOqYByse2D31Es9Uff1lmcvouVTicar2hp9fzsZecqKh46D%2FF3KA2FVj8HCx6gYPpx93RUukCJiG2gOmIcMysXAaATJ%2FInCvqGYFpAFycCekMDvbLsC3O%2FpTLNKsbatGD%2FSrRs7wqIC4pFC5llKnNItpgV4kCKElrFocxBWZ07o%2B5ZWAXYr247Kvfy%2BIQq9plaID%2Fn2D14TeFMDTM6HIkAeONzC9liPZpOQ%3D%3D&X-Amz-SignedHeaders=host&X-Amz-Signature=e1fba3c71b330c4637371d1dd5138f5384a2dcda5a3aa0d8eabcbdc430840dcd","full_name": "Oli Wales","title": "Founder/CTO","linkedin": "https://linkedin.com/in/oliverfwales","x": ""}]}...]
⚙️ How It Works
- Uses Apify's proxy infrastructure to handle Y Combinator traffic cleanly
- Parses YC search data for batch filtering
- Crawls each startup’s profile page to extract job postings, founders, and full metadata
- Outputs structured JSON data you can use in Airtable, Notion, CRMs, or spreadsheets
⚡ Performance Tips
- For large-scale scraping, filter by region, industry, or batch to reduce load
- Schedule the actor to run weekly or monthly to track new YC startups over time
🧠 Built With ❤️ by Damilo
I build fast, scalable Apify scrapers for serious use cases — B2B lead gen, research, job mining, content aggregation, and beyond.
If this actor saved you time, please consider leaving a ⭐⭐⭐⭐⭐ review to help others find it!
Happy scraping!