Y Combinator Scraper avatar
Y Combinator Scraper

Pricing

from $3.00 / 1,000 results

Go to Apify Store
Y Combinator Scraper

Y Combinator Scraper

Discover the Y Combinator Scraper, an efficient actor for scraping the YC companies directory. Easily extract detailed profiles, founder info, and batch data. Ideal for market research, lead generation, or investment analysis. For best results and to avoid blocks, residential proxies are best.

Pricing

from $3.00 / 1,000 results

Rating

0.0

(0)

Developer

Shahid Irfan

Shahid Irfan

Maintained by Community

Actor stats

2

Bookmarked

8

Total users

5

Monthly active users

18 days ago

Last modified

Share

Y Combinator Companies Scraper

Apify Actor Scrape YC Companies

Overview

Discover and extract comprehensive data from the Y Combinator companies directory with this powerful scraper. Whether you're conducting market research, analyzing startup trends, or scouting talent, this tool provides structured access to YC-funded companies, their founders, and open job opportunities. Perfect for investors, recruiters, and analysts seeking insights into the startup ecosystem.

Keywords: Y Combinator scraper, YC companies scraper, startup directory scraper, scrape Y Combinator companies, YC batch scraper, founder data extraction, job listings scraper.

Features

  • Complete Company Profiles: Extract detailed information about Y Combinator-backed startups, including company names, descriptions, locations, funding batches, and status.
  • Founder Insights: Optionally scrape founder details with names and social media links for deeper analysis.
  • Job Market Intelligence: Access open positions at YC companies, including job titles, descriptions, locations, and salary ranges.
  • Batch-Specific Scraping: Target specific Y Combinator batches (e.g., Summer 2025, Winter 2024) or scrape the entire directory.
  • Flexible Data Collection: Customize what data to collect based on your research needs.
  • Reliable Extraction: Built-in mechanisms to handle dynamic content and ensure comprehensive data retrieval.
  • Structured JSON Output: Clean, consistent data format ready for analysis or integration.

How It Works

This scraper navigates the Y Combinator companies directory, systematically collecting data from company profiles. It handles pagination automatically and can visit individual company pages for additional details like founders and jobs. The process is optimized for efficiency while respecting website guidelines.

Input Parameters

Configure the scraper using the following parameters:

ParameterTypeDescriptionDefault
urlstringSpecific Y Combinator directory URL to start scraping (e.g., https://www.ycombinator.com/companies?batch=Summer%202025). Overrides default if provided.-
scrape_all_companiesbooleanScrape companies from the entire directory if true, or use provided URL/default page if false.false
scrape_foundersbooleanExtract founder information from individual company pages.true
scrape_open_jobsbooleanExtract open job listings from company pages.true
results_wantedintegerMaximum number of companies to collect. Use high number for all available.100
max_pagesintegerMaximum directory pages to visit as a safety limit.20
proxyConfigurationobjectProxy settings for reliable scraping. Use Apify Proxy for best results.{"useApifyProxy": true, "apifyProxyGroups": ["RESIDENTIAL"]}

Output Data

The scraper outputs structured JSON records to an Apify dataset. Each record represents a company with the following fields:

Core Company Fields

  • company_image: Company logo URL
  • company_id: Unique company identifier
  • company_name: Official company name
  • url: Y Combinator profile URL
  • short_description: Brief company overview
  • long_description: Detailed company description
  • batch: YC batch (e.g., "S25", "W24")
  • status: Company status (Active, Acquired, etc.)
  • tags: Industry/technology tags array
  • company_location: Geographic location
  • year_founded: Founding year
  • team_size: Number of team members
  • primary_partner: Associated YC partner
  • website: Official website URL
  • company_linkedin: LinkedIn profile
  • company_x: X (Twitter) profile

Founders Data (when enabled)

  • founders: Array of founder objects with id, name, linkedin, x

Jobs Data (when enabled)

  • open_jobs: Array of job objects with id, title, description_url, description, location, salary, years_experience

Sample Output Record

{
"company_image": "https://example.com/logo.png",
"company_id": "company-123",
"company_name": "Example Startup",
"url": "https://www.ycombinator.com/companies/example-startup",
"short_description": "Building the future of AI",
"long_description": "Example Startup is revolutionizing artificial intelligence with cutting-edge machine learning solutions.",
"batch": "S25",
"status": "Active",
"tags": ["AI", "Machine Learning", "SaaS"],
"company_location": "San Francisco, CA",
"year_founded": "2023",
"team_size": "15",
"primary_partner": "John Doe",
"website": "https://examplestartup.com",
"company_linkedin": "https://linkedin.com/company/example-startup",
"company_x": "https://x.com/example_startup",
"founders": [
{
"id": "founder-1",
"name": "Jane Smith",
"linkedin": "https://linkedin.com/in/jane-smith",
"x": "https://x.com/janesmith"
}
],
"open_jobs": [
{
"id": "job-456",
"title": "Senior Software Engineer",
"description_url": "https://www.ycombinator.com/companies/example-startup/jobs/senior-software-engineer",
"description": "We are looking for a talented software engineer to join our growing team...",
"location": "Remote",
"salary": "$150k - $200k",
"years_experience": "5+"
}
]
}

Usage

Running on Apify Platform

  1. Navigate to the Y Combinator Companies Scraper on Apify.
  2. Click "Run" to start the actor.
  3. Configure input parameters in the form.
  4. Monitor progress and view results in the dataset.

API Usage

Use the Apify API to run the scraper programmatically:

curl -X POST "https://api.apify.com/v2/acts/shahidirfan100~ycombinator-companies-scraper/runs?token=YOUR_API_TOKEN" \
-H "Content-Type: application/json" \
-d '{
"scrape_all_companies": true,
"scrape_founders": true,
"results_wanted": 500
}'

Configuration Examples

Cost and Performance

  • Compute Units: Approximately 0.01-0.05 CU per company depending on data depth.
  • Estimated Cost: $0.001-$0.005 per company (based on Apify pricing).
  • Performance: Processes 10-50 companies per minute, depending on configuration and proxy settings.

Limitations and Best Practices

  • Respects Y Combinator's terms of service with appropriate request delays.
  • Some data may be incomplete if not publicly available.
  • Founder and job data requires additional page visits, increasing processing time.
  • Results reflect the current state of the directory and may not include the most recent additions.
  • For large-scale scraping, use residential proxies and monitor rate limits.

Support and Resources

  • Documentation: Refer to Apify Docs for platform guidance.
  • Issues: Report bugs or request features via the actor page.
  • Updates: Check changelog for new features and improvements.

Changelog

  • v1.0.0: Initial release with comprehensive Y Combinator directory scraping capabilities, including company profiles, founders, and job listings.

Discover more YC insights with this scraper. Start extracting valuable startup data today!