a16z Portfolio Scraper avatar

a16z Portfolio Scraper

Pricing

from $750.00 / 1,000 portfolio companies

Go to Apify Store
a16z Portfolio Scraper

a16z Portfolio Scraper

Canonical scraper for the full Andreessen Horowitz (a16z) portfolio: 800+ companies with sector, stage, year, founders, status, website, description. Built for VC sourcing analysts, M&A analysts, biz dev, recruiters.

Pricing

from $750.00 / 1,000 portfolio companies

Rating

0.0

(0)

Developer

Stephan Corbeil

Stephan Corbeil

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

3 days ago

Last modified

Categories

Share

a16z Portfolio Scraper — Andreessen Horowitz Investments Directory

The canonical sourcing dataset for the a16z portfolio. Pulls all ~836 Andreessen Horowitz portfolio companies straight from a16z.com/portfolio with sector, stage, year of investment, founders, status, website, and description. Filter by focus area, stage, or vintage year.

Built for analysts who need clean, structured a16z portfolio data on demand — no scraping infrastructure, no maintenance, no broken selectors.

Who this is for

  • VC sourcing analysts — map a16z's bets by sector and vintage to find similar-stage companies your fund should be looking at
  • M&A analysts — surface a16z portfolio companies in a given sector with current status (active vs exited) for landscape decks
  • Biz dev / partnerships — target a16z-backed startups in your category for outbound; pre-qualified by a top-tier VC
  • Recruiters & founders — identify hiring companies inside a high-signal portfolio
  • Founders & operators — competitor and adjacency mapping for diligence and positioning

Output fields

Every record includes:

FieldDescription
nameCompany name
sector / sectorsPrimary sector + full list (Crypto, Enterprise, Bio + Health, Fintech, Consumer, American Dynamism, Games, Infra, CLF)
focus_areasa16z fund focus area tags
stage / stagesInvestment stage (Seed, Venture, Growth, M&A, IPO)
investment_yearYear of initial a16z check
investment_dateRaw date string of initial investment
foundersFounder names (when published by a16z)
websiteCompany URL
statusActive / Exits / mixed
is_active / is_exitedBoolean convenience flags
acquirerAcquiring company name if exited via M&A
ticker_symbolPublic ticker if IPO'd
descriptionCompany description
logo, linkedin, twitter, instagramBrand assets and social handles
permalinka16z's company profile URL
a16z_company_idStable a16z internal ID

Input

{
"sectorFilter": "Enterprise", // optional — Crypto, Enterprise, Bio + Health, Fintech, ...
"stageFilter": "Growth", // optional — Seed, Venture, Growth, M&A, IPO
"yearFromTo": { "from": 2020, "to": 2025 }, // optional — investment year range
"maxResults": 100 // 1–1000 (default 100; full portfolio is ~836)
}

All filters are optional. Combine them: e.g. sectorFilter: "Crypto" + yearFromTo: {from: 2023} returns every crypto company a16z has backed since 2023.

Pricing — Pay-Per-Event

EventPrice
Actor start$0.00005
Per portfolio company record$0.75

Cost examples

  • 50 companies in your sector → $37.50
  • Full a16z portfolio (~836) → ~$627
  • 25 companies (one Series B sector slice) → $18.75

Premium pricing reflects the sourcing-analyst use case: VC associates bill $200–500/hr; a single qualified company surfaced from this data covers the entire run cost many times over.

Companion actors

For broader sourcing coverage, pair with:

  • YC Companies Directory — every Y Combinator company by batch and industry
  • Startup Funding Tracker — venture funding rounds across firms

Together these three give you a complete top-tier-VC + accelerator picture of the early-stage landscape.

Notes on data freshness

  • Pulls live from a16z.com on every run — no cache, no stale data
  • a16z publishes the portfolio as static JSON embedded in the page, so this scraper is fast and resilient (no headless browser, no selector drift)
  • Partner names are not published on a16z's portfolio index — that field is returned as an empty string. If a16z adds it, this actor will populate it without a schema change.

Example use cases

  1. "Pull every a16z American Dynamism company from 2022 onward"sectorFilter: "American Dynamism", yearFromTo: {from: 2022}
  2. "All a16z-backed crypto companies still active"sectorFilter: "Crypto", filter is_active: true on the dataset
  3. "a16z's bio + health exits"sectorFilter: "Bio + Health", stageFilter: "M&A"
  4. "Full vintage 2024 cohort"yearFromTo: {from: 2024, to: 2024}, maxResults: 1000