# a16z Portfolio Scraper (`nexgendata/a16z-portfolio-scraper`) Actor

Canonical scraper for the full Andreessen Horowitz (a16z) portfolio: 800+ companies with sector, stage, year, founders, status, website, description. Built for VC sourcing analysts, M\&A analysts, biz dev, recruiters.

- **URL**: https://apify.com/nexgendata/a16z-portfolio-scraper.md
- **Developed by:** [Stephan Corbeil](https://apify.com/nexgendata) (community)
- **Categories:** Business
- **Stats:** 2 total users, 1 monthly users, 100.0% runs succeeded, NaN bookmarks
- **User rating**: No ratings yet

## Pricing

from $750.00 / 1,000 portfolio companies

This Actor is paid per event. You are not charged for the Apify platform usage, but only a fixed price for specific events.

Learn more: https://docs.apify.com/platform/actors/running/actors-in-store#pay-per-event

## What's an Apify Actor?

Actors are a software tools running on the Apify platform, for all kinds of web data extraction and automation use cases.
In Batch mode, an Actor accepts a well-defined JSON input, performs an action which can take anything from a few seconds to a few hours,
and optionally produces a well-defined JSON output, datasets with results, or files in key-value store.
In Standby mode, an Actor provides a web server which can be used as a website, API, or an MCP server.
Actors are written with capital "A".

## How to integrate an Actor?

If asked about integration, you help developers integrate Actors into their projects.
You adapt to their stack and deliver integrations that are safe, well-documented, and production-ready.
The best way to integrate Actors is as follows.

In JavaScript/TypeScript projects, use official [JavaScript/TypeScript client](https://docs.apify.com/api/client/js.md):

```bash
npm install apify-client
```

In Python projects, use official [Python client library](https://docs.apify.com/api/client/python.md):

```bash
pip install apify-client
```

In shell scripts, use [Apify CLI](https://docs.apify.com/cli/docs.md):

````bash
# MacOS / Linux
curl -fsSL https://apify.com/install-cli.sh | bash
# Windows
irm https://apify.com/install-cli.ps1 | iex
```bash

In AI frameworks, you might use the [Apify MCP server](https://docs.apify.com/platform/integrations/mcp.md).

If your project is in a different language, use the [REST API](https://docs.apify.com/api/v2.md).

For usage examples, see the [API](#api) section below.

For more details, see Apify documentation as [Markdown index](https://docs.apify.com/llms.txt) and [Markdown full-text](https://docs.apify.com/llms-full.txt).


# README

## a16z Portfolio Scraper — Andreessen Horowitz Investments Directory

**The canonical sourcing dataset for the a16z portfolio.** Pulls all ~836 Andreessen Horowitz portfolio companies straight from [a16z.com/portfolio](https://a16z.com/portfolio/) with sector, stage, year of investment, founders, status, website, and description. Filter by focus area, stage, or vintage year.

Built for analysts who need clean, structured a16z portfolio data on demand — no scraping infrastructure, no maintenance, no broken selectors.

### Who this is for

- **VC sourcing analysts** — map a16z's bets by sector and vintage to find similar-stage companies your fund should be looking at
- **M&A analysts** — surface a16z portfolio companies in a given sector with current status (active vs exited) for landscape decks
- **Biz dev / partnerships** — target a16z-backed startups in your category for outbound; pre-qualified by a top-tier VC
- **Recruiters & founders** — identify hiring companies inside a high-signal portfolio
- **Founders & operators** — competitor and adjacency mapping for diligence and positioning

### Output fields

Every record includes:

| Field | Description |
|---|---|
| `name` | Company name |
| `sector` / `sectors` | Primary sector + full list (Crypto, Enterprise, Bio + Health, Fintech, Consumer, American Dynamism, Games, Infra, CLF) |
| `focus_areas` | a16z fund focus area tags |
| `stage` / `stages` | Investment stage (Seed, Venture, Growth, M&A, IPO) |
| `investment_year` | Year of initial a16z check |
| `investment_date` | Raw date string of initial investment |
| `founders` | Founder names (when published by a16z) |
| `website` | Company URL |
| `status` | Active / Exits / mixed |
| `is_active` / `is_exited` | Boolean convenience flags |
| `acquirer` | Acquiring company name if exited via M&A |
| `ticker_symbol` | Public ticker if IPO'd |
| `description` | Company description |
| `logo`, `linkedin`, `twitter`, `instagram` | Brand assets and social handles |
| `permalink` | a16z's company profile URL |
| `a16z_company_id` | Stable a16z internal ID |

### Input

```jsonc
{
  "sectorFilter": "Enterprise",          // optional — Crypto, Enterprise, Bio + Health, Fintech, ...
  "stageFilter": "Growth",               // optional — Seed, Venture, Growth, M&A, IPO
  "yearFromTo": { "from": 2020, "to": 2025 }, // optional — investment year range
  "maxResults": 100                       // 1–1000 (default 100; full portfolio is ~836)
}
````

All filters are optional. Combine them: e.g. `sectorFilter: "Crypto"` + `yearFromTo: {from: 2023}` returns every crypto company a16z has backed since 2023.

### Pricing — Pay-Per-Event

| Event | Price |
|---|---|
| Actor start | $0.00005 |
| Per portfolio company record | **$0.75** |

**Cost examples**

- 50 companies in your sector → $37.50
- Full a16z portfolio (~836) → ~$627
- 25 companies (one Series B sector slice) → $18.75

Premium pricing reflects the sourcing-analyst use case: VC associates bill $200–500/hr; a single qualified company surfaced from this data covers the entire run cost many times over.

### Companion actors

For broader sourcing coverage, pair with:

- **YC Companies Directory** — every Y Combinator company by batch and industry
- **Startup Funding Tracker** — venture funding rounds across firms

Together these three give you a complete top-tier-VC + accelerator picture of the early-stage landscape.

### Notes on data freshness

- Pulls live from a16z.com on every run — no cache, no stale data
- a16z publishes the portfolio as static JSON embedded in the page, so this scraper is fast and resilient (no headless browser, no selector drift)
- Partner names are **not** published on a16z's portfolio index — that field is returned as an empty string. If a16z adds it, this actor will populate it without a schema change.

### Example use cases

1. **"Pull every a16z American Dynamism company from 2022 onward"**
   → `sectorFilter: "American Dynamism"`, `yearFromTo: {from: 2022}`
2. **"All a16z-backed crypto companies still active"**
   → `sectorFilter: "Crypto"`, filter `is_active: true` on the dataset
3. **"a16z's bio + health exits"**
   → `sectorFilter: "Bio + Health"`, `stageFilter: "M&A"`
4. **"Full vintage 2024 cohort"**
   → `yearFromTo: {from: 2024, to: 2024}`, `maxResults: 1000`

# Actor input Schema

## `sectorFilter` (type: `string`):

Optional. Filter companies by a16z focus area or vertical. Matches case-insensitively against the company's sectors (e.g. 'Bio + Health', 'Crypto', 'Enterprise', 'Fintech', 'Consumer', 'American Dynamism', 'Games', 'Infra'). Leave blank to include all sectors.

## `stageFilter` (type: `string`):

Optional. Filter by investment stage. Accepts 'Seed', 'Venture' (series A/B), 'Growth', 'M\&A' (acquired/exit), 'IPO'. Case-insensitive. Leave blank to include all stages.

## `yearFromTo` (type: `object`):

Optional. Restrict to companies whose initial a16z investment year falls in \[from, to] inclusive. Either bound may be omitted. Year is parsed from the a16z 'initial\_a16z\_date\_funded' field.

## `maxResults` (type: `integer`):

Maximum number of portfolio companies to emit (1-1000). Default 100. Each record charges $0.75. Set to 1000 to pull the entire a16z portfolio (~836 companies as of mid-2026).

## Actor input object example

```json
{
  "sectorFilter": "Enterprise",
  "stageFilter": "Growth",
  "yearFromTo": {
    "from": 2020,
    "to": 2025
  },
  "maxResults": 100
}
```

# API

You can run this Actor programmatically using our API. Below are code examples in JavaScript, Python, and CLI, as well as the OpenAPI specification and MCP server setup.

## JavaScript example

```javascript
import { ApifyClient } from 'apify-client';

// Initialize the ApifyClient with your Apify API token
// Replace the '<YOUR_API_TOKEN>' with your token
const client = new ApifyClient({
    token: '<YOUR_API_TOKEN>',
});

// Prepare Actor input
const input = {
    "sectorFilter": "Enterprise",
    "stageFilter": "",
    "yearFromTo": {
        "from": 2020,
        "to": 2025
    },
    "maxResults": 100
};

// Run the Actor and wait for it to finish
const run = await client.actor("nexgendata/a16z-portfolio-scraper").call(input);

// Fetch and print Actor results from the run's dataset (if any)
console.log('Results from dataset');
console.log(`💾 Check your data here: https://console.apify.com/storage/datasets/${run.defaultDatasetId}`);
const { items } = await client.dataset(run.defaultDatasetId).listItems();
items.forEach((item) => {
    console.dir(item);
});

// 📚 Want to learn more 📖? Go to → https://docs.apify.com/api/client/js/docs

```

## Python example

```python
from apify_client import ApifyClient

# Initialize the ApifyClient with your Apify API token
# Replace '<YOUR_API_TOKEN>' with your token.
client = ApifyClient("<YOUR_API_TOKEN>")

# Prepare the Actor input
run_input = {
    "sectorFilter": "Enterprise",
    "stageFilter": "",
    "yearFromTo": {
        "from": 2020,
        "to": 2025,
    },
    "maxResults": 100,
}

# Run the Actor and wait for it to finish
run = client.actor("nexgendata/a16z-portfolio-scraper").call(run_input=run_input)

# Fetch and print Actor results from the run's dataset (if there are any)
print("💾 Check your data here: https://console.apify.com/storage/datasets/" + run["defaultDatasetId"])
for item in client.dataset(run["defaultDatasetId"]).iterate_items():
    print(item)

# 📚 Want to learn more 📖? Go to → https://docs.apify.com/api/client/python/docs/quick-start

```

## CLI example

```bash
echo '{
  "sectorFilter": "Enterprise",
  "stageFilter": "",
  "yearFromTo": {
    "from": 2020,
    "to": 2025
  },
  "maxResults": 100
}' |
apify call nexgendata/a16z-portfolio-scraper --silent --output-dataset

```

## MCP server setup

```json
{
    "mcpServers": {
        "apify": {
            "command": "npx",
            "args": [
                "mcp-remote",
                "https://mcp.apify.com/?tools=nexgendata/a16z-portfolio-scraper",
                "--header",
                "Authorization: Bearer <YOUR_API_TOKEN>"
            ]
        }
    }
}

```

## OpenAPI specification

```json
{
    "openapi": "3.0.1",
    "info": {
        "title": "a16z Portfolio Scraper",
        "description": "Canonical scraper for the full Andreessen Horowitz (a16z) portfolio: 800+ companies with sector, stage, year, founders, status, website, description. Built for VC sourcing analysts, M&A analysts, biz dev, recruiters.",
        "version": "0.0",
        "x-build-id": "8COSESSTc9BtZPw4x"
    },
    "servers": [
        {
            "url": "https://api.apify.com/v2"
        }
    ],
    "paths": {
        "/acts/nexgendata~a16z-portfolio-scraper/run-sync-get-dataset-items": {
            "post": {
                "operationId": "run-sync-get-dataset-items-nexgendata-a16z-portfolio-scraper",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor, waits for its completion, and returns Actor's dataset items in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK"
                    }
                }
            }
        },
        "/acts/nexgendata~a16z-portfolio-scraper/runs": {
            "post": {
                "operationId": "runs-sync-nexgendata-a16z-portfolio-scraper",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor and returns information about the initiated run in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK",
                        "content": {
                            "application/json": {
                                "schema": {
                                    "$ref": "#/components/schemas/runsResponseSchema"
                                }
                            }
                        }
                    }
                }
            }
        },
        "/acts/nexgendata~a16z-portfolio-scraper/run-sync": {
            "post": {
                "operationId": "run-sync-nexgendata-a16z-portfolio-scraper",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor, waits for completion, and returns the OUTPUT from Key-value store in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK"
                    }
                }
            }
        }
    },
    "components": {
        "schemas": {
            "inputSchema": {
                "type": "object",
                "properties": {
                    "sectorFilter": {
                        "title": "Sector / Focus Area filter",
                        "type": "string",
                        "description": "Optional. Filter companies by a16z focus area or vertical. Matches case-insensitively against the company's sectors (e.g. 'Bio + Health', 'Crypto', 'Enterprise', 'Fintech', 'Consumer', 'American Dynamism', 'Games', 'Infra'). Leave blank to include all sectors."
                    },
                    "stageFilter": {
                        "title": "Stage filter",
                        "type": "string",
                        "description": "Optional. Filter by investment stage. Accepts 'Seed', 'Venture' (series A/B), 'Growth', 'M&A' (acquired/exit), 'IPO'. Case-insensitive. Leave blank to include all stages."
                    },
                    "yearFromTo": {
                        "title": "Investment year range",
                        "type": "object",
                        "description": "Optional. Restrict to companies whose initial a16z investment year falls in [from, to] inclusive. Either bound may be omitted. Year is parsed from the a16z 'initial_a16z_date_funded' field."
                    },
                    "maxResults": {
                        "title": "Maximum results",
                        "minimum": 1,
                        "maximum": 1000,
                        "type": "integer",
                        "description": "Maximum number of portfolio companies to emit (1-1000). Default 100. Each record charges $0.75. Set to 1000 to pull the entire a16z portfolio (~836 companies as of mid-2026).",
                        "default": 100
                    }
                }
            },
            "runsResponseSchema": {
                "type": "object",
                "properties": {
                    "data": {
                        "type": "object",
                        "properties": {
                            "id": {
                                "type": "string"
                            },
                            "actId": {
                                "type": "string"
                            },
                            "userId": {
                                "type": "string"
                            },
                            "startedAt": {
                                "type": "string",
                                "format": "date-time",
                                "example": "2025-01-08T00:00:00.000Z"
                            },
                            "finishedAt": {
                                "type": "string",
                                "format": "date-time",
                                "example": "2025-01-08T00:00:00.000Z"
                            },
                            "status": {
                                "type": "string",
                                "example": "READY"
                            },
                            "meta": {
                                "type": "object",
                                "properties": {
                                    "origin": {
                                        "type": "string",
                                        "example": "API"
                                    },
                                    "userAgent": {
                                        "type": "string"
                                    }
                                }
                            },
                            "stats": {
                                "type": "object",
                                "properties": {
                                    "inputBodyLen": {
                                        "type": "integer",
                                        "example": 2000
                                    },
                                    "rebootCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "restartCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "resurrectCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "computeUnits": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            },
                            "options": {
                                "type": "object",
                                "properties": {
                                    "build": {
                                        "type": "string",
                                        "example": "latest"
                                    },
                                    "timeoutSecs": {
                                        "type": "integer",
                                        "example": 300
                                    },
                                    "memoryMbytes": {
                                        "type": "integer",
                                        "example": 1024
                                    },
                                    "diskMbytes": {
                                        "type": "integer",
                                        "example": 2048
                                    }
                                }
                            },
                            "buildId": {
                                "type": "string"
                            },
                            "defaultKeyValueStoreId": {
                                "type": "string"
                            },
                            "defaultDatasetId": {
                                "type": "string"
                            },
                            "defaultRequestQueueId": {
                                "type": "string"
                            },
                            "buildNumber": {
                                "type": "string",
                                "example": "1.0.0"
                            },
                            "containerUrl": {
                                "type": "string"
                            },
                            "usage": {
                                "type": "object",
                                "properties": {
                                    "ACTOR_COMPUTE_UNITS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_WRITES": {
                                        "type": "integer",
                                        "example": 1
                                    },
                                    "KEY_VALUE_STORE_LISTS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_INTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_EXTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_RESIDENTIAL_TRANSFER_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_SERPS": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            },
                            "usageTotalUsd": {
                                "type": "number",
                                "example": 0.00005
                            },
                            "usageUsd": {
                                "type": "object",
                                "properties": {
                                    "ACTOR_COMPUTE_UNITS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_WRITES": {
                                        "type": "number",
                                        "example": 0.00005
                                    },
                                    "KEY_VALUE_STORE_LISTS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_INTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_EXTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_RESIDENTIAL_TRANSFER_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_SERPS": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            }
                        }
                    }
                }
            }
        }
    }
}
```
