# Greylock Portfolio Scraper — Reid Hoffman VC Companies (`nexgendata/greylock-portfolio-scraper`) Actor

Scrape the complete Greylock Partners portfolio (Reid Hoffman, David Sze, Saam Motamedi, Sarah Guo). Returns name, sector, stage, partner, founders, website, status, exit details per company.

- **URL**: https://apify.com/nexgendata/greylock-portfolio-scraper.md
- **Developed by:** [Stephan Corbeil](https://apify.com/nexgendata) (community)
- **Categories:** Business, Lead generation
- **Stats:** 2 total users, 1 monthly users, 100.0% runs succeeded, NaN bookmarks
- **User rating**: No ratings yet

## Pricing

from $500.00 / 1,000 portfolio companies

This Actor is paid per event. You are not charged for the Apify platform usage, but only a fixed price for specific events.

Learn more: https://docs.apify.com/platform/actors/running/actors-in-store#pay-per-event

## What's an Apify Actor?

Actors are a software tools running on the Apify platform, for all kinds of web data extraction and automation use cases.
In Batch mode, an Actor accepts a well-defined JSON input, performs an action which can take anything from a few seconds to a few hours,
and optionally produces a well-defined JSON output, datasets with results, or files in key-value store.
In Standby mode, an Actor provides a web server which can be used as a website, API, or an MCP server.
Actors are written with capital "A".

## How to integrate an Actor?

If asked about integration, you help developers integrate Actors into their projects.
You adapt to their stack and deliver integrations that are safe, well-documented, and production-ready.
The best way to integrate Actors is as follows.

In JavaScript/TypeScript projects, use official [JavaScript/TypeScript client](https://docs.apify.com/api/client/js.md):

```bash
npm install apify-client
```

In Python projects, use official [Python client library](https://docs.apify.com/api/client/python.md):

```bash
pip install apify-client
```

In shell scripts, use [Apify CLI](https://docs.apify.com/cli/docs.md):

````bash
# MacOS / Linux
curl -fsSL https://apify.com/install-cli.sh | bash
# Windows
irm https://apify.com/install-cli.ps1 | iex
```bash

In AI frameworks, you might use the [Apify MCP server](https://docs.apify.com/platform/integrations/mcp.md).

If your project is in a different language, use the [REST API](https://docs.apify.com/api/v2.md).

For usage examples, see the [API](#api) section below.

For more details, see Apify documentation as [Markdown index](https://docs.apify.com/llms.txt) and [Markdown full-text](https://docs.apify.com/llms-full.txt).


# README

## Greylock Portfolio Scraper

Scrape the complete **Greylock Partners** portfolio — Reid Hoffman, David Sze, Asheem Chandna, Saam Motamedi, Sarah Guo, Jerry Chen, Mike Duboe, Seth Rosenberg, and the rest of the Greylock partnership. Get structured data on every company they've publicly invested in: **Airbnb, LinkedIn, Workday, Discord, Coinbase, Roblox, Figma, Cloudera, AppDynamics, Palo Alto Networks, Abnormal AI, Adept, Cresta, dbt, Tecton, Inflection AI, Cato Networks**, and the rest of their ~157-company portfolio.

Greylock's portfolio is **deliberately small** — they fund fewer companies than a16z or Sequoia, so every line item is high-signal. The firm skews **enterprise SaaS, cybersecurity, AI, and infrastructure**, making this feed especially valuable for B2B sourcing and competitive intelligence.

### What you get per company

Every dataset row contains:

| Field | Description |
| --- | --- |
| `name` | Official company name |
| `slug` | Greylock URL slug |
| `greylock_url` | Canonical Greylock portfolio entry URL |
| `tagline` / `sector` | Short positioning tagline (Greylock's portfolio page does not expose an explicit sector taxonomy, so the tagline doubles as the sector signal) |
| `description` | Full company description as published by Greylock |
| `hq` | Headquarters (city, state, country) |
| `stage_invested` | Stage Greylock came in at — e.g. `Seed`, `Series A`, `Series B`, `Series C`, `Series D`, `Series E`, `Series F` |
| `stages` | List of all stage labels (typically a single stage) |
| `partner` | Lead Greylock partner on the deal — e.g. `Reid Hoffman`, `David Sze`, `Asheem Chandna`, `Saam Motamedi`, `Sarah Guo` |
| `partners` | List of all Greylock partners associated with the deal |
| `founders` | List of named founders (extracted from the leadership block with "Founder" or "Co-Founder" titles) |
| `leadership` | Full leadership team — list of `{name, title}` objects |
| `website` | Company homepage URL |
| `linkedin_url`, `twitter_url`, `facebook_url`, `instagram_url`, `crunchbase_url` | Social and database links when present |
| `status` | `active` (operating private company), `acquired` (M&A exit), or `public` (IPO) |
| `status_raw` | Greylock's own label — verbatim |
| `exit_class` | `active` or `exited` (acquired + public collapsed) |
| `exit_details` | Short phrase describing the exit when applicable |
| `year_of_investment` | Year Greylock added the company to its public portfolio page |
| `added_to_portfolio_date` | ISO date the company was added |

### Pricing (Pay-Per-Event)

| Event | Price |
| --- | --- |
| Per portfolio company | **$0.50** |
| Per actor start | $0.00005 |

A full scrape of the entire Greylock portfolio (~157 companies) costs roughly **$78.50** — for a complete, structured snapshot of one of the most selective venture portfolios in the industry. Greylock's hit rate (Airbnb, LinkedIn, Workday, Palo Alto Networks, Coinbase, Figma, Discord, Roblox) is unusually high; every entry is worth indexing.

### Input

```json
{
  "sectorFilter": ["security"],
  "partnerFilter": ["Asheem Chandna"],
  "statusFilter": "active",
  "yearFromTo": "2020-2026",
  "maxResults": 100
}
````

- `sectorFilter` (array of strings, optional) — case-insensitive substring match against each company's tagline and description. Greylock has no explicit sector field, so this matches on positioning language. Examples: `["AI"]` for Adept / Inflection / Cresta, `["security"]` for Abnormal AI / Cato / Sublime, `["data"]` for dbt / Tecton.
- `partnerFilter` (array of strings, optional) — case-insensitive match against the lead Greylock investor. Known partners: Reid Hoffman, David Sze, Asheem Chandna, Saam Motamedi, Sarah Guo, Jerry Chen, Mike Duboe, Seth Rosenberg, Corinne Riley, Jason Risch, John Lilly, Aneel Bhusri, James Slavet, Josh McFarland, Christine Kim, David Thacker, David Wadhwani.
- `statusFilter` (`all` | `active` | `acquired` | `public` | `exited`) — `exited` = acquired + public combined. Default `all`.
- `yearFromTo` (string, optional) — year range applied to the added-to-portfolio date. Formats: `"2022-2024"` (range), `"2020-"` (from year onward), `"-2023"` (up to year). Empty = no year filter.
- `maxResults` (integer, default 100, max 1000) — cap the dataset size.

Empty input scrapes the entire portfolio.

### Use cases

- **Enterprise SaaS sourcing** — Greylock skews heavily enterprise. Filter to recent active investments to surface mature B2B targets backed by one of the most pattern-matching investors in the category.
- **Cybersecurity competitive intelligence** — Asheem Chandna's portfolio alone (Palo Alto Networks, Cato Networks, Abnormal AI, Sumo Logic) is a top-tier security map. Filter on his name to pull it.
- **AI investment thesis tracking** — Saam Motamedi and Sarah Guo's AI bets reveal where Greylock's AI thesis is converging.
- **Founder profile enrichment** — Pull founders + leadership team from every Greylock company in one shot for recruiting, outreach, or due diligence.
- **M\&A target discovery** — Filter to `status=active` for mature private targets, or to `status=acquired` to study Greylock's exit playbook.
- **Partner attribution analysis** — Group by partner to study individual GP track records inside Greylock.

### Related actors

- **a16z Portfolio Scraper** — Andreessen Horowitz's full portfolio (~836 companies).
- **Sequoia Capital Portfolio Scraper** — Sequoia's US + global portfolios (~406 companies).
- **Founders Fund Portfolio Scraper** — Peter Thiel's VC firm portfolio (~62 companies).

Pair Greylock with a16z + Sequoia + Founders Fund for the cleanest possible "elite VC portfolio" feed across the four most brand-recognizable firms in Silicon Valley.

### How it works

The Greylock portfolio page at `greylock.com/portfolio/` is a WordPress site (WP Engine) that inlines its full portfolio dataset as a single JavaScript variable assignment: `var data_portfolio_<5char> = [ ... ];`. This actor fetches the HTML once, extracts the JSON, and emits one dataset row per company. There is no anti-bot, no JavaScript-rendering, no proxy required in normal operation — Apify residential proxy is wired as a safety net only.

If Greylock ever pulls the inline JSON or blocks all scrapers, this actor will be deprecated rather than silently emit placeholder data.

### Output format

Returns a clean JSON array via the standard Apify dataset. Use it directly in Make.com, Zapier, n8n, or any tool that consumes JSON.

# Actor input Schema

## `sectorFilter` (type: `array`):

Case-insensitive substring match against each company's subtitle / short description (Greylock's portfolio page does not expose an explicit sector taxonomy, so the tagline is used as the sector proxy). Examples: \['security'] catches Abnormal AI / Cato Networks / etc., \['AI'] catches Adept / Inflection / Cresta, \['data'] catches dbt / Tecton / etc. Empty = ALL sectors.

## `partnerFilter` (type: `array`):

Case-insensitive match against the lead Greylock investor on each deal. Known partners (popular leads): 'Reid Hoffman', 'David Sze', 'Asheem Chandna', 'Saam Motamedi', 'Sarah Guo', 'Jerry Chen', 'Mike Duboe', 'Seth Rosenberg', 'Corinne Riley', 'Jason Risch', 'John Lilly', 'Aneel Bhusri', 'James Slavet', 'Josh McFarland', 'Christine Kim', 'David Thacker', 'David Wadhwani'. Multiple values OR together. Empty = ALL partners.

## `statusFilter` (type: `string`):

Restrict by current company status. 'active' = operating private companies; 'acquired' = M\&A exits; 'public' = IPO'd companies (e.g. Airbnb, LinkedIn, Workday); 'exited' = acquired + public combined; 'all' = everything. Default 'all'.

## `yearFromTo` (type: `string`):

Optional year range applied to the date Greylock added the company to its portfolio. Format: 'YYYY-YYYY' (inclusive on both ends), 'YYYY-' (from year onward), or '-YYYY' (up to year). Examples: '2022-2024' for recent deals only, '2020-' for the last five years. Empty = no year filter.

## `maxResults` (type: `integer`):

Maximum number of companies to return (1–1000). Greylock currently publishes ~157 portfolio companies, so any value ≥157 returns everything that passes the other filters. Each company is one dataset row priced at $0.50 (premium small-portfolio VC tier).

## Actor input object example

```json
{
  "sectorFilter": [],
  "partnerFilter": [],
  "statusFilter": "all",
  "maxResults": 10
}
```

# API

You can run this Actor programmatically using our API. Below are code examples in JavaScript, Python, and CLI, as well as the OpenAPI specification and MCP server setup.

## JavaScript example

```javascript
import { ApifyClient } from 'apify-client';

// Initialize the ApifyClient with your Apify API token
// Replace the '<YOUR_API_TOKEN>' with your token
const client = new ApifyClient({
    token: '<YOUR_API_TOKEN>',
});

// Prepare Actor input
const input = {
    "sectorFilter": [],
    "partnerFilter": [],
    "statusFilter": "all",
    "yearFromTo": "",
    "maxResults": 10
};

// Run the Actor and wait for it to finish
const run = await client.actor("nexgendata/greylock-portfolio-scraper").call(input);

// Fetch and print Actor results from the run's dataset (if any)
console.log('Results from dataset');
console.log(`💾 Check your data here: https://console.apify.com/storage/datasets/${run.defaultDatasetId}`);
const { items } = await client.dataset(run.defaultDatasetId).listItems();
items.forEach((item) => {
    console.dir(item);
});

// 📚 Want to learn more 📖? Go to → https://docs.apify.com/api/client/js/docs

```

## Python example

```python
from apify_client import ApifyClient

# Initialize the ApifyClient with your Apify API token
# Replace '<YOUR_API_TOKEN>' with your token.
client = ApifyClient("<YOUR_API_TOKEN>")

# Prepare the Actor input
run_input = {
    "sectorFilter": [],
    "partnerFilter": [],
    "statusFilter": "all",
    "yearFromTo": "",
    "maxResults": 10,
}

# Run the Actor and wait for it to finish
run = client.actor("nexgendata/greylock-portfolio-scraper").call(run_input=run_input)

# Fetch and print Actor results from the run's dataset (if there are any)
print("💾 Check your data here: https://console.apify.com/storage/datasets/" + run["defaultDatasetId"])
for item in client.dataset(run["defaultDatasetId"]).iterate_items():
    print(item)

# 📚 Want to learn more 📖? Go to → https://docs.apify.com/api/client/python/docs/quick-start

```

## CLI example

```bash
echo '{
  "sectorFilter": [],
  "partnerFilter": [],
  "statusFilter": "all",
  "yearFromTo": "",
  "maxResults": 10
}' |
apify call nexgendata/greylock-portfolio-scraper --silent --output-dataset

```

## MCP server setup

```json
{
    "mcpServers": {
        "apify": {
            "command": "npx",
            "args": [
                "mcp-remote",
                "https://mcp.apify.com/?tools=nexgendata/greylock-portfolio-scraper",
                "--header",
                "Authorization: Bearer <YOUR_API_TOKEN>"
            ]
        }
    }
}

```

## OpenAPI specification

```json
{
    "openapi": "3.0.1",
    "info": {
        "title": "Greylock Portfolio Scraper — Reid Hoffman VC Companies",
        "description": "Scrape the complete Greylock Partners portfolio (Reid Hoffman, David Sze, Saam Motamedi, Sarah Guo). Returns name, sector, stage, partner, founders, website, status, exit details per company.",
        "version": "0.0",
        "x-build-id": "tabGrs6OgZ9xesVzw"
    },
    "servers": [
        {
            "url": "https://api.apify.com/v2"
        }
    ],
    "paths": {
        "/acts/nexgendata~greylock-portfolio-scraper/run-sync-get-dataset-items": {
            "post": {
                "operationId": "run-sync-get-dataset-items-nexgendata-greylock-portfolio-scraper",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor, waits for its completion, and returns Actor's dataset items in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK"
                    }
                }
            }
        },
        "/acts/nexgendata~greylock-portfolio-scraper/runs": {
            "post": {
                "operationId": "runs-sync-nexgendata-greylock-portfolio-scraper",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor and returns information about the initiated run in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK",
                        "content": {
                            "application/json": {
                                "schema": {
                                    "$ref": "#/components/schemas/runsResponseSchema"
                                }
                            }
                        }
                    }
                }
            }
        },
        "/acts/nexgendata~greylock-portfolio-scraper/run-sync": {
            "post": {
                "operationId": "run-sync-nexgendata-greylock-portfolio-scraper",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor, waits for completion, and returns the OUTPUT from Key-value store in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK"
                    }
                }
            }
        }
    },
    "components": {
        "schemas": {
            "inputSchema": {
                "type": "object",
                "properties": {
                    "sectorFilter": {
                        "title": "Sector / tagline filter (optional)",
                        "type": "array",
                        "description": "Case-insensitive substring match against each company's subtitle / short description (Greylock's portfolio page does not expose an explicit sector taxonomy, so the tagline is used as the sector proxy). Examples: ['security'] catches Abnormal AI / Cato Networks / etc., ['AI'] catches Adept / Inflection / Cresta, ['data'] catches dbt / Tecton / etc. Empty = ALL sectors.",
                        "items": {
                            "type": "string"
                        }
                    },
                    "partnerFilter": {
                        "title": "Greylock partner filter (optional)",
                        "type": "array",
                        "description": "Case-insensitive match against the lead Greylock investor on each deal. Known partners (popular leads): 'Reid Hoffman', 'David Sze', 'Asheem Chandna', 'Saam Motamedi', 'Sarah Guo', 'Jerry Chen', 'Mike Duboe', 'Seth Rosenberg', 'Corinne Riley', 'Jason Risch', 'John Lilly', 'Aneel Bhusri', 'James Slavet', 'Josh McFarland', 'Christine Kim', 'David Thacker', 'David Wadhwani'. Multiple values OR together. Empty = ALL partners.",
                        "items": {
                            "type": "string"
                        }
                    },
                    "statusFilter": {
                        "title": "Company status filter",
                        "enum": [
                            "all",
                            "active",
                            "exited",
                            "acquired",
                            "public"
                        ],
                        "type": "string",
                        "description": "Restrict by current company status. 'active' = operating private companies; 'acquired' = M&A exits; 'public' = IPO'd companies (e.g. Airbnb, LinkedIn, Workday); 'exited' = acquired + public combined; 'all' = everything. Default 'all'.",
                        "default": "all"
                    },
                    "yearFromTo": {
                        "title": "Year range filter (optional)",
                        "type": "string",
                        "description": "Optional year range applied to the date Greylock added the company to its portfolio. Format: 'YYYY-YYYY' (inclusive on both ends), 'YYYY-' (from year onward), or '-YYYY' (up to year). Examples: '2022-2024' for recent deals only, '2020-' for the last five years. Empty = no year filter."
                    },
                    "maxResults": {
                        "title": "Max companies",
                        "minimum": 1,
                        "maximum": 1000,
                        "type": "integer",
                        "description": "Maximum number of companies to return (1–1000). Greylock currently publishes ~157 portfolio companies, so any value ≥157 returns everything that passes the other filters. Each company is one dataset row priced at $0.50 (premium small-portfolio VC tier).",
                        "default": 100
                    }
                }
            },
            "runsResponseSchema": {
                "type": "object",
                "properties": {
                    "data": {
                        "type": "object",
                        "properties": {
                            "id": {
                                "type": "string"
                            },
                            "actId": {
                                "type": "string"
                            },
                            "userId": {
                                "type": "string"
                            },
                            "startedAt": {
                                "type": "string",
                                "format": "date-time",
                                "example": "2025-01-08T00:00:00.000Z"
                            },
                            "finishedAt": {
                                "type": "string",
                                "format": "date-time",
                                "example": "2025-01-08T00:00:00.000Z"
                            },
                            "status": {
                                "type": "string",
                                "example": "READY"
                            },
                            "meta": {
                                "type": "object",
                                "properties": {
                                    "origin": {
                                        "type": "string",
                                        "example": "API"
                                    },
                                    "userAgent": {
                                        "type": "string"
                                    }
                                }
                            },
                            "stats": {
                                "type": "object",
                                "properties": {
                                    "inputBodyLen": {
                                        "type": "integer",
                                        "example": 2000
                                    },
                                    "rebootCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "restartCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "resurrectCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "computeUnits": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            },
                            "options": {
                                "type": "object",
                                "properties": {
                                    "build": {
                                        "type": "string",
                                        "example": "latest"
                                    },
                                    "timeoutSecs": {
                                        "type": "integer",
                                        "example": 300
                                    },
                                    "memoryMbytes": {
                                        "type": "integer",
                                        "example": 1024
                                    },
                                    "diskMbytes": {
                                        "type": "integer",
                                        "example": 2048
                                    }
                                }
                            },
                            "buildId": {
                                "type": "string"
                            },
                            "defaultKeyValueStoreId": {
                                "type": "string"
                            },
                            "defaultDatasetId": {
                                "type": "string"
                            },
                            "defaultRequestQueueId": {
                                "type": "string"
                            },
                            "buildNumber": {
                                "type": "string",
                                "example": "1.0.0"
                            },
                            "containerUrl": {
                                "type": "string"
                            },
                            "usage": {
                                "type": "object",
                                "properties": {
                                    "ACTOR_COMPUTE_UNITS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_WRITES": {
                                        "type": "integer",
                                        "example": 1
                                    },
                                    "KEY_VALUE_STORE_LISTS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_INTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_EXTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_RESIDENTIAL_TRANSFER_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_SERPS": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            },
                            "usageTotalUsd": {
                                "type": "number",
                                "example": 0.00005
                            },
                            "usageUsd": {
                                "type": "object",
                                "properties": {
                                    "ACTOR_COMPUTE_UNITS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_WRITES": {
                                        "type": "number",
                                        "example": 0.00005
                                    },
                                    "KEY_VALUE_STORE_LISTS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_INTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_EXTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_RESIDENTIAL_TRANSFER_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_SERPS": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            }
                        }
                    }
                }
            }
        }
    }
}
```
