Pricing

Pay per event

500 Global Portfolio Scraper

Extract public 500 Global portfolio companies with websites, industries, locations, stages, batches, and investment metadata.

Pricing

Pay per event

Rating

0.0

(0)

Developer

Stas Persiianenko

Actor stats

Bookmarked

Total users

Monthly active users

12 days ago

Last modified

What does 500 Global Portfolio Scraper do?

500 Global Portfolio Scraper collects public startup records from 500 Global's portfolio. It reads the same public data source used by the 500 Global companies page. It returns clean dataset rows for portfolio companies, industries, stages, locations, websites, LinkedIn URLs, batches, and investment metadata when available.

Who is it for?

Venture capital analysts building startup market maps.
Startup scouts tracking 500 Global-backed companies.
Recruiters sourcing venture-backed companies by country or industry.
Agencies building prospect lists for sales or partnerships.
Researchers comparing accelerator and seed-stage ecosystems.
Data teams enriching CRM or warehouse records.

Why use this actor?

HTTP-only collection keeps runs fast and cost-efficient.
The output is already normalized for exports and APIs.
Filters let you narrow records by tenant, keyword, country, industry, and stage.
Every row includes a sourceUrl and scrapedAt timestamp for traceability.

What data can you extract?

Field	Description
`companyName`	Best available company display name
`description`	Public one-line company description
`websiteUrl`	Company website
`linkedinUrl`	Company LinkedIn page when available
`logoUrl`	Company logo image URL
`country`	Country of operation
`region`	Region of operation
`stage`	Investment stage
`businessModel`	Business model tag
`industries`	Industry tags
`tenants`	500 Global tenant/program tags
`batches`	Batch names
`initialInvestmentDates`	Initial investment dates
`companyId`	Source company ID
`organizationId`	Source organization ID
`alternativeNames`	Alternative names from the source
`sourceUrl`	API URL used for the scrape
`scrapedAt`	Collection timestamp

How much does it cost to scrape 500 Global portfolio companies?

This actor uses pay-per-event pricing. You pay a $0.005 start fee for each run and a per-result fee for each saved company record. Formula-derived per-result pricing starts at $0.000024241 for BRONZE users, with canonical tier discounts for higher Apify plans. A 1,000-company export costs about $0.029 before any plan-tier discount, plus the $0.005 run start fee. For small tests, keep maxItems low. For production exports, increase maxItems to the number of rows you need.

Quick start

Open the actor on Apify.
Keep the default tenant value or leave it empty for all public records.
Add an optional keyword like fintech, ai, or Brazil.
Set maxItems to the number of companies you want.
Click Start.
Export the dataset as JSON, CSV, Excel, or via API.

Input options

tenant

Optional 500 Global tenant or program filter. Examples include global, latam, sea, mena, thailand, korea, taiwan, and istambul. Leave it empty to request all public portfolio records.

search

Optional case-insensitive keyword. The actor searches company name, alternative names, description, website, LinkedIn URL, country, region, stage, business model, industries, tenants, and batches.

maxItems

Maximum number of records to save. Use a low value for testing. Use a larger value when you need a full export.

countries

Optional list of country names. If provided, the actor saves only companies whose country exactly matches one of the names.

industries

Optional list of industry names. If provided, the actor saves companies with at least one matching industry.

stages

Optional list of stage names. If provided, the actor saves only matching stages.

Example input

{
  "tenant": "global",
  "search": "fintech",
  "maxItems": 25,
  "countries": [],
  "industries": [],
  "stages": []
}

Example output

{
  "companyName": "AA Audience",
  "description": "Developer of a mobile application designed to generate customers for the finance industry via mobile devices.",
  "websiteUrl": "https://aaaudi.com/",
  "linkedinUrl": null,
  "logoUrl": "https://founder-hub-public.s3.us-west-1.amazonaws.com/example.png",
  "country": "United States",
  "region": "North America",
  "stage": "Seed",
  "businessModel": "B2B",
  "industries": ["FinTech"],
  "tenants": ["taiwan", "global"],
  "batches": ["GA 21"],
  "initialInvestmentDates": ["2017-05-16T00:00:00.000Z"],
  "companyId": 4502,
  "organizationId": 1810,
  "businessName": "AA Audience",
  "alternativeNames": ["AA Audience"],
  "sourceUrl": "https://500.co/api/startups?tenant=global",
  "scrapedAt": "2026-05-23T08:49:09.741Z"
}

Tips for best results

Start with maxItems set to 25 or less.
Use search to narrow broad portfolio exports.
Use country filters only after checking the exact source spelling.
Leave optional filters empty when you want the broadest dataset.
Combine tenant and search for focused lists, such as latam plus fintech.

Integrations

Use this actor with CRM imports, spreadsheet enrichment, BI dashboards, lead generation workflows, or startup intelligence pipelines. Common workflows include:

Export CSV into Google Sheets.
Sync dataset items into Airtable.
Feed company domains into enrichment actors.
Monitor new records over time by comparing dataset exports.
Build market maps by country, industry, or stage.

API usage with Node.js

import { ApifyClient } from 'apify-client';

const client = new ApifyClient({ token: process.env.APIFY_TOKEN });
const run = await client.actor('automation-lab/500-global-portfolio-scraper').call({
  tenant: 'global',
  search: 'fintech',
  maxItems: 25,
});
const { items } = await client.dataset(run.defaultDatasetId).listItems();
console.log(items);

API usage with Python

from apify_client import ApifyClient
import os

client = ApifyClient(os.environ['APIFY_TOKEN'])
run = client.actor('automation-lab/500-global-portfolio-scraper').call(run_input={
    'tenant': 'global',
    'search': 'fintech',
    'maxItems': 25,
})
items = client.dataset(run['defaultDatasetId']).list_items().items
print(items)

API usage with cURL

curl -X POST 'https://api.apify.com/v2/acts/automation-lab~500-global-portfolio-scraper/runs?token=YOUR_APIFY_TOKEN' \
  -H 'Content-Type: application/json' \
  -d '{"tenant":"global","search":"fintech","maxItems":25}'

MCP usage

Use this actor through Apify MCP with Claude Code or Claude Desktop. Configure the MCP URL with the actor tool parameter:

https://mcp.apify.com/?tools=automation-lab/500-global-portfolio-scraper

Claude Code setup:

$claude mcp add apify https://mcp.apify.com/?tools=automation-lab/500-global-portfolio-scraper

Claude Desktop JSON setup:

{
  "mcpServers": {
    "apify": {
      "url": "https://mcp.apify.com/?tools=automation-lab/500-global-portfolio-scraper"
    }
  }
}

Example prompts:

"Run 500 Global Portfolio Scraper for fintech companies and summarize the top countries."
"Collect 50 global 500 Global portfolio companies and make a table of websites and industries."
"Find 500 Global companies in Brazil and group them by industry."

Data freshness

The actor retrieves the public 500 Global API during each run. Rows reflect the source data available at run time. The scrapedAt field records when the run collected the data.

Limitations

The actor extracts public portfolio fields exposed by 500 Global. It does not log in, bypass access controls, or infer hidden fields. Founder-level details are not included because they are not present in the public API response used by the MVP.

FAQ and troubleshooting

Why did my run return fewer rows than maxItems?

Your filters may be too narrow, or the selected tenant may contain fewer matching companies. Remove search, countries, industries, or stages to broaden the result set.

Why are some website or LinkedIn fields null?

Some portfolio records do not include those URLs in the public source data. The actor keeps null values rather than inventing or guessing data.

Legality and ethics

This actor collects publicly available business information from 500 Global. Use the data responsibly and comply with applicable laws, platform terms, and privacy rules. Do not use scraped data for spam, harassment, or unlawful profiling.

You may also find these Apify actors useful:

Support

If the source page changes or you need additional fields, open an issue on Apify. Include your run ID, input, and a short description of the expected output.

Changelog

Initial version extracts public portfolio company records from 500 Global's API.

500 Global Scraper

michael.g/500-global-scraper

Scrape data on 500 Global companies from the 500 Global portfolio. Export scraped data, schedule via API, and integrate with other tools or AI workflows.

Michael G

5.0

500 Global Scraper With Emails

fatihtahta/500-global-scraper

Scrape company data from 500 Global’s portfolio (500.co) including names, descriptions, founders, emails, investments, hiring signals, and more. Ideal for deal sourcing, market mapping, or outreach.

Fatih Tahta

5.0

500 Global Companies Directory — Startup API

nexgendata/500-global-companies-directory

Scrape the 500 Global company directory for deal sourcing. Clean JSON for VCs, deal scouts and AI agents.

NexGenData

Sequoia Portfolio Scraper — VC Portfolio Companies API

nexgendata/sequoia-portfolio-scraper

Scrape the Sequoia Capital portfolio companies. Clean JSON for VCs, deal scouts and AI agents.

NexGenData

Founders Fund Portfolio Scraper — Portfolio API

nexgendata/founders-fund-portfolio-scraper

Scrape the Founders Fund portfolio companies. Clean JSON for VCs, deal scouts and AI agents.

NexGenData

Global

variety/global

Variety

Techstars Portfolio Companies Scraper

scraped/techstars-portfolio-companies-scraper

Scrape all Portfolio Companies from Techstars

scraped

Greylock Portfolio Scraper — VC Portfolio API

nexgendata/greylock-portfolio-scraper

Scrape the Greylock Partners portfolio companies. Clean JSON for VCs, deal scouts and AI agents.

NexGenData

Lightspeed Portfolio Scraper — VC Portfolio API

nexgendata/lightspeed-portfolio-scraper

Scrape the Lightspeed Venture Partners portfolio companies. Clean JSON for VCs, deal scouts and AI agents.

NexGenData

TinySeed Portfolio Scraper

automation-lab/tinyseed-portfolio-scraper

🌱 Scrape TinySeed portfolio companies with cohorts, locations, categories, descriptions, logos, and websites for startup research.

Stas Persiianenko

500 Global Portfolio Scraper

What does 500 Global Portfolio Scraper do?

Who is it for?

Why use this actor?

What data can you extract?

How much does it cost to scrape 500 Global portfolio companies?

Quick start

Input options

tenant

search

maxItems

countries

industries

stages

Example input

Example output

Tips for best results

Integrations

API usage with Node.js

API usage with Python

API usage with cURL

MCP usage

Data freshness

Limitations

FAQ and troubleshooting

Why did my run return fewer rows than maxItems?

Why are some website or LinkedIn fields null?

Legality and ethics

Related scrapers

Support

Changelog

You might also like

500 Global Scraper

500 Global Scraper With Emails

500 Global Companies Directory — Startup API

Sequoia Portfolio Scraper — VC Portfolio Companies API

Founders Fund Portfolio Scraper — Portfolio API

Global

Techstars Portfolio Companies Scraper

Greylock Portfolio Scraper — VC Portfolio API

Lightspeed Portfolio Scraper — VC Portfolio API

TinySeed Portfolio Scraper