500 Global Portfolio Scraper avatar

500 Global Portfolio Scraper

Pricing

Pay per event

Go to Apify Store
500 Global Portfolio Scraper

500 Global Portfolio Scraper

Extract public 500 Global portfolio companies with websites, industries, locations, stages, batches, and investment metadata.

Pricing

Pay per event

Rating

0.0

(0)

Developer

Stas Persiianenko

Stas Persiianenko

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

4 days ago

Last modified

Categories

Share

Extract structured company data from the public 500 Global portfolio directory.

What does 500 Global Portfolio Scraper do?

500 Global Portfolio Scraper collects public startup records from 500 Global's portfolio. It reads the same public data source used by the 500 Global companies page. It returns clean dataset rows for portfolio companies, industries, stages, locations, websites, LinkedIn URLs, batches, and investment metadata when available.

Who is it for?

  • Venture capital analysts building startup market maps.
  • Startup scouts tracking 500 Global-backed companies.
  • Recruiters sourcing venture-backed companies by country or industry.
  • Agencies building prospect lists for sales or partnerships.
  • Researchers comparing accelerator and seed-stage ecosystems.
  • Data teams enriching CRM or warehouse records.

Why use this actor?

  • HTTP-only collection keeps runs fast and cost-efficient.
  • The output is already normalized for exports and APIs.
  • Filters let you narrow records by tenant, keyword, country, industry, and stage.
  • Every row includes a sourceUrl and scrapedAt timestamp for traceability.

What data can you extract?

FieldDescription
companyNameBest available company display name
descriptionPublic one-line company description
websiteUrlCompany website
linkedinUrlCompany LinkedIn page when available
logoUrlCompany logo image URL
countryCountry of operation
regionRegion of operation
stageInvestment stage
businessModelBusiness model tag
industriesIndustry tags
tenants500 Global tenant/program tags
batchesBatch names
initialInvestmentDatesInitial investment dates
companyIdSource company ID
organizationIdSource organization ID
alternativeNamesAlternative names from the source
sourceUrlAPI URL used for the scrape
scrapedAtCollection timestamp

How much does it cost to scrape 500 Global portfolio companies?

This actor uses pay-per-event pricing. You pay a $0.005 start fee for each run and a per-result fee for each saved company record. Formula-derived per-result pricing starts at $0.000024241 for BRONZE users, with canonical tier discounts for higher Apify plans. A 1,000-company export costs about $0.029 before any plan-tier discount, plus the $0.005 run start fee. For small tests, keep maxItems low. For production exports, increase maxItems to the number of rows you need.

Quick start

  1. Open the actor on Apify.
  2. Keep the default tenant value or leave it empty for all public records.
  3. Add an optional keyword like fintech, ai, or Brazil.
  4. Set maxItems to the number of companies you want.
  5. Click Start.
  6. Export the dataset as JSON, CSV, Excel, or via API.

Input options

tenant

Optional 500 Global tenant or program filter. Examples include global, latam, sea, mena, thailand, korea, taiwan, and istambul. Leave it empty to request all public portfolio records.

Optional case-insensitive keyword. The actor searches company name, alternative names, description, website, LinkedIn URL, country, region, stage, business model, industries, tenants, and batches.

maxItems

Maximum number of records to save. Use a low value for testing. Use a larger value when you need a full export.

countries

Optional list of country names. If provided, the actor saves only companies whose country exactly matches one of the names.

industries

Optional list of industry names. If provided, the actor saves companies with at least one matching industry.

stages

Optional list of stage names. If provided, the actor saves only matching stages.

Example input

{
"tenant": "global",
"search": "fintech",
"maxItems": 25,
"countries": [],
"industries": [],
"stages": []
}

Example output

{
"companyName": "AA Audience",
"description": "Developer of a mobile application designed to generate customers for the finance industry via mobile devices.",
"websiteUrl": "https://aaaudi.com/",
"linkedinUrl": null,
"logoUrl": "https://founder-hub-public.s3.us-west-1.amazonaws.com/example.png",
"country": "United States",
"region": "North America",
"stage": "Seed",
"businessModel": "B2B",
"industries": ["FinTech"],
"tenants": ["taiwan", "global"],
"batches": ["GA 21"],
"initialInvestmentDates": ["2017-05-16T00:00:00.000Z"],
"companyId": 4502,
"organizationId": 1810,
"businessName": "AA Audience",
"alternativeNames": ["AA Audience"],
"sourceUrl": "https://500.co/api/startups?tenant=global",
"scrapedAt": "2026-05-23T08:49:09.741Z"
}

Tips for best results

  • Start with maxItems set to 25 or less.
  • Use search to narrow broad portfolio exports.
  • Use country filters only after checking the exact source spelling.
  • Leave optional filters empty when you want the broadest dataset.
  • Combine tenant and search for focused lists, such as latam plus fintech.

Integrations

Use this actor with CRM imports, spreadsheet enrichment, BI dashboards, lead generation workflows, or startup intelligence pipelines. Common workflows include:

  • Export CSV into Google Sheets.
  • Sync dataset items into Airtable.
  • Feed company domains into enrichment actors.
  • Monitor new records over time by comparing dataset exports.
  • Build market maps by country, industry, or stage.

API usage with Node.js

import { ApifyClient } from 'apify-client';
const client = new ApifyClient({ token: process.env.APIFY_TOKEN });
const run = await client.actor('automation-lab/500-global-portfolio-scraper').call({
tenant: 'global',
search: 'fintech',
maxItems: 25,
});
const { items } = await client.dataset(run.defaultDatasetId).listItems();
console.log(items);

API usage with Python

from apify_client import ApifyClient
import os
client = ApifyClient(os.environ['APIFY_TOKEN'])
run = client.actor('automation-lab/500-global-portfolio-scraper').call(run_input={
'tenant': 'global',
'search': 'fintech',
'maxItems': 25,
})
items = client.dataset(run['defaultDatasetId']).list_items().items
print(items)

API usage with cURL

curl -X POST 'https://api.apify.com/v2/acts/automation-lab~500-global-portfolio-scraper/runs?token=YOUR_APIFY_TOKEN' \
-H 'Content-Type: application/json' \
-d '{"tenant":"global","search":"fintech","maxItems":25}'

MCP usage

Use this actor through Apify MCP with Claude Code or Claude Desktop. Configure the MCP URL with the actor tool parameter:

https://mcp.apify.com/?tools=automation-lab/500-global-portfolio-scraper

Claude Code setup:

$claude mcp add apify https://mcp.apify.com/?tools=automation-lab/500-global-portfolio-scraper

Claude Desktop JSON setup:

{
"mcpServers": {
"apify": {
"url": "https://mcp.apify.com/?tools=automation-lab/500-global-portfolio-scraper"
}
}
}

Example prompts:

  • "Run 500 Global Portfolio Scraper for fintech companies and summarize the top countries."
  • "Collect 50 global 500 Global portfolio companies and make a table of websites and industries."
  • "Find 500 Global companies in Brazil and group them by industry."

Data freshness

The actor retrieves the public 500 Global API during each run. Rows reflect the source data available at run time. The scrapedAt field records when the run collected the data.

Limitations

The actor extracts public portfolio fields exposed by 500 Global. It does not log in, bypass access controls, or infer hidden fields. Founder-level details are not included because they are not present in the public API response used by the MVP.

FAQ and troubleshooting

Why did my run return fewer rows than maxItems?

Your filters may be too narrow, or the selected tenant may contain fewer matching companies. Remove search, countries, industries, or stages to broaden the result set.

Why are some website or LinkedIn fields null?

Some portfolio records do not include those URLs in the public source data. The actor keeps null values rather than inventing or guessing data.

Legality and ethics

This actor collects publicly available business information from 500 Global. Use the data responsibly and comply with applicable laws, platform terms, and privacy rules. Do not use scraped data for spam, harassment, or unlawful profiling.

You may also find these Apify actors useful:

Support

If the source page changes or you need additional fields, open an issue on Apify. Include your run ID, input, and a short description of the expected output.

Changelog

Initial version extracts public portfolio company records from 500 Global's API.