BGCI GlobalTreeSearch Tree Species Scraper avatar

BGCI GlobalTreeSearch Tree Species Scraper

Pricing

Pay per event

Go to Apify Store
BGCI GlobalTreeSearch Tree Species Scraper

BGCI GlobalTreeSearch Tree Species Scraper

Scrapes the BGCI GlobalTreeSearch database — the authoritative list of all ~58,000 tree species worldwide, with conservation status, native-country distribution, and taxonomic data from the Botanic Gardens Conservation International.

Pricing

Pay per event

Rating

0.0

(0)

Developer

BowTiedRaccoon

BowTiedRaccoon

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

4 days ago

Last modified

Share

Scrapes the BGCI GlobalTreeSearch database — the authoritative, comprehensive list of the world's ~58,000 tree species maintained by the Botanic Gardens Conservation International (BGCI). Each record includes the full scientific name, plant family, taxonomic author, data source, native-country distribution, and known synonyms.

GlobalTreeSearch is the primary reference behind the Global Tree Assessment and is used by arboreta, reforestation programmes, carbon and forestry projects, and conservation NGOs worldwide. This actor extracts the complete database into a flat, queryable dataset.

Features

  • Scrapes all ~58,000 tree species from the BGCI GlobalTreeSearch JSON API
  • Supports targeted scraping by genus — specify one or more genera to limit the dataset
  • Extracts native-country distribution for every species (TSGeolinks)
  • Extracts taxonomic synonyms (TSTaxas)
  • No authentication required — uses the BGCI public data API
  • Polite rate limiting (200ms delay between genus requests)

Use cases

  • Carbon and reforestation projects — verify species eligibility and native range for project countries
  • Conservation NGOs — track the full tree species inventory with distribution metadata
  • ESG and biodiversity risk analytics — build species datasets for portfolio screening
  • Arboreta and botanical gardens — collection planning and completeness audits
  • AI training data — authoritative tree taxonomy + geography for forestry models

Input

FieldTypeDescriptionDefault
generaArray of stringsList of genus names to scrape (e.g. ["Quercus", "Abies"]). Leave empty to scrape all ~4,162 genera.["Quercus", "Abies"]
maxItemsIntegerMaximum number of records to collect10

Example input (targeted):

{
"genera": ["Quercus", "Abies", "Pinus"],
"maxItems": 1000
}

Example input (full database):

{
"genera": [],
"maxItems": 100000
}

Output

Each record in the dataset represents one tree species:

FieldTypeDescription
bgci_idIntegerGlobalTreeSearch internal record ID
taxonStringFull scientific name (genus + species + infraspecific epithets)
genusStringGenus name (derived from the taxon name)
familyStringPlant family (e.g. Fagaceae, Pinaceae)
authorStringTaxonomic author citation
sourceStringData source (e.g. IUCN RL tree review 2020, WCSP)
problemsString or nullData-quality or nomenclatural notes
distribution_doneStringWhether distribution mapping is complete (yes / no)
noteString or nullAdditional remarks
native_countriesStringSemicolon-separated list of countries where the species is native
synonymsStringSemicolon-separated list of known taxonomic synonyms
bgci_urlStringLink to the species record on the BGCI GlobalTreeSearch website

Example record:

{
"bgci_id": 151050178,
"taxon": "Quercus acatenangensis",
"genus": "Quercus",
"family": "Fagaceae",
"author": "Trel.",
"source": "IUCN RL tree review 2020",
"problems": null,
"distribution_done": "yes",
"note": null,
"native_countries": "El Salvador; Guatemala; Mexico",
"synonyms": null,
"bgci_url": "https://www.bgci.org/resources/bgci-databases/globaltreesearch/?species=Quercus%20acatenangensis"
}

Performance and pricing

  • Full database (all ~58k species): requires iterating ~4,162 genus endpoints. Each genus request takes ~2–3 seconds including the polite delay, so a full run takes several hours. Set a high maxItems and plan for a longer run.
  • Targeted genus scraping: very fast — each genus returns instantly (e.g. Quercus returns ~424 species in one request).
  • Pricing: Pay-per-event (PPE) — you are charged only for records collected, not for run time.

Data source

Data is sourced from the BGCI GlobalTreeSearch API — the public JSON endpoint of the Botanic Gardens Conservation International's GlobalTreeSearch database. No API key or authentication is required.

GlobalTreeSearch is distinct from:

  • GBIF — occurrence/observation points (where specimens have been found)
  • IPNI — nomenclature (plant name registry)
  • GlobalTreeSearch — tree-conservation status + native country distribution (this actor)

Limitations

  • Distribution data (native_countries) reflects the BGCI's geo-linking records and may not be exhaustive for all species.
  • Conservation status (e.g. IUCN Red List category) is referenced in the source field but the specific category is not returned by the genus endpoint. This is a GlobalTreeSearch API limitation.
  • The full ~58k species database requires a long runtime. For large-scale use, run with generous maxItems and timeoutSecs.