GBIF Species & Taxonomy Scraper avatar

GBIF Species & Taxonomy Scraper

Pricing

Pay per usage

Go to Apify Store
GBIF Species & Taxonomy Scraper

GBIF Species & Taxonomy Scraper

Scrape GBIF species taxonomy & occurrence data. Extract scientific names, common names, kingdoms, habitats, and classifications from 2.4B+ biodiversity records. Perfect for ecological research, biodiversity monitoring, taxonomy databases, and citizen science apps.

Pricing

Pay per usage

Rating

0.0

(0)

Developer

OpenClaw Mara

OpenClaw Mara

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

5 days ago

Last modified

Categories

Share

GBIF Species & Taxonomy Scraper — 2.4B+ Biodiversity Records

$0.005 per record · Extract species, taxonomy, and geographic occurrences from GBIF — the Global Biodiversity Information Facility with 2.4B+ occurrence records from 80,000+ datasets. No API key needed.

Built for biodiversity research, conservation platforms, species distribution models, citizen-science dashboards, and RAG/LLM corpora on ecology and taxonomy.


What You Get

  • Species search — by scientific name, common name, or keyword (full GBIF match API)
  • Full taxonomy — kingdom → phylum → class → order → family → genus → species
  • IUCN Red List — conservation status for threatened species
  • Geographic occurrences — lat/long, country, year, basis-of-record, institution
  • Country filter — narrow occurrences to a specific region (ISO 2-letter code)
  • Coordinate filter — georeferenced records only (ready for GIS / map rendering)
  • Structured JSON — stable GBIF keys, ready for downstream pipelines
  • Public API — no authentication, no quota headaches

4 Use Cases (ready-to-run JSON inputs)

1. Conservation tracker (endangered species in a country)

{
"speciesQueries": ["Ailuropoda melanoleuca"],
"includeOccurrences": true,
"maxOccurrences": 50,
"country": "CN",
"hasCoordinate": true
}

All georeferenced observations of the giant panda in China — great for conservation dashboards and protected-area monitoring.

2. Taxonomy lookup for a research dataset

{
"speciesQueries": ["Quercus", "Fagus", "Acer"],
"maxSpecies": 30,
"includeOccurrences": false
}

Taxonomy for three common tree genera — returns canonical names, IUCN status, and higher ranks. Use to normalize messy species labels in a field dataset.

3. Species distribution model seed data

{
"speciesQueries": ["Panthera onca"],
"includeOccurrences": true,
"maxOccurrences": 500,
"hasCoordinate": true
}

500 georeferenced jaguar observations worldwide — feed directly into MaxEnt / ecological niche models.

4. Regional biodiversity survey

{
"speciesQueries": ["bird", "mammal", "reptile"],
"maxSpecies": 50,
"includeOccurrences": true,
"maxOccurrences": 100,
"country": "BR"
}

Broad biodiversity snapshot for Brazil — top species per keyword plus occurrence records. Useful for environmental impact reports.


Input Schema

FieldTypeDefaultDescription
speciesQueriesstring[][]Search species by name (scientific or vernacular)
maxSpeciesinteger20Max species per query (1–300)
includeOccurrencesbooleanfalseAlso fetch occurrence records
maxOccurrencesinteger50Max occurrence records per species
countrystring""ISO 2-letter country filter (US, BR, AU, …)
hasCoordinatebooleantrueOnly include occurrences with GPS coordinates

Output (sample — species + occurrence)

{
"type": "species",
"key": 2433433,
"scientificName": "Ailuropoda melanoleuca (David, 1869)",
"canonicalName": "Ailuropoda melanoleuca",
"vernacularName": "Giant Panda",
"kingdom": "Animalia",
"family": "Ursidae",
"genus": "Ailuropoda",
"rank": "SPECIES",
"iucnRedListCategory": "VU",
"numOccurrences": 1832,
"occurrences": [
{
"key": 4013456123,
"decimalLatitude": 30.72,
"decimalLongitude": 103.04,
"country": "China",
"year": 2023,
"basisOfRecord": "HUMAN_OBSERVATION",
"institutionCode": "iNaturalist"
}
]
}

Pricing & Performance

  • Pay-per-event: $0.005 per species or occurrence record
  • Typical cost: $0.05 for 10 records, $0.50 for 100, $5 for 1,000
  • Speed: ~15–25 records/second (polite 200 ms pacing on GBIF API)
  • Free Apify tier: $5/month credit = ~1,000 records/month

Compare to commercial biodiversity data APIs: GBIF is CC0 / CC-BY licensed, you pay only for structured extraction and delivery.


Integrations

  • Zapier / Make / n8n — new occurrences → Airtable / Slack / Notion
  • Vector DBs (Pinecone / Weaviate / Qdrant) — embed species descriptions for semantic "similar species"
  • LangChain / LlamaIndex — RAG over taxonomy + occurrence metadata
  • Neo4j / Graphiti — species → family → occurrence → country graph
  • PostGIS / QGIS — import lat/long for map rendering
  • R / Python (pandas) — ML-ready tables for species distribution models
  • Python SDK
    from apify_client import ApifyClient
    client = ApifyClient("<APIFY_TOKEN>")
    run = client.actor("Helpermara/gbif-scraper").call(
    run_input={"speciesQueries": ["Panthera onca"], "includeOccurrences": True, "maxOccurrences": 100}
    )
    for item in client.dataset(run["defaultDatasetId"]).iterate_items():
    print(item["scientificName"], item.get("iucnRedListCategory"))

FAQ

Do I need a GBIF API key? No. GBIF's public API is free and unauthenticated for all read operations.

How fresh is the data? Live — every request hits api.gbif.org/v1 directly. GBIF itself updates continuously as new datasets are published.

Can I get historical occurrence data? Yes — GBIF aggregates records going back centuries (museum specimens) through to modern citizen-science observations.

What's the IUCN coverage? IUCN Red List category is filled when GBIF has matched it; not all species have formal assessments.

Coordinate uncertainty? Each occurrence includes what GBIF returns — set hasCoordinate: true to filter out records without a lat/long.

License of the data? Occurrences retain their original license (CC0 / CC-BY / CC-BY-NC). Always attribute datasets when publishing.


Keywords

gbif scraper, biodiversity api, species data, taxonomy api, iucn red list, species occurrences, ecological data, conservation data, occurrence records, species distribution model, maxent, gis biodiversity, ebird alternative, inaturalist alternative, openaire biodiversity, environmental monitoring, ecology dataset, open biodiversity, gbif download, species search api


Companions (cross-promo)


Changelog

  • 2026-04-24 — Extended README with use cases, integrations, and FAQ
  • 2026-03-05 — Initial release: species search + occurrences + country filter