Zenodo Scraper avatar

Zenodo Scraper

Pricing

from $3.00 / 1,000 results

Go to Apify Store
Zenodo Scraper

Zenodo Scraper

Scrape Zenodo, CERN's open science repository with 3M+ research records including papers, datasets, software, posters, and presentations. Search by query, resource type, access rights, or fetch by record ID, DOI, or community.

Pricing

from $3.00 / 1,000 results

Rating

5.0

(7)

Developer

Crawler Bros

Crawler Bros

Maintained by Community

Actor stats

7

Bookmarked

2

Total users

1

Monthly active users

7 days ago

Last modified

Share

Extract research records from Zenodo — CERN's open-access repository with 3M+ scholarly outputs including publications, datasets, software, posters, presentations, and more. No API key required.

What Does Zenodo Scraper Do?

Zenodo Scraper gives you structured access to the Zenodo research repository via its public REST API. You can:

  • Search the full Zenodo catalog by any keyword or phrase
  • Filter by resource type (dataset, paper, software, etc.), access rights, date range, and sort order
  • Fetch a specific record by its Zenodo ID or DOI
  • Browse all records belonging to a Zenodo community

Each record includes the title, authors, description, keywords, resource type, access rights, license, publication date, community membership, file counts, and the direct Zenodo URL.


Output Fields

FieldTypeDescription
zenodoIdIntegerZenodo numeric record identifier
doiStringDigital Object Identifier (DOI)
titleStringRecord title
creatorsArrayList of creators with name, affiliation, orcid
descriptionStringAbstract or description (HTML stripped)
keywordsArrayAuthor-supplied keywords
resourceTypeStringType of resource (publication, dataset, software, etc.)
subtypeStringSub-type (e.g. article, preprint, figure)
accessRightStringAccess level (open, closed, embargoed, restricted)
licenseStringLicense identifier (e.g. cc-by-4.0)
publicationDateStringPublication date (YYYY-MM-DD)
communitiesArrayZenodo community IDs this record belongs to
fileCountIntegerNumber of attached files
totalFileSizeBytesIntegerTotal size of all attached files in bytes
zenodoUrlStringDirect URL to the record on zenodo.org
scrapedAtStringUTC timestamp when the record was scraped

Input Configuration

Modes

ModeDescription
searchFull-text search across all Zenodo records
byRecordIdFetch one specific record by its Zenodo numeric ID
byCommunityBrowse all records within a specific Zenodo community
byDOIFind a record by its DOI string

Input Parameters

ParameterTypeDefaultDescription
modeEnumsearchOperating mode
queryStringSearch query (mode=search)
recordIdStringZenodo record ID (mode=byRecordId)
communityIdStringCommunity slug (mode=byCommunity)
doiStringFull DOI (mode=byDOI)
resourceTypeEnumAnyFilter by type (publication, dataset, software, etc.)
accessRightEnumAnyFilter by access (open, closed, embargoed, restricted)
sortByEnumbestmatchSort order for results
fromDateStringFilter by publication date ≥ (YYYY-MM-DD)
toDateStringFilter by publication date ≤ (YYYY-MM-DD)
maxItemsInteger50Maximum number of records to return (1–1000)

Example Inputs

Search for climate change datasets

{
"mode": "search",
"query": "climate change",
"resourceType": "dataset",
"accessRight": "open",
"maxItems": 100
}

Fetch a specific record by ID

{
"mode": "byRecordId",
"recordId": "10234567"
}

Browse a community

{
"mode": "byCommunity",
"communityId": "zenodo",
"maxItems": 50
}

Find by DOI

{
"mode": "byDOI",
"doi": "10.5281/zenodo.10234567"
}

Search within a date range

{
"mode": "search",
"query": "machine learning",
"fromDate": "2023-01-01",
"toDate": "2024-12-31",
"sortBy": "mostrecent",
"maxItems": 200
}

Use Cases

  • Literature reviews: Collect papers and preprints across any research domain
  • Dataset discovery: Find open datasets for any scientific field
  • Software citation: Locate software deposits and their metadata
  • Community monitoring: Track new uploads to specific Zenodo communities
  • Open science analytics: Analyze publication trends, license adoption, and access patterns
  • Research metadata enrichment: Enrich reference lists with full Zenodo metadata

Frequently Asked Questions

Do I need a Zenodo account or API key? No. Zenodo's public API is free and requires no authentication.

How many records can I scrape? Up to 1,000 records per run. For larger datasets, Zenodo provides OAI-PMH and full data dumps at zenodo.org/oai2d.

What does the open access right mean? Open access records have files freely available to download. Closed and restricted records may only return metadata.

Can I search by author name? Yes — use the query field with Elasticsearch syntax: author:"Smith, John" or creators.name:Smith.

Are file download URLs included? Not directly. Use the zenodoId to construct the API URL: https://zenodo.org/api/records/{zenodoId}/files.

How fresh is the data? Zenodo's API serves live data. Records scraped reflect the current state of the repository.

What communities are available? Visit zenodo.org/communities to browse thousands of communities across all scientific disciplines.