Zenodo Scraper
Pricing
from $3.00 / 1,000 results
Zenodo Scraper
Scrape Zenodo, CERN's open science repository with 3M+ research records including papers, datasets, software, posters, and presentations. Search by query, resource type, access rights, or fetch by record ID, DOI, or community.
Pricing
from $3.00 / 1,000 results
Rating
5.0
(7)
Developer
Crawler Bros
Maintained by CommunityActor stats
7
Bookmarked
2
Total users
1
Monthly active users
7 days ago
Last modified
Categories
Share
Extract research records from Zenodo — CERN's open-access repository with 3M+ scholarly outputs including publications, datasets, software, posters, presentations, and more. No API key required.
What Does Zenodo Scraper Do?
Zenodo Scraper gives you structured access to the Zenodo research repository via its public REST API. You can:
- Search the full Zenodo catalog by any keyword or phrase
- Filter by resource type (dataset, paper, software, etc.), access rights, date range, and sort order
- Fetch a specific record by its Zenodo ID or DOI
- Browse all records belonging to a Zenodo community
Each record includes the title, authors, description, keywords, resource type, access rights, license, publication date, community membership, file counts, and the direct Zenodo URL.
Output Fields
| Field | Type | Description |
|---|---|---|
zenodoId | Integer | Zenodo numeric record identifier |
doi | String | Digital Object Identifier (DOI) |
title | String | Record title |
creators | Array | List of creators with name, affiliation, orcid |
description | String | Abstract or description (HTML stripped) |
keywords | Array | Author-supplied keywords |
resourceType | String | Type of resource (publication, dataset, software, etc.) |
subtype | String | Sub-type (e.g. article, preprint, figure) |
accessRight | String | Access level (open, closed, embargoed, restricted) |
license | String | License identifier (e.g. cc-by-4.0) |
publicationDate | String | Publication date (YYYY-MM-DD) |
communities | Array | Zenodo community IDs this record belongs to |
fileCount | Integer | Number of attached files |
totalFileSizeBytes | Integer | Total size of all attached files in bytes |
zenodoUrl | String | Direct URL to the record on zenodo.org |
scrapedAt | String | UTC timestamp when the record was scraped |
Input Configuration
Modes
| Mode | Description |
|---|---|
search | Full-text search across all Zenodo records |
byRecordId | Fetch one specific record by its Zenodo numeric ID |
byCommunity | Browse all records within a specific Zenodo community |
byDOI | Find a record by its DOI string |
Input Parameters
| Parameter | Type | Default | Description |
|---|---|---|---|
mode | Enum | search | Operating mode |
query | String | — | Search query (mode=search) |
recordId | String | — | Zenodo record ID (mode=byRecordId) |
communityId | String | — | Community slug (mode=byCommunity) |
doi | String | — | Full DOI (mode=byDOI) |
resourceType | Enum | Any | Filter by type (publication, dataset, software, etc.) |
accessRight | Enum | Any | Filter by access (open, closed, embargoed, restricted) |
sortBy | Enum | bestmatch | Sort order for results |
fromDate | String | — | Filter by publication date ≥ (YYYY-MM-DD) |
toDate | String | — | Filter by publication date ≤ (YYYY-MM-DD) |
maxItems | Integer | 50 | Maximum number of records to return (1–1000) |
Example Inputs
Search for climate change datasets
{"mode": "search","query": "climate change","resourceType": "dataset","accessRight": "open","maxItems": 100}
Fetch a specific record by ID
{"mode": "byRecordId","recordId": "10234567"}
Browse a community
{"mode": "byCommunity","communityId": "zenodo","maxItems": 50}
Find by DOI
{"mode": "byDOI","doi": "10.5281/zenodo.10234567"}
Search within a date range
{"mode": "search","query": "machine learning","fromDate": "2023-01-01","toDate": "2024-12-31","sortBy": "mostrecent","maxItems": 200}
Use Cases
- Literature reviews: Collect papers and preprints across any research domain
- Dataset discovery: Find open datasets for any scientific field
- Software citation: Locate software deposits and their metadata
- Community monitoring: Track new uploads to specific Zenodo communities
- Open science analytics: Analyze publication trends, license adoption, and access patterns
- Research metadata enrichment: Enrich reference lists with full Zenodo metadata
Frequently Asked Questions
Do I need a Zenodo account or API key? No. Zenodo's public API is free and requires no authentication.
How many records can I scrape? Up to 1,000 records per run. For larger datasets, Zenodo provides OAI-PMH and full data dumps at zenodo.org/oai2d.
What does the open access right mean?
Open access records have files freely available to download. Closed and restricted records may only return metadata.
Can I search by author name?
Yes — use the query field with Elasticsearch syntax: author:"Smith, John" or creators.name:Smith.
Are file download URLs included?
Not directly. Use the zenodoId to construct the API URL: https://zenodo.org/api/records/{zenodoId}/files.
How fresh is the data? Zenodo's API serves live data. Records scraped reflect the current state of the repository.
What communities are available? Visit zenodo.org/communities to browse thousands of communities across all scientific disciplines.