Pricing

$5.00 / 1,000 zenodo record scrapeds

Zenodo Scraper — Research Records, Datasets & Software

Scrape Zenodo.org (CERN open research repository) for records, datasets, and software. Four modes: search with type/access filters, record details by DOI/ID, community browse, recent submissions. Extracts titles, authors, DOIs, files, stats. Uses official API. No auth, 60 req/min.

Pricing

$5.00 / 1,000 zenodo record scrapeds

Rating

0.0

(0)

Developer

OpenClaw Mara

Actor stats

Bookmarked

Total users

Monthly active users

2 months ago

Last modified

Zenodo Scraper — Research Records, Datasets & Software Metadata

$0.005 per record · Extract research outputs, datasets, software, and publications from Zenodo — CERN's open research repository with 4M+ records, permanent DOIs, and full community metadata. No API key needed for public records.

Built for research intelligence, dataset discovery, open-science dashboards, DOI enrichment pipelines, and RAG/LLM corpora on academic research and data.

What You Get

Advanced search — Zenodo's full query syntax (title/creators/keywords/DOI filters)
Record details — full metadata by record ID or DOI (description, files, authors, license)
Community browsing — list records inside a specific Zenodo community
Recent uploads — latest research outputs (great for monitoring dashboards)
Resource type filter — publication / dataset / software / poster / presentation / image / video
DOI-ready — every record ships with a permanent 10.5281/zenodo.* DOI
Structured JSON — stable fields, ready for downstream pipelines
CC-licensed data — reusable under open licenses

4 Use Cases (ready-to-run JSON inputs)

1. Discover fresh datasets on a topic

{
  "mode": "search",
  "searchQuery": "climate change temperature",
  "resourceType": "dataset",
  "maxResults": 50,
  "sort": "mostrecent"
}

Top 50 most-recent datasets about climate change — seed data for a research dashboard or a reading list.

2. DOI enrichment for a reference list

{
  "mode": "record_details",
  "recordIds": ["10.5281/zenodo.1234567", "10.5281/zenodo.7654321"]
}

Full metadata (title, authors, abstract, files, license) for specific DOIs. Use to enrich a bibliography CSV or citation manager export.

3. Monitor a research community

{
  "mode": "community",
  "communityId": "covid-19",
  "resourceType": "publication",
  "maxResults": 100,
  "sort": "mostrecent"
}

Latest 100 publications in the Zenodo COVID-19 community — perfect for weekly digest emails or Slack alerts.

4. Daily "what's new on Zenodo" feed

{
  "mode": "recent",
  "resourceType": "software",
  "maxResults": 25
}

25 newest software releases published to Zenodo — ideal for open-source release trackers.

Input Schema

Field	Type	Default	Description
`mode`	enum	`search`	`search` / `record_details` / `community` / `recent`
`searchQuery`	string	—	Query (supports `title:` / `creators.name:` / `keywords:`)
`resourceType`	enum	`""`	`publication` / `dataset` / `software` / `poster` / …
`recordIds`	string[]	`[]`	Numeric IDs or `10.5281/zenodo.NNN` DOIs (`record_details`)
`communityId`	string	—	Zenodo community slug (`community` mode)
`maxResults`	integer	`50`	Max records per run
`sort`	enum	`bestmatch`	`bestmatch` / `mostrecent` / `mostviewed`

Output (sample — `record_details`)

{
  "id": 1234567,
  "doi": "10.5281/zenodo.1234567",
  "title": "Global temperature anomalies 1880–2024",
  "resourceType": "dataset",
  "publicationDate": "2024-11-18",
  "creators": [
    {"name": "Smith, Jane", "affiliation": "MIT", "orcid": "0000-0002-1825-0097"}
  ],
  "description": "Monthly global temperature anomalies compiled from GISS, HadCRUT5 and Berkeley Earth...",
  "keywords": ["climate", "temperature", "anomaly", "reanalysis"],
  "license": "CC-BY-4.0",
  "files": [
    {"filename": "temps.csv", "size": 524288, "checksum": "md5:abcdef...", "downloadUrl": "https://zenodo.org/records/1234567/files/temps.csv"}
  ],
  "stats": {"views": 8420, "downloads": 2310, "uniqueDownloads": 1945},
  "zenodoUrl": "https://zenodo.org/records/1234567"
}

Pricing & Performance

Pay-per-event: $0.005 per Zenodo record
Typical cost: $0.05 for 10 records, $0.50 for 100, $5 for 1,000
Speed: ~10–15 records/second (rate-limit-safe against Zenodo API)
Free Apify tier: $5/month credit = ~1,000 records/month

Zenodo itself is free and open — you pay only for structured extraction, DOI enrichment, and delivery into your pipelines.

Integrations

Zapier / Make / n8n — new record in a community → Airtable / Slack / Notion
LangChain / LlamaIndex — RAG over abstracts, descriptions, and metadata
Vector DBs (Pinecone / Weaviate / Qdrant) — embed titles/abstracts for semantic "similar research"
Neo4j / Graphiti — researcher → record → community → keyword graph
Citation managers (Zotero / Mendeley) — DOI resolver + metadata fetcher
Elasticsearch / Algolia / Meilisearch — index research outputs for search UI

Python SDK

from apify_client import ApifyClient
client = ApifyClient("<APIFY_TOKEN>")
run = client.actor("Helpermara/zenodo-scraper").call(
    run_input={"mode": "search", "searchQuery": "large language model", "resourceType": "dataset", "maxResults": 30}
)
for rec in client.dataset(run["defaultDatasetId"]).iterate_items():
    print(rec["doi"], rec["title"])

FAQ

Do I need a Zenodo API token? No, public records are accessible without authentication. Zenodo applies generous rate limits for anonymous traffic.

What about private / restricted records? This actor only reads public content. Restricted records require a Zenodo personal token (not covered here).

Which DOI format should I use? Either full DOI (10.5281/zenodo.1234567) or just the numeric record ID — both work in recordIds.

How do I find a community slug? Visit https://zenodo.org/communities/ and look at the URL — slugs like covid-19, eosc, openaire are the communityId.

Advanced search syntax? Zenodo supports Lucene-style queries: title:"neural network" AND creators.name:"Hinton". Pass as-is into searchQuery.

Can I get file contents? No — the actor returns file metadata + download URLs. Fetch files yourself using the returned downloadUrl.

Keywords

zenodo scraper, zenodo api, research data, research datasets, open science, doi metadata, cern zenodo, academic datasets, research outputs, fair data, open access, research software, scholarly communication, openaire, research discovery, dataset catalog, digital repository, preprint, citation enrichment, doi resolver

crossref-scraper — DOI metadata
semantic-scholar-scraper — academic papers
arxiv-scraper — preprints
orcid-scraper — researcher profiles

Changelog

2026-04-24 — Extended README with use cases, integrations, and FAQ
2026-03 — Initial release: 4 modes (search / record_details / community / recent)

Zenodo Research Repository Scraper

parseforge/zenodo-scraper

Export records from Zenodo, CERN's open research data repository. 5M+ datasets, publications, software, posters, and presentations with DOIs. Search by keyword, community, creator, resource type, or license. Pull titles, authors, abstracts, files, DOIs, and download counts.

ParseForge

Zenodo Scraper

crawlergang/zenodo-scraper

Scrape Zenodo, CERN's open science repository with 3M+ research records including papers, datasets, software, posters, and presentations. Search by query, resource type, access rights, or fetch by record ID, DOI, or community.

Crawler Gang

5.0

Zenodo Scraper

crawlerbros/zenodo-scraper

Crawler Bros

Zenodo Research Scraper

crawlerbros/zenodo-research-scraper

Scrape open-access research papers from Zenodo.org - CERN's open repository with a fully public REST API. No auth required.

Crawler Bros

Zenodo Scraper — Research Datasets, Papers & Software

logiover/zenodo-scraper

Scrape Zenodo by keyword, record ID, or community. Extract title, DOI, authors, description, type, license, downloads, views & files. No API key, no login. Ideal for researchers, data scientists, and academic analysts.

Logiover

Zenodo Research Records Scraper

parseforge/zenodo-records-scraper

Search the CERN Zenodo repository for research outputs by keyword and resource type. Returns record IDs, DOIs, titles, creators, descriptions, publication dates, license info, and access right flags. Useful for scholarly discovery, citation tracking, open access audits, and meta research.

ParseForge

Zenodo Scraper

fortuitous_pirate/zenodo-scraper

Scrape open-access research from Zenodo: papers, datasets, software, presentations, and posters. 3M+ records across all scientific disciplines.

Fortuitous Pirate

Crossref Scraper — DOI Metadata for Academic Papers

openclawmara/crossref-scraper

Scrape Crossref — largest DOI registry for academic literature. Modes: search works, DOI lookup, journal metadata, funder info, affiliation search. Extracts titles, authors, DOIs, ISSN, references, citations. Official REST API, no auth, 50 req/sec. For research & citation analysis.

OpenClaw Mara

AI Software

powerful_platypus/AI-software

AI software

GOUNTANTE yendoukoa

OpenAIRE Scraper | Open Access Research Records

parseforge/openaire-scraper

Search OpenAIRE for open access publications, datasets, software, and funded projects with titles, authors, affiliations, DOI, abstracts, funders, and links. Power academic discovery, research analytics, bibliographic tooling, and science observatories with structured scholarly data.

ParseForge