Pricing

from $27.60 / 1,000 results

Europe PMC Literature Scraper

Scrape Europe PMC for biomedical research papers. Search by title, author, MeSH terms, journal. Get DOI, abstract, full-text URLs, citations, references, open-access status. No API key required.

Pricing

from $27.60 / 1,000 results

Rating

0.0

(0)

Developer

ParseForge

Actor stats

Bookmarked

Total users

Monthly active users

21 hours ago

Last modified

🧬 Europe PMC Literature Scraper

🚀 Export the biomedical literature index in seconds. Search 40+ million records across PubMed, PubMed Central, life-science preprints, agricultural literature, and patents. Filter by title, author, MeSH term, DOI, journal, open access, or free-text. No API key, no registration.

The Europe PMC Literature Scraper wraps the official Europe PMC REST API (ebi.ac.uk/europepmc/webservices/rest/search) and returns one row per article with 40+ fields, including DOI, PMID, PMCID, abstract, full-text URLs, MeSH terms, keywords, journal, citation count, open-access status, and licensing. The underlying corpus is published by Europe PMC, the European mirror of PubMed Central, maintained by EMBL-EBI and funded by 32 life-science research funders worldwide.

The index covers MEDLINE/PubMed, PubMed Central (full text), Agricola (USDA agricultural literature), bioRxiv and medRxiv preprints, CTX patents, and Europe PMC-curated content. Free-text and field-qualified queries (TITLE, AUTH, MESH, DOI, PMID, AFFILIATION, JOURNAL) compose freely with boolean operators. This Actor returns structured records ready to download as CSV, Excel, JSON, or XML.

🎯 Target Audience	💡 Primary Use Cases
Biomedical researchers, systematic-review teams, bibliometrics analysts, pharma intelligence, scientific publishers, science journalists, OA advocacy, ML training pipelines	Literature reviews, MeSH-term mining, author publication tracking, journal impact studies, drug-target evidence harvesting, training-set assembly

📋 What the Europe PMC Scraper does

One programmable interface to the full Europe PMC search service:

🔍 Field-qualified queries. TITLE:, AUTH:, AFFILIATION:, JOURNAL:, MESH:, DOI:, PMID:, PMCID:, OPEN_ACCESS:, plus boolean operators (AND, OR, NOT) and quoted phrases.
📚 Three response shapes. core returns the full record with abstract, full-text URLs, and metadata. lite returns compact fields. idlist returns IDs only for ultra-fast scans.
⏱️ Sort options. Relevance (default), newest first, oldest first, or most cited.
🔁 Cursor-mark pagination. Fully automatic. Walks the entire result set efficiently for large queries.

Output captures the publication metadata (PMID, PMCID, DOI, source, journal title, ISSN, volume, issue, page info, publication year and date), full author list, abstract text, affiliation, language, publication types, MeSH headings, keywords, grant count, citation count, full-text URLs, license, open-access flag, and indexing dates.

💡 Why it matters: Europe PMC is the deepest open-access biomedical literature index in the world. The web UI is great for one-off lookups, but systematic reviews, bibliometric studies, and ML training-set assembly need flat rows. This Actor turns the search service into a downloadable dataset in one run.

📊 Data fields

Each record includes: abstractText, affiliation, authorList, authorString, citedByCount, dateOfRevision, doi, firstIndexDate, firstPublicationDate, fullTextUrls, grantsCount, hasBook, hasDbCrossReferences, hasPDF, hasReferences, hasSuppl, hasTextMinedTerms, id, inEPMC, inPMC, isOpenAccess, issue, journalIssn, journalTitle, journalVolume, keywords, language, license, meshTerms, pageInfo, pmcid, pmid, pubDate, pubYear, publicationStatus, publicationTypes, scrapedAt, source, title, url. All 40 field names come from a real production run, so what you see here is what lands in your dataset.

🚀 How to use

📝 Sign up. Create a free account with $5 credit (takes 2 minutes).
🌐 Open the Actor. Go to the Europe PMC Literature Scraper page on the Apify Store.
🔍 Build a query. Free-text or use field qualifiers (AUTH:"Doudna J" AND CRISPR).
📚 Pick a response shape. core for full metadata, lite for compact, idlist for IDs only.
🚀 Run it. Click Start and let the Actor collect your data.
📥 Download. Grab your results in the Dataset tab as CSV, Excel, JSON, or XML.

⏱️ Total time from signup to downloaded dataset: 3-5 minutes. No coding required.

🔗 Recommended Actors

🤖 Hugging Face Model Scraper - ML model registry metadata
🇪🇺 Eurostat Statistics Scraper - 7,500+ Eurostat datasets
📊 ClinicalTrials.gov Scraper - Clinical trial registry
📚 Figshare Research Output Scraper - Open research datasets
🔬 OSF Open Science Framework Scraper - Open-science project metadata

💡 Pro Tip: browse the complete ParseForge collection for more reference-data scrapers.

⚠️ Disclaimer: this Actor is an independent tool and is not affiliated with, endorsed by, or sponsored by Europe PMC, EMBL-EBI, the European Bioinformatics Institute, the National Center for Biotechnology Information, or any of the 32 funders supporting Europe PMC. All trademarks mentioned are the property of their respective owners. Only publicly available open data from the official Europe PMC REST API is collected.

🆘 Need Help?

If you hit a bug, have questions about setup, or need a scraper we haven't built yet, open our contact form or write to parseforge@protonmail.com. We also take on paid custom data projects.

For faster answers, join our Discord. It's the best place to get support and suggest new actors.

Europe PMC Papers Scraper - Biomedical Literature Data

benthepythondev/europepmc-papers-scraper

Scrape Europe PMC paper search results: titles, authors, abstracts, journals, citations, DOI and PubMed IDs.

Ben

Europe PMC Biomedical Papers

scrupulous_waterbird_m4w/europe-pmc-papers

Search Europe PMC biomedical literature and return structured papers with abstracts, authors, identifiers, citations, full-text availability, journals, grants, and publication dates. No API key or proxy required.

Mori

Europe PMC Scientific Literature Scraper

parseforge/europe-pmc-scraper

Query Europe PMC with the full TITLE, AUTH, JOURNAL, and DOI syntax. Returns PMID, DOI, title, authors, abstract, journal, publication year, citation count, open access flag, and source. Useful for systematic reviews, literature mining, and biomedical research workflows.

ParseForge

Europe PMC Scraper

crawlergang/europe-pmc-scraper

Scrape Europe PMC, 42M+ biomedical literature records including PubMed, PubMed Central, patents, and preprints. Search publications, get article details by PMID or DOI, and retrieve citation/reference lists.

Crawler Gang

5.0

Europe PMC Scraper

crawlerbros/europe-pmc-scraper

Crawler Bros

Europe PMC Articles Scraper

parseforge/europe-pmc-articles-scraper

Search Europe PMC across millions of life sciences articles with any free text query. Returns PMID, PMCID, DOI, title, authors, journal, year, and abstract snippet. Useful for systematic reviews, citation harvesting, drug target evidence collection, and literature monitoring.

ParseForge

PubMed Scraper: Biomedical Articles & MeSH

themineworks/pubmed-ncbi-scraper

Scrape 36M+ PubMed/NCBI biomedical articles: title, abstract, authors, journal, PMID, DOI, MeSH terms. No API key needed. Build literature reviews & AI training corpora. Works in Claude, ChatGPT & any MCP agent.

The Mine Works

Europe PMC — Biomedical Knowledge Graph & Literature Mining

ryanclinton/europe-pmc-search

Turn a biomedical topic into a knowledge graph and evidence corpus from Europe PMC. Mines genes, diseases, chemicals, organisms and deposited datasets (GEO, ENA, PDB) from full text, builds entity co-occurrence networks, tracks emerging entities, and exports Neo4j/Gephi CSV. No API key.

Ryan Clinton

PubMed Biomedical Literature Scraper

meticulous_sweetwilliam/pubmed-biomedical-literature

Query PubMed via NCBI API for biomedical papers. Extract title, authors, abstract, MeSH terms, DOI, PMID. For pharma R&D, biotech, medical AI pipelines, and systematic reviews.

Leo

PubMed Scraper — Abstracts, Authors & MeSH Terms

logiover/pubmed-scraper

Scrape PubMed by keyword query or direct PMIDs. Extract title, abstract, authors, journal, DOI, MeSH terms, keywords, and publication date via NCBI E-utilities. No API key required.