PubMed Biomedical Paper Scraper avatar

PubMed Biomedical Paper Scraper

Pricing

from $30.00 / 1,000 results

Go to Apify Store
PubMed Biomedical Paper Scraper

PubMed Biomedical Paper Scraper

Scrapes PubMed biomedical papers using the official NCBI Entrez API. Extracts full metadata including abstracts, MeSH terms, authors with affiliations, citations, grants, and more. Includes smart analytics for author networks, topic trends, and geographic distribution.

Pricing

from $30.00 / 1,000 results

Rating

0.0

(0)

Developer

Yuliia Kulakova

Yuliia Kulakova

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

4 days ago

Last modified

Categories

Share

PubMed Scraper — Biomedical Research, Abstracts & Citations

PubMed Scraper

Search and extract biomedical research papers from PubMed — the world's largest database of life sciences and biomedical literature. Get full metadata, abstracts, MeSH terms, citations, author networks, and smart analytics. Powered by the official NCBI Entrez API.

What can this scraper do?

  • Search by any query — use simple keywords or advanced PubMed syntax with MeSH terms and field tags
  • Full metadata extraction — authors with affiliations and ORCID, journal info, DOI, publication types, grants, chemicals
  • Abstracts — complete structured abstracts for every article
  • MeSH terms — Medical Subject Headings with major topic indicators and qualifiers
  • Citation data — cited-by counts and reference PMIDs
  • Smart analytics — author networks, topic trends by year, top journals, funding agencies, geographic distribution
  • Multiple databases — PubMed, PubMed Central (PMC), Gene, Protein
  • No API key required — works out of the box, optional API key for faster rate limits

Why use PubMed Scraper?

Use caseHow it helps
Literature reviewCollect hundreds of papers on any topic with full abstracts and metadata
Competitive intelligenceTrack research output from specific institutions or authors
Pharma & biotech researchMonitor clinical trials, drug targets, and therapeutic areas
Grant landscape analysisIdentify top funding agencies and grant patterns in your field
Bibliometric analysisAnalyze publication trends, co-authorship networks, and journal distributions
Academic researchBuild structured datasets for systematic reviews and meta-analyses
Trend monitoringTrack emerging topics through MeSH term trends over time

Output data

Article data

Each article includes all available metadata:

FieldDescription
PMID & PMCIDPubMed and PubMed Central identifiers
TitleArticle title
AuthorsFull list with last name, first name, affiliations, and ORCID
AbstractComplete abstract text
JournalName, abbreviation, ISSN, volume, issue, pages
Publication dateISO format date
DOIDigital Object Identifier
Publication typesJournal Article, Review, Clinical Trial, Meta-Analysis, etc.
MeSH termsDescriptors with qualifiers and major topic flags
KeywordsAuthor-provided keywords
Grant infoGrant IDs, agencies, and countries
ChemicalsRegistry numbers and substance names
Full text URLPMC link for open access articles
Cited-by countNumber of citing articles (when enabled)
ReferencesFirst 10 reference PMIDs (when enabled)
Language & countryPublication language and journal country
Clinical trial IDAssociated trial registry number
Retraction statusWhether the article has been retracted
Conflict of interestAuthor disclosures

Smart analytics

Every run can include an automatic analytics summary:

AnalyticsWhat you get
Top authorsMost prolific authors with collaborator counts
Author networksCo-authorship connections and top collaborators
Topic trendsMeSH major topics by year
Journal distributionTop journals by article count
Funding agenciesTop grant sources
Geographic distributionArticles by country
Publication typesBreakdown by article type
Open access ratePercentage of open access articles
Publications by yearYearly publication volume

Filters & options

FilterOptions
Date rangeFrom/to dates (YYYY/MM/DD format)
Sort byRelevance, Publication date, Most cited
SpeciesAll, Human only, Animal only
LanguageFilter by publication language (eng, fre, de, etc.)
Publication typeJournal Article, Review, Clinical Trial, Meta-Analysis, etc.
DatabasePubMed, PMC, Gene, Protein
Include abstractFull abstract text (on by default)
Include MeSHMedical Subject Headings (on by default)
Include citationsCited-by counts and references
Include full text linksPMC open access URLs
Include analyticsSmart analytics summary

How much does it cost?

PubMed Scraper uses a pay-per-result model at $0.03 per result. No monthly fees — you only pay for the data you collect.

ResultsCost
100 articles$3.00
500 articles$15.00
1,000 articles$30.00
10,000 articles$300.00

Platform costs are included in the per-result price.

Input examples

{
"queries": ["breast cancer immunotherapy"],
"maxResults": 100,
"includeAbstract": true,
"includeMeshTerms": true,
"includeAnalytics": true
}

Advanced PubMed syntax with date filter

{
"queries": ["COVID-19[MeSH] AND vaccine[tiab]"],
"maxResults": 200,
"dateFrom": "2024/01/01",
"dateTo": "2026/05/18",
"sortBy": "pub_date",
"species": "human",
"language": ["eng"]
}

Multi-query research with citations

{
"queries": ["CRISPR gene therapy", "base editing clinical trials"],
"maxResults": 50,
"includeCitations": true,
"includeFullTextLinks": true,
"includeAnalytics": true
}

Systematic review — specific publication types

{
"queries": ["diabetes type 2 metformin"],
"maxResults": 500,
"pubTypes": ["Clinical Trial", "Randomized Controlled Trial"],
"species": "human",
"sortBy": "cited_by_count",
"dateFrom": "2020/01/01"
}
{
"queries": ["machine learning drug discovery"],
"database": "pmc",
"maxResults": 100,
"includeFullTextLinks": true,
"includeAnalytics": true
}

Tips for best results

  1. Start small — test with 10-20 results to verify your query before scaling up
  2. Use PubMed syntax — field tags like [MeSH], [tiab], [au] give precise control over search results
  3. API key — get a free NCBI API key to increase rate limits from 3 to 10 requests/second
  4. Citations wisely — enable includeCitations only when needed, it requires extra API calls
  5. Species filter — use human for clinical research, animal for preclinical studies
  6. Sort by citationscited_by_count helps find the most impactful papers in your field
  7. Combine queries — use multiple queries to cover different aspects of your research topic

Limitations

  • PubMed API returns up to 9,999 results per query (NCBI hard cap). For larger datasets, split into narrower date windows
  • Citation counts are approximate and may not reflect the latest data
  • Full text is only available for open access articles in PMC
  • Rate limits: 3 requests/second without API key, 10 requests/second with key
  • Analytics summary is included as the last record in the dataset (field type: "analytics")