Pricing

$2.00 / 1,000 records

PubMed Scraper: Biomedical Articles & MeSH

Scrape 36M+ PubMed/NCBI biomedical articles: title, abstract, authors, journal, PMID, DOI, MeSH terms. No API key needed. Build literature reviews & AI training corpora. Works in Claude, ChatGPT & any MCP agent.

Pricing

$2.00 / 1,000 records

Rating

0.0

(0)

Developer

The Mine Works

Actor stats

Bookmarked

Total users

Monthly active users

4 days ago

Last modified

🧬 PubMed NCBI Scraper: Biomedical Articles & MeSH (No Key)

Overview

PubMed NCBI Scraper pulls biomedical literature from the NCBI PubMed database, which indexes 36M+ articles across medicine, life sciences, and healthcare. Search by keyword or by PubMed field tags ([ti] title, [au] author, [ta] journal, [mh] MeSH) with optional publication date range. Get back PMID, DOI, title, abstract, authors, journal, ISSN, MeSH terms, and publication year for every matching article. No API key required, with an optional free NCBI key that raises the rate limit.

It's the fastest way to build a biomedical literature corpus, feed a clinical RAG system, or run automated literature reviews.

Reliability posture: blocked, empty, or failed searches are never charged. You only pay for an article record that was actually delivered.

✅ No API key required | ✅ Full abstracts + MeSH | ✅ 36M+ articles | ✅ MCP-ready for AI agents

Features

Field-tagged search. PubMed's native tag syntax ([ti], [au], [ta], [mh]) supported end to end. Date range. Filter by publication date (YYYY/MM/DD). Full abstracts. Complete abstract text on every record. MeSH terms. Standardised biomedical vocabulary tags for every article. Optional NCBI key. Free key lifts the rate limit from 3 to 10 requests per second.

How it works

The actor calls the official NCBI E-utilities API (ESearch, EFetch, ESummary), which is the source PubMed itself is built on. Your query is submitted via ESearch to get the matching PMIDs, then EFetch pulls the full XML record for each PMID and normalises it into a flat JSON row.

Runs work without any credentials against NCBI's public rate limit (3 requests per second). Supplying a free NCBI API key in ncbiApiKey lifts the limit to 10 requests per second, which makes large runs meaningfully faster.

🧾 Input configuration

{
  "query": "GLP-1 receptor agonist[ti] AND diabetes[mh]",
  "dateFrom": "2020/01/01",
  "dateTo": "2026/06/30",
  "maxResults": 500
}

📤 Output format

{
  "pmid": "42446258",
  "title": "Evaluation of insulin, leptin, ghrelin, and adiponectin levels in type 2 diabetic patients receiving combined metformin-sulfonylurea therapy.",
  "abstract": "Type 2 diabetes mellitus (T2DM) is a chronic metabolic disorder, which is commonly related to reduced insulin responsiveness and altered appetite-related hormones. In this study, we have evaluated how combined metformin and sulfonylurea therapy affects glycemic indicators and appetite hormones, with special focus on hormone ratios...",
  "authors": ["S M Hussein", "N K Zaidan"],
  "journal": "Biomeditsinskaia khimiia",
  "issn": "2310-6972",
  "year": "2026",
  "doi": "10.18097/PBMCE0047",
  "mesh_terms": [
    "Humans",
    "Diabetes Mellitus, Type 2",
    "Metformin",
    "Adiponectin",
    "Leptin",
    "Ghrelin",
    "Insulin",
    "Female",
    "Male",
    "Middle Aged",
    "Sulfonylurea Compounds",
    "Insulin Resistance",
    "Hypoglycemic Agents",
    "Blood Glucose",
    "Drug Therapy, Combination",
    "Adult",
    "Glycated Hemoglobin"
  ],
  "url": "https://pubmed.ncbi.nlm.nih.gov/42446258/",
  "scraped_at": "2026-07-15T04:22:06.073Z"
}

This is a genuine article record from a live run against the real PubMed database. Note: the actor's field is url, not article_url, and there is no separate publication_date field; year is the field actually returned.

Every article record contains these fields:

Field	Description
🆔 `pmid`	PubMed identifier
📄 `title`	Article title
📝 `abstract`	Full abstract text
👥 `authors`	Array of author names
📰 `journal`	Journal name
🔢 `issn`	Journal ISSN
📅 `year`	Publication year
🔗 `doi`	Digital Object Identifier
🏷️ `mesh_terms`	Array of MeSH (Medical Subject Headings) terms
🌐 `url`	Canonical PubMed URL
🕒 `scraped_at`	ISO timestamp of when the record was captured

💼 Common use cases

Systematic reviews & meta-analyses Pull every article on a topic in a date range and load into a review-management tool. Filter by MeSH term for reproducible search strategies.

Clinical RAG & AI assistants Feed biomedical abstracts into a retrieval-augmented generation system for a clinical decision-support tool. Build a specialty-specific corpus (cardiology, oncology, endocrinology) for an AI agent.

Pharma competitive intel Track publications on a molecule or mechanism across time. Monitor a competitor's key opinion leaders by author search.

Grant & academic research support Build reference lists for a grant application or a paper's introduction. Monitor a lab's or institution's publication output.

🚀 Getting started

Open the actor and enter a PubMed query. Use tags for precision (GLP-1 receptor agonist[ti] AND diabetes[mh]).
Optionally set dateFrom and dateTo (YYYY/MM/DD).
Set maxResults to control cost.
Optionally paste a free NCBI key in ncbiApiKey for higher rate limits.
Click Start. Records stream to the dataset as pages parse.

FAQ

Do I need an NCBI API key? No. Runs work keyless at NCBI's public rate limit. A free NCBI key (from ncbi.nlm.nih.gov/account/settings/) lifts the rate limit and makes large runs significantly faster.

What PubMed field tags are supported? All of them. Use [ti] title, [au] author, [ta] journal abbreviation, [mh] MeSH, [dp] publication date, and any others documented by PubMed. Combine with AND, OR, NOT and parentheses.

How much does it cost? Pay per article returned, pay as you go. No subscription, no monthly minimum.

Can I use it in an AI agent? Yes. It's exposed as an MCP tool. See below.

Use in Claude, ChatGPT & any MCP agent

https://mcp.apify.com/?tools=themineworks/pubmed-ncbi-scraper

Or call it programmatically with the Apify client:

import { ApifyClient } from 'apify-client';

const client = new ApifyClient({ token: 'YOUR_APIFY_TOKEN' });

const run = await client.actor('themineworks/pubmed-ncbi-scraper').call({
  query: 'GLP-1 receptor agonist[ti] AND diabetes[mh]',
  dateFrom: '2023/01/01',
  maxResults: 100,
});

const { items } = await client.dataset(run.defaultDatasetId).listItems();
console.log(items);

🛠️ Complete your biomedical research pipeline

Got the articles. Now widen the corpus:

arXiv Paper Scraper: pull preprints on the same topic before they hit a journal.
Crossref Scholarly Metadata: pull citation counts and journal metadata for any DOI.
FDA Recalls Scraper: cross-reference drug or device safety history.

Typical flow: pubmed pulls the peer-reviewed evidence, arxiv adds preprints, fda-recalls checks safety signals.

Questions or need a custom field set? Reach out through the Apify profile.

PubMed Biomedical Literature Scraper

meticulous_sweetwilliam/pubmed-biomedical-literature

Query PubMed via NCBI API for biomedical papers. Extract title, authors, abstract, MeSH terms, DOI, PMID. For pharma R&D, biotech, medical AI pipelines, and systematic reviews.

Leo

PubMed Search Scraper

crawlerbros/pubmed-search-scraper

Search PubMed (NCBI E-utilities) for biomedical articles by keyword, date range, and article type. Returns title, authors, journal, abstract, DOI, MeSH terms, keywords, and citation. Free public API, no proxy, no cookies. Optional NCBI API key for higher rate limits.

Crawler Bros

PubMed Scraper

lulzasaur/pubmed-scraper

Search and scrape PubMed biomedical literature via NCBI E-utilities. Get titles, authors, abstracts, journals, MeSH terms, DOIs. Search by keyword or fetch by PMID.

lulz bot

PubMed Scraper — Abstracts, Authors & MeSH Terms

logiover/pubmed-scraper

Scrape PubMed by keyword query or direct PMIDs. Extract title, abstract, authors, journal, DOI, MeSH terms, keywords, and publication date via NCBI E-utilities. No API key required.

Logiover

PubMed Scraper — Papers, DOI & MeSH to JSON

devilscrapes/pubmed-papers-scraper

Search PubMed by query and export structured paper rows — title, authors, abstract, journal, DOI, PMID, MeSH terms, publication date — to JSON or CSV. A clean PubMed API wrapper that handles NCBI pagination, rate limits, and retries for research and ML pipelines.

DevilScrapes

PubMed Biomedical Article Search

fit_melon/pubmed-biomedical-article-search

Search PubMed's 35M+ biomedical citations by keyword: title, authors, journal, year, DOI, PMID and abstract for each match. Official NCBI E-utilities API. Free — you only pay Apify usage.

D N

🧬 PubMed Scraper - Biomedical Literature & Citations

benthepythondev/pubmed-scraper

PubMed Scraper for the official NCBI PubMed API. Search 37M+ biomedical citations; extract title, authors, journal, publication date, DOI, PMID, article type and links. Supports PubMed field tags and sorting. For systematic reviews, medical research and bibliometrics. Keyless and fast.

Ben

PubMed Articles Scraper

scrapers_lat/pubmed-scraper

Scrape biomedical articles with title, authors, journal, publication date, DOI and a direct link. Search by keyword. Export to JSON, CSV or Excel.

Scrapers Lat

PubMed Scraper — Biomedical Research Papers

du7chmaniac/pubmed-scraper

Scrape biomedical research papers from PubMed via the NCBI E-utilities API. Search by keyword with optional date range, retrieve article metadata, abstracts, authors, MeSH terms, and DOIs. Supports both summary (fast) and full (with abstract) retrieval modes.

Joren Maurissen