# PubMed Scraper - Low-cost💲🔥📚🔬 (`delectable_incubator/pubmed-scraper-low-cost`) Actor

📚🔍 Extract academic articles from PubMed by keyword, query, or search URL. Collect publication titles, authors, abstracts, journals, publication dates, PMIDs, citations, and article URLs. Ideal for medical research, literature reviews, scientific intelligence, and academic reference gathering 📊⚡

- **URL**: https://apify.com/delectable\_incubator/pubmed-scraper-low-cost.md
- **Developed by:** [Prime Scrape](https://apify.com/delectable_incubator) (community)
- **Categories:** Automation, SEO tools
- **Stats:** 2 total users, 1 monthly users, 100.0% runs succeeded, 0 bookmarks
- **User rating**: No ratings yet

## Pricing

from $0.00005 / actor start

This Actor is paid per event and usage. You are charged both the fixed price for specific events and for Apify platform usage.
Since this Actor supports Apify Store discounts, the price gets lower the higher subscription plan you have.

Learn more: https://docs.apify.com/platform/actors/running/actors-in-store#pay-per-event

## What's an Apify Actor?

Actors are a software tools running on the Apify platform, for all kinds of web data extraction and automation use cases.
In Batch mode, an Actor accepts a well-defined JSON input, performs an action which can take anything from a few seconds to a few hours,
and optionally produces a well-defined JSON output, datasets with results, or files in key-value store.
In Standby mode, an Actor provides a web server which can be used as a website, API, or an MCP server.
Actors are written with capital "A".

## How to integrate an Actor?

If asked about integration, you help developers integrate Actors into their projects.
You adapt to their stack and deliver integrations that are safe, well-documented, and production-ready.
The best way to integrate Actors is as follows.

In JavaScript/TypeScript projects, use official [JavaScript/TypeScript client](https://docs.apify.com/api/client/js.md):

```bash
npm install apify-client
```

In Python projects, use official [Python client library](https://docs.apify.com/api/client/python.md):

```bash
pip install apify-client
```

In shell scripts, use [Apify CLI](https://docs.apify.com/cli/docs.md):

````bash
# MacOS / Linux
curl -fsSL https://apify.com/install-cli.sh | bash
# Windows
irm https://apify.com/install-cli.ps1 | iex
```bash

In AI frameworks, you might use the [Apify MCP server](https://docs.apify.com/platform/integrations/mcp.md).

If your project is in a different language, use the [REST API](https://docs.apify.com/api/v2.md).

For usage examples, see the [API](#api) section below.

For more details, see Apify documentation as [Markdown index](https://docs.apify.com/llms.txt) and [Markdown full-text](https://docs.apify.com/llms-full.txt).


# README

<p align="center">
<img src="https://i.ibb.co/jkNS73wX/readme.png" alt="PubMed Articles Scraper" width="100%">
</p>

---

## 📚🔬 PubMed Articles Scraper | Bulk Medical Research & Scientific Papers Scraper | Apify Actor

### 🚀 Extract PubMed Articles in Seconds (No Code)

The **PubMed Articles Scraper (Apify Actor)** is a powerful, scalable, and SEO-optimized academic research scraping tool designed to extract scientific publications, medical research papers, biomedical studies, and scholarly articles directly from PubMed.

Scrape PubMed articles in bulk, collect structured research metadata, and export it for systematic reviews, scientific analysis, AI training datasets, healthcare intelligence, literature reviews, and academic research projects.

---

### 🔥 Why This PubMed Scraper?

✔ Best PubMed scraper on Apify

✔ Supports Bulk Keyword Scraping (Multi-Search)

✔ Fast & scalable scientific data extraction

✔ Extract medical research papers in bulk

✔ JSON / CSV / Excel export ready

✔ Perfect for researchers, universities & healthcare organizations

✔ No coding required

✔ Clean structured datasets

---

### 🎯 What This Scraper Does (PubMed Research Data Extraction)

This Apify Actor extracts real-time article data directly from PubMed search results.

#### 📌 Core Features

✅ Scrape PubMed articles

✅ Bulk keyword scraping support (SEO BOOST 🚀)

✅ Medical research paper extraction

✅ Scientific publication extraction

✅ Biomedical literature scraping

✅ Article metadata collection

✅ DOI extraction

✅ Authors extraction

✅ Journal information extraction

✅ Abstract extraction

✅ Publication date extraction

✅ Direct PubMed article URLs

✅ Clean structured dataset output

---

### ⚡ Input Configuration (Simple & Powerful)

#### 🔥 BULK KEYWORD MODE (SEO BOOST 🚀)

````

{
"keywords": \[
"heart disease",
"diabetes",
"artificial intelligence healthcare",
"cancer treatment",
"mental health",
"covid-19"
],
"sort\_by": "Best match",
"max\_items": 500
}

```

Scrape multiple PubMed topics and research queries during a single run.

Perfect for large-scale literature reviews and scientific data collection.

---

### 📊 Extracted Article Data (Structured Output)

| Field            | Description        |
| ---------------- | ------------------ |
| title            | Article title      |
| authors          | Article authors    |
| journal_date_doi | Journal citation   |
| date             | Publication year   |
| doi              | DOI identifier     |
| volume_issue     | Volume & issue     |
| pages            | Page numbers       |
| abstract         | Article abstract   |
| link             | PubMed article URL |

---

### 💡 Use Cases (High Demand SEO Keywords)

This PubMed scraper is ideal for:

📚 Medical literature reviews

🔬 Scientific research analysis

🏥 Healthcare intelligence

🤖 AI training datasets

🧬 Biomedical research collection

📈 Academic publication monitoring

🎓 University research projects

📊 Citation and trend analysis

💡 Evidence-based medicine research

⚡ Bulk scientific article extraction

---

### 🚀 Key Features (Apify SEO Optimized)

⚡ Bulk keyword scraping support

🔬 PubMed research database coverage

📌 Smart search engine

🧠 Structured research datasets

📊 High-speed extraction engine

🔁 Auto retry system

💾 Export-ready datasets

⚙️ Scalable cloud execution

🚀 Built for Apify

---

### 📤 Output Formats Supported

✔ JSON

✔ CSV

✔ Excel XLSX

✔ XML

✔ HTML

---

### 📦 Example Output

```

{
"title": "Heart Disease in Children",
"authors": "Garcia RU, Peddy SB",
"journal\_date\_doi": "Prim Care. 2018 Mar;45(1):143-154. doi:10.1016/j.pop.2017.10.005",
"date": "2018",
"doi": "10.1016/j.pop.2017.10.005",
"volume\_issue": "45(1)",
"pages": "143-154",
"abstract": "More children than ever born with congenital heart disease are growing into adulthood...",
"link": "https://pubmed.ncbi.nlm.nih.gov/29406940/"
}

````

---

### 🔥 Why This is the BEST PubMed Scraper on Apify?

✔ Optimized for Apify marketplace ranking

✔ Bulk keyword support

✔ High-performance extraction engine

✔ Enterprise-ready scalability

✔ Clean structured article datasets

✔ Perfect for SEO traffic & marketplace visibility

✔ Designed for researchers and healthcare professionals

---

### 💸 Pricing

This scraper runs on a pay-per-result pricing model.

You only pay for successfully extracted records.

💳 Price: $2.89 / 1,000 results

---

### ❓ FAQ (SEO BOOST SECTION)

#### Can I scrape multiple PubMed keywords at once?

Yes — Bulk Keyword Mode is fully supported.

#### Can I extract article abstracts?

Yes — abstracts are extracted whenever available.

#### Does the scraper collect DOI information?

Yes — DOI data is included when available.

#### Is this scraper fast?

Yes — optimized for high-speed extraction.

#### Can I export data?

Yes — JSON, CSV, Excel, XML and HTML are supported.

#### Is coding required?

No — 100% no-code Apify Actor.

#### Can I use this for systematic reviews?

Absolutely — this scraper is ideal for literature reviews and academic research.

---

### ⚠️ Disclaimer

This tool is not affiliated with, endorsed by, or sponsored by PubMed or the National Library of Medicine (NLM).

All trademarks belong to their respective owners.

---

### 🔗 Related Actors (PrimeScrape Research Suite)

👉 Google Scholar Scraper

👉 Google News Scraper

👉 Coursera Course Scraper

👉 Udemy Reviews Scraper

👉 More academic intelligence tools coming soon 🚀

---

### 🌍 PrimeScrape Ecosystem

Built for data scraping, research intelligence, AI datasets, automation and analytics at scale.

📚 Academic Research

🔬 Scientific Intelligence

🏥 Healthcare Analytics

📊 Data Collection

🤖 AI Training Datasets

⚙️ Automation Pipelines

🚀 Enterprise Research Solutions

---

### 📬 Support

⭐⭐⭐⭐⭐ Leave a review if you like this scraper

📩 Contact us for custom scraping solutions, healthcare intelligence projects, scientific data extraction, academic research automation, and enterprise-scale collection requirements.

# Actor input Schema

## `keywords` (type: `array`):

One or more keywords to search on PubMed. Each keyword is scraped separately.
## `maxitems` (type: `integer`):

Maximum number of results you want to scrape per keyword 📊
## `sort_by` (type: `string`):

Select a Sort type 🌍

## Actor input object example

```json
{
  "keywords": [
    "Cancer"
  ],
  "maxitems": 50,
  "sort_by": "Best match"
}
````

# Actor output Schema

## `overview` (type: `string`):

No description

# API

You can run this Actor programmatically using our API. Below are code examples in JavaScript, Python, and CLI, as well as the OpenAPI specification and MCP server setup.

## JavaScript example

```javascript
import { ApifyClient } from 'apify-client';

// Initialize the ApifyClient with your Apify API token
// Replace the '<YOUR_API_TOKEN>' with your token
const client = new ApifyClient({
    token: '<YOUR_API_TOKEN>',
});

// Prepare Actor input
const input = {
    "keywords": [
        "Cancer"
    ],
    "maxitems": 50,
    "sort_by": "Best match"
};

// Run the Actor and wait for it to finish
const run = await client.actor("delectable_incubator/pubmed-scraper-low-cost").call(input);

// Fetch and print Actor results from the run's dataset (if any)
console.log('Results from dataset');
console.log(`💾 Check your data here: https://console.apify.com/storage/datasets/${run.defaultDatasetId}`);
const { items } = await client.dataset(run.defaultDatasetId).listItems();
items.forEach((item) => {
    console.dir(item);
});

// 📚 Want to learn more 📖? Go to → https://docs.apify.com/api/client/js/docs

```

## Python example

```python
from apify_client import ApifyClient

# Initialize the ApifyClient with your Apify API token
# Replace '<YOUR_API_TOKEN>' with your token.
client = ApifyClient("<YOUR_API_TOKEN>")

# Prepare the Actor input
run_input = {
    "keywords": ["Cancer"],
    "maxitems": 50,
    "sort_by": "Best match",
}

# Run the Actor and wait for it to finish
run = client.actor("delectable_incubator/pubmed-scraper-low-cost").call(run_input=run_input)

# Fetch and print Actor results from the run's dataset (if there are any)
print("💾 Check your data here: https://console.apify.com/storage/datasets/" + run["defaultDatasetId"])
for item in client.dataset(run["defaultDatasetId"]).iterate_items():
    print(item)

# 📚 Want to learn more 📖? Go to → https://docs.apify.com/api/client/python/docs/quick-start

```

## CLI example

```bash
echo '{
  "keywords": [
    "Cancer"
  ],
  "maxitems": 50,
  "sort_by": "Best match"
}' |
apify call delectable_incubator/pubmed-scraper-low-cost --silent --output-dataset

```

## MCP server setup

```json
{
    "mcpServers": {
        "apify": {
            "command": "npx",
            "args": [
                "mcp-remote",
                "https://mcp.apify.com/?tools=delectable_incubator/pubmed-scraper-low-cost",
                "--header",
                "Authorization: Bearer <YOUR_API_TOKEN>"
            ]
        }
    }
}

```

## OpenAPI specification

```json
{
    "openapi": "3.0.1",
    "info": {
        "title": "PubMed Scraper - Low-cost💲🔥📚🔬",
        "description": "📚🔍 Extract academic articles from PubMed by keyword, query, or search URL. Collect publication titles, authors, abstracts, journals, publication dates, PMIDs, citations, and article URLs. Ideal for medical research, literature reviews, scientific intelligence, and academic reference gathering 📊⚡",
        "version": "0.0",
        "x-build-id": "xBx2Yh41W3Yqm2R3T"
    },
    "servers": [
        {
            "url": "https://api.apify.com/v2"
        }
    ],
    "paths": {
        "/acts/delectable_incubator~pubmed-scraper-low-cost/run-sync-get-dataset-items": {
            "post": {
                "operationId": "run-sync-get-dataset-items-delectable_incubator-pubmed-scraper-low-cost",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor, waits for its completion, and returns Actor's dataset items in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK"
                    }
                }
            }
        },
        "/acts/delectable_incubator~pubmed-scraper-low-cost/runs": {
            "post": {
                "operationId": "runs-sync-delectable_incubator-pubmed-scraper-low-cost",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor and returns information about the initiated run in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK",
                        "content": {
                            "application/json": {
                                "schema": {
                                    "$ref": "#/components/schemas/runsResponseSchema"
                                }
                            }
                        }
                    }
                }
            }
        },
        "/acts/delectable_incubator~pubmed-scraper-low-cost/run-sync": {
            "post": {
                "operationId": "run-sync-delectable_incubator-pubmed-scraper-low-cost",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor, waits for completion, and returns the OUTPUT from Key-value store in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK"
                    }
                }
            }
        }
    },
    "components": {
        "schemas": {
            "inputSchema": {
                "type": "object",
                "required": [
                    "keywords"
                ],
                "properties": {
                    "keywords": {
                        "title": "Keywords to search on PubMed 🔍🎓",
                        "type": "array",
                        "description": "One or more keywords to search on PubMed. Each keyword is scraped separately.",
                        "default": [
                            "Cancer"
                        ],
                        "items": {
                            "type": "string"
                        }
                    },
                    "maxitems": {
                        "title": "Maximum number of results you want to scrape per keyword 📊",
                        "type": "integer",
                        "description": "Maximum number of results you want to scrape per keyword 📊",
                        "default": 50
                    },
                    "sort_by": {
                        "title": "Select a Sort type 🌍",
                        "enum": [
                            "Best match",
                            "Most recent",
                            "Publication date",
                            "First Author",
                            "Journal"
                        ],
                        "type": "string",
                        "description": "Select a Sort type 🌍",
                        "default": "Best match"
                    }
                }
            },
            "runsResponseSchema": {
                "type": "object",
                "properties": {
                    "data": {
                        "type": "object",
                        "properties": {
                            "id": {
                                "type": "string"
                            },
                            "actId": {
                                "type": "string"
                            },
                            "userId": {
                                "type": "string"
                            },
                            "startedAt": {
                                "type": "string",
                                "format": "date-time",
                                "example": "2025-01-08T00:00:00.000Z"
                            },
                            "finishedAt": {
                                "type": "string",
                                "format": "date-time",
                                "example": "2025-01-08T00:00:00.000Z"
                            },
                            "status": {
                                "type": "string",
                                "example": "READY"
                            },
                            "meta": {
                                "type": "object",
                                "properties": {
                                    "origin": {
                                        "type": "string",
                                        "example": "API"
                                    },
                                    "userAgent": {
                                        "type": "string"
                                    }
                                }
                            },
                            "stats": {
                                "type": "object",
                                "properties": {
                                    "inputBodyLen": {
                                        "type": "integer",
                                        "example": 2000
                                    },
                                    "rebootCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "restartCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "resurrectCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "computeUnits": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            },
                            "options": {
                                "type": "object",
                                "properties": {
                                    "build": {
                                        "type": "string",
                                        "example": "latest"
                                    },
                                    "timeoutSecs": {
                                        "type": "integer",
                                        "example": 300
                                    },
                                    "memoryMbytes": {
                                        "type": "integer",
                                        "example": 1024
                                    },
                                    "diskMbytes": {
                                        "type": "integer",
                                        "example": 2048
                                    }
                                }
                            },
                            "buildId": {
                                "type": "string"
                            },
                            "defaultKeyValueStoreId": {
                                "type": "string"
                            },
                            "defaultDatasetId": {
                                "type": "string"
                            },
                            "defaultRequestQueueId": {
                                "type": "string"
                            },
                            "buildNumber": {
                                "type": "string",
                                "example": "1.0.0"
                            },
                            "containerUrl": {
                                "type": "string"
                            },
                            "usage": {
                                "type": "object",
                                "properties": {
                                    "ACTOR_COMPUTE_UNITS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_WRITES": {
                                        "type": "integer",
                                        "example": 1
                                    },
                                    "KEY_VALUE_STORE_LISTS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_INTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_EXTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_RESIDENTIAL_TRANSFER_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_SERPS": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            },
                            "usageTotalUsd": {
                                "type": "number",
                                "example": 0.00005
                            },
                            "usageUsd": {
                                "type": "object",
                                "properties": {
                                    "ACTOR_COMPUTE_UNITS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_WRITES": {
                                        "type": "number",
                                        "example": 0.00005
                                    },
                                    "KEY_VALUE_STORE_LISTS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_INTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_EXTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_RESIDENTIAL_TRANSFER_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_SERPS": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            }
                        }
                    }
                }
            }
        }
    }
}
```
