Pubmed Citation Scraper avatar

Pubmed Citation Scraper

Pricing

Pay per event

Go to Apify Store
Pubmed Citation Scraper

Pubmed Citation Scraper

Automate collection of detailed citation information from the world's largest biomedical literature database. Extract complete citation data including titles, authors, abstracts, publication dates, journals, DOIs, MeSH terms, and more from NCBI's PubMed database.

Pricing

Pay per event

Rating

5.0

(1)

Developer

ParseForge

ParseForge

Maintained by Community

Actor stats

0

Bookmarked

7

Total users

0

Monthly active users

4 days ago

Last modified

Share

ParseForge Banner

๐Ÿ“š PubMed Citation Scraper

Automate your biomedical literature research with our PubMed Citation Scraper. Extract comprehensive citation data from the world's largest biomedical database without manual work. Whether you're conducting systematic reviews, tracking research trends, or building a custom publication database, this tool helps you collect structured research data in minutes. Perfect for researchers needing PubMed data CSV export, literature collection for meta-analysis, or monitoring academic citations.

The PubMed Citation Scraper collects detailed publication metadata from NCBI's PubMed database, up to 1,000,000 records per run, with no coding required.

โœจ What Does It Do

  • ๐Ÿ“ Title - Extract full publication titles for accurate literature cataloging and citation management
  • ๐Ÿ‘ค Authors - Collect complete author names and affiliations to identify key researchers and track collaboration networks
  • ๐Ÿ“„ Abstract - Download abstracts for content analysis, topic modeling, and research methodology review
  • ๐Ÿ“… Publication Date - Retrieve exact publication dates to filter research by time period and track publication trends
  • ๐Ÿ“Š Journal Name - Extract journal information for impact factor analysis and publication venue assessment
  • ๐Ÿ”— DOI and Links - Capture persistent identifiers and PMC/PMID links for direct access to full articles

๐Ÿ”ง Input

  • Search Term - Use PubMed's advanced search syntax to query the database. Examples: 'cancer AND therapy', 'Smith J[Author]', 'Nature[Journal]'. Leave blank if using a direct URL instead.
  • Start URL - Paste a pre-built PubMed search URL directly (e.g. https://pubmed.ncbi.nlm.nih.gov/?term=cancer+AND+therapy). If provided, all other filters are ignored.
  • Date From - Filter results to publications from this date onward. Format: YYYY/MM/DD or just YYYY (example: 2020 or 2020/01/01)
  • Date To - Filter results to publications up to this date. Format: YYYY/MM/DD or just YYYY (example: 2023 or 2023/12/31)
  • Publication Type - Narrow results to specific types like Review, Clinical Trial, Meta-Analysis, or Case Reports
  • Journal - Filter by specific journal name (example: Nature, Science, The Lancet)
  • Author - Search by author surname and first initial (example: Smith J)
  • Sort Order - Choose how results are ranked: relevance (default), publication date, first author, or journal name
  • Max Items - Limit the number of citations to collect. Free users: up to 100. Paid users: up to 1,000,000

Example input:

{
"searchTerm": "machine learning AND diagnosis",
"dateFrom": "2022",
"dateTo": "2024",
"sort": "pub_date",
"maxItems": 50
}

๐Ÿ“Š Output

Each citation includes up to 13 data fields. Download as JSON, CSV, or Excel.

๐Ÿ“ Publication ID๐Ÿ“„ Title๐Ÿ‘ค Authors
๐Ÿ“… Publication Date๐Ÿ“Š Journal Name๐Ÿ“‹ Volume
๐Ÿ”ข Issue Number๐Ÿ“– Page Range๐Ÿ“š Abstract
๐Ÿ”— DOI๐Ÿ“Œ PMID๐ŸŽฏ PMC ID
โฑ๏ธ Scraped Timestampโš ๏ธ Error Status

๐Ÿ’Ž Why Choose the PubMed Citation Scraper?

FeatureOur ActorSimilar Tools
Direct PubMed integrationโœ”๏ธโŒ
Advanced search syntax supportโœ”๏ธPartial
No authentication setup requiredโœ”๏ธโŒ
CSV, JSON, Excel exportโœ”๏ธโœ”๏ธ
Date range filteringโœ”๏ธPartial
Journal and publication type filtersโœ”๏ธโŒ
Author name searchโœ”๏ธPartial
Up to 1,000,000 results per runโœ”๏ธโŒ
Automatic pagination handlingโœ”๏ธโœ”๏ธ
Detailed metadata (DOI, PMC ID, abstracts)โœ”๏ธPartial
Free tier support (up to 100 results)โœ”๏ธโŒ
Flexible billing optionsโœ”๏ธโŒ

๐Ÿ“‹ How to Use

No technical skills required. Follow these simple steps:

  1. Sign Up - Create a free account with $5 credit
  2. Find the Tool - Search for "PubMed Citation Scraper" in the Apify Store and set up your search parameters
  3. Run It - Click "Start" and watch your citations appear

That's it. No coding, no setup, no complicated configuration. Now you can export your data in CSV, Excel, or JSON format.

๐ŸŽฏ Business Use Cases

  • ๐Ÿ“Š Researchers - Collect 500+ citations on a specific disease treatment to identify research trends and discover emerging methodologies before publishing a systematic review
  • ๐Ÿ’ผ Pharmaceutical Companies - Monitor competitor research on new drug compounds across a 5 year period to track development timelines and inform R&D strategy
  • ๐Ÿฅ Medical Libraries - Build searchable citation databases by discipline to help clinicians quickly find evidence-based treatment recommendations for patient cases

โ“ FAQ

๐Ÿ” How does it work? The scraper connects directly to PubMed's database, searches using your criteria, and extracts structured citation metadata including abstracts, authors, and publication details. No manual work required.

๐Ÿ“Š How accurate is the data? All data comes directly from NCBI's PubMed, the official US National Library of Medicine database. You receive the same data visible on PubMed.ncbi.nlm.nih.gov.

๐Ÿ“… Can I schedule runs automatically? Yes. Set up a schedule in the Apify platform to run your search weekly, monthly, or on any interval you choose. Perfect for monitoring new publications in your research area.

โš–๏ธ Is web scraping PubMed allowed? PubMed is public data from the US government, and scraping is permitted for research and non-commercial use. Always review PubMed's terms of service and comply with your local regulations.

๐Ÿ›ก๏ธ Will PubMed block me? PubMed does not typically block legitimate automated requests. The scraper uses responsible request patterns. For high-volume runs, residential proxies are recommended.

โšก How long does a run take? Depends on your search scope. Typically 1-5 minutes for 50-100 citations, 5-15 minutes for 500 citations, and 30+ minutes for large datasets (1,000+). Broader searches may take longer.

โš ๏ธ Are there any limits? Free users can collect up to 100 results per run. Paid users can collect up to 1,000,000 results per run on the Apify platform.

๐Ÿ”— Integrate PubMed Citation Scraper with any app

๐Ÿ’ก More ParseForge Actors

Browse our complete collection of data extraction tools for more.

๐Ÿš€ Ready to Start?

Create a free account with $5 credit and collect your first 100 citations for free. No coding, no setup.

๐Ÿ†˜ Need Help?

  • Check the FAQ section above for common questions
  • Visit the Apify support page for documentation and tutorials
  • Contact us to request a new scraper, propose a custom project, or report an issue at Tally contact form

โš ๏ธ Disclaimer

This Actor is an independent tool and is not affiliated with, endorsed by, or sponsored by the National Library of Medicine, NIH, NCBI, or PubMed. All trademarks mentioned are the property of their respective owners.