Pubmed Citation Scraper
Pricing
Pay per event
Pubmed Citation Scraper
Automate collection of detailed citation information from the world's largest biomedical literature database. Extract complete citation data including titles, authors, abstracts, publication dates, journals, DOIs, MeSH terms, and more from NCBI's PubMed database.
Pricing
Pay per event
Rating
5.0
(1)
Developer
ParseForge
Actor stats
0
Bookmarked
10
Total users
0
Monthly active users
3 days ago
Last modified
Categories
Share

π PubMed Citation Scraper
π Last updated: 2026-05-05
Automate your biomedical literature research with our PubMed Citation Scraper. Extract comprehensive citation data from the world's largest biomedical database without manual work. Whether you're conducting systematic reviews, tracking research trends, or building a custom publication database, this tool helps you collect structured research data in minutes. Perfect for researchers needing PubMed data CSV export, literature collection for meta-analysis, or monitoring academic citations.
The PubMed Citation Scraper collects detailed publication metadata from NCBI's PubMed database, up to 1,000,000 records per run, with no coding required.
β¨ What Does It Do
- π Title - Extract full publication titles for accurate literature cataloging and citation management
- π€ Authors - Collect complete author names and affiliations to identify key researchers and track collaboration networks
- π Abstract - Download abstracts for content analysis, topic modeling, and research methodology review
- π Publication Date - Retrieve exact publication dates to filter research by time period and track publication trends
- π Journal Name - Extract journal information for impact factor analysis and publication venue assessment
- π DOI and Links - Capture persistent identifiers and PMC/PMID links for direct access to full articles
π§ Input
- Search Term - Use PubMed's advanced search syntax to query the database. Examples: 'cancer AND therapy', 'Smith J[Author]', 'Nature[Journal]'. Leave blank if using a direct URL instead.
- Start URL - Paste a pre-built PubMed search URL directly (e.g. https://pubmed.ncbi.nlm.nih.gov/?term=cancer+AND+therapy). If provided, all other filters are ignored.
- Date From - Filter results to publications from this date onward. Format: YYYY/MM/DD or just YYYY (example: 2020 or 2020/01/01)
- Date To - Filter results to publications up to this date. Format: YYYY/MM/DD or just YYYY (example: 2023 or 2023/12/31)
- Publication Type - Narrow results to specific types like Review, Clinical Trial, Meta-Analysis, or Case Reports
- Journal - Filter by specific journal name (example: Nature, Science, The Lancet)
- Author - Search by author surname and first initial (example: Smith J)
- Sort Order - Choose how results are ranked: relevance (default), publication date, first author, or journal name
- Max Items - Limit the number of citations to collect. Free users: up to 100. Paid users: up to 1,000,000
Example input:
{"searchTerm": "machine learning AND diagnosis","dateFrom": "2022","dateTo": "2024","sort": "pub_date","maxItems": 50}
π Output
Each citation includes up to 13 data fields. Download as JSON, CSV, or Excel.
| π Publication ID | π Title | π€ Authors |
|---|---|---|
| π Publication Date | π Journal Name | π Volume |
| π’ Issue Number | π Page Range | π Abstract |
| π DOI | π PMID | π― PMC ID |
| β±οΈ Scraped Timestamp | β οΈ Error Status |
π Why Choose the PubMed Citation Scraper?
| Feature | Our Actor | Similar Tools |
|---|---|---|
| Direct PubMed integration | βοΈ | β |
| Advanced search syntax support | βοΈ | Partial |
| No authentication setup required | βοΈ | β |
| CSV, JSON, Excel export | βοΈ | βοΈ |
| Date range filtering | βοΈ | Partial |
| Journal and publication type filters | βοΈ | β |
| Author name search | βοΈ | Partial |
| Up to 1,000,000 results per run | βοΈ | β |
| Automatic pagination handling | βοΈ | βοΈ |
| Detailed metadata (DOI, PMC ID, abstracts) | βοΈ | Partial |
| Free tier support (up to 100 results) | βοΈ | β |
| Flexible billing options | βοΈ | β |
π How to Use
No technical skills required. Follow these simple steps:
- Sign Up - Create a free account with $5 credit
- Find the Tool - Search for "PubMed Citation Scraper" in the Apify Store and set up your search parameters
- Run It - Click "Start" and watch your citations appear
That's it. No coding, no setup, no complicated configuration. Now you can export your data in CSV, Excel, or JSON format.
π― Business Use Cases
- π Researchers - Collect 500+ citations on a specific disease treatment to identify research trends and discover emerging methodologies before publishing a systematic review
- πΌ Pharmaceutical Companies - Monitor competitor research on new drug compounds across a 5 year period to track development timelines and inform R&D strategy
- π₯ Medical Libraries - Build searchable citation databases by discipline to help clinicians quickly find evidence-based treatment recommendations for patient cases
β¨ Why choose this Actor
| Capability | |
|---|---|
| π― | Built for the job. Scoped specifically to this data source so you skip the parser engineering entirely. |
| π | Structured output. Clean, typed fields ready for analysis, dashboards, or downstream pipelines. |
| β‘ | Fast. Optimized request patterns return results in seconds, not minutes. |
| π | Always fresh. Every run pulls live data, so the dataset reflects the source as of run time. |
| π | No infra to manage. Apify handles proxies, retries, scaling, scheduling, and storage. |
| π‘οΈ | Reliable. Battle-tested across many runs and edge cases, with graceful error handling. |
| π« | No code required. Configure in the UI, run from CLI, schedule via cron, or call from any language with the Apify SDK. |
π Production-grade structured data without the engineering overhead of building and maintaining your own scraper.
π How it compares to alternatives
| Approach | Cost | Coverage | Refresh | Filters | Setup |
|---|---|---|---|---|---|
| β PubMed Citation Scraper (this Actor) | $5 free credit, then pay-per-use | Full source coverage | Live per run | Source-native filters supported | β‘ 2 min |
| Build your own scraper | Engineering hours | Full once built | Whenever you maintain it | Custom code | π’ Days to weeks |
| Paid managed APIs | $$$ monthly | Vendor-defined | Live | Vendor-defined | β³ Hours |
| Third-party data dumps | Varies | Subset, often stale | Periodic | None | π Variable |
Pick this Actor when you want broad coverage, server-side filtering, and no pipeline maintenance.
π How to use
- π Sign up. Create a free account with $5 credit (takes 2 minutes).
- π Open the Actor. Go to the PubMed Citation Scraper page on the Apify Store.
- π― Set input. Configure the input fields in the form (or paste a JSON), then set
maxItems. - π Run it. Click Start and let the Actor collect your data.
- π₯ Download. Grab your results in the Dataset tab as CSV, Excel, JSON, or XML.
β±οΈ Total time from signup to downloaded dataset: 3-5 minutes. No coding required.
πΌ Business use cases
π Beyond business use cases
Data like this powers more than commercial workflows. The same structured records support research, education, civic projects, and personal initiatives.
β FAQ
π How does it work? The scraper connects directly to PubMed's database, searches using your criteria, and extracts structured citation metadata including abstracts, authors, and publication details. No manual work required.
π How accurate is the data? All data comes directly from NCBI's PubMed, the official US National Library of Medicine database. You receive the same data visible on PubMed.ncbi.nlm.nih.gov.
π Can I schedule runs automatically? Yes. Set up a schedule in the Apify platform to run your search weekly, monthly, or on any interval you choose. Perfect for monitoring new publications in your research area.
βοΈ Is web scraping PubMed allowed? PubMed is public data from the US government, and scraping is permitted for research and non-commercial use. Always review PubMed's terms of service and comply with your local regulations.
π‘οΈ Will PubMed block me? PubMed does not typically block legitimate automated requests. The scraper uses responsible request patterns. For high-volume runs, residential proxies are recommended.
β‘ How long does a run take? Depends on your search scope. Typically 1-5 minutes for 50-100 citations, 5-15 minutes for 500 citations, and 30+ minutes for large datasets (1,000+). Broader searches may take longer.
β οΈ Are there any limits? Free users can collect up to 100 results per run. Paid users can collect up to 1,000,000 results per run on the Apify platform.
π Integrate PubMed Citation Scraper with any app
- Make - Automate workflows
- Zapier - Connect 5000+ apps
- GitHub - Version control integration
- Slack - Get notifications
- Airbyte - Data pipelines
- Google Drive - Export to spreadsheets
π€ Ask an AI assistant about this scraper
Open a ready-to-send prompt about this ParseForge actor in the AI of your choice:
- π¬ ChatGPT
- π§ Claude
- π Perplexity
- π Copilot
π Integrate with any app
PubMed Citation Scraper connects to any cloud service via Apify integrations:
- Make - Automate multi-step workflows
- Zapier - Connect with 5,000+ apps
- Slack - Get run notifications in your channels
- Airbyte - Pipe results into your warehouse
- GitHub - Trigger runs from commits and releases
- Google Drive - Export datasets straight to Sheets
You can also use webhooks to trigger downstream actions when a run finishes. Push fresh data into your product backend, or alert your team in Slack.
π‘ More ParseForge Actors
- Open Citations Scraper - Extract citation metadata from Open Citations
- DataCite Metadata Scraper - Collect research metadata from DataCite
- medRxiv Scraper - Scrape preprints from medRxiv
- FRED Economic Data Scraper - Extract economic indicators and time series data
- arXiv Scraper - Collect research papers from arXiv
Browse our complete collection of data extraction tools for more.
π Ready to Start?
Create a free account with $5 credit and collect your first 100 citations for free. No coding, no setup.
π Need Help?
- Check the FAQ section above for common questions
- Visit the Apify support page for documentation and tutorials
- Contact us to request a new scraper, propose a custom project, or report an issue at Tally contact form
β οΈ Disclaimer
This Actor is an independent tool and is not affiliated with, endorsed by, or sponsored by the National Library of Medicine, NIH, NCBI, or PubMed. All trademarks mentioned are the property of their respective owners.
π Recommended Actors
- π Google Search Scraper - Multi-engine SERP results with country and language targeting
- πΊοΈ Nominatim OSM Scraper - Geocode addresses via OpenStreetMap
- π Indexmundi Scraper - Global demographic and economic indicators
- π° RAG Web Browser - Crawl and extract clean text from any URL for AI retrieval
- π Website Content Crawler - Crawl entire sites and export structured content
π‘ Pro Tip: browse the complete ParseForge collection for more reference-data scrapers.