Open Citations Scraper
Pricing
Pay per event
Open Citations Scraper
Comprehensive OpenCitations scraper for extracting citation and reference data from OpenCitations API. Perfect for researchers, academics, and data scientists who need automated access to citation networks, bibliographic metadata, and citation analysis data.
Pricing
Pay per event
Rating
0.0
(0)
Developer

ParseForge
Actor stats
0
Bookmarked
3
Total users
2
Monthly active users
9 days ago
Last modified
Categories
Share

๐ OpenCitations Scraper
Collect bibliographic metadata and citation networks from OpenCitations in minutes without coding. Perfect for researchers, academics, and data scientists who need to download citation data as CSV, access citation counts, or track research impact across publication networks. Extract citation relationships, author metadata, and journal information without building custom code.
The OpenCitations Scraper collects bibliographic metadata and citation counts from OpenCitations, up to 1,000,000 records per run, with no setup required.
โจ What Does It Do
- ๐ Citation OCI - Unique identifier for each citation relationship, so you can deduplicate and reference citations across databases
- ๐ค Citing and Cited Entities - Identifiers for the publications creating and receiving citations, so you can map research influence networks
- ๐ Creation Date - When the citation relationship was first recorded, so you can track how citations accumulate over time
- ๐ Self-Citation Flags - Boolean indicators for author and journal self-citations, so you can filter for independent research citations only
- ๐ Bibliographic Metadata - Author names, publication dates, titles, and venue information for complete publication context
- ๐ข Publisher and Type Information - Publisher name and publication type, so you can segment research by publication channel
๐ง Input
- DOI - Digital Object Identifier (e.g., 10.1016/j.jmb.2005.08.075), the primary identifier for academic publications on OpenCitations
- PMID - PubMed ID for biomedical publications if you don't have a DOI available
- OMID - OpenCitations Meta Identifier (e.g., omid:br/06140242082) to search using OpenCitations internal identifiers
- Search Type - Choose between citations to find incoming citations or references to find outgoing references
- Include Metadata - Enable to collect detailed metadata like authors, publication date, and venue for each citation
- Max Items - Limit the number of results to collect. Free users can collect up to 100 results per run. Paid users can collect up to 1,000,000 results per run.
Example input:
{"doi": "10.1016/j.jmb.2005.08.075","searchType": "citations","includeMetadata": true,"maxItems": 500}
๐ Output
Each citation includes up to 20 data fields. Download as JSON, CSV, or Excel file.
| ๐ OCI | ๐ค Citing Entity | ๐ค Cited Entity |
|---|---|---|
| ๐ Creation Date | โฑ๏ธ Timespan | ๐ Journal Self-Citation |
| ๐ Author Self-Citation | ๐ Title | ๐ฅ Authors |
| ๐ Publication Date | ๐ Volume | ๐ Issue |
| ๐ Venue | ๐ท๏ธ Publication Type | ๐ Page |
| ๐ข Publisher | โ๏ธ Editor | โฐ Timestamp |
| โ Error | ๐ Work ID |
๐ Why Choose the OpenCitations Scraper?
| Feature | Our Actor | Similar Tools |
|---|---|---|
| Citation collection from OpenCitations | โ๏ธ | โ |
| Support for multiple identifier types (DOI, PMID, OMID) | โ๏ธ | โ |
| Metadata enrichment (authors, dates, venue) | โ๏ธ | Partial |
| Self-citation detection (author and journal) | โ๏ธ | โ |
| Incoming and outgoing citation search | โ๏ธ | โ |
| Up to 1,000,000 results per run | โ๏ธ | โ |
| CSV, JSON, and Excel export | โ๏ธ | โ๏ธ |
| No coding required | โ๏ธ | โ |
| Deduplication of citation relationships | โ๏ธ | โ |
| Batch identifier support | โ๏ธ | โ |
| Real-time progress monitoring | โ๏ธ | Partial |
| Free tier with 100 results | โ๏ธ | โ๏ธ |
๐ How to Use
No technical skills required. Follow these simple steps:
- Sign Up: Create a free account with $5 credit
- Find the Tool: Search for "OpenCitations Scraper" in the Apify Store and configure your input
- Run It: Click "Start" and watch your results appear
That's it. No coding, no setup, no complicated configuration. Now you can export your data in CSV, Excel, or JSON format.
๐ฏ Business Use Cases
- ๐ Academic Researchers - Track citation networks for a published paper to understand research impact, identify influential citing works, and map how your research influences the field
- ๐ผ Literature Review Teams - Collect all citations and references for a topic to accelerate systematic reviews, identify research gaps, and build comprehensive citation networks without manual searching
- ๐ฌ Institutional Analysts - Analyze citation patterns across your institution's publications to measure research output, benchmark impact against competitors, and identify high-impact research areas for future funding
โ FAQ
๐ How does it work? The OpenCitations Scraper queries OpenCitations using a DOI, PMID, or OMID and retrieves citations (incoming) or references (outgoing). Optionally, it enriches each result with detailed bibliographic metadata like author names, publication dates, and venue information.
๐ How accurate is the citation data? OpenCitations aggregates data from Crossref, PubMed, and other authoritative sources. The accuracy depends on the completeness of data in these sources. Metadata enrichment is optional and fetched in real-time from OpenCitations.
๐ Can I schedule this to run automatically? Yes. Once you've configured your input parameters, you can schedule the actor to run daily, weekly, or on a custom schedule using Apify's automation features.
โ๏ธ Is scraping OpenCitations legal? Yes. OpenCitations is a freely available, non-profit, open-access service. The data collected is public and the service explicitly allows automated access. Always verify compliance with local laws and OpenCitations' terms of service.
๐ก๏ธ Will OpenCitations block me? No. OpenCitations explicitly supports automated access and does not implement blocking measures. The actor uses standard, respectful requests that comply with their usage guidelines.
โก How long does a typical run take? A run typically takes 1-3 minutes for 100 results, 5-15 minutes for 500 results, and 20-60 minutes for 1,000+ results. Speed depends on whether metadata enrichment is enabled and OpenCitations response times.
โ ๏ธ Are there any limits? Free users can collect up to 100 results per run. Paid users can collect up to 1,000,000 results per run.
๐ Integrate OpenCitations Scraper with any app
- Make - Automate workflows
- Zapier - Connect 5000+ apps
- GitHub - Version control integration
- Slack - Get notifications
- Airbyte - Data pipelines
- Google Drive - Export to spreadsheets
๐ก More ParseForge Actors
- Crunchbase Scraper - Extract company, investor, and deal data from Crunchbase
- Redfin Scraper - Collect property listings and real estate market data
- Etsy Scraper - Gather product listings, prices, and seller information from Etsy
- PropertyShark Commercial Property Transactions Scraper - Download commercial real estate transaction data
- NY Business Entity Scraper - Extract New York business registration and filing data
Browse our complete collection of data extraction tools for more.
๐ Ready to Start?
Create a free account with $5 credit and collect your first 100 results for free. No coding, no setup.
๐ Need Help?
- Check the FAQ section above for common questions
- Visit the Apify support page for documentation and tutorials
- Contact us to request a new scraper, propose a custom project, or report an issue at Tally contact form
โ ๏ธ Disclaimer
This Actor is an independent tool and is not affiliated with, endorsed by, or sponsored by OpenCitations or any of its subsidiaries. All trademarks mentioned are the property of their respective owners.
