Open Citations Scraper avatar

Open Citations Scraper

Pricing

Pay per event

Go to Apify Store
Open Citations Scraper

Open Citations Scraper

Comprehensive OpenCitations scraper for extracting citation and reference data from OpenCitations API. Perfect for researchers, academics, and data scientists who need automated access to citation networks, bibliographic metadata, and citation analysis data.

Pricing

Pay per event

Rating

0.0

(0)

Developer

ParseForge

ParseForge

Maintained by Community

Actor stats

0

Bookmarked

3

Total users

2

Monthly active users

9 days ago

Last modified

Share

ParseForge Banner

๐Ÿ“š OpenCitations Scraper

Collect bibliographic metadata and citation networks from OpenCitations in minutes without coding. Perfect for researchers, academics, and data scientists who need to download citation data as CSV, access citation counts, or track research impact across publication networks. Extract citation relationships, author metadata, and journal information without building custom code.

The OpenCitations Scraper collects bibliographic metadata and citation counts from OpenCitations, up to 1,000,000 records per run, with no setup required.

โœจ What Does It Do

  • ๐Ÿ“ Citation OCI - Unique identifier for each citation relationship, so you can deduplicate and reference citations across databases
  • ๐Ÿ‘ค Citing and Cited Entities - Identifiers for the publications creating and receiving citations, so you can map research influence networks
  • ๐Ÿ“… Creation Date - When the citation relationship was first recorded, so you can track how citations accumulate over time
  • ๐Ÿ“Š Self-Citation Flags - Boolean indicators for author and journal self-citations, so you can filter for independent research citations only
  • ๐Ÿ“š Bibliographic Metadata - Author names, publication dates, titles, and venue information for complete publication context
  • ๐Ÿข Publisher and Type Information - Publisher name and publication type, so you can segment research by publication channel

๐Ÿ”ง Input

  • DOI - Digital Object Identifier (e.g., 10.1016/j.jmb.2005.08.075), the primary identifier for academic publications on OpenCitations
  • PMID - PubMed ID for biomedical publications if you don't have a DOI available
  • OMID - OpenCitations Meta Identifier (e.g., omid:br/06140242082) to search using OpenCitations internal identifiers
  • Search Type - Choose between citations to find incoming citations or references to find outgoing references
  • Include Metadata - Enable to collect detailed metadata like authors, publication date, and venue for each citation
  • Max Items - Limit the number of results to collect. Free users can collect up to 100 results per run. Paid users can collect up to 1,000,000 results per run.

Example input:

{
"doi": "10.1016/j.jmb.2005.08.075",
"searchType": "citations",
"includeMetadata": true,
"maxItems": 500
}

๐Ÿ“Š Output

Each citation includes up to 20 data fields. Download as JSON, CSV, or Excel file.

๐Ÿ“ OCI๐Ÿ‘ค Citing Entity๐Ÿ‘ค Cited Entity
๐Ÿ“… Creation Dateโฑ๏ธ Timespan๐Ÿ“Š Journal Self-Citation
๐Ÿ“Š Author Self-Citation๐Ÿ“š Title๐Ÿ‘ฅ Authors
๐Ÿ“… Publication Date๐Ÿ“– Volume๐Ÿ“„ Issue
๐Ÿ“ Venue๐Ÿท๏ธ Publication Type๐Ÿ“„ Page
๐Ÿข Publisherโœ๏ธ Editorโฐ Timestamp
โŒ Error๐Ÿ”— Work ID

๐Ÿ’Ž Why Choose the OpenCitations Scraper?

FeatureOur ActorSimilar Tools
Citation collection from OpenCitationsโœ”๏ธโŒ
Support for multiple identifier types (DOI, PMID, OMID)โœ”๏ธโŒ
Metadata enrichment (authors, dates, venue)โœ”๏ธPartial
Self-citation detection (author and journal)โœ”๏ธโŒ
Incoming and outgoing citation searchโœ”๏ธโŒ
Up to 1,000,000 results per runโœ”๏ธโŒ
CSV, JSON, and Excel exportโœ”๏ธโœ”๏ธ
No coding requiredโœ”๏ธโŒ
Deduplication of citation relationshipsโœ”๏ธโŒ
Batch identifier supportโœ”๏ธโŒ
Real-time progress monitoringโœ”๏ธPartial
Free tier with 100 resultsโœ”๏ธโœ”๏ธ

๐Ÿ“‹ How to Use

No technical skills required. Follow these simple steps:

  1. Sign Up: Create a free account with $5 credit
  2. Find the Tool: Search for "OpenCitations Scraper" in the Apify Store and configure your input
  3. Run It: Click "Start" and watch your results appear

That's it. No coding, no setup, no complicated configuration. Now you can export your data in CSV, Excel, or JSON format.

๐ŸŽฏ Business Use Cases

  • ๐Ÿ“Š Academic Researchers - Track citation networks for a published paper to understand research impact, identify influential citing works, and map how your research influences the field
  • ๐Ÿ’ผ Literature Review Teams - Collect all citations and references for a topic to accelerate systematic reviews, identify research gaps, and build comprehensive citation networks without manual searching
  • ๐Ÿ”ฌ Institutional Analysts - Analyze citation patterns across your institution's publications to measure research output, benchmark impact against competitors, and identify high-impact research areas for future funding

โ“ FAQ

๐Ÿ” How does it work? The OpenCitations Scraper queries OpenCitations using a DOI, PMID, or OMID and retrieves citations (incoming) or references (outgoing). Optionally, it enriches each result with detailed bibliographic metadata like author names, publication dates, and venue information.

๐Ÿ“Š How accurate is the citation data? OpenCitations aggregates data from Crossref, PubMed, and other authoritative sources. The accuracy depends on the completeness of data in these sources. Metadata enrichment is optional and fetched in real-time from OpenCitations.

๐Ÿ“… Can I schedule this to run automatically? Yes. Once you've configured your input parameters, you can schedule the actor to run daily, weekly, or on a custom schedule using Apify's automation features.

โš–๏ธ Is scraping OpenCitations legal? Yes. OpenCitations is a freely available, non-profit, open-access service. The data collected is public and the service explicitly allows automated access. Always verify compliance with local laws and OpenCitations' terms of service.

๐Ÿ›ก๏ธ Will OpenCitations block me? No. OpenCitations explicitly supports automated access and does not implement blocking measures. The actor uses standard, respectful requests that comply with their usage guidelines.

โšก How long does a typical run take? A run typically takes 1-3 minutes for 100 results, 5-15 minutes for 500 results, and 20-60 minutes for 1,000+ results. Speed depends on whether metadata enrichment is enabled and OpenCitations response times.

โš ๏ธ Are there any limits? Free users can collect up to 100 results per run. Paid users can collect up to 1,000,000 results per run.

๐Ÿ”— Integrate OpenCitations Scraper with any app

๐Ÿ’ก More ParseForge Actors

Browse our complete collection of data extraction tools for more.

๐Ÿš€ Ready to Start?

Create a free account with $5 credit and collect your first 100 results for free. No coding, no setup.

๐Ÿ†˜ Need Help?

  • Check the FAQ section above for common questions
  • Visit the Apify support page for documentation and tutorials
  • Contact us to request a new scraper, propose a custom project, or report an issue at Tally contact form

โš ๏ธ Disclaimer

This Actor is an independent tool and is not affiliated with, endorsed by, or sponsored by OpenCitations or any of its subsidiaries. All trademarks mentioned are the property of their respective owners.