Pricing

Pay per event

Open Citations Scraper

Comprehensive OpenCitations scraper for extracting citation and reference data from OpenCitations API. Perfect for researchers, academics, and data scientists who need automated access to citation networks, bibliographic metadata, and citation analysis data.

Pricing

Pay per event

Rating

0.0

(0)

Developer

ParseForge

Actor stats

Bookmarked

Total users

Monthly active users

2 days ago

Last modified

📚 OpenCitations Scraper

🚀 Extract citation networks and bibliographic metadata from OpenCitations in seconds. Search by DOI, PMID, or OMID. No coding, no API keys required.

OpenCitations is an open scholarly infrastructure providing free access to citation data from millions of academic publications. This scraper collects citation relationships, self-citation flags, and optional bibliographic metadata (authors, titles, venues, publication dates) for any publication identified by DOI, PubMed ID, or OpenCitations Meta ID. Choose between citations mode (who cited this work) and references mode (what this work cites) to map research influence in either direction.

Researchers, bibliometric analysts, and data scientists use this actor to build citation networks, track research impact, identify self-citations, and analyze how knowledge flows between publications. Instead of querying the OpenCitations API manually and parsing responses, you get clean, structured data exported as JSON, CSV, or Excel. With metadata enabled, every record includes the citing and cited entity IDs, creation date, timespan, self-citation flags, plus the full title, authors, publication date, venue, and publisher.

🎯 Target Audience	💡 Use Cases
Bibliometric analysts	Map citation networks and measure impact
Academic researchers	Track who cites your publications
University administrators	Evaluate research impact for departments
Science policy makers	Analyze knowledge flow between institutions
Data scientists	Build citation graph datasets for analysis
Librarians	Enrich catalog records with citation data

📋 What the OpenCitations Scraper does

🔍 DOI-based search to find citations or references for any published work
🆔 PMID support for biomedical publications indexed in PubMed
📋 OMID support for OpenCitations internal identifier lookups
🔄 Bidirectional search with citations (incoming) and references (outgoing) modes
📊 Self-citation detection with flags for author and journal self-citations
📝 Optional metadata including titles, authors, venues, and publication dates

The scraper queries the OpenCitations API with your identifier and search type, retrieves all matching citation relationships, and extracts structured data for each record. When metadata is enabled, it also fetches detailed bibliographic information for each citing or cited work. Results include unique citation identifiers (OCI), entity IDs, creation dates, timespans, self-citation flags, and full publication metadata.

💡 Why it matters: Manually collecting citation data from OpenCitations involves API queries, pagination, and metadata enrichment. This scraper handles everything automatically, delivering structured citation networks ready for analysis, visualization, or integration with other research tools.

📊 Data fields

Each record includes: authorSelfCitation, authors, cited, citing, creation, editor, id, issue, journalSelfCitation, oci, page, publicationDate, publisher, scrapedTimestamp, timespan, title, type, venue, volume. All 19 field names come from a real production run, so what you see here is what lands in your dataset.

⚠️ Good to Know: Provide one identifier (DOI, PMID, or OMID), not multiple. Enabling metadata makes the scraper slower but provides full bibliographic details for each citation. The default search type is "citations" (incoming citations).

🚀 How to use

Create an Apify account - Sign up free with $5 credit
Open the OpenCitations Scraper - Navigate to the actor page on Apify
Enter a DOI, PMID, or OMID - Provide the identifier for the publication you want to analyze
Choose search type and options - Select citations or references mode and enable metadata if needed
Click Start - The actor collects citation relationships and delivers structured data

⏱️ A typical run with 50 citations completes in under 1 minute.

🔗 Recommended Actors

Actor	Description
Crossref Scraper	Extract DOI metadata for 155M+ research publications
PubMed Citation Scraper	Extract publication metadata from PubMed for biomedical research
Open Library Scraper	Search and download book data from the Internet Archive
ROR Scraper	Collect research organization data from ROR
US Census Bureau Scraper	Extract demographic and economic data from the Census Bureau

💡 Pro Tip: Combine the OpenCitations Scraper with the Crossref Scraper to get both citation networks and full publication metadata for each cited work.

Disclaimer: This actor is not affiliated with, endorsed by, or connected to OpenCitations. It accesses publicly available data through the OpenCitations API. Use responsibly and in accordance with applicable terms of service.

🆘 Need Help?

If you hit a bug, have questions about setup, or need a scraper we haven't built yet, open our contact form or write to parseforge@protonmail.com. We also take on paid custom data projects.

For faster answers, join our Discord. It's the best place to get support and suggest new actors.

OpenCitations Scraper: Citation Graph

themineworks/opencitations-citation-graph

Scrape the OpenCitations INDEX (1.6B links) by DOI. Get all citing papers, cited references, dates and self citation flags. No API key. Free tier. Works in Claude, ChatGPT and any MCP agent for citation graphs and literature reviews.

The Mine Works

Ai Citation Auditor

dbott23/ai-citation-auditor

Darren S

Citation Generator API

dev00/citation-generator-api

Generate citation metadata for any book, article, URL, DOI, PMID, or arXiv ID.

dev00

📚 OpenAlex Scraper - Academic Papers & Citation Data

benthepythondev/openalex-scraper

OpenAlex Scraper to search 250M+ academic papers via the free OpenAlex API. Extract title, authors, institutions, year, venue, DOI, citation count, open-access status, concepts and PDF links. Filter by year and open access. For literature reviews, citation analysis and AI/RAG datasets.

Ben

Research Corpus & Citation Graph Builder

zentrafoundry/openalex-research-graph-builder

Build research corpora and citation graph datasets from public metadata APIs.

Zentra

Pubmed Citation Scraper

parseforge/pubmed-citation-scraper

Automate collection of detailed citation information from the world's largest biomedical literature database. Extract complete citation data including titles, authors, abstracts, publication dates, journals, DOIs, MeSH terms, and more from NCBI's PubMed database.

ParseForge

5.0

Academic Paper Scraper

labrat011/academic-paper-scraper

Search MILLIONS of academic papers from Semantic Scholar and arXiv by keyword, DOI, or citation graph. Returns titles, authors, abstracts, citation counts, and open access PDFs as clean JSON. Works as an MCP tool for AI agents.

mick_

Semantic Scholar Paper Scraper

agenscrape/semantic-scholar-paper-scraper

Scrape academic papers from Semantic Scholar. Search by keyword and extract paper titles, abstracts, authors, citation counts, publication dates, DOIs, open access PDFs... Perfect for literature reviews, citation analysis, and research databases. Real time data output with pagination support.

Agenscrape

Google Scholar Scraper

moving_beacon-owner1/google-scholar-scraper

Scrapes Google Scholar search results, including paper titles, authors, publication years, citation counts, article URLs, and PDF links. Supports multiple queries and year filters for research, literature reviews, and citation analysis.

Jamshaid Arif

Academic Research & Citation Tracker

second_coming/academic-research-tracker

Searches academic databases (arXiv, PubMed, Crossref) for research papers matching keywords. Returns structured citation data including title, authors, journal, DOI, abstract, and URL.