Pricing

Pay per usage

Go to Apify Store

Crossref Doi Scraper

Try for free

Scrapes scholarly article metadata from the Crossref API by DOI or keyword search.

Pricing

Pay per usage

Rating

0.0

(0)

Developer

Donny

Actor stats

Bookmarked

Total users

Monthly active users

12 hours ago

Last modified

Crossref DOI Metadata Scraper

What it does

Scrapes scholarly article metadata from the Crossref API by DOI or keyword search.

This actor connects to a public API, fetches structured data based on your search criteria, and stores the results in a clean, normalized dataset on the Apify platform. It handles pagination automatically so you can collect large volumes of results without worrying about API limits or offsets. The actor is designed to be robust with built-in error handling, request timeouts, and input validation to ensure reliable data collection every time you run it.

Why use this actor

Manually querying APIs and handling pagination, rate limits, and data normalization is tedious and error-prone. This actor automates the entire process. Simply provide your search parameters, set the maximum number of results you want, and let the actor handle the rest. The data is stored in a structured dataset that you can export as JSON, CSV, or Excel. You can integrate this actor into larger workflows using the Apify API, schedule it for recurring data collection, or trigger it from your own applications via webhooks.

Input parameters

searchQuery (string, required): The search term to query. Default: "test".
maxResults (integer, optional): Maximum number of results to return. Default: 100. Range: 1-1000.

All inputs are validated at startup with sensible defaults applied when values are missing. The actor will log warnings for any misconfigured options and continue with safe defaults rather than failing outright.

Output data

Each result in the dataset contains the following fields:

doi: The doi of the result
title: The title of the result
authors: The authors of the result
publisher: The publisher of the result
publishedDate: The published date of the result
journal: The journal of the result
url: The url of the result

All string fields are null-checked to ensure consistent data quality. Missing or undefined values are stored as null rather than empty strings or undefined values.

Example output

{
    "doi": "Example DOI",
    "title": "Example Title",
    "authors": "Example Authors",
    "publisher": "Example Publisher",
    "publishedDate": "2025-01-15T00:00:00Z",
    "journal": "Example Journal",
    "url": "https://example.com/item"
}

Pricing

This actor is available on the Apify platform with transparent usage-based pricing. Each run incurs a small startup cost of approximately $0.005 per start, plus roughly $0.01 per result collected. Actual costs depend on the number of results, API response times, and memory allocation. You can control costs by setting the maxResults parameter to limit the number of results collected per run. For high-volume use cases, consider running the actor on a schedule during off-peak hours to optimize platform resource usage.

More scrapers from brave_paradise

Check out other data collection actors by brave_paradise on the Apify Store. We offer a wide range of specialized data scrapers and automation tools covering research databases, package registries, job boards, and many more public data sources. Each actor is designed with the same high-quality standards: robust error handling, automatic pagination, clean structured output, and transparent pricing.

Visit the brave_paradise profile on Apify to explore the full collection.

Crossref Scraper

parseforge/crossref-scraper

Transform how you access scholarly data with our Crossref scraper. This intelligent automation tool gathers titles, authors, abstracts, and citations in seconds, giving researchers, librarians, and academics the accurate, up-to-date publication information they need without lifting a finger.

ParseForge

5.0

(2)

Crossref Academic Citation Scraper

cloud9_ai/crossref-scraper

Search and extract scholarly publication metadata from Crossref. Get DOIs, citations, authors, journals for 140M+ works.

cloud9

Researchgpt Deep Research Agent

wheat_tourist/researchgpt-deep-research-agent

🔬 Transform any topic into a comprehensive research report in minutes! Scrapes Wikipedia, arXiv, Semantic Scholar, news & web sources. Outputs professional JSON, HTML & PDF reports. Perfect for students, researchers, content creators & businesses. No API keys needed.

Varun Chopra

CrossRef Academic Metadata Scraper

fortuitous_pirate/crossref-scraper

Search CrossRef for academic paper metadata. Get DOIs, authors, journals, citations, and publication dates. Essential for research and bibliography building.

Fortuitous Pirate

Unpaywall Scraper

parseforge/unpaywall-scraper

Discover open access research articles with our powerful Unpaywall scraper! Search through millions of articles in the Unpaywall database to find free-to-read scholarly publications. Perfect for researchers, librarians, and academics who need to find and access open access articles efficiently.

ParseForge

5.0

(1)

Open Citations Scraper

parseforge/open-citations-scraper

Comprehensive OpenCitations scraper for extracting citation and reference data from OpenCitations API. Perfect for researchers, academics, and data scientists who need automated access to citation networks, bibliographic metadata, and citation analysis data.

ParseForge

5.0

(1)