OpenAlex Scraper avatar

OpenAlex Scraper

Pricing

from $2.00 / 1,000 record scrapeds

Go to Apify Store
OpenAlex Scraper

OpenAlex Scraper

Scrape OpenAlex - the free open catalog of scholarly works (250M+ papers, 100M+ authors, 100K institutions). Search across works, authors, institutions, concepts, journals. Returns title, abstract, authors, citations, DOI, OA status, and more.

Pricing

from $2.00 / 1,000 record scrapeds

Rating

0.0

(0)

Developer

Gio

Gio

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

2 days ago

Last modified

Share

Scrape OpenAlex - the free open catalog of scholarly works. 250M+ papers, 100M+ authors, 100K+ institutions, 65K+ concepts, 250K+ journals.

Five entity types:

  • works - papers, books, datasets (title, abstract, authors, citations, DOI, OA status)
  • authors - researchers (ORCID, h-index, i10 index, institution)
  • institutions - universities/labs (country, works count, ROR ID)
  • concepts - research topics
  • sources - journals, repositories, conferences

Useful for academic research, citation analysis, bibliometrics, trend detection, researcher discovery.

Free vs. paid

  • Free plan: 10 clearly-labelled mock records.
  • Paid plan: real, live OpenAlex data.

Input

FieldTypeDescription
entityString (required)works, authors, institutions, concepts, sources.
searchTermStringFree text search.
filterStringOpenAlex filter syntax (e.g. publication_year:2024, is_oa:true, cited_by_count:>100).
maxItemsIntegerDefault 100, max 10,000.
debugBooleanVerbose logs.

Output (works example)

{
"entityType": "work",
"id": "https://openalex.org/W4406789012",
"displayName": "Attention Is All You Need",
"title": "Attention Is All You Need",
"doi": "https://doi.org/10.48550/arxiv.1706.03762",
"publicationYear": 2017,
"publicationDate": "2017-06-12",
"type": "article",
"citedByCount": 95432,
"isOpenAccess": true,
"oaUrl": "https://arxiv.org/pdf/1706.03762",
"authors": ["Ashish Vaswani", "Noam Shazeer", "..."],
"institutions": ["Google Brain"],
"primarySourceTitle": "arXiv (Cornell University)",
"language": "en",
"abstract": "The dominant sequence transduction models...",
"concepts": ["Artificial neural network", "Transformer", "Attention"],
"referenceCount": 28,
"url": "https://openalex.org/W4406789012"
}

Pricing

$0.002/record. 1,000 records = $2.

Limitations

  • OpenAlex API has a polite rate limit (~10 req/sec); the actor adds ~250ms between paginated calls.
  • For best citation/identifier coverage, include an email in the User-Agent header (already set by the actor).
  • abstract is reconstructed from OpenAlex's inverted index. Some old works don't have one.

If this actor helped you, please leave a review on the Apify Store.