OpenAlex Research Intelligence avatar

OpenAlex Research Intelligence

Pricing

Pay per usage

Go to Apify Store
OpenAlex Research Intelligence

OpenAlex Research Intelligence

Extract publication, author, institution, source, citation, topic, DOI, and open-access signals from the official OpenAlex API.

Pricing

Pay per usage

Rating

0.0

(0)

Developer

太郎 山田

太郎 山田

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

18 hours ago

Last modified

Share

Extract research intelligence from the official OpenAlex API without browser scraping. Use search terms, OpenAlex work IDs, author IDs, institution IDs, or concept IDs to produce flattened dataset rows for works, authors, and publication sources.

Inputs

FieldDefaultNotes
searchTerms[]Research queries for OpenAlex works.
workIds[]OpenAlex work IDs or URLs.
authorIds[]OpenAlex author IDs.
institutionIds[]OpenAlex institution IDs.
conceptIds[]OpenAlex concept IDs.
fromDate / toDateemptyPublication date filters in YYYY-MM-DD.
sortcited_by_count:descCitation, publication date, or relevance sort.
limitPerSource25Works fetched per search/filter source.
maxWorks100Global unique work cap.
includeAbstractfalseReconstruct abstract text from OpenAlex inverted index when available.
mailtoemptyOptional polite-pool email parameter for OpenAlex.
deliverydatasetdataset or webhook.
dryRunfalseSkip dataset/webhook delivery.

At least one of searchTerms, workIds, authorIds, institutionIds, or conceptIds is required.

Dataset Rows

work_summary

  • title, DOI, OpenAlex ID, publication year/date, type
  • citation count, open-access status, retraction flag
  • primary source, publisher, landing page, PDF URL
  • topics, concepts, countries, institutions

author_signal

  • work ID/title, author name and OpenAlex ID
  • author order/position, corresponding author flag
  • affiliated institutions and countries

source_summary

  • work ID/title, journal or source name
  • source type, ISSN, host organization, open-access source flag

Example Input

{
"searchTerms": ["large language models", "retrieval augmented generation"],
"fromDate": "2024-01-01",
"sort": "cited_by_count:desc",
"limitPerSource": 10,
"maxWorks": 20,
"includeAbstract": false,
"delivery": "dataset",
"dryRun": false
}

Local Development

npm install
npm test
node src/index.js

output/result.json contains the full payload. Apify dataset delivery writes flattened rows.

Limitations

  • OpenAlex coverage is broad but not identical to Crossref, PubMed, Semantic Scholar, or publisher APIs.
  • Citation counts and metadata can lag source publications.
  • Relevance sort is used only for search sources; non-search filters fall back to citation sort.
  • includeAbstract can increase payload size substantially.