OpenAlex Works Scraper avatar
OpenAlex Works Scraper

Pricing

$19.99/month + usage

Go to Apify Store
OpenAlex Works Scraper

OpenAlex Works Scraper

Collect scholarly works from OpenAlex search results by URL, with automatic pagination and structured data (title, authors, venue, citations, PDF link).

Pricing

$19.99/month + usage

Rating

5.0

(1)

Developer

PowerAI

PowerAI

Maintained by Community

Actor stats

1

Bookmarked

2

Total users

1

Monthly active users

2 days ago

Last modified

Share

This actor helps you collect scholarly work listings from OpenAlex by providing a search results URL. It follows pagination automatically and extracts key information for each work: title, publication year, authors, venue, citation count, and optional PDF link.

Features

  • URL-based collection: Paste any OpenAlex works search URL to collect results from that query.
  • Automatic pagination: Moves through result pages until the end or your item limit is reached.
  • Structured output: Each result includes:
    • Work title and link to the detail page
    • Publication year
    • Author names
    • Venue or source (journal, conference, etc.)
    • Citation count
    • PDF link when available

Input Parameters

FieldTypeRequiredDescription
searchUrlstringYesThe full OpenAlex works search URL to collect from
maxItemsintegerNoMaximum number of works to collect (default: 100)

Output

Each output item includes:

  • searchUrl: The search URL you provided
  • workId: OpenAlex work identifier
  • detailPageUrl: Link to the work’s page on OpenAlex
  • title: Title of the work
  • year: Publication year
  • authors: Author names as shown
  • venue: Publication venue or source
  • citedByCount: Number of citations
  • pdfUrl: Link to a PDF when available, otherwise null
  • scrapedAt: When the item was collected

Example output:

[
{
"searchUrl": "https://openalex.org/works?page=1&filter=title_and_abstract.search:ai",
"workId": "w2896457183",
"detailPageUrl": "https://openalex.org/works?page=1&filter=title_and_abstract.search:ai&zoom=w2896457183",
"title": "AI-Assisted Pipeline for Dynamic Generation of Trustworthy Health Supplement Content at Scale",
"year": "2025",
"authors": "Manjunath, Sampritha H., McCrae, John P.",
"venue": "Leibniz-Zentrum für Informatik (Schloss Dagstuhl)",
"citedByCount": 45160,
"pdfUrl": null,
"scrapedAt": "2026-01-30T09:28:15.655Z"
}
]

Use Cases

  • Building datasets of papers by topic or filter
  • Tracking citation counts and venues for research
  • Finding PDF links for works listed on OpenAlex
  • Literature surveys and reference list collection

Notes

  • Use the same URL you see in your browser after running a search on OpenAlex.
  • You can limit how many works are collected with the max items setting.
  • Results are deduplicated by work ID.