OpenAlex Scraper avatar

OpenAlex Scraper

Pricing

Pay per event

Go to Apify Store
OpenAlex Scraper

OpenAlex Scraper

Optimize your academic research with our comprehensive OpenAlex scraper! Obtain complete academic information, including publication dates, DOI links, open access status, and citation metrics. Ideal for researchers, academic institutions, and data analysts who need accurate data without manual work.

Pricing

Pay per event

Rating

5.0

(1)

Developer

ParseForge

ParseForge

Maintained by Community

Actor stats

0

Bookmarked

5

Total users

0

Monthly active users

6 days ago

Last modified

Share

ParseForge Banner

πŸ“š OpenAlex Scraper

Collect scholarly publications, research papers, and academic metadata from OpenAlex without coding. This scraper automates collection of complete academic data including publication metadata, author affiliations, citation metrics, open access status, and research concepts from OpenAlex's free catalog of 250+ million works. Whether you're conducting literature reviews, tracking research trends, building citation networks, or gathering publication metrics, extract comprehensive academic data in CSV, JSON, or Excel format in minutes. Perfect for researchers, librarians, academic institutions, and data analysts who need scholarly data collection without manual work or APIs.

The OpenAlex Scraper collects scholarly works with up to 23 data fields including citations, open access status, author information, and research concepts from OpenAlex's open catalog of 250+ million publications.

✨ What Does It Do

  • πŸ“ Work Title and Abstract - Extract publication titles and abstracts to understand research content and identify matching papers
  • πŸ”— DOI Links and Work URLs - Get persistent identifiers and direct links for citations and full text access
  • πŸ‘₯ Author Names and Affiliations - Collect author lists and institutional affiliations to map collaboration networks
  • 🏒 Institution Names - Gather affiliated organizations to analyze institutional research output
  • πŸ“Š Citation Metrics - Access citation counts and FWCI scores to measure research impact
  • πŸ“… Publication Dates - Filter and organize works by publication date to track research trends
  • πŸ”“ Open Access Status - Identify freely available publications and access open access URLs
  • 🎯 Research Concepts and Topics - Extract research domains and keywords by academic field

🎬 Demo Video

πŸ”§ Input

  • Start URL - Direct OpenAlex URL to scrape from, use either this or the search filters below. Example: https://openalex.org/works?page=1&sort=cited_by_count:desc
  • Max Items - Free users limited to 100, paid users up to 1,000,000 results per run
  • Author ID - Filter works by author IDs, multiple values use logical OR
  • Funder ID - Filter works by funding organization IDs
  • Institution ID - Filter by institutional affiliations
  • Open Access - Toggle to collect only freely available publications
  • Title and Abstract - Search by keyword or phrase to find matching works
  • Topic ID - Filter by topic or concept IDs to narrow results by research domain
  • Keyword ID - Filter by keyword IDs to find works matching specific terms
  • Type - Filter by publication type (article, book, chapter, dataset, etc.)
  • From Publication Date - Collect works published on or after a date (format: YYYY-MM-DD)
  • To Publication Date - Collect works published on or before a date (format: YYYY-MM-DD)
  • Sort - Order results by Citation Count, Citation Percentile, FWCI, Title, or Year

Example input configuration:

{
"startUrl": "https://openalex.org/works?page=1&sort=cited_by_count:desc",
"maxItems": 10
}

Or using search filters:

{
"search": "machine learning",
"fromPublicationDate": "2020-01-01",
"toPublicationDate": "2023-12-31",
"isOA": true,
"sort": "cited_by_count:desc",
"maxItems": 10
}

πŸ“Š Output

Each work includes up to 23 data fields. Download as JSON, CSV, or Excel.

πŸ“ TitleπŸ”— Work URLπŸ”— DOI Link
πŸ“„ AbstractπŸ‘₯ Author Names🏒 Institution Names
πŸ“… Publication DateπŸ“Š Publication Year🎯 Work Type
πŸ’¬ Citation CountπŸ“ˆ FWCIπŸ“Š Citation Percentile
πŸ”“ Open Access Status🌐 Open Access URLπŸ—£οΈ Language
πŸ“ Primary LocationπŸŽ“ Research Concepts🎯 Primary Topic
🏷️ KeywordsπŸ“‹ Bibliographic Info⏰ Scraped Timestamp
⚠️ Error

πŸ’Ž Why Choose the OpenAlex Scraper?

FeatureOur ActorSimilar Tools
Direct OpenAlex access without credentials requiredβœ”οΈβŒ
Search by author, institution, funder, or topicβœ”οΈPartial
Extract up to 23 fields per work including citationsβœ”οΈβŒ
Filter by publication date rangeβœ”οΈβŒ
Collect up to 1,000,000 results for paid usersβœ”οΈβŒ
Export to CSV, JSON, and Excel formatsβœ”οΈβœ”οΈ
Automatic pagination handlingβœ”οΈβœ”οΈ
Research concept and topic classificationβœ”οΈβŒ
FWCI and citation percentile metrics includedβœ”οΈβŒ
Free to use, no subscription requiredβœ”οΈβŒ
Works with any OpenAlex query syntaxβœ”οΈβŒ
Real-time scraped timestamp for data freshnessβœ”οΈβŒ

πŸ“‹ How to Use

No technical skills required. Follow these simple steps:

  1. Sign Up: Create a free account with $5 credit
  2. Find the Tool: Search for "OpenAlex Scraper" in the Apify Store and configure your input
  3. Run It: Click "Start" and watch your results appear

That's it. No coding, no setup, no complicated configuration. Now you can export your data in CSV, Excel, or JSON format.

🎯 Business Use Cases

  • πŸ“Š Research Analyst - Monitor publications in your field to identify emerging trends and influential papers before your competitors discover them
  • πŸŽ“ Academic Librarian - Collect institutional publication records and citation metrics to compile annual research impact reports and benchmark departmental performance
  • πŸ’Ό Data Researcher - Build comprehensive citation networks from multiple searches to map research evolution, identify collaboration patterns, and discover gaps in published knowledge

❓ FAQ

πŸ” How does it work? The scraper connects to OpenAlex to retrieve scholarly publication data. You can either provide a direct URL or use simple filter fields like author ID, institution, or keywords. No credentials needed.

πŸ“Š How accurate is the data? OpenAlex is a comprehensive catalog maintained by the University of Illinois. Data reliability depends on what institutions and researchers have submitted, but it covers publication metadata, citations, and open access information well.

πŸ“… Can I schedule regular runs? Yes. Use the scheduler to run the scraper daily, weekly, or monthly to keep your research databases current and track new publications automatically.

βš–οΈ Is scraping OpenAlex data legal? Yes. OpenAlex is a public, open access database designed for research and analytics. All data is freely available. You are responsible for complying with local laws when processing data.

πŸ›‘οΈ Will OpenAlex block me? No. OpenAlex welcomes automated access. The scraper uses standard calls with appropriate rate limiting.

⚑ How long does a run take? Collecting 100 works typically takes 30 seconds to 2 minutes. Larger runs of 1,000+ works may take 10 to 30 minutes depending on network speed.

⚠️ Are there any limits? Free users can collect up to 100 results per run. Paid users can collect up to 1,000,000 results per run.

πŸ”— Integrate OpenAlex Scraper with any app

πŸ’‘ More ParseForge Actors

Browse our complete collection of data extraction tools for more.

πŸš€ Ready to Start?

Create a free account with $5 credit and collect your first 100 results for free. No coding, no setup.

πŸ†˜ Need Help?

  • Check the FAQ section above for common questions
  • Visit the Apify support page for documentation and tutorials
  • Contact us to request a new scraper, propose a custom project, or report an issue at Tally contact form

⚠️ Disclaimer

This Actor is an independent tool and is not affiliated with, endorsed by, or sponsored by OpenAlex or any of its subsidiaries. All trademarks mentioned are the property of their respective owners.