Pricing

Pay per usage

Try for free

Go to Apify Store

OpenAlex Scraper

Try for free

Extract scholarly data from OpenAlex—titles, authors, institutions, venues, concepts—using this fast Apify actor. Get academic research in bulk via API, and export results as CSV, Excel, or HTML datasets for research, analytics, or discovery.

Pricing

Pay per usage

Rating

5.0

(1)

Developer

Shahid Irfan

Actor stats

Bookmarked

Total users

Monthly active users

4 months ago

Last modified

🚀 Key Features

Multi-Entity Support: Scrape works, authors, institutions, venues, and concepts from OpenAlex
Advanced Search & Filtering: Use powerful search queries with custom filters and sorting options
High-Volume Data Collection: Retrieve thousands of records with automatic pagination
Rate Limit Optimization: Polite pool access for maximum API throughput (up to 100,000 requests/day)
Automatic Error Handling: Built-in retries and rate limit management
Structured Data Output: Clean, consistent JSON output ready for analysis

📊 What You Can Scrape

Works: Research papers, articles, books with full metadata, abstracts, and citations
Authors: Researcher profiles with publication counts and institutional affiliations
Institutions: University and research organization data with country information
Venues: Journals, conferences, and publishers with impact metrics
Concepts: Research topics and keywords with hierarchical relationships

🔧 Input Configuration

Configure your scraping job with these parameters:

Parameter	Type	Description	Default
`search`	string	Search query (title, author, institution name, etc.)	""
`entity`	select	Entity type to scrape	"works"
`results_wanted`	integer	Maximum results to collect	100
`max_pages`	integer	Maximum API pages to fetch	10
`email`	string	Email for polite pool (higher rate limits)	""
`filters`	object	Additional API filters	{}
`sort`	string	Sort order	"relevance_score:desc"

Entity Options

works - Scholarly publications
authors - Researcher profiles
institutions - Academic organizations
venues - Publication outlets
concepts - Research topics

Example Filters

{
  "publication_year": "2023",
  "cited_by_count": ">100",
  "country_code": "US"
}

📤 Output Data Structure

Works Entity Example

{
  "id": "https://openalex.org/W123456789",
  "title": "Machine Learning in Healthcare: A Comprehensive Review",
  "authors": ["Dr. Jane Smith", "Prof. John Doe"],
  "institutions": ["Harvard University", "MIT"],
  "publication_year": 2023,
  "doi": "10.1234/health-ml-2023",
  "url": "https://openalex.org/W123456789",
  "abstract": "This paper explores the applications of machine learning...",
  "concepts": ["Machine Learning", "Healthcare", "Artificial Intelligence"],
  "cited_by_count": 245,
  "type": "journal-article",
  "source": "openalex.org"
}

Authors Entity Example

{
  "id": "https://openalex.org/A123456789",
  "display_name": "Dr. Jane Smith",
  "works_count": 87,
  "cited_by_count": 1250,
  "last_known_institution": "Harvard University",
  "orcid": "0000-0001-2345-6789",
  "source": "openalex.org"
}

🎯 Usage Examples

Basic Research Paper Search

{
  "search": "machine learning healthcare",
  "entity": "works",
  "results_wanted": 500,
  "email": "your-email@example.com"
}

Top Cited Authors in AI

{
  "entity": "authors",
  "search": "artificial intelligence",
  "sort": "cited_by_count:desc",
  "results_wanted": 100,
  "filters": {
    "works_count": ">50"
  }
}

University Research Output

{
  "entity": "institutions",
  "search": "Stanford University",
  "results_wanted": 1,
  "email": "researcher@university.edu"
}

{
  "entity": "concepts",
  "sort": "works_count:desc",
  "results_wanted": 50,
  "filters": {
    "level": "1"
  }
}

⚙️ Advanced Configuration

Optimizing for Large Datasets

Use email parameter for polite pool access
Set appropriate max_pages to control API usage
Apply filters to narrow results before pagination

Rate Limiting

Free tier: 100,000 requests/day
Polite pool (with email): Higher priority access
Automatic handling of rate limits with retry logic

Data Filtering Tips

Use publication_year for time-based analysis
Filter by cited_by_count for impact studies
Country codes for geographical research
Concept IDs for topic-specific queries

📈 Use Cases

Bibliometric Analysis: Track citation patterns and research impact
Literature Reviews: Systematic collection of papers on specific topics
Researcher Profiling: Build comprehensive author databases
Institutional Rankings: Compare research output across organizations
Trend Analysis: Identify emerging research areas and concepts
Academic Network Mapping: Discover collaborations and affiliations

🔍 API Integration

This scraper uses the official OpenAlex REST API:

Base URL: https://api.openalex.org
Documentation: OpenAlex API Guide
Rate Limits: 100,000 requests/day per user
No authentication required (email optional for polite pool)

📋 Limits & Considerations

Rate Limits: 100,000 API calls per day (higher with polite pool)
Result Limits: Up to 10,000 results per entity type
Data Freshness: OpenAlex updates data regularly
Data Coverage: Over 200 million works, 15 million authors, 100,000 institutions

🤝 Contributing

Found a bug or have a feature request? Open an issue on our GitHub repository.

📄 License

This project is licensed under the MIT License - see the LICENSE file for details.

Keywords: OpenAlex scraper, academic data extraction, scholarly works API, bibliometric data, research papers scraper, author profiles, institution data, academic analytics, citation analysis, research trends

Openalex Scraper

automation-lab/openalex-scraper

Extract research papers from OpenAlex — titles, authors, citations, institutions, and open access links.

Stas Persiianenko

OpenAlex Academic Research Scraper - Scholarly Papers

cloud9_ai/openalex-scraper

Search and extract academic papers, authors, institutions, and research topics from OpenAlex. Free open API covering 250M+ scholarly works. Get citations, abstracts, open access URLs.

cloud9

OpenAlex Scraper – Cheap 📚🪄✨

scrapestorm/openalex-scraper---cheap

🔍 Easily extract OpenAlex research Collect structured academic data from OpenAlex, including publication titles, authors, institutions, sources, years, citations, funding details, & entity URLs 📚📊 Ideal for bibliometric analysis, research intelligence, funding analysis & academic insights 🌍🧠

Storm_Scraper

OpenAlex Works Scraper

powerai/openalex-works-scraper

Collect scholarly works from OpenAlex search results by URL, with automatic pagination and structured data (title, authors, venue, citations, PDF link).

PowerAI

OpenAlex Research Paper Search

ryanclinton/openalex-research-search

Search 250M+ academic papers via OpenAlex API. Filter by keyword, year, citations, and open access status. Returns authors, affiliations, journal, DOI, and concepts. Free, no API key needed.

ryan clinton

OpenAlex Scraper

parseforge/openalex-scraper

Optimize your academic research with our comprehensive OpenAlex scraper! Obtain complete academic information, including publication dates, DOI links, open access status, and citation metrics. Ideal for researchers, academic institutions, and data analysts who need accurate data without manual work.

ParseForge

5.0

Facebook Events Scraper

scraper-engine/facebook-events-scraper

Scrape Facebook events effortlessly using this Apify actor. It collects event titles, dates, venues, organizers, and links from Facebook pages or search results. Ideal for market research, event aggregation, or trend analysis with structured, exportable data in JSON, CSV, or Excel formats.

Scraper Engine

5.0

Semantic Scholar Scraper

parseforge/semantic-scholar-scraper

Extract detailed academic paper data from Semantic Scholar, including abstracts, citations, authors, and publication details. Ideal for researchers, academics, and analysts who need structured scholarly data for literature reviews, research workflows, and large-scale academic analysis.

ParseForge

5.0

Crunchbase Companies Scraper-Extract Company Data & Export

davidsharadbhatt/crunchbase-companies-bulk-scraper

Scrape Crunchbase company data in bulk — including names, industries, locations, funding details, and more. Export all data to CSV, JSON, or Excel for analysis, lead generation, or market research.

David Bhatt

Youtube Video Search ⚡⚡: Data, Details & Analytics

thedoor/youtube-video-search

YouTube Search Extraction: $0.7/1k Results 📊 Automate research with our high-speed search scraper. Support for bulk keywords and all URL formats. Get video titles, views, and channel info in CSV, Excel, or JSON. Fast, clean data for your spreadsheets and apps. 🚀