OpenAlex Scraper
Pricing
from $2.00 / 1,000 record scrapeds
OpenAlex Scraper
Scrape OpenAlex - the free open catalog of scholarly works (250M+ papers, 100M+ authors, 100K institutions). Search across works, authors, institutions, concepts, journals. Returns title, abstract, authors, citations, DOI, OA status, and more.
Pricing
from $2.00 / 1,000 record scrapeds
Rating
0.0
(0)
Developer
Gio
Maintained by CommunityActor stats
0
Bookmarked
2
Total users
1
Monthly active users
2 days ago
Last modified
Categories
Share
Scrape OpenAlex - the free open catalog of scholarly works. 250M+ papers, 100M+ authors, 100K+ institutions, 65K+ concepts, 250K+ journals.
Five entity types:
- works - papers, books, datasets (title, abstract, authors, citations, DOI, OA status)
- authors - researchers (ORCID, h-index, i10 index, institution)
- institutions - universities/labs (country, works count, ROR ID)
- concepts - research topics
- sources - journals, repositories, conferences
Useful for academic research, citation analysis, bibliometrics, trend detection, researcher discovery.
Free vs. paid
- Free plan: 10 clearly-labelled mock records.
- Paid plan: real, live OpenAlex data.
Input
| Field | Type | Description |
|---|---|---|
entity | String (required) | works, authors, institutions, concepts, sources. |
searchTerm | String | Free text search. |
filter | String | OpenAlex filter syntax (e.g. publication_year:2024, is_oa:true, cited_by_count:>100). |
maxItems | Integer | Default 100, max 10,000. |
debug | Boolean | Verbose logs. |
Output (works example)
{"entityType": "work","id": "https://openalex.org/W4406789012","displayName": "Attention Is All You Need","title": "Attention Is All You Need","doi": "https://doi.org/10.48550/arxiv.1706.03762","publicationYear": 2017,"publicationDate": "2017-06-12","type": "article","citedByCount": 95432,"isOpenAccess": true,"oaUrl": "https://arxiv.org/pdf/1706.03762","authors": ["Ashish Vaswani", "Noam Shazeer", "..."],"institutions": ["Google Brain"],"primarySourceTitle": "arXiv (Cornell University)","language": "en","abstract": "The dominant sequence transduction models...","concepts": ["Artificial neural network", "Transformer", "Attention"],"referenceCount": 28,"url": "https://openalex.org/W4406789012"}
Pricing
$0.002/record. 1,000 records = $2.
Limitations
- OpenAlex API has a polite rate limit (~10 req/sec); the actor adds ~250ms between paginated calls.
- For best citation/identifier coverage, include an email in the
User-Agentheader (already set by the actor). abstractis reconstructed from OpenAlex's inverted index. Some old works don't have one.
If this actor helped you, please leave a review on the Apify Store.