ArXiv Academic Paper Scraper
Pricing
from $1.00 / 1,000 results
Go to Apify Store

ArXiv Academic Paper Scraper
Scrape academic papers from ArXiv. Extract titles, authors, abstracts, categories, and PDF links. Essential for research and literature reviews.
Pricing
from $1.00 / 1,000 results
Rating
0.0
(0)
Developer

Fortuitous Pirate
Maintained by Community
Actor stats
1
Bookmarked
3
Total users
2
Monthly active users
6 days ago
Last modified
Categories
Share
arXiv Papers Scraper
Overview
Search and extract academic papers from arXiv. org. Access 2.
Features
- Search by keywords to find specific results
- Filter results by category or type
- Export data in JSON, CSV, or Excel formats
- Includes ratings and review data
Use Cases
- Aggregate - Aggregate academic papers and research data
- Build - Build course catalogs and educational resource databases
- Track - Track educational institution data and rankings
- Monitor - Monitor academic publishing trends
Input Parameters
| Parameter | Type | Description | Default |
|---|---|---|---|
searchQuery | string | Search query (e.g., 'machine learning', 'quantum computing', 'neural networks') | machine learning |
category | string | arXiv category (e.g., cs.AI, cs.LG, physics, math, q-bio) | |
author | string | Filter by author name | |
sortBy | string | Sort order for results | relevance |
sortOrder | string | Sort direction (descending or ascending) | descending |
limit | integer | Maximum number of papers to return | 100 |
Output Example
Each result contains structured data like this:
{"arxivId": "ABC-12345","title": "Sample Education/Academic Result","authors": ["J. Smith","A. Johnson"],"primaryCategory": "Sample primaryCategory","published": "Sample published","pdfUrl": "https://example.com/item","abstractUrl": "https://example.com/item","summary": "Detailed description of the item..."}
Pricing
This actor uses pay-per-result pricing:
- $0.001 per result
- $1.00 per 1,000 results
No monthly fees. You only pay for what you scrape. Apify Free plan includes $5/month in platform credits.
How to Run
Apify Console
- Go to the arXiv Papers Scraper actor page
- Configure your input parameters
- Click Start and wait for the results
- Download data in JSON, CSV, or Excel format
API
curl -X POST "https://api.apify.com/v2/acts/fortuitous_pirate~arxiv-scraper/runs?token=YOUR_API_TOKEN" \-H "Content-Type: application/json" \-d '{"maxItems": 10}'
Python SDK
from apify_client import ApifyClientclient = ApifyClient("YOUR_API_TOKEN")run = client.actor("fortuitous_pirate/arxiv-scraper").call(run_input={"maxItems": 10})for item in client.dataset(run["defaultDatasetId"]).iterate_items():print(item)
Integration
Connect arXiv Papers Scraper with your existing tools and workflows:
- API access - Programmatic access via Apify API
- Webhooks - Get notified when scraping completes
- Scheduling - Set up recurring runs on any schedule
- Zapier / Make - Connect with 5,000+ apps via Apify integrations
- Python / Node.js SDKs - Native client libraries for easy integration