ArXiv Academic Paper Scraper avatar

ArXiv Academic Paper Scraper

Pricing

from $1.00 / 1,000 results

Go to Apify Store
ArXiv Academic Paper Scraper

ArXiv Academic Paper Scraper

Scrape academic papers from ArXiv. Extract titles, authors, abstracts, categories, and PDF links. Essential for research and literature reviews.

Pricing

from $1.00 / 1,000 results

Rating

0.0

(0)

Developer

Fortuitous Pirate

Fortuitous Pirate

Maintained by Community

Actor stats

1

Bookmarked

3

Total users

2

Monthly active users

6 days ago

Last modified

Categories

Share

arXiv Papers Scraper

Overview

Search and extract academic papers from arXiv. org. Access 2.

Features

  • Search by keywords to find specific results
  • Filter results by category or type
  • Export data in JSON, CSV, or Excel formats
  • Includes ratings and review data

Use Cases

  • Aggregate - Aggregate academic papers and research data
  • Build - Build course catalogs and educational resource databases
  • Track - Track educational institution data and rankings
  • Monitor - Monitor academic publishing trends

Input Parameters

ParameterTypeDescriptionDefault
searchQuerystringSearch query (e.g., 'machine learning', 'quantum computing', 'neural networks')machine learning
categorystringarXiv category (e.g., cs.AI, cs.LG, physics, math, q-bio)
authorstringFilter by author name
sortBystringSort order for resultsrelevance
sortOrderstringSort direction (descending or ascending)descending
limitintegerMaximum number of papers to return100

Output Example

Each result contains structured data like this:

{
"arxivId": "ABC-12345",
"title": "Sample Education/Academic Result",
"authors": [
"J. Smith",
"A. Johnson"
],
"primaryCategory": "Sample primaryCategory",
"published": "Sample published",
"pdfUrl": "https://example.com/item",
"abstractUrl": "https://example.com/item",
"summary": "Detailed description of the item..."
}

Pricing

This actor uses pay-per-result pricing:

  • $0.001 per result
  • $1.00 per 1,000 results

No monthly fees. You only pay for what you scrape. Apify Free plan includes $5/month in platform credits.

How to Run

Apify Console

  1. Go to the arXiv Papers Scraper actor page
  2. Configure your input parameters
  3. Click Start and wait for the results
  4. Download data in JSON, CSV, or Excel format

API

curl -X POST "https://api.apify.com/v2/acts/fortuitous_pirate~arxiv-scraper/runs?token=YOUR_API_TOKEN" \
-H "Content-Type: application/json" \
-d '{"maxItems": 10}'

Python SDK

from apify_client import ApifyClient
client = ApifyClient("YOUR_API_TOKEN")
run = client.actor("fortuitous_pirate/arxiv-scraper").call(
run_input={"maxItems": 10}
)
for item in client.dataset(run["defaultDatasetId"]).iterate_items():
print(item)

Integration

Connect arXiv Papers Scraper with your existing tools and workflows:

  • API access - Programmatic access via Apify API
  • Webhooks - Get notified when scraping completes
  • Scheduling - Set up recurring runs on any schedule
  • Zapier / Make - Connect with 5,000+ apps via Apify integrations
  • Python / Node.js SDKs - Native client libraries for easy integration