Pricing

Pay per usage

Go to Apify Store

Arxiv Paper Scraper

Try for free

Pricing

Pay per usage

Rating

0.0

(0)

Developer

Donny Nguyen

Actor stats

Bookmarked

Total users

Monthly active users

3 days ago

Last modified

Overview

arXiv Paper Scraper extracts academic paper metadata from arXiv.org using the official public API. It collects paper titles, complete author lists, abstracts, arXiv categories, publication dates, PDF download links, and DOI identifiers. This actor is designed for researchers, data scientists, and academic professionals who need to systematically collect and analyze research paper data from one of the largest open-access preprint repositories in the world.

Features

Search arXiv papers by multiple keywords or topics simultaneously
Filter results by arXiv categories (cs.AI, cs.CL, math.CO, etc.)
Extract comprehensive paper metadata including abstracts and author lists
Direct links to PDF downloads and abstract pages
Configurable result limits per search term
Uses the official arXiv API for reliable and structured data extraction
Built-in rate limiting to respect arXiv API guidelines
Fallback data ensures results are always returned

Input Parameters

Parameter	Type	Default	Description
searchTerms	array	["large language models", "transformer architecture"]	Keywords to search
categories	array	["cs.AI", "cs.CL"]	arXiv category filters
maxResults	integer	200	Maximum number of papers to extract

Output Format

Each paper in the dataset includes:

title - Paper title
authors - Comma-separated list of authors
abstract - Paper abstract
categories - arXiv categories assigned to the paper
publishedDate - Original publication date
updatedDate - Last update date
pdfUrl - Direct link to PDF download
doi - Digital Object Identifier (if available)
arxivId - arXiv paper identifier
absUrl - Link to abstract page
searchTerm - The search term that found this paper
scrapedAt - Timestamp of data extraction

Use Cases

This scraper is perfect for conducting systematic literature reviews across research domains, tracking publication trends in AI and machine learning, building citation databases for academic projects, monitoring new research in specific arXiv categories, creating reading lists for research groups, and aggregating paper metadata for bibliometric analysis. The structured output enables easy import into reference managers and research databases.

Pricing

This actor uses pay-per-event pricing at $0.30 per 1,000 papers scraped. Since it uses the free arXiv API, costs are very low. No subscription fees or minimum commitments required. A typical run extracting 200 papers costs just a fraction of a cent in data delivery charges.

Limitations

arXiv API has rate limits requiring 3-second delays between requests
Search results are limited by the arXiv API capabilities
Full-text content is not extracted, only metadata and abstracts
Some older papers may have incomplete metadata or missing DOIs

Built by consummate_mandala on Apify.

Arxiv Paper Scraper

technicaldost/arxiv-paper-scraper

Technical Dost Solutions

arXiv Search Scraper 📚

easyapi/arxiv-search-scraper

Extract comprehensive research paper data from arXiv search results. Get detailed metadata including titles, authors, abstracts, categories and more. Perfect for academic research monitoring, trend analysis and building paper databases. 🎓📚

EasyApi

5.0

Scrape Arxiv Paper — Data, Details & Metadata

tropical_quince/arxiv-paper-scraper

Scrape arxiv paper data at scale with this powerful Apify actor. Extracts data, details & metadata with automatic pagination and proxy rotation. Perfect for market research, competitive intelligence, and data-driven decision making.

Donny Nguyen

arXiv Paper Scraper

cloud9_ai/arxiv-paper-scraper

Scrape academic papers from arXiv.org. Search by keyword, browse categories, or get latest papers. Extract titles, abstracts, authors, PDF links, and citation data via arXiv API.

cloud9

ArXiv Paper Scraper

nexgendata/arxiv-scraper

Extract research papers, abstracts, authors, and citations from arXiv.org. Perfect for academic research monitoring, literature reviews, and scientific trend analysis.

Stephan Corbeil

ArXiv Academic Paper Scraper

fortuitous_pirate/arxiv-scraper

Scrape academic papers from ArXiv. Extract titles, authors, abstracts, categories, and PDF links. Essential for research and literature reviews.

Fortuitous Pirate

arXiv Scraper

artificially/arxiv-scraper

Search and extract academic papers from arXiv.org. Get paper titles, authors, abstracts, categories, and PDF links for AI/ML, physics, math, and more.

Artificially

Arxiv Citation Network Scraper

codepoetry/arxiv-citation-network-scraper

A professional Apify Actor that scrapes academic papers from arXiv and builds citation networks. Extract paper metadata, analyze author collaborations, track research trends, and discover emerging topics in science and technology.

CodePoetry

Academic Paper Scraper

constant_quadruped/academic-paper-scraper

Search arXiv and PubMed in one request. Returns unified paper data: titles, authors, abstracts, DOIs, and PDF links. Filter by keywords, authors, categories, and date range. Built-in rate limiting and cross-source deduplication. Export to JSON, CSV, or Excel.