Pricing

from $1.00 / 1,000 results

arXiv Search & Paper Scraper

Search arXiv and get clean structured JSON for each paper: title, authors, abstract, categories, DOI, PDF link, and dates. Built for research, datasets, and AI pipelines.

Pricing

from $1.00 / 1,000 results

Rating

0.0

(0)

Developer

Nicolas van Arkens

Actor stats

Bookmarked

Total users

Monthly active users

2 months ago

Last modified

arXiv Search & Paper Scraper 📚

Search arXiv and get clean, structured JSON for every paper — title, authors, abstract, categories, DOI, journal reference, PDF link, and dates. The arXiv API returns awkward Atom XML; this actor does the parsing for you and hands back tidy records ready for analysis, datasets, citation management, or feeding papers to an LLM.

Why use it

🔎 Flexible search — by keywords, author, arXiv category, or title
👥 Authors as a clean list — not a blob of XML
🏷️ Categories split out — primary category plus all cross-listed ones
🔗 Direct PDF + abstract links — and DOI / journal reference when available
📅 Parsed dates — published and last-updated
🧹 Normalized text — abstracts cleaned of the API's messy whitespace
↕️ Sort by relevance, last updated, or submission date

Use cases

Literature reviews & research — pull every recent paper in a field
Building datasets — assemble structured corpora of papers and abstracts
LLM / RAG pipelines — feed clean abstracts and metadata to models
Trend monitoring — track new submissions in a category over time
Citation & reference tooling — grab DOIs and journal refs at scale

Input

Field	Description
Search query	Free-text keywords across all fields.
Author	Restrict to an author (phrase match).
Category	arXiv code, e.g. `cs.LG`, `cs.CL`, `stat.ML`.
Title contains	Restrict by title phrase.
Sort by / order	Relevance, last updated, or submitted; asc/desc.
Maximum papers	How many to return.

Output

{
  "arxivId": "1706.03762v7",
  "version": 7,
  "title": "Attention Is All You Need",
  "summary": "The dominant sequence transduction models are based on...",
  "authors": ["Ashish Vaswani", "Noam Shazeer", "Niki Parmar"],
  "authorCount": 3,
  "primaryCategory": "cs.CL",
  "categories": ["cs.CL", "cs.LG"],
  "published": "2017-06-12T17:57:34Z",
  "updated": "2023-08-02T00:41:18Z",
  "doi": "10.5555/3295222.3295349",
  "journalRef": "NeurIPS 2017",
  "pdfUrl": "http://arxiv.org/pdf/1706.03762v7",
  "absUrl": "http://arxiv.org/abs/1706.03762v7"
}

Export to JSON, CSV, or Excel, or pull via the Apify API. Connect to Sheets, Notion, Slack, Zapier, or Make.

Notes

Uses the official public arXiv API. Independent tool, not affiliated with arXiv or Cornell University.
Please be considerate with large jobs; the actor paces requests to respect arXiv's API guidelines.
arXiv category reference: see arxiv.org/category_taxonomy for the full list of codes.

arXiv Scraper

dami_studio/arxiv-scraper

Search arXiv via the official API and return structured paper metadata as JSON: title, abstract, authors, categories, DOI, dates, and abstract + PDF links. Best for literature reviews.

Dami's Studio

5.0

ArXiv Research Paper Scraper

datapilot/arxiv-research-paper-scraper

arXiv Research Paper Scraper retrieves academic paper metadata from the arXiv API based on a keyword. It extracts titles, abstracts, authors with affiliations, DOI, categories, submission dates, and PDF links. Supports proxy usage and outputs structured JSON results for research and data analysis.

Data Pilot

arXiv Research Paper Scraper

codingfrontend/arxiv-search-scraper

Extract comprehensive research paper data from arXiv search results including titles, authors, abstracts, categories, and more.

Coding Frontned

arXiv Papers Scraper: AI & Science Research Tracker

scrapemint/arxiv-papers-scraper

Track new research papers on arXiv by keyword, category, or author. One clean JSON row per paper: title, abstract, authors, categories, dates, PDF link, and DOI. Official open API, no key, no browser. Pay per paper.

Ken M

ArXiv Paper Search MCP

reverberant_equality/mcp-arxiv-search

Search ArXiv papers and retrieve paper details. AI agents can discover academic research, abstracts, authors, categories, and PDF links.

Jordan C

arXiv Research Paper Scraper

crawlerbros/arxiv-research-paper-scraper

Scrape research papers from arXiv.org - search by query, category, or author; lookup by arXiv ID. Returns title, authors, abstract, PDF URL, DOI, categories, and more. Uses the public arXiv Atom API. No login or proxy required.

Crawler Bros

arXiv Paper Search Scraper

fetch_cat/arxiv-paper-search-scraper

Search arXiv papers by keyword, author, category, and date using public paper metadata.

Hanna Nosova

AI Paper / arXiv Monitor

civicdataworks/ai-paper-arxiv-monitor

Search arXiv for AI/LLM/agent papers and export normalized paper metadata.

Rowan Mercer

arXiv Paper Scraper

plantane/arxiv-scraper

Scrape research papers from arXiv by search query or category. Get titles, abstracts, authors, categories, and PDF links via the public arXiv API.

Daniel

arXiv Papers Scraper

maximedupre/arxiv-papers-scraper

Search public arXiv paper records by query, category, author, title, abstract, date range, or advanced syntax. Export paper metadata, URLs, DOI values, and monitoring-ready results.