Pricing

Pay per event

arXiv Paper Tracker - Free Academic API

Track new arXiv papers. Filter by category (cs.AI, cs.LG, q-bio.NC...), author, keyword, date. Returns title, authors, abstract, PDF link, DOI. Free official arXiv API. For AI/ML researchers, biotech analysts, journalists.

Pricing

Pay per event

Rating

0.0

(0)

Developer

Mohieldin Mohamed

Actor stats

Bookmarked

Total users

Monthly active users

3 months ago

Last modified

arXiv Paper Tracker

Track new academic papers on arXiv. Filter by category, author, keyword, or date. Free official arXiv API. No key required.

This actor uses the official arXiv Atom-format public API to extract paper metadata with full structured fields. Tested on real data: 172,328 cs.AI papers, 307,287 cs.LG diffusion model papers, 759 papers with "Hinton" co-authors.

What you actually get

Real papers returned in test runs:

ID	Title	Authors
2604.13029v1	Visual Preference Optimization with Rubric Rewards	18 authors led by Ya-Qi Yu
2401.05779v4	Erasing Undesirable Influence in Diffusion Models	Jing Wu, Trung Le, Munawar Hayat
2603.24741v1	The Diffuse Gamma-ray Sky of a Milky Way Analogue	11 authors incl. Jim Hinton

Each row includes:

arXiv ID (e.g. 2604.13029v1)
Title (cleaned of XML escaping)
Full author list as array
Primary category + all categories (cs.AI, cs.CV, hep-ph, etc.)
Published date and last-updated date
Full abstract (optional)
Direct links: HTML abstract page + PDF
DOI (when paper is linked to a published version)
Author comment (e.g. "Accepted at NeurIPS 2024", page count)
Journal reference (when published)

Why use this

arXiv hosts 2.5 million+ research papers across CS, math, physics, biology, economics, and more. ~17,000 new papers are submitted every week. Tracking new releases in your domain is essential for:

AI/ML researchers — be the first to read new SOTA papers in your area
Tech investors and VCs — predict which research areas will become commercial in 12-24 months
Science journalists — break stories on new papers from labs you watch
Biotech analysts — track competitive research from labs at Big Pharma + universities
Indie hackers building AI tools — find new papers for your RAG dataset
Academic literature reviewers — automate paper discovery for systematic reviews
Patent attorneys and IP firms — track prior art in real time
Newsletters and blogs — automate "this week in AI" content sourcing

Commercial alternatives that do similar things:

Semantic Scholar API — free, but rate-limited and missing some fields
Connected Papers — $5-15/month, beautiful UI but no bulk export
Paperscape — free, visual, but no API
Google Scholar Alerts — free, email only, no structured export
Scite.ai — $99-499/month, citation-focused

This actor delivers structured JSON output with pay-per-event pricing — perfect for low-volume use that doesn't justify a $99/month subscription.

Honest limitations

arXiv has a self-imposed 3-second rate limit between requests. We comply, so large pulls (10,000+ papers) take a while.
arXiv author search is fuzzy. Searching "Hinton" matches any author with "Hinton" in their name, not just Geoffrey Hinton. Use "Geoffrey Hinton" or include co-authors for disambiguation.
The boolean query syntax is arXiv's own, not standard SQL. Use AND, OR, ANDNOT, parentheses. Field prefixes: ti: (title), au: (author), abs: (abstract), cat: (category), all: (anywhere).
arXiv only covers preprints, not peer-reviewed final versions. Many papers eventually get DOIs, but the abstract on arXiv may differ slightly from the published version.
arXiv ID versions matter. 2604.13029v1 and 2604.13029v2 are different revisions. We return whatever version was current at fetch time.

How to use

Click Try for free (or Start)
Pick categories to filter (e.g. ["cs.AI", "cs.LG"])
Optionally add search query with arXiv syntax (e.g. "transformer ANDNOT survey")
Optionally add author names for disambiguation
Pick sort order (newest first / recently updated / relevance)
Set max results (default 50, max 50,000)
Click Start

Common arXiv categories

Code	Field
`cs.AI`	Artificial Intelligence
`cs.LG`	Machine Learning
`cs.CL`	Computation and Language (NLP)
`cs.CV`	Computer Vision
`cs.RO`	Robotics
`cs.NE`	Neural and Evolutionary Computing
`cs.IR`	Information Retrieval
`cs.CR`	Cryptography and Security
`cs.DC`	Distributed Computing
`cs.SE`	Software Engineering
`cs.HC`	Human-Computer Interaction
`stat.ML`	Statistics - Machine Learning
`math.OC`	Math - Optimization and Control
`q-bio.NC`	Quantitative Biology - Neurons and Cognition
`q-fin.TR`	Quantitative Finance - Trading and Microstructure
`econ.EM`	Economics - Econometrics
`physics.bio-ph`	Physics - Biological Physics
`hep-ph`	High Energy Physics - Phenomenology

Full list: https://arxiv.org/category_taxonomy

Output

{
    "arxivId": "2604.13029v1",
    "title": "Visual Preference Optimization with Rubric Rewards",
    "authors": ["Ya-Qi Yu", "Fangyu Hong", "Xiangyang Qu", "Hao Wang", "..."],
    "primaryCategory": "cs.CV",
    "categories": ["cs.CV", "cs.AI"],
    "publishedAt": "2026-04-14T17:58:22Z",
    "updatedAt": "2026-04-14T17:58:22Z",
    "abstract": "The effectiveness of Direct Preference Optimization (DPO) depends on preference data...",
    "absUrl": "https://arxiv.org/abs/2604.13029v1",
    "pdfUrl": "https://arxiv.org/pdf/2604.13029v1",
    "doi": null,
    "comment": "12 pages, 5 figures",
    "journalRef": null,
    "extractedAt": "2026-04-15T21:35:00.000Z"
}

Pricing

This actor uses pay-per-event pricing — extremely cheap given how cheap academic data is to extract:

Actor start: $0.05 per run
Per paper extracted: $0.005 per paper

Example costs:

100 latest cs.AI papers → $0.55
1,000 papers for a literature review → $5.05
Daily check on cs.LG (50 new papers/day) → ~$8/month
Bulk pull of all 50,000 cs.CL papers ever → $250.05

For comparison: Connected Papers Pro is $5/month with limited filters. Scite.ai is $99/month base. Most academic researchers can use this actor for under $5/month.

Free Apify tier members get $5/month in platform credits, which covers ~1,000 papers per month.

Tips

Use sortBy: submittedDate for new-paper alerts (default)
Use sortBy: relevance when you have a search query and want the best matches first
Combine categories with +OR+ in your downstream pipeline if you want union, or pass multiple categories in the input array (we OR them automatically)
Schedule daily runs with date filters in your downstream code to build a fresh-papers queue
Pipe abstracts into an LLM for summarization — each paper is ~200-400 tokens of abstract, perfect for batch summarization

Source

This actor uses the official arXiv Atom-format query API at http://export.arxiv.org/api/query. The API is documented at https://info.arxiv.org/help/api/index.html. It's free, public, requires no authentication, and is maintained by Cornell University as a community service.

License

This actor is MIT licensed. arXiv papers themselves are licensed individually by their authors — most use Creative Commons or arXiv's nonexclusive license. Always cite the original paper when using extracted abstracts in your work.

arXiv Papers Scraper

thriftykiwi/arxiv-scraper

Extract academic paper metadata from arXiv via the official public API. Search by keyword, browse categories (cs.AI, cs.LG, stat.ML, etc.), or fetch specific papers by arXiv ID. Parses Atom XML into clean JSON with title, authors, abstract, DOI, PDF link, and categories. No authentication required.

Thrifty Kiwi

arXiv Paper Scraper - AI Research Tracker

arjunannamalai/arxiv-paper-scraper

Track new arXiv papers by category, keyword and author. Clean output with authors, abstract and direct PDF links. Public API, no key.

Arjun Annamalai

arXiv Papers Scraper

troy_007/arxiv-papers-scraper

Search and export arXiv research papers by query, category, or author — title, abstract, authors, categories, dates, PDF link, and DOI. Uses the official arXiv API.

Pathik Shah

arXiv CS Papers Scraper

gio21/arxiv-cs-scraper

arXiv CS Papers Scraper — auto-scaffolded

Gio

arXiv Papers Scraper — AI & Research by Keyword or Category

hichemdev/arxiv-papers-scraper

Scrape arXiv research papers by keyword or category: title, authors, abstract, dates, categories, DOI and PDF link. Perfect for tracking AI/ML research.

Hichem Ben Moussa

arXiv Research Paper Scraper

crawlerbros/arxiv-research-paper-scraper

Scrape research papers from arXiv.org - search by query, category, or author; lookup by arXiv ID. Returns title, authors, abstract, PDF URL, DOI, categories, and more. Uses the public arXiv Atom API. No login or proxy required.

Crawler Bros

🔬 arXiv Scraper - Scientific Papers, Abstracts & PDFs

benthepythondev/arxiv-scraper

arXiv Scraper for the official arXiv API. Search 2M+ scientific papers in CS, physics, math and biology by keyword, title, author, abstract or category. Extract title, authors, abstract, categories, DOI, dates and PDF links. For AI/ML research, literature reviews and RAG datasets.

Ben

arXiv Papers Scraper: AI & Science Research Tracker

scrapemint/arxiv-papers-scraper

Track new research papers on arXiv by keyword, category, or author. One clean JSON row per paper: title, abstract, authors, categories, dates, PDF link, and DOI. Official open API, no key, no browser. Pay per paper.

Ken M

arXiv Scraper - Research Papers & Abstracts

antishock/arxiv-paper-scraper

Search and scrape arXiv research papers by keyword, category (cs.AI, math, physics, etc.), date range or author. Returns title, abstract, authors, PDF link and citation data. Bulk export ready.

Ryan Zinburg

arXiv Papers Scraper

crawlerbros/arxiv-papers-scraper

Scrape academic preprints from arXiv.org by keyword, author, or category. Returns clean records with title, authors, abstract, categories, PDF URL, DOI. HTTP-only via the public arXiv API. No login, no proxy.

Crawler Bros