arXiv Paper Tracker - Free Academic API
Pricing
Pay per event
arXiv Paper Tracker - Free Academic API
Track new arXiv papers. Filter by category (cs.AI, cs.LG, q-bio.NC...), author, keyword, date. Returns title, authors, abstract, PDF link, DOI. Free official arXiv API. For AI/ML researchers, biotech analysts, journalists.
Pricing
Pay per event
Rating
0.0
(0)
Developer
Mohieldin Mohamed
Actor stats
0
Bookmarked
2
Total users
1
Monthly active users
11 days ago
Last modified
Categories
Share
arXiv Paper Tracker
Track new academic papers on arXiv. Filter by category, author, keyword, or date. Free official arXiv API. No key required.
This actor uses the official arXiv Atom-format public API to extract paper metadata with full structured fields. Tested on real data: 172,328 cs.AI papers, 307,287 cs.LG diffusion model papers, 759 papers with "Hinton" co-authors.
What you actually get
Real papers returned in test runs:
| ID | Title | Authors |
|---|---|---|
| 2604.13029v1 | Visual Preference Optimization with Rubric Rewards | 18 authors led by Ya-Qi Yu |
| 2401.05779v4 | Erasing Undesirable Influence in Diffusion Models | Jing Wu, Trung Le, Munawar Hayat |
| 2603.24741v1 | The Diffuse Gamma-ray Sky of a Milky Way Analogue | 11 authors incl. Jim Hinton |
Each row includes:
- arXiv ID (e.g.
2604.13029v1) - Title (cleaned of XML escaping)
- Full author list as array
- Primary category + all categories (cs.AI, cs.CV, hep-ph, etc.)
- Published date and last-updated date
- Full abstract (optional)
- Direct links: HTML abstract page + PDF
- DOI (when paper is linked to a published version)
- Author comment (e.g. "Accepted at NeurIPS 2024", page count)
- Journal reference (when published)
Why use this
arXiv hosts 2.5 million+ research papers across CS, math, physics, biology, economics, and more. ~17,000 new papers are submitted every week. Tracking new releases in your domain is essential for:
- AI/ML researchers — be the first to read new SOTA papers in your area
- Tech investors and VCs — predict which research areas will become commercial in 12-24 months
- Science journalists — break stories on new papers from labs you watch
- Biotech analysts — track competitive research from labs at Big Pharma + universities
- Indie hackers building AI tools — find new papers for your RAG dataset
- Academic literature reviewers — automate paper discovery for systematic reviews
- Patent attorneys and IP firms — track prior art in real time
- Newsletters and blogs — automate "this week in AI" content sourcing
Commercial alternatives that do similar things:
- Semantic Scholar API — free, but rate-limited and missing some fields
- Connected Papers — $5-15/month, beautiful UI but no bulk export
- Paperscape — free, visual, but no API
- Google Scholar Alerts — free, email only, no structured export
- Scite.ai — $99-499/month, citation-focused
This actor delivers structured JSON output with pay-per-event pricing — perfect for low-volume use that doesn't justify a $99/month subscription.
Honest limitations
- arXiv has a self-imposed 3-second rate limit between requests. We comply, so large pulls (10,000+ papers) take a while.
- arXiv author search is fuzzy. Searching
"Hinton"matches any author with "Hinton" in their name, not just Geoffrey Hinton. Use"Geoffrey Hinton"or include co-authors for disambiguation. - The boolean query syntax is arXiv's own, not standard SQL. Use
AND,OR,ANDNOT, parentheses. Field prefixes:ti:(title),au:(author),abs:(abstract),cat:(category),all:(anywhere). - arXiv only covers preprints, not peer-reviewed final versions. Many papers eventually get DOIs, but the abstract on arXiv may differ slightly from the published version.
- arXiv ID versions matter.
2604.13029v1and2604.13029v2are different revisions. We return whatever version was current at fetch time.
How to use
- Click Try for free (or Start)
- Pick categories to filter (e.g.
["cs.AI", "cs.LG"]) - Optionally add search query with arXiv syntax (e.g.
"transformer ANDNOT survey") - Optionally add author names for disambiguation
- Pick sort order (newest first / recently updated / relevance)
- Set max results (default 50, max 50,000)
- Click Start
Common arXiv categories
| Code | Field |
|---|---|
cs.AI | Artificial Intelligence |
cs.LG | Machine Learning |
cs.CL | Computation and Language (NLP) |
cs.CV | Computer Vision |
cs.RO | Robotics |
cs.NE | Neural and Evolutionary Computing |
cs.IR | Information Retrieval |
cs.CR | Cryptography and Security |
cs.DC | Distributed Computing |
cs.SE | Software Engineering |
cs.HC | Human-Computer Interaction |
stat.ML | Statistics - Machine Learning |
math.OC | Math - Optimization and Control |
q-bio.NC | Quantitative Biology - Neurons and Cognition |
q-fin.TR | Quantitative Finance - Trading and Microstructure |
econ.EM | Economics - Econometrics |
physics.bio-ph | Physics - Biological Physics |
hep-ph | High Energy Physics - Phenomenology |
Full list: https://arxiv.org/category_taxonomy
Output
{"arxivId": "2604.13029v1","title": "Visual Preference Optimization with Rubric Rewards","authors": ["Ya-Qi Yu", "Fangyu Hong", "Xiangyang Qu", "Hao Wang", "..."],"primaryCategory": "cs.CV","categories": ["cs.CV", "cs.AI"],"publishedAt": "2026-04-14T17:58:22Z","updatedAt": "2026-04-14T17:58:22Z","abstract": "The effectiveness of Direct Preference Optimization (DPO) depends on preference data...","absUrl": "https://arxiv.org/abs/2604.13029v1","pdfUrl": "https://arxiv.org/pdf/2604.13029v1","doi": null,"comment": "12 pages, 5 figures","journalRef": null,"extractedAt": "2026-04-15T21:35:00.000Z"}
Pricing
This actor uses pay-per-event pricing — extremely cheap given how cheap academic data is to extract:
- Actor start: $0.05 per run
- Per paper extracted: $0.005 per paper
Example costs:
- 100 latest cs.AI papers → $0.55
- 1,000 papers for a literature review → $5.05
- Daily check on cs.LG (50 new papers/day) → ~$8/month
- Bulk pull of all 50,000 cs.CL papers ever → $250.05
For comparison: Connected Papers Pro is $5/month with limited filters. Scite.ai is $99/month base. Most academic researchers can use this actor for under $5/month.
Free Apify tier members get $5/month in platform credits, which covers ~1,000 papers per month.
Tips
- Use
sortBy: submittedDatefor new-paper alerts (default) - Use
sortBy: relevancewhen you have a search query and want the best matches first - Combine categories with
+OR+in your downstream pipeline if you want union, or pass multiple categories in the input array (we OR them automatically) - Schedule daily runs with date filters in your downstream code to build a fresh-papers queue
- Pipe abstracts into an LLM for summarization — each paper is ~200-400 tokens of abstract, perfect for batch summarization
Source
This actor uses the official arXiv Atom-format query API at http://export.arxiv.org/api/query. The API is documented at https://info.arxiv.org/help/api/index.html. It's free, public, requires no authentication, and is maintained by Cornell University as a community service.
License
This actor is MIT licensed. arXiv papers themselves are licensed individually by their authors — most use Creative Commons or arXiv's nonexclusive license. Always cite the original paper when using extracted abstracts in your work.