arXiv Paper Tracker - Free Academic API avatar

arXiv Paper Tracker - Free Academic API

Pricing

Pay per event

Go to Apify Store
arXiv Paper Tracker - Free Academic API

arXiv Paper Tracker - Free Academic API

Track new arXiv papers. Filter by category (cs.AI, cs.LG, q-bio.NC...), author, keyword, date. Returns title, authors, abstract, PDF link, DOI. Free official arXiv API. For AI/ML researchers, biotech analysts, journalists.

Pricing

Pay per event

Rating

0.0

(0)

Developer

Mohieldin Mohamed

Mohieldin Mohamed

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

11 days ago

Last modified

Share

arXiv Paper Tracker

Track new academic papers on arXiv. Filter by category, author, keyword, or date. Free official arXiv API. No key required.

This actor uses the official arXiv Atom-format public API to extract paper metadata with full structured fields. Tested on real data: 172,328 cs.AI papers, 307,287 cs.LG diffusion model papers, 759 papers with "Hinton" co-authors.

What you actually get

Real papers returned in test runs:

IDTitleAuthors
2604.13029v1Visual Preference Optimization with Rubric Rewards18 authors led by Ya-Qi Yu
2401.05779v4Erasing Undesirable Influence in Diffusion ModelsJing Wu, Trung Le, Munawar Hayat
2603.24741v1The Diffuse Gamma-ray Sky of a Milky Way Analogue11 authors incl. Jim Hinton

Each row includes:

  • arXiv ID (e.g. 2604.13029v1)
  • Title (cleaned of XML escaping)
  • Full author list as array
  • Primary category + all categories (cs.AI, cs.CV, hep-ph, etc.)
  • Published date and last-updated date
  • Full abstract (optional)
  • Direct links: HTML abstract page + PDF
  • DOI (when paper is linked to a published version)
  • Author comment (e.g. "Accepted at NeurIPS 2024", page count)
  • Journal reference (when published)

Why use this

arXiv hosts 2.5 million+ research papers across CS, math, physics, biology, economics, and more. ~17,000 new papers are submitted every week. Tracking new releases in your domain is essential for:

  • AI/ML researchers — be the first to read new SOTA papers in your area
  • Tech investors and VCs — predict which research areas will become commercial in 12-24 months
  • Science journalists — break stories on new papers from labs you watch
  • Biotech analysts — track competitive research from labs at Big Pharma + universities
  • Indie hackers building AI tools — find new papers for your RAG dataset
  • Academic literature reviewers — automate paper discovery for systematic reviews
  • Patent attorneys and IP firms — track prior art in real time
  • Newsletters and blogs — automate "this week in AI" content sourcing

Commercial alternatives that do similar things:

  • Semantic Scholar API — free, but rate-limited and missing some fields
  • Connected Papers — $5-15/month, beautiful UI but no bulk export
  • Paperscape — free, visual, but no API
  • Google Scholar Alerts — free, email only, no structured export
  • Scite.ai — $99-499/month, citation-focused

This actor delivers structured JSON output with pay-per-event pricing — perfect for low-volume use that doesn't justify a $99/month subscription.

Honest limitations

  • arXiv has a self-imposed 3-second rate limit between requests. We comply, so large pulls (10,000+ papers) take a while.
  • arXiv author search is fuzzy. Searching "Hinton" matches any author with "Hinton" in their name, not just Geoffrey Hinton. Use "Geoffrey Hinton" or include co-authors for disambiguation.
  • The boolean query syntax is arXiv's own, not standard SQL. Use AND, OR, ANDNOT, parentheses. Field prefixes: ti: (title), au: (author), abs: (abstract), cat: (category), all: (anywhere).
  • arXiv only covers preprints, not peer-reviewed final versions. Many papers eventually get DOIs, but the abstract on arXiv may differ slightly from the published version.
  • arXiv ID versions matter. 2604.13029v1 and 2604.13029v2 are different revisions. We return whatever version was current at fetch time.

How to use

  1. Click Try for free (or Start)
  2. Pick categories to filter (e.g. ["cs.AI", "cs.LG"])
  3. Optionally add search query with arXiv syntax (e.g. "transformer ANDNOT survey")
  4. Optionally add author names for disambiguation
  5. Pick sort order (newest first / recently updated / relevance)
  6. Set max results (default 50, max 50,000)
  7. Click Start

Common arXiv categories

CodeField
cs.AIArtificial Intelligence
cs.LGMachine Learning
cs.CLComputation and Language (NLP)
cs.CVComputer Vision
cs.RORobotics
cs.NENeural and Evolutionary Computing
cs.IRInformation Retrieval
cs.CRCryptography and Security
cs.DCDistributed Computing
cs.SESoftware Engineering
cs.HCHuman-Computer Interaction
stat.MLStatistics - Machine Learning
math.OCMath - Optimization and Control
q-bio.NCQuantitative Biology - Neurons and Cognition
q-fin.TRQuantitative Finance - Trading and Microstructure
econ.EMEconomics - Econometrics
physics.bio-phPhysics - Biological Physics
hep-phHigh Energy Physics - Phenomenology

Full list: https://arxiv.org/category_taxonomy

Output

{
"arxivId": "2604.13029v1",
"title": "Visual Preference Optimization with Rubric Rewards",
"authors": ["Ya-Qi Yu", "Fangyu Hong", "Xiangyang Qu", "Hao Wang", "..."],
"primaryCategory": "cs.CV",
"categories": ["cs.CV", "cs.AI"],
"publishedAt": "2026-04-14T17:58:22Z",
"updatedAt": "2026-04-14T17:58:22Z",
"abstract": "The effectiveness of Direct Preference Optimization (DPO) depends on preference data...",
"absUrl": "https://arxiv.org/abs/2604.13029v1",
"pdfUrl": "https://arxiv.org/pdf/2604.13029v1",
"doi": null,
"comment": "12 pages, 5 figures",
"journalRef": null,
"extractedAt": "2026-04-15T21:35:00.000Z"
}

Pricing

This actor uses pay-per-event pricing — extremely cheap given how cheap academic data is to extract:

  • Actor start: $0.05 per run
  • Per paper extracted: $0.005 per paper

Example costs:

  • 100 latest cs.AI papers → $0.55
  • 1,000 papers for a literature review → $5.05
  • Daily check on cs.LG (50 new papers/day) → ~$8/month
  • Bulk pull of all 50,000 cs.CL papers ever → $250.05

For comparison: Connected Papers Pro is $5/month with limited filters. Scite.ai is $99/month base. Most academic researchers can use this actor for under $5/month.

Free Apify tier members get $5/month in platform credits, which covers ~1,000 papers per month.

Tips

  • Use sortBy: submittedDate for new-paper alerts (default)
  • Use sortBy: relevance when you have a search query and want the best matches first
  • Combine categories with +OR+ in your downstream pipeline if you want union, or pass multiple categories in the input array (we OR them automatically)
  • Schedule daily runs with date filters in your downstream code to build a fresh-papers queue
  • Pipe abstracts into an LLM for summarization — each paper is ~200-400 tokens of abstract, perfect for batch summarization

Source

This actor uses the official arXiv Atom-format query API at http://export.arxiv.org/api/query. The API is documented at https://info.arxiv.org/help/api/index.html. It's free, public, requires no authentication, and is maintained by Cornell University as a community service.

License

This actor is MIT licensed. arXiv papers themselves are licensed individually by their authors — most use Creative Commons or arXiv's nonexclusive license. Always cite the original paper when using extracted abstracts in your work.