arXiv Daily New Papers Tracker — by Query
Pricing
$1.00 / 1,000 arxiv paper extracteds
arXiv Daily New Papers Tracker — by Query
Pull newly-submitted arXiv preprints by query. One row per paper — id, title, abstract, authors, categories, DOI, journal ref, author comment (often the accepted conference), PDF link. Built for cron with sinceDays:1. Source: arXiv Atom API (free, no auth).
Pull newly-submitted arXiv preprints matching any query. One clean row per paper — arXiv id, title, abstract, authors, primary + cross-listed categories, DOI, journal ref, author comment (often the accepted conference name — gold for "what made it into ICML/NeurIPS"), and PDF link.
Built for cron: pass sinceDays: 1 and get yesterday's new papers in your topic. No auth, no scraping — pure arXiv Atom API.
Inputs
{"queries": ["cat:cs.LG", "cat:cs.AI AND all:agent"],"sinceDays": 1,"maxResultsPerQuery": 500}
| input | default | notes |
|---|---|---|
queries | — | Required. arXiv query syntax. Examples: cat:cs.LG, cat:cs.AI AND all:agent, cat:q-bio.GN AND ti:"single-cell", au:hinton. Multi = union. |
sinceDays | 7 | submittedDate >= now-sinceDays. Use 1 for daily cron. |
maxResultsPerQuery | 500 | Cap per query. arXiv max is 2000 per call; we paginate. |
includeAbstract | true | Adds ~2-4 KB of summary text per row. Set false for lightweight rows. |
Output
One row per preprint:
| field | example |
|---|---|
| arxivId | 2606.02578 |
| version | v1 |
| title | Mitigating Perceptual Judgment Bias in Multimodal LLM-as-a-Judge… |
| authors | ["Seojeong Park", "Jiho Choi", …] |
| primaryCategory | cs.CV |
| categories | ["cs.CV", "cs.AI"] |
| published | 2026-06-01T17:59:46Z |
| updated | 2026-06-01T17:59:46Z |
| doi | 10.1145/... (if assigned) |
| journalRef | Nature Machine Intelligence, 2026 (if known) |
| comment | ICML 2026 ← this is the conference acceptance signal |
| pdfUrl | https://arxiv.org/pdf/2606.02578v1 |
| absUrl | https://arxiv.org/abs/2606.02578v1 |
| query | the originating query |
| abstract | full summary text (when includeAbstract is true) |
Daily cron pattern
{ "queries": ["cat:cs.LG", "cat:cs.CL"], "sinceDays": 1 }
Real-world questions this answers
- AI/ML researcher. What's new in cs.LG today that I should skim?
- VC scouting deep tech. Which papers landed at ICML/NeurIPS this week? (
commentfield) - Corporate R&D. New papers from competitor labs (
au:) or topic (all:)? - Journal-club organiser. A weekly digest of cs.AI + cs.CL with abstracts.
- Bibliometrics. A clean feed for downstream analytics (LangChain, citation graphs).
arXiv query syntax cheatsheet
| operator | meaning |
|---|---|
cat:cs.AI | category filter (cs.AI, cs.LG, q-bio.GN, math.PR, …) |
ti:"foo bar" | title contains exact phrase |
abs:foo | abstract contains word |
au:hinton | author surname |
all:foo | any field |
AND / OR / ANDNOT | boolean combinators |
Full list at https://arxiv.org/help/api/user-manual#query_details.
What it doesn't do
- No full-text extraction — we return only the metadata + abstract. Download the PDF yourself if you need the body.
- No citation graph / references — separate problem, see Crossref / Semantic Scholar.
- No author de-duplication / ORCID resolution — names as-published.
Maintained by
vøiddo studio — small independent shop.