arXiv Paper Scraper - AI Research Tracker
Pricing
from $1.00 / 1,000 results
Go to Apify Store

arXiv Paper Scraper - AI Research Tracker
Track new arXiv papers by category, keyword and author. Clean output with authors, abstract and direct PDF links. Public API, no key.
Pricing
from $1.00 / 1,000 results
Rating
0.0
(0)
Developer
Arjun Annamalai
Maintained by CommunityActor stats
0
Bookmarked
2
Total users
1
Monthly active users
3 days ago
Last modified
Categories
Share
arXiv Paper Scraper 📄 — AI Research Tracker
Track new arXiv research programmatically. Filter by category, keyword, and author, sorted newest-first. Clean output with authors, abstract, and direct PDF links — export to JSON/CSV/Excel.
For AI researchers, RAG/dataset builders, R&D teams, and newsletter writers.
What makes it better
- Query builder — combine multiple categories (OR), a keyword (title + abstract), and an author in one clean query.
- Ready-to-use fields — parsed
authorslist,abstract, abstract URL and directpdfUrl(no manual link digging). - Sort by newest submitted, recently updated, or relevance.
Example input
{"categories": ["cs.AI", "cs.CL", "cs.LG"],"searchQuery": "agent","sortBy": "submittedDate","maxItems": 200}
Example output
{"arxivId": "2606.01234v1","title": "Self-Guided Skill Acquisition via Steerable Agents","authors": ["Jane Doe", "John Smith"],"primaryCategory": "cs.AI","url": "http://arxiv.org/abs/2606.01234v1","pdfUrl": "http://arxiv.org/pdf/2606.01234v1","published": "2026-06-23T10:00:00Z"}
Use cases
- Daily digest of new papers in your subfield.
- Feed abstracts into a RAG system or literature-review pipeline.
- Track a research group's latest output by author.
Run locally
pip install -r requirements.txtpython3 local_test.py
Then apify push.
Uses the public arXiv API. Please respect arXiv's rate guidance. Not affiliated with arXiv.