arXiv Paper Scraper - AI Research Tracker avatar

arXiv Paper Scraper - AI Research Tracker

Pricing

from $1.00 / 1,000 results

Go to Apify Store
arXiv Paper Scraper - AI Research Tracker

arXiv Paper Scraper - AI Research Tracker

Track new arXiv papers by category, keyword and author. Clean output with authors, abstract and direct PDF links. Public API, no key.

Pricing

from $1.00 / 1,000 results

Rating

0.0

(0)

Developer

Arjun Annamalai

Arjun Annamalai

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

3 days ago

Last modified

Share

arXiv Paper Scraper 📄 — AI Research Tracker

Track new arXiv research programmatically. Filter by category, keyword, and author, sorted newest-first. Clean output with authors, abstract, and direct PDF links — export to JSON/CSV/Excel.

For AI researchers, RAG/dataset builders, R&D teams, and newsletter writers.

What makes it better

  • Query builder — combine multiple categories (OR), a keyword (title + abstract), and an author in one clean query.
  • Ready-to-use fields — parsed authors list, abstract, abstract URL and direct pdfUrl (no manual link digging).
  • Sort by newest submitted, recently updated, or relevance.

Example input

{
"categories": ["cs.AI", "cs.CL", "cs.LG"],
"searchQuery": "agent",
"sortBy": "submittedDate",
"maxItems": 200
}

Example output

{
"arxivId": "2606.01234v1",
"title": "Self-Guided Skill Acquisition via Steerable Agents",
"authors": ["Jane Doe", "John Smith"],
"primaryCategory": "cs.AI",
"url": "http://arxiv.org/abs/2606.01234v1",
"pdfUrl": "http://arxiv.org/pdf/2606.01234v1",
"published": "2026-06-23T10:00:00Z"
}

Use cases

  • Daily digest of new papers in your subfield.
  • Feed abstracts into a RAG system or literature-review pipeline.
  • Track a research group's latest output by author.

Run locally

pip install -r requirements.txt
python3 local_test.py

Then apify push.


Uses the public arXiv API. Please respect arXiv's rate guidance. Not affiliated with arXiv.