Pricing

Pay per usage

arXiv Daily Digest Scraper

Scrape arXiv papers by search query or category. Extract titles, authors, abstracts, and PDF links from recent submissions.

Pricing

Pay per usage

Rating

0.0

(0)

Developer

Donny Nguyen

Actor stats

Bookmarked

Total users

Monthly active users

6 hours ago

Last modified

arXiv Daily Digest

Monitor arXiv for new papers by topic, author, or keyword. This actor extracts comprehensive paper metadata from arXiv.org including titles, authors, abstracts, categories, and PDF links.

Features

Search by Keywords: Query arXiv using any search terms
Monitor Categories: Track specific arXiv categories (cs.AI, cs.LG, physics, etc.)
Custom URLs: Provide your own arXiv search or list page URLs
Date Filtering: Only include papers from the last N days
Comprehensive Metadata: Extract title, authors, abstract, categories, arXiv ID, PDF URL, and publication date

Input Parameters

Search Queries (stringList): Keywords or phrases to search on arXiv (e.g., "machine learning", "quantum computing")
arXiv Categories (stringList): Category codes to monitor (e.g., "cs.AI", "cs.LG", "physics.gen-ph")
Start URLs (requestListSources): Custom arXiv URLs to scrape
Max Results (integer, default: 50): Maximum number of papers to extract per URL
Days Back (integer, default: 7): Only include papers from the last N days (0 = no filter)
Use Residential Proxy (boolean, default: false): Use residential proxies for better reliability

Output

Each paper includes:

{
  "title": "Paper Title",
  "authors": ["Author 1", "Author 2"],
  "abstract": "Paper abstract text...",
  "categories": ["cs.AI", "cs.LG"],
  "arxivId": "2401.12345",
  "pdfUrl": "https://arxiv.org/pdf/2401.12345.pdf",
  "publishedDate": "1 Jan 2024",
  "url": "https://arxiv.org/abs/2401.12345",
  "scrapedAt": "2024-01-15T10:30:00.000Z"
}

Use Cases

Stay updated on research in your field
Track specific research topics or authors
Build research paper databases
Monitor new publications in specific categories
Automated literature review workflows

Example Configuration

{
  "searchQueries": ["machine learning", "deep learning"],
  "categories": ["cs.AI", "cs.LG"],
  "maxResults": 50,
  "daysBack": 7
}

Notes

arXiv is an open-access repository, so this actor respects their terms of use
Date filtering is based on the submission/announcement date
The actor handles both search result pages and category list pages
Results are limited per URL to avoid excessive scraping

Performance

Speed: Fast (Cheerio-based, no browser overhead)
Cost: Low (datacenter proxies sufficient for arXiv)
Memory: 256-512 MB recommended

Built with Apify SDK and Crawlee using CheerioCrawler for efficient scraping.

ArXiv Paper Scraper

nexgendata/arxiv-scraper

Extract research papers from ArXiv including titles, abstracts, authors, categories, and submission dates. Track cutting-edge research in AI, physics, math, and more.

Stephan Corbeil

arXiv Paper Scraper

cloud9_ai/arxiv-paper-scraper

Scrape academic papers from arXiv.org. Search by keyword, browse categories, or get latest papers. Extract titles, abstracts, authors, PDF links, and citation data via arXiv API.

cloud9

ArXiv Academic Paper Scraper

fortuitous_pirate/arxiv-scraper

Scrape academic papers from ArXiv. Extract titles, authors, abstracts, categories, and PDF links. Essential for research and literature reviews.

Fortuitous Pirate

arXiv Scraper

artificially/arxiv-scraper

Search and extract academic papers from arXiv.org. Get paper titles, authors, abstracts, categories, and PDF links for AI/ML, physics, math, and more.

Artificially

📚 arXiv Article Metadata Scraper - Cheap

scrapestorm/arxiv-article-metadata-scraper---cheap

Discover top arXiv papers with ⚡fast metadata extraction! Sort by 🔥 relevance 🕒 submission date or 📚 subject area. Get key info like titles, abstracts, authors, PDF links & more. Perfect for 📊 literature reviews, trend tracking, academic research & building high-quality AI training datasets!

Storm_Scraper

5.0

📚 arXiv Article Metadata Scraper - Pay per results

scrapestorm/arxiv-article-metadata-scraper---pay-per-results

Storm_Scraper

5.0

arXiv Search Scraper 📚

easyapi/arxiv-search-scraper

Extract comprehensive research paper data from arXiv search results. Get detailed metadata including titles, authors, abstracts, categories and more. Perfect for academic research monitoring, trend analysis and building paper databases. 🎓📚

EasyApi

5.0

Arxiv Semantic Search

draouadmohamed/arxiv-semantic-search

Scrape arXiv papers by category and find relevant research using AI-powered semantic search. Get papers from any field (AI, physics, biology, economics, etc.) with embeddings for RAG systems. Find your categories at: https://arxiv.org/category_taxonomy

Mohamed Aouad

Arxiv Paper Scraper

technicaldost/arxiv-paper-scraper

Technical Dost Solutions

ArXiv Preprint Paper Search

ryanclinton/arxiv-paper-search

Search 2.4M+ preprint papers on ArXiv. Filter by keyword, author, category (cs.AI, cs.CL, math, physics, etc.), sort by relevance or date. Returns titles, abstracts, authors, categories, PDF links, DOIs. Free API, no key needed.