πŸ“š arXiv Article Metadata Scraper -  Pay per results avatar
πŸ“š arXiv Article Metadata Scraper - Pay per results

Pricing

$9.99 / 1,000 articles

Go to Apify Store
πŸ“š arXiv Article Metadata Scraper -  Pay per results

πŸ“š arXiv Article Metadata Scraper - Pay per results

Developed by

Storm_Scraper

Storm_Scraper

Maintained by Community

Discover top arXiv papers with ⚑fast metadata extraction! Sort by πŸ”₯ relevance πŸ•’ submission date or πŸ“š subject area. Get key info like titles, abstracts, authors, PDF links & more. Perfect for πŸ“Š literature reviews, trend tracking, academic research & building high-quality AI training datasets!

5.0 (2)

Pricing

$9.99 / 1,000 articles

2

2

2

Last modified

2 days ago

πŸ“š arXiv Article Metadata Scraper

The arXiv Article Metadata Scraper is a fast and reliable tool designed to extract clean, structured metadata from arxiv.org based on any keyword. Whether you're a researcher, student, journalist, or data scientist β€” this scraper helps you quickly build datasets of scientific articles for analysis, research, or machine learning training.

πŸ” What It Does

Just enter a keyword (e.g., "LLM", "climate change", "quantum computing") and get back a dataset containing:

βœ… Full metadata for each article, including:

Field Descriptions:

🏷️ Field🧾 Description
πŸ†” arxiv_idUnique arXiv article identifier
πŸ”— arxiv_linkLink to the abstract page on arXiv
🧠 domainSubject classification (e.g., cs.CL, physics.optics)
πŸ“° titleFull title of the article
✍️ authorsList of authors
🧾 abstractFull article abstract
πŸ“… submitted_infoSubmission date and original announcement info
πŸ’¬ commentsAdditional comments from the authors or arXiv moderators (e.g., publication info)

πŸ›  How to Use the Scraper

This scraper is plug-and-play. No technical skills required:

Clone the repo or deploy via Apify.
Enter your keyword (e.g., "transformers").
Set the maximum number of articles to fetch.
Run the scraper.
Export the data in your preferred format:
JSON
CSV
Excel
XML
HTML

πŸ’° Pricing

This scraper operates on a pay-per-results basis at a cost of $9.99/1000 results.

If you're interested in other social media (Facebook, YouTube, Pinterest & more) or real estate Actors:


πŸ’‘ Why Use This Scraper?

  • πŸ“– Academic Research – Quickly access recent or relevant articles by topic

  • πŸ“Š Data Collection – Build datasets for NLP, LLMs, or scientific knowledge graphs

  • πŸ” Literature Review – Discover new papers in your domain without manual searching

  • ⏱️ Save Time – Skip repetitive browsing, get results in seconds

πŸ“₯ Input Schema

Here is how to structure the input JSON:

{
"keyword": "AI",
"maxitems": 60
}

Input Fields:

  • πŸ” keyword (string, required) – Any keyword to search for on arXiv

  • πŸ”’ maxitems (integer, optional) – Max number of articles to fetch (default = 60)

πŸ“€ Output Example

{
"arxiv_id": "arXiv:2508.06484",
"arxiv_link": "https://arxiv.org/abs/2508.06484",
"pdf_link": "https://arxiv.org/pdf/2508.06484.pdf",
"domain": "cs.HC",
"title": "Non-programmers Assessing AI-Generated Code: A Case Study...",
"authors": ["Yuvraj Virk", "Dongyu Liu"],
"abstract": "Non-technical end-users increasingly rely on AI code generation...",
"submitted_info": "Submitted 8 August, 2025; originally announced August 2025.",
"comments": "Accepted by VL/HCC 2025"
}

πŸ“« Support

😊 Leave us a star ⭐⭐⭐⭐⭐ if you are satisfied with the product! 🌍 For any questions, specific needs or issues, please reach out through Apify's platform or via email - Storm_Scraper πŸŒͺ️🌩️