Pricing

from $2.00 / 1,000 paper fetcheds

ArXiv Preprint Paper Search

Search and extract preprint research papers from the ArXiv open-access repository. Query over 2.4 million academic papers across physics, mathematics, computer science, biology, economics, and more with structured JSON output, no API key required.

Pricing

from $2.00 / 1,000 paper fetcheds

Rating

0.0

(0)

Developer

Ryan Clinton

Actor stats

Bookmarked

Total users

Monthly active users

a month ago

Last modified

categories

Optional

All ArXiv categories assigned to the paper

Type:

authors

Optional

Comma-separated author names

Type:string | null

authorList

Optional

List of author names

Type:

pdfUrl

Optional

Direct URL to the PDF

Type:string | null

absUrl

Optional

URL to the abstract page

Type:string | null

doi

Optional

Digital Object Identifier (an arXiv-minted 10.48550/arXiv DOI is not external publication)

Type:string | null

journalRef

Optional

Journal reference — present when the preprint was published in a venue

Type:string | null

comment

Optional

Author-provided comment (venue acceptance, page count, code links)

Type:string | null

extractedAt

Optional

ISO timestamp when the record was extracted

Type:string | null

versionCount

Optional

Number of arXiv versions posted (from the vN suffix)

Type:integer | null

publicationStatus

Optional

Stable enum: published (journal_ref or external DOI) | accepted (venue acceptance in the comment) | preprint

Type:string | null

peerReviewStatus

Optional

Stable enum: published | accepted | preprint-only

Type:string | null

venue

Optional

Detected publication or acceptance venue, when present

Type:string | null

hasCode

Optional

A code/repository link was found in the comment or abstract

Type:boolean | null

codeUrl

Optional

First detected code repository URL, if any

Type:string | null

withdrawn

Optional

Author comment marks the paper as withdrawn

Type:boolean | null

revisionActivity

Optional

Stable enum: single-version | revised | heavily-revised

Type:string | null

recencyDays

Optional

Days since the paper was first posted

Type:integer | null

freshness

Optional

Stable enum: cutting-edge (<90d) | recent (<1y) | established (<3y) | older

Type:string | null

crossListed

Optional

Paper carries more than one ArXiv category

Type:boolean | null

interdisciplinary

Optional

Categories span more than one top-level archive (e.g. cs + math)

Type:boolean | null

relevanceScore

Optional

0-100 search-relevance axis (rank-derived) — how well the paper matched the query

Type:integer | null

maturityScore

Optional

0-100 trust/maturity axis — peer-review status, code, revisions, collaboration. Distinct from relevanceScore.

Type:integer | null

maturityFactors

Optional

Breakdown of the maturity score: [{ factor, points }]

Type:

maturityTier

Optional

Stable enum: peer-reviewed | venue-accepted | established-preprint | fresh-preprint

Type:string | null

priorityScore

Optional

0-100 mode-weighted ordering scalar (the field to sort by)

Type:integer | null

citationRisk

Optional

Stable enum: low | medium | high — risk of citing this paper as-is

Type:string | null

citationRiskReasons

Optional

Plain-English reasons behind the citation-risk level

Type:

ragSafe

Optional

Safe to index into a RAG/LLM corpus (has a substantive abstract, not withdrawn)

Type:boolean | null

ragSafeReason

Optional

Why the paper is or is not RAG-safe

Type:string | null

recommendedAction

Optional

Type:string | null

why

Optional

Plain-English reasons for the recommendation

Type:

signalReason

Optional

Reasoning chain behind the classification (publication status, peer review, versions, code, recency, maturity)

Type:

isLandmark

Optional

Earliest paper of its top-level archive within this result set

Type:boolean | null

landmarkReason

Optional

Why the paper was tagged a landmark

Type:string | null

summary

Optional

LLM-quotable one-line summary (≤280 chars)

Type:string | null

canonicalArxivId

Optional

ArXiv ID without the version suffix (stable identity across versions)

Type:string | null

version

Optional

Version number of this record (from the vN suffix)

Type:integer | null

statusConfidence

Optional

Confidence in the publication-status classification: high | medium | low

Type:string | null

venueNormalized

Optional

Parsed venue: { raw, venueName, venueYear, venueType (conference|journal|workshop|unknown), confidence }

Type:

categoryNames

Optional

Human-readable names for the ArXiv category codes (null-safe; code echoed when unknown)

Type:

codeUrls

Optional

All detected code/repository URLs

Type:

codeHost

Optional

Host of the first detected code URL (e.g. github.com)

Type:string | null

paperLifecycle

Optional

Lifecycle flags: { withdrawn, superseded, replacementHint, errataHint, statusConfidence }

Type:

citation

Optional

Deterministic citation companion: { preferredCitationTarget, citationWarning, versionAwareCitationNote, bibtexKey, bibtex } (when includeCitationFields). BibTeX only — style-formatted strings are intentionally not generated.

Type:

evidence

Optional

Inspectable evidence ledger: { statusSignals[], riskSignals[], scoreTrace[] } (when includeEvidenceLedger)

Type:

paperType

Optional

Type:string | null

isSurvey

Optional

True when the title/abstract identify a survey/review/overview

Type:boolean | null

surveyConfidence

Optional

Confidence the paper is a survey: high | medium | low

Type:string | null

foundationalCandidate

Optional

Earliest ESTABLISHED (published/accepted or revised) paper of its archive WITHIN THIS RESULT SET — a metadata-only candidate, not a citation-based seminal/importance claim

Type:boolean | null

researchRole

Optional

Type:string | null

foundationalReason

Optional

Why the paper was tagged foundational

Type:string | null

authorCount

Optional

Number of authors

Type:integer | null

firstAuthor

Optional

First author name

Type:string | null

lastAuthor

Optional

Last author name (often the senior/PI author)

Type:string | null

largeCollaboration

Optional

True when 10+ authors (large collaboration)

Type:boolean | null

role

Optional

On role-coverage records: the research role (survey / foundational / benchmark / dataset / methodology / state-of-art / reproducible)

Type:string | null

status

Optional

On role-coverage records: covered | missing

Type:string | null

count

Optional

On role-coverage records: number of papers in the result set with this role

Type:integer | null

Other properties may be included.

arXiv Preprint Scraper

parseforge/arxiv-scraper

Export preprints from arXiv.org. Search 2.5M+ open-access papers across physics, mathematics, computer science, biology, economics, and quantitative finance. Query by keyword, author, category, or date range. Pull titles, authors, abstracts, categories, DOIs, journal refs, and PDF links.

ParseForge

5.0

arXiv Scraper

jungle_synthesizer/arxiv-scraper

Export preprints from arXiv.org. Search 2.5M+ open-access papers across physics, mathematics, computer science, biology, economics, and quantitative finance. Query by keyword, author, category, or date range. Returns titles, authors, abstracts, categories, and PDF links.

BowTiedRaccoon

arXiv Paper Scraper — Abstracts, Authors & Metadata

logiover/arxiv-paper-scraper

Scrape research paper metadata from arXiv.org the worlds largest open-access repository. Search by keyword across computer science physics mathematics biology. Returns titles abstracts authors categories PDF links and DOIs. No API key required.

Logiover

ArXiv Preprint Paper Search

scrupulous_waterbird_m4w/arxiv-papers

Search and extract arXiv preprint papers by category, author, title, and date range. Returns title, authors, abstract, PDF URL, categories, primary category, and submission date as structured records.

Mori

arXiv Preprint Scraper

chrisp1211/arxiv-scraper-max

arXiv Preprint Scraper. No API key required. Pay only per result; empty or failed runs cost nothing.

Christian Pichichero

ArXiv Papers Scraper

leftwinglautus/arxiv-papers-scraper

Search and scrape academic papers from the arXiv API by keyword, category, or author.

Moeeze Hassan

arXiv Recent CS Papers Scraper | JSON Export

trisert/arxiv-recent-cs-papers

Extract the latest Computer Science papers from arXiv into structured JSON (title, authors, arXiv ID, abstract). Pay per result.

Nicola Destro

ArXiv Paper Search

gentle_cloud/arxiv-paper-search

Search and extract academic papers from ArXiv. Find papers by keyword, author, or category with full metadata including title, authors, abstract, categories, and PDF links.

Monkey Coder

arXiv Paper Scraper

cloud9_ai/arxiv-paper-scraper

Scrape academic papers from arXiv.org. Search by keyword, browse categories, or get latest papers. Extract titles, abstracts, authors, PDF links, and citation data via arXiv API.

cloud9

arXiv Paper Scraper — Search Academic Papers & Abstracts

puskin/arxiv-scraper

Search and retrieve academic papers from arXiv by keyword, author, or category. Extracts titles, authors, abstracts, and download links via the free arXiv API — no authentication needed.