Pricing

from $7.49 / 1,000 result items

HAL Open Science Scraper

Export research papers, theses, and preprints from HAL, France's national open science archive. 3M+ full-text records across every scientific discipline. Filter by domain, author, lab, journal, or year. Pull titles, abstracts, authors, DOIs, PDFs, citations.

Pricing

from $7.49 / 1,000 result items

Rating

0.0

(0)

Developer

ParseForge

Actor stats

Bookmarked

Total users

Monthly active users

2 hours ago

Last modified

🚀 HAL Open Science Scraper

🚀 Export French open-access research from HAL. 3M+ papers and theses by domain, author, lab, journal.

Export research papers, theses, and preprints from HAL, Frances national open science archive. 3M+ full-text records across every scientific discipline. Filter by domain, author, lab, journal, or year.

Pull titles, abstracts, authors, DOIs, PDFs, citations.

📋 What the HAL Open Science Scraper does

🎯 Targeted filtering. Use the input schema to narrow results to what you need.
📦 Structured output. Clean, typed records with every field documented.
🔄 Live data. Every run fetches fresh data at runtime, no cached responses.
🔌 Easy integration. Consume via Apify API, webhooks, or direct dataset export.
📊 Scale on demand. Run once or run on a schedule, the same way.

💡 Why it matters: teams that rely on this source no longer need to babysit a custom crawler. Set up your filters once, get updated data on demand.

📊 Data fields

Each record includes: abstract, abstractEnglish, audience, authors, bookTitle, citationFull, citationRef, conferenceTitle, docId, documentType, doi, domainLabels, domains, edition, halId, halUrl, institutions, isbn, issue, journalIssn, journalTitle, keywords, keywordsEnglish, labs, labStructures, language, openAccess, pages, pdfUrl, peerReviewed, popularLevel, primaryDomain, publicationDate, publisher, scrapedAt, title, titleEnglish, volume, year. These field names come straight from the actor's dataset schema, so what you see here is what lands in your dataset.

⚠️ Good to Know: free users are limited to 10 items per run for preview purposes. Upgrade to Apify paid plans for higher limits.

🚀 How to use

📝 Create a free account. Sign up at console.apify.com to get $5 in credits.
🔍 Open the actor. Paste your filters into the input schema in the Apify console.
▶️ Click Start. Wait a few seconds for the first records to land.
📤 Export the data. Download JSON/CSV or pipe to webhooks, Google Sheets, or Zapier.
🔄 Schedule it. Apify Schedules let you rerun on a cron cadence for free.

⏱️ Total time to first data: about 60 seconds.

🔗 Recommended Actors

Pair the HAL Open Science Scraper with related actors:

🌐 Website Content Crawler - crawl any page at scale
🔍 Google Search Scraper - harvest SERPs
📄 Article Extractor - extract clean article text
📊 Google Trends Scraper - capture demand signals
📸 Screenshot URL - render any page to image

💡 Pro Tip: browse the complete ParseForge collection for more niche actors.

⚠️ Disclaimer: This actor retrieves data from publicly available sources. You are responsible for complying with the source website's terms of service and applicable laws in your jurisdiction. ParseForge is not affiliated with the data source.

🆘 Need Help?

If you hit a bug, have questions about setup, or need a scraper we haven't built yet, open our contact form or write to parseforge@protonmail.com. We also take on paid custom data projects.

For faster answers, join our Discord. It's the best place to get support and suggest new actors.

HAL Open Archives Scraper - Research Publications

benthepythondev/hal-open-archives-scraper

Search HAL Open Science and export document IDs, labels and canonical publication URLs for research workflows.

Ben

OSF Open Science Framework Scraper

parseforge/osf-scraper

Export public research projects, preprints, and registrations from the Open Science Framework (OSF). Search across 1M+ open science records. Filter by keyword, subject, or provider. Pull titles, descriptions, tags, DOIs, authors, institutions, dates, and full metadata.

ParseForge

arXiv Preprint Scraper

parseforge/arxiv-scraper

Export preprints from arXiv.org. Search 2.5M+ open-access papers across physics, mathematics, computer science, biology, economics, and quantitative finance. Query by keyword, author, category, or date range. Pull titles, authors, abstracts, categories, DOIs, journal refs, and PDF links.

ParseForge

5.0

arXiv Scraper

jungle_synthesizer/arxiv-scraper

Export preprints from arXiv.org. Search 2.5M+ open-access papers across physics, mathematics, computer science, biology, economics, and quantitative finance. Query by keyword, author, category, or date range. Returns titles, authors, abstracts, categories, and PDF links.

BowTiedRaccoon

medRxiv Scraper

parseforge/medrxiv-scraper

Extract comprehensive preprint data from medRxiv, including titles, authors, abstracts, full text, DOIs, citations, and metadata. Automate access to health-science preprints with structured outputs, ideal for researchers and analysts who need reliable, large-scale article data without manual work.

ParseForge

arXiv Paper Scraper - Research Papers & Abstracts

viralanalyzer/arxiv-paper-intelligence

Search and extract ArXiv papers, abstracts, authors, and citations. Track research trends across any scientific field. AI-powered analysis.

viralanalyzer

5.0

OSF Open Science Framework Projects Scraper

parseforge/osf-projects-scraper

Search the Open Science Framework for public research projects by keyword or category. Returns project IDs, titles, descriptions, contributors, public flags, date created, date modified, and tag lists. Useful for meta science, scholarly discovery, and tracking research outputs across labs.

ParseForge

arXiv Paper Scraper — Abstracts, Authors & Metadata

logiover/arxiv-paper-scraper

Scrape research paper metadata from arXiv.org the worlds largest open-access repository. Search by keyword across computer science physics mathematics biology. Returns titles abstracts authors categories PDF links and DOIs. No API key required.

Logiover

Academic Research & Papers Scraper (OpenAlex)

rupom888/academic-research-scraper

Search 200M+ academic papers, researchers, and institutions via OpenAlex API. Completely free, no API key needed. Get paper titles, abstracts, DOIs, citations, authors, open access links, and concepts. Filter by year, paper type, open access, and field of study.