Unpaywall Scraper avatar

Unpaywall Scraper

Pricing

Pay per event

Go to Apify Store
Unpaywall Scraper

Unpaywall Scraper

Discover open access research articles with our powerful Unpaywall scraper! Search through millions of articles in the Unpaywall database to find free-to-read scholarly publications. Perfect for researchers, librarians, and academics who need to find and access open access articles efficiently.

Pricing

Pay per event

Rating

5.0

(1)

Developer

ParseForge

ParseForge

Maintained by Community

Actor stats

1

Bookmarked

3

Total users

1

Monthly active users

2 days ago

Last modified

Share

ParseForge Banner

πŸ“š Unpaywall Scraper

Find open access research articles instantly with our powerful Unpaywall scraper. Search the Unpaywall database to discover millions of free, legal open-access versions of scholarly publications, download research data as CSV or JSON, access complete publication metadata without paywalls, and monitor citation metrics across your research topics. Whether you're an academic institution looking for an unpaywall scraper free, a librarian needing to bulk download research papers CSV, or a researcher who wants an unpaywall data service, this tool eliminates the barrier to discovering and accessing peer-reviewed content.

The Unpaywall Scraper finds free, legal open-access research articles across millions of publications - up to 22 data fields per article - with filtering by journal type, publication year, and open-access status.

✨ What Does It Do

  • πŸ“– Article Title - Extract full publication titles for literature reviews and reference management
  • πŸ”— DOI and DOI URL - Access direct Digital Object Identifiers and persistent links to each article for citation purposes
  • βœ… Open Access Status - Identify which articles are freely available and their specific OA classification (gold, green, hybrid, bronze)
  • πŸ“Š Citation Metrics - Track how many times each article has been cited to gauge research impact and relevance
  • πŸ‘₯ Author Information - Collect author names and affiliations to build researcher networks and collaboration maps
  • πŸ“° Journal Metadata - Extract journal names, ISSNs, publisher details, and whether journals themselves are open access
  • πŸ—“οΈ Publication Year and Dates - Filter research by era and track when articles were updated in the database
  • πŸ›οΈ Repository Information - Find whether articles have copies in institutional repositories for backup access
  • πŸ”— PDF and Landing Page URLs - Direct links to both the open-access PDF and the article's official webpage
  • πŸ“ˆ Relevance Scoring - Understand search result ranking to prioritize the most relevant articles for your query

πŸ”§ Input

  • Search Query - enter one or more keywords to search article titles in the Unpaywall database. The system finds articles containing all search terms. Examples: machine learning, COVID-19 vaccines, climate change impacts
  • Max Items - limit the number of articles to collect per run. Free users can collect up to 100 results, paid users can collect up to 1,000,000. Leave blank for free users to default to 100
  • Open Access Filter - choose to view all articles, only open access articles, or only closed access articles. This narrows results without needing to search again
{
"query": "software",
"maxItems": 10,
"is_oa": "any"
}

πŸ“Š Output

Each article includes up to 22 data fields. Download as JSON, CSV, or Excel.

πŸ“– Article TitleπŸ”— DOIπŸ”— DOI URL
βœ… Open Access StatusπŸ“Š OA ClassificationπŸ“… Publication Year
πŸ‘₯ AuthorsπŸ“° Journal NameπŸ“° Journal ISSN
πŸ“° Publisher NameπŸ›οΈ Journal is OAπŸ›οΈ In DOAJ Directory
πŸ›οΈ Repository Copy AvailableπŸ”— Best OA LocationπŸ”— First OA Location
πŸ’¬ Citation CountπŸ“ˆ Relevance Score🎯 Content Genre
πŸ“… Last UpdatedπŸ”— Citation LinksπŸ• Scraped Timestamp

πŸ’Ž Why Choose the Unpaywall Scraper?

FeatureOur ActorSimilar Scrapers
Search by keyword across entire databaseβœ”οΈPartial
Open access status filter (gold/green/hybrid/bronze)βœ”οΈβŒ
Citation count and impact metricsβœ”οΈβŒ
Repository copy availability detectionβœ”οΈβŒ
Journal-level OA and DOAJ directory dataβœ”οΈβŒ
Author and affiliation extractionβœ”οΈPartial
Paid tier (up to 1,000,000 articles)βœ”οΈβŒ
Real-time progress logging during collectionβœ”οΈβŒ
Multi-filter searches in one runβœ”οΈβŒ
Complete author ORCID data retrievalβœ”οΈβŒ
Relevance scoring for search resultsβœ”οΈβŒ
Support for complex query filtersβœ”οΈβŒ

πŸ“‹ How to Use

No technical skills required. Follow these simple steps:

  1. Sign Up: Create a free account with $5 credit
  2. Find the Tool: Search for "Unpaywall Scraper" in the Apify Store and configure your input
  3. Run It: Click "Start" and watch your results appear

That's it. No coding, no setup, no complicated configuration. Now you can export your data in CSV, Excel, or JSON format.

🎯 Business Use Cases

  • πŸ“Š Academic Librarians - Search for open access articles by topic to identify which journals and publishers offer free content, then negotiate better licensing agreements based on actual institution usage patterns
  • πŸ”¬ Researchers and PhD Students - Collect all open access papers in your field of study to build a comprehensive literature review dataset, filter by publication year and citation count, and export directly into your reference management tool
  • πŸ’Ό Publishing Analytics Teams - Monitor competitor publishers' open access prevalence and citation metrics across research domains to identify market gaps and adjust publishing strategy before launching new journal titles

❓ FAQ

πŸ” How does this actor work? It searches the Unpaywall database using your keywords and retrieves article metadata, open-access status, and direct links to free PDF versions instantly.

πŸ“Š How accurate is the data? Data comes directly from Unpaywall (unpaywall.org), a vetted, publicly-available database of millions of open-access research articles maintained by the Scholarly Kitchen and Crossref.

πŸ“… Can I schedule this to run automatically? Yes. Use the Apify platform's scheduler or integrate with Make, Zapier, or GitHub to run the scraper on a daily, weekly, or monthly basis.

βš–οΈ Is it legal to scrape Unpaywall? Yes. Unpaywall data is public and open access by design. You are collecting publicly-available information about free scholarly content. Always verify that you comply with local laws and terms of service.

πŸ›‘οΈ Will Unpaywall block my requests? Unpaywall is designed to be accessed by researchers and institutions. The scraper respects rate limits and follows best practices. No special proxy configuration is required.

⚑ How long does a run take? Search time depends on your query and max items. Typical runs collect 100 articles in 2-5 seconds. Larger batches (10000+ articles) take 1-3 minutes.

⚠️ Are there any limits? Free users can collect up to 100 results per run. Paid users can collect up to 1,000,000 results per run.

πŸ”— Integrate Unpaywall Scraper with any app

πŸ’‘ More ParseForge Actors

  • Etsy Scraper - Extract product listings, prices, and seller information from Etsy
  • Crunchbase Scraper - Collect company profiles, funding data, and investment information
  • Indeed Scraper - Extract job listings, salaries, and applicant details from Indeed
  • Redfin Scraper - Gather real estate listings, prices, and market data from Redfin

Browse our complete collection of data extraction tools for more.

πŸš€ Ready to Start?

Create a free account with $5 credit and collect your first 100 results for free. No coding, no setup.

πŸ†˜ Need Help?

  • Check the FAQ section above for common questions
  • Visit the Apify support page for documentation and tutorials
  • Contact us to request a new scraper, propose a custom project, or report an issue at Tally contact form

⚠️ Disclaimer

This Actor is an independent tool and is not affiliated with, endorsed by, or sponsored by Unpaywall or any of its subsidiaries. All trademarks mentioned are the property of their respective owners.