Pricing

Pay per usage

Try for free

Go to Apify Store

Google Scholar Scraper

Try for free

Scrape publication details from scholar.google.com. Add your query, time range, and optionally document type (PDF or HTML only). Extract information about articles such as titles, authors, links, related articles, and more.

Pricing

Pay per usage

Rating

5.0

(6)

Developer

Marco Gullo

Actor stats

Bookmarked

1.9K

Total users

Monthly active users

a year ago

Last modified

🎓 What is Google Scholar Scraper?

Google Scholar Scraper is a web scraping tool that enables you to quickly extract publication data from scholar.google.com. Just enter your search query and scrape publication details such as authors, article titles, citations, dates, and more.

📖 What can this Google Scholar Scraper do?

Google Scholar Scraper is a data extraction tool created to serve as an alternative to Google Scholar API. With this scraping tool, you can:

🔍 Extract publications metadata by search query

⌛️ Specify the time range for your search

📄 Filter out articles by document type: PDFs only or HTMLs only, All documents or Reviews only

📒 Set up sorting by date or relevance

⬇️ Export data in formats such as Excel, CSV, JSON, HTML

🦾 Use the API in Python and Node.js, API Endpoints, webhooks, and integrations with other apps

📕 What data can this Google Scholar Scraper extract?

Google Scholar Scraper is capable of extracting publication details such as:

📚 Document type	📝 Title
🔗 Document link	📄 Additional document link
🔍 Full attribution	👥 Authors
📅 Publication	📆 Publication year
🔍 Source	🔎 Search match
📖 Citations	🔗 Link to citations
🔗 Link to related articles	🥉 Versions

💸 How much does it cost to scrape articles from Google Scholar?

When it comes to scraping, it can be challenging to estimate the resources needed to extract data, as use cases may vary significantly. That's why the best course of action is to run a test scrape with a small sample of input data and limited output. You’ll get your price per scrape, which you’ll then multiply by the number of scrapes you intend to do.

Apify provides you with $5 free usage credits to use every month on the Apify Free plan. That should be enough to give this scraper a test drive.

Watch this video for a few helpful tips. And don't forget that choosing a higher plan will save you money in the long run.

👨🏻‍🏫 How do I use Google Scholar Scraper to extract data?

This Google Scholar Scraper was designed for an easy start even if you've never extracted article data from the web before. Here's how you can scrape data from Google Scholar search with this tool:

Create a free Apify account using your email.
Open Google Scholar Scraper.
Enter your search queries.
Customize your search parameters, such as time range or document type.
Click "Start" and wait for the data to be extracted.
Export your Google Scholar data in Excel, CSV, JSON, or other formats.

You can also follow this guide on scraping Google Scholar.

⬇️ Input

The input for Google Scholar Scraper should be one search query. You can also specify additional parameters such as the time range, document type (PDFs or HTML only), sorting type, or scraping article reviews specifically.

Here's a simple input example of scraping research papers about COVID published after 2020 and sorted by date:

{
  "articleType": "any",
  "enableDebugDumps": false,
  "filter": "all",
  "keyword": "COVID-19",
  "maxItems": 100,
  "newerThan": 2020,
  "proxyOptions": {
    "useApifyProxy": true
  },
  "sortBy": "date"
}

Click on the input tab for a full explanation of input parameters.

⬆️ Output sample

The extracted Google Scholar data will be shown as a dataset which you can find in the Output tab. Note that the output will first be organized as a table for viewing convenience.

You can preview all the fields in the Storage and Output tabs and choose the format in which to export the Google Scholar data you've extracted: JSON, CSV, Excel, or HTML table. Here below is a sample dataset in JSON:

{
    "cidCode": "rvTRXmkWdSIJ",
    "didCode": "rvTRXmkWdSIJ",
    "lidCode": "",
    "aidCode": "rvTRXmkWdSIJ",
    "resultIndex": 3,
    "type": "ARTICLE",
    "title": "… OF THREE-TIME POINT ESTIMATION OF INFLAMMATORY MARKERS WITH THE SEVERITY AND OUTCOME IN PATIENTS OF COVID-19 IN A TERTIARY CARE …",
    "link": "https://www.jpmi.org.pk/index.php/jpmi/article/view/3251",
    "documentLink": "N/A",
    "documentType": "N/A",
    "fullAttribution": "M Hussain, S Orakzai, MM Dawood, A Ijaz… - Journal of Postgraduate …, 2024 - jpmi.org.pk",
    "authors": "M Hussain, S Orakzai, MM Dawood, A Ijaz…",
    "publication": "Journal of Postgraduate …",
    "year": 2024,
    "source": "jpmi.org.pk",
    "searchMatch": "2 days ago - … COVID-19 Quality & Clinical Research Collaborative. C-reactive protein as a \nprognostic indicator in hospitalized patients with COVID-19. … fatalities caused by COVID-19: a …",
    "citations": 0,
    "citationsLink": "N/A",
    "relatedArticlesLink": "https://scholar.google.com/scholar?q=related:rvTRXmkWdSIJ:scholar.google.com/&scioq=COVID-19&hl=en&scisbd=1&as_sdt=0,33",
    "versions": 2,
    "versionsLink": "https://scholar.google.com/scholar?cluster=2482915411382891694&hl=en&scisbd=1&as_sdt=0,33"
  },
  {
    "cidCode": "UZ71Uw_IxggJ",
    "didCode": "UZ71Uw_IxggJ",
    "lidCode": "",
    "aidCode": "UZ71Uw_IxggJ",
    "resultIndex": 4,
    "type": "ARTICLE",
    "title": "Environmental Impact of Covid-19 Pandemic in Owerri Metropolis, Imo State of Nigeria",
    "link": "https://hspublishing.org/GRES/article/view/363",
    "documentLink": "N/A",
    "documentType": "N/A",
    "fullAttribution": "CV Amadi, RF Njoku-Tony - Global Research in Environment and …, 2024 - hspublishing.org",
    "authors": "CV Amadi, RF Njoku-Tony",
    "publication": "Global Research in Environment and …",
    "year": 2024,
    "source": "hspublishing.org",
    "searchMatch": "2 days ago - … environmental impact of COVID-19 in Owerri metropolis … environmental impacts \nof COVID-19 pandemic in Owerri … environmental impact of COVID-19 pandemic in Owerri …",
    "citations": 0,
    "citationsLink": "N/A",
    "relatedArticlesLink": "https://scholar.google.com/scholar?q=related:UZ71Uw_IxggJ:scholar.google.com/&scioq=COVID-19&hl=en&scisbd=1&as_sdt=0,33",
    "versions": 0,
    "versionsLink": "N/A"
  },
  {
    "cidCode": "M3C8n-b4NGsJ",
    "didCode": "M3C8n-b4NGsJ",
    "lidCode": "",
    "aidCode": "M3C8n-b4NGsJ",
    "resultIndex": 5,
    "type": "HTML",
    "title": "Identification of factors affecting student academic burnout in online education during the COVID-19 pandemic using grey Delphi and grey-DEMATEL …",
    "link": "https://www.nature.com/articles/s41598-024-53233-7",
    "documentLink": "https://www.nature.com/articles/s41598-024-53233-7",
    "documentType": "HTML",
    "fullAttribution": "A Aria, P Jafari, M Behifar - Scientific Reports, 2024 - nature.com",
    "authors": "A Aria, P Jafari, M Behifar",
    "publication": "Scientific Reports",
    "year": 2024,
    "source": "nature.com",
    "searchMatch": "2 days ago - … Although after the end of Covid-19, most educational institutions have returned \nto the … online education in the post-Covid-19 era by gaining valuable experience during the …",
    "citations": 0,
    "citationsLink": "N/A",
    "relatedArticlesLink": "https://scholar.google.com/scholar?q=related:M3C8n-b4NGsJ:scholar.google.com/&scioq=COVID-19&hl=en&scisbd=1&as_sdt=0,33",
    "versions": 0,
    "versionsLink": "N/A"
  },
  {
    "cidCode": "X68f7LOXWUoJ",
    "didCode": "X68f7LOXWUoJ",
    "lidCode": "",
    "aidCode": "X68f7LOXWUoJ",
    "resultIndex": 6,
    "type": "ARTICLE",
    "title": "Reframing the Service Environment in Collegiate Sport: A Transformative Sport Service Research Approach",
    "link": "https://journals.ku.edu/jis/article/view/19739",
    "documentLink": "N/A",
    "documentType": "N/A",
    "fullAttribution": "Y Yang, E Gray, K Kinoshita… - Journal of Intercollegiate …, 2024 - journals.ku.edu",
    "authors": "Y Yang, E Gray, K Kinoshita…",
    "publication": "Journal of Intercollegiate …",
    "year": 2024,
    "source": "journals.ku.edu",
    "searchMatch": "2 days ago - This study applies a transformative sport service research approach to \nexamine student-athletes’ wellness within a collegiate sport setting. Sixteen semi-structured …",
    "citations": 0,
    "citationsLink": "N/A",
    "relatedArticlesLink": "https://scholar.google.com/scholar?q=related:X68f7LOXWUoJ:scholar.google.com/&scioq=COVID-19&hl=en&scisbd=1&as_sdt=0,33",
    "versions": 0,
    "versionsLink": "N/A"
  },
  {
    "cidCode": "1fxjSu8kPT4J",
    "didCode": "1fxjSu8kPT4J",
    "lidCode": "",
    "aidCode": "1fxjSu8kPT4J",
    "resultIndex": 7,
    "type": "ARTICLE",
    "title": "THE LEADERSHIP OF THE MADRASA PRINCIPAL IN ENHANCING LEARNING QUALITY AMIDST COVID-19 PANDEMIC IN CENTRAL ACEH REGENCY",
    "link": "https://jurnal-assalam.org/index.php/JAS/article/view/703",
    "documentLink": "N/A",
    "documentType": "N/A",
    "fullAttribution": "B Mizal, T Tathahira, RI Basith - Jurnal As-Salam, 2024 - jurnal-assalam.org",
    "authors": "B Mizal, T Tathahira, RI Basith",
    "publication": "Jurnal As-Salam",
    "year": 2024,
    "source": "jurnal-assalam.org",
    "searchMatch": "2 days ago - … The COVID-19 pandemic is a scourge for education actors, especially school \nand … the quality of learning during the COVID-19 pandemic. This research is classified as …",
    "citations": 0,
    "citationsLink": "N/A",
    "relatedArticlesLink": "https://scholar.google.com/scholar?q=related:1fxjSu8kPT4J:scholar.google.com/&scioq=COVID-19&hl=en&scisbd=1&as_sdt=0,33",
    "versions": 2,
    "versionsLink": "https://scholar.google.com/scholar?cluster=4484781414094732501&hl=en&scisbd=1&as_sdt=0,33"
  },
...

📚 What are other tools for scraping Google?

If you need to scrape specific data from Google Scholar, you can try these tools:

📍 Google Maps Extractor	🔍 Google Search Scraper
📉 Google Trending Searches	📈 Google Trends Scraper
👁 Google Lens Actor	🎑 Google Image Scraper
📩 Google Maps Email Extractor	🤟 Google Datasets Translator

❓FAQ

Is there an official Google Scholar API?

No, which makes researchers unable to directly access Google Scholar data using Google's APIs. Since there isn't an official way to get data from Google Scholar, people use other ways like web scraping or open-source APIs. Much like the API, web scraping tools like Google Scholar Scraper can visit the Google Scholar website, conduct a search, and extract article and author information from the pages they find.

Can I integrate Google Scholar Scraper with other apps?

Yes. This Google Scholar Scraper can be connected with almost any cloud service or web app thanks to integrations on the Apify platform. You can integrate with Make, Zapier, Slack, Airbyte, GitHub, Google Sheets, Google Drive, LangChain and more.

Or you can use webhooks to carry out an action whenever an event occurs, e.g. get a notification whenever Google Scholar Scraper successfully finishes a run.

Can I use Google Scholar Scraper as its own API?

Yes, you can use the Apify API to access Google Scholar Scraper programmatically. The API allows you to manage, schedule, and run Apify Actors, access datasets, monitor performance, get results, create and update Actor versions, and more.

To access the API using Node.js, you can use the apify-client NPM package. To access the API using Python, you can use the apify-client PyPI package.

For detailed information and code examples, refer to the Apify API documentation.

Can I use this Google Scholar API in Python?

Yes, you can use the Apify API with Python. To access the Google Scholar API with Python, use the apify-client PyPI package. You can find more details about the client in our Python Client documentation.

Not your cup of tea? Build your own Google Scholar scraper.

Google Scholar Scraper doesn’t exactly do what you need? You can always build one of your own! We have various web scraping templates in Python, JavaScript, and TypeScript to get you started. Alternatively, you can write it from scratch using our open-source library Crawlee. You can keep the scraper to yourself or make it public by adding it to Apify Store (and find users for it).

Your feedback

We’re always working on improving the performance of our Actors. So if you’ve got any technical feedback for Google Scholar Scraper or simply found a bug, please create an issue on the Actor’s Issues tab in Apify Console.

Google Scholar Scraper

crawlerbros/google-scholar-scraper

Scrape academic papers, articles, and citations from Google Scholar. Search by keywords with filters for year range, document type, sort order, and article type. Extract titles, authors, citations, links, and more.

Crawler Bros

5.0

Google Scholar Scraper

george.the.developer/google-scholar-scraper

Scrape Google Scholar for academic papers, citations, author profiles. No API key needed. Extract titles, authors, abstracts, citation counts, PDF links, h-index, i10-index. Export JSON, CSV, Excel. Anti-bot protection with residential proxies, UA rotation, CAPTCHA detection.

George Kioko

102

5.0

Google Scholar Scraper

easyapi/google-scholar-scraper

Powerful Google Scholar scraper collect up to 5000 scholarly results per run with flexible search options, citation filtering. Perfect for academic research, bibliometric analysis, and scientific trend tracking. 🎓🔍

EasyApi

405

2.5

Semantic Scholar Scraper

parseforge/semantic-scholar-scraper

Extract detailed academic paper data from Semantic Scholar, including abstracts, citations, authors, and publication details. Ideal for researchers, academics, and analysts who need structured scholarly data for literature reviews, research workflows, and large-scale academic analysis.

ParseForge

1.1

California State Licensed Contractor CSLB Scraper

parseforge/cslb-california-scraper

Boost your contractor research with our comprehensive California State Licensed Contractor Scraper! Perfect for companies, project managers, and compliance professionals who need complete contractor information including license numbers, business names, addresses, phone numbers, and license types.

ParseForge

5.0

Etsy Scraper Pro

webdatalabs/etsy-scraper-pro

Fast and reliable Etsy scraper that extracts product listings from search results. Get product titles, prices, ratings, reviews, shop names, and images - perfect for market research, price monitoring, competitor analysis, and e-commerce automation.

WebDataLabs

200

3.8

Google Scholar | Research Papers, Citations & Author Profiles

johnvc/google-scholar-api

Scrape Google Scholar at scale. Search research papers, get citation formats (MLA, APA, Chicago, BibTeX), author profiles with h-index and i10-index, list an author's publications, view per-article citation history, & map co-author networks. Six modes in one for lit reviews, bibliometrics, & agents.

John

5.0

Google Scholar Scraper — Papers & Citations

muhammadafzal/google-scholar-scraper

Scrape Google Scholar results with paper titles, authors, publication details, citation counts, related links, and research metadata.

Muhammad Afzal

Google Scholar Article Scraper

agenscrape/google-scholar-article-scraper

Extract academic articles, citations, authors, and publication data from Google Scholar. Perfect for research analysis and literature reviews with fast, reliable scraping.

Agenscrape

Google Scholar Scraper

automation-lab/google-scholar-scraper

Search Google Scholar and extract academic papers. Get titles, authors, citation counts, abstracts, PDF links, and publication details. Supports year filtering.