Pricing

from $5.00 / 1,000 results

Google Scholar Scraper — Papers & Citations

Scrape Google Scholar results with paper titles, authors, publication details, citation counts, related links, and research metadata.

Pricing

from $5.00 / 1,000 results

Rating

0.0

(0)

Developer

Muhammad Afzal

Actor stats

Bookmarked

Total users

Monthly active users

15 days ago

Last modified

Features

50 papers in ~10 seconds — lightning-fast API-based extraction
Rich academic metadata — title, authors, abstract, venue, year, citation count, PDF link
Citation tracking — exact citation counts for research impact measurement
Year filtering — narrow results to a specific publication year range
Author search — find all papers by a specific researcher (e.g., "Geoffrey Hinton")
Multi-query support — search multiple keywords or authors in a single run
Up to 500 results per query — deep search coverage with automatic pagination
PDF direct links — when available, extract direct PDF download URLs
Journal and conference data — venue name extracted from publication info
Source type detection — classify results as PDF, book, or HTML

Use Cases

Use Case	Description
Literature reviews	Systematically collect papers for academic research and systematic reviews
Bibliometric analysis	Measure research impact, track citation trends, map collaboration networks
Competitor research	Monitor competitor publications and R&D directions
Grant writing	Find related work, citation context, and research gaps for proposals
AI knowledge graphs	Feed structured academic data to LLMs for summarization and classification
Content creation	Generate research-backed articles, newsletters, and educational materials

Input

Field	Type	Default	Description
`searchQueries`	`string[]`	`["machine learning"]`	Keywords or topics to search
`authorUrls`	`string[]`	`[]`	Author names to search (e.g., `"Geoffrey Hinton"`)
`maxResults`	`integer`	`20`	Max papers per query (1–500)
`yearFrom`	`integer`	—	Filter papers from this year onward
`yearTo`	`integer`	—	Filter papers up to this year

Output

Each record represents one academic paper:

{
  "title": "Attention Is All You Need",
  "authors": ["Ashish Vaswani", "Noam Shazeer", "Niki Parmar"],
  "publicationInfo": "Advances in neural information processing systems, 2017",
  "abstract": "The dominant sequence transduction models are based on complex recurrent or convolutional neural networks...",
  "citationCount": 98432,
  "paperUrl": "https://proceedings.neurips.cc/paper/2017/hash/...",
  "pdfUrl": "https://arxiv.org/pdf/1706.03762.pdf",
  "sourceType": "PDF",
  "year": 2017,
  "citationsUrl": "https://scholar.google.com/scholar?cites=...",
  "relatedUrl": "https://scholar.google.com/scholar?q=related:...",
  "scrapedAt": "2025-08-01T12:00:00.000Z",
  "searchQuery": "transformer attention mechanism"
}

API Usage

import { ApifyClient } from 'apify-client';

const client = new ApifyClient({ token: 'YOUR_API_TOKEN' });

const run = await client.actor('USERNAME/google-scholar-scraper').call({
  searchQueries: ['large language models', 'RLHF reinforcement learning'],
  maxResults: 50,
  yearFrom: 2022,
});

const { items } = await client.dataset(run.defaultDatasetId).listItems();
console.log(`Found ${items.length} papers`);

Pricing

This actor charges per paper returned.

Volume	Estimated Cost
100 papers	~$0.10
1,000 papers	~$1.00
5,000 papers	~$5.00

FAQ

Q: Does this require a Google account or API key? No — the scraper uses Google Scholar's public data. No credentials needed.

Q: What is the citation count accuracy? Citation counts are extracted directly from Google Scholar's displayed counts. They match what you see on the website.

Q: Can I search for papers by a specific author? Yes — use the authorUrls field with the author's name (e.g., "Yann LeCun") or their Scholar profile URL.

Q: Does it extract full paper text? No — it extracts the abstract/snippet shown on Google Scholar. For full text, use the pdfUrl field when available.

What is Google Scholar Scraper?

Google Scholar Scraper turns the target data into structured, reusable results on Apify. Use it when you need repeatable collection for researchers, analysts, education or recruiting teams, and structured-data workflows without maintaining a custom scraper or one-off integration. Run it manually, schedule recurring jobs, call it through the Apify API, or connect it to an AI agent through the Apify MCP server.

The Actor stores results in an Apify dataset, where they can be previewed and exported as JSON, CSV, Excel, XML, or RSS. Availability and completeness depend on the source, supplied inputs, public visibility, authentication requirements, and upstream rate limits.

Use cases for Google Scholar Scraper

Build structured datasets for research, reporting, enrichment, or monitoring.
Automate repetitive collection with schedules, webhooks, and API calls.
Feed clean records into spreadsheets, databases, CRMs, BI tools, AI agents, or RAG pipelines.
Track changes over time by running the same validated input on a schedule.
Replace fragile manual copy-and-paste work with a reproducible Apify workflow.

How to use Google Scholar Scraper

Open the Actor input page and choose a focused, valid target.
Set a conservative result limit for the first run.
Start the Actor and inspect the dataset for coverage and field availability.
Export the results or connect the dataset to your downstream system.
Scale gradually and use scheduling, pagination, or proxies when supported.

Important input options

searchQueries — Enter one or more keywords, topics, or research areas to search on Google Scholar (e.g., 'machine learning', 'cancer immunotherapy'). Each query returns up to maxResults papers. Use authorUr
authorUrls — Enter author names to find their most cited and relevant papers on Google Scholar (e.g., 'Geoffrey Hinton', 'Yann LeCun'). Returns papers authored by or citing these researchers.
maxResults — Maximum number of papers to extract per search query. Default value is 50. Set lower for faster test runs, higher for comprehensive searches (up to 500 per query).
yearLow — Only include papers published in or after this year. Set to 2020 to focus on recent research, or leave as 2000 for a wider range.
yearHigh — Only include papers published in or before this year. Set to 2025 to exclude upcoming papers.
sortBy — Sort order for search results. Use 'relevance' for most relevant papers first. Use 'date' for newest papers first.
articlesOnly — When enabled, only includes actual articles — excludes patents and citations from results.

API and automation example

import { ApifyClient } from 'apify-client';

const client = new ApifyClient({ token: process.env.APIFY_TOKEN });
const run = await client.actor('muhammadafzal/google-scholar-scraper').call({
  // Add the same input fields you use in the Apify Console.
});
const { items } = await client.dataset(run.defaultDatasetId).listItems();
console.log(items);

Use these dedicated tools when a neighboring data source or workflow is a better match:

Frequently asked questions

How many results can I scrape with Google Scholar Scraper?

The practical total depends on the source, input limits, pagination, available records, run timeout, and upstream restrictions. Start with a small run, verify the output, and increase the limit gradually.

Can I integrate Google Scholar Scraper with other apps?

Yes. Use Apify integrations, webhooks, schedules, dataset exports, Make, Zapier, Google Sheets, cloud storage, or your own application.

Can I use Google Scholar Scraper with the Apify API?

Yes. Start runs with the Apify REST API or an official Apify client, then retrieve records from the run's default dataset. Keep your API token in a secret or environment variable.

Can I use Google Scholar Scraper through an MCP Server?

Yes. The Apify MCP server can expose the Actor to compatible AI clients and agents. Review the input and expected cost before allowing an autonomous workflow to run it at scale.

Do I need proxies?

It depends on the source and volume. Use the default configuration first. For larger or geographically sensitive jobs, select an appropriate proxy configuration only when the Actor supports it.

Is it legal to scrape this data?

Scraping rules vary by source, jurisdiction, data type, and intended use. Collect only data you are authorized to access, respect applicable terms and privacy laws, and avoid restricted or personal data misuse. This documentation is not legal advice.

Your feedback

If a field is missing, a source layout has changed, or you need a supported use case documented, open an issue on the Actor page with a reproducible input and run ID.

Google Scholar Scraper

johnlenflure/google-scholar-scraper

Scrape Google Scholar search results. Extract paper titles, authors, abstracts, citation counts, years, PDF links, and related article URLs.

Sinan Donmez

Google Scholar Scraper

automation-lab/google-scholar-scraper

Search Google Scholar and extract academic papers. Get titles, authors, citation counts, abstracts, PDF links, and publication details. Supports year filtering.

Stas Persiianenko

🔍 Google Scholar Scraper

scraper-engine/google-scholar-scraper

Google Scholar Scraper research papers from Google Scholar, including titles, authors, publication years, journals, citations, abstracts, PDFs, and profile links. Export structured data to JSON, CSV, Excel, or XML for academic research, literature reviews, citation analysis, and AI workflows.

Scraper Engine

Google Scholar Scraper

moving_beacon-owner1/google-scholar-scraper

Scrapes Google Scholar search results, including paper titles, authors, publication years, citation counts, article URLs, and PDF links. Supports multiple queries and year filters for research, literature reviews, and citation analysis.

Jamshaid Arif

🎓 Google Scholar Scraper — Papers & Citations

nexgendata/google-scholar-scraper

Scrape Google Scholar for papers, citations, authors & h-index data. Semantic Scholar, Scopus & Web of Science alternative for literature reviews, citation analysis, author clustering and research analytics. Pay per paper.

NexGenData

Free Google Scholar Scraper — Papers + Citations

s-r/free-google-scholar-scraper

Google Scholar Scraper - Academic Papers & Citations

klondikeking/google-scholar-scraper-v2

Extract academic papers, citations, authors, and PDF links from Google Scholar.

Pierrick McD0nald

Google Scholar Scraper

kawsar/google-scholar-scraper

Google Scholar scraper that collects paper titles, authors, citations, and PDF links from search results, so you get structured academic data without the manual work.

Kawsar

Google Scholar Scraper - Low-cost💲🔥📚🎓

delectable_incubator/google-scholar-scraper-low-cost

Scrape Google Scholar academic papers 📚🔍 with a powerful research scraper. Extract paper titles, authors, publication dates, journals/sources, citations, and direct links to full texts. Ideal for academic research, literature reviews, citation analysis, AI/NLP training, and knowledge discovery 🚀

Prime Scrape

5.0

Google Scholar Article Scraper

agenscrape/google-scholar-article-scraper

Extract academic articles, citations, authors, and publication data from Google Scholar. Perfect for research analysis and literature reviews with fast, reliable scraping.