Pricing

from $10.00 / 1,000 results

Google Scholar Scraper

Scrape Google Scholar search results with titles, authors, citations, abstracts, and PDF links. Also supports author profile mode to extract h-index, i10-index, and publication lists.

Pricing

from $10.00 / 1,000 results

Rating

0.0

(0)

Developer

lulz bot

Actor stats

Bookmarked

Total users

Monthly active users

2 months ago

Last modified

Features

Search mode: Search papers by keyword with title, authors, journal, year, abstract, citation count, and PDF links
Author profile mode: Fetch an author's complete profile including h-index, i10-index, total citations, and full publication list
Year range filtering (yearFrom / yearTo)
Sort by relevance or date
Automatic pagination (10 results per page, up to 1000)
CAPTCHA detection with automatic retry (30–60s wait, up to 3 attempts)
Residential proxy support (required — Google blocks datacenter IPs)

Input

Field	Type	Default	Description
`mode`	select	`search`	`search` or `authorProfile`
`query`	string	`machine learning`	Search keywords (search mode)
`authorId`	string	—	Google Scholar user ID (author profile mode)
`yearFrom`	integer	—	Filter papers from this year
`yearTo`	integer	—	Filter papers up to this year
`sortBy`	select	`relevance`	`relevance` or `date`
`limit`	integer	`10`	Max results (1–1000)
`proxyConfiguration`	object	Apify Residential	Proxy settings

Finding an Author ID

Go to any Google Scholar author profile page. The URL will look like:

https://scholar.google.com/citations?user=XXXXXXXXXXX&hl=en

Copy the value after user= — that is the author ID.

Output

Search Mode

{
  "title": "Attention Is All You Need",
  "url": "https://arxiv.org/abs/1706.03762",
  "authors": ["A Vaswani", "N Shazeer", "N Parmar"],
  "source": "Advances in neural information processing systems",
  "year": 2017,
  "abstract": "The dominant sequence transduction models are based on complex recurrent...",
  "citationCount": 98432,
  "pdfUrl": "https://arxiv.org/pdf/1706.03762",
  "allVersionsUrl": "https://scholar.google.com/scholar?cluster=...",
  "relatedArticlesUrl": "https://scholar.google.com/scholar?q=related:...",
  "scholarUrl": "https://arxiv.org/abs/1706.03762",
  "searchQuery": "transformer attention mechanism",
  "scrapedAt": "2024-01-15T10:23:45.123Z"
}

Author Profile Mode

{
  "name": "Geoffrey Hinton",
  "affiliation": "University of Toronto",
  "hIndex": 155,
  "i10Index": 322,
  "totalCitations": 783210,
  "publications": [
    {
      "title": "Deep learning",
      "url": "https://scholar.google.com/citations?view_op=view_citation&...",
      "authorsSource": "Y LeCun, Y Bengio, G Hinton - nature, 2015",
      "year": 2015,
      "citationCount": 62450
    }
  ],
  "scholarUrl": "https://scholar.google.com/citations?user=JicYPdAAAAAJ&hl=en",
  "scrapedAt": "2024-01-15T10:23:45.123Z"
}

Use Cases

Literature reviews: Collect papers on a topic with citation counts to identify seminal works
Bibliometric analysis: Track citation trends, prolific authors, top journals
Research monitoring: Watch for new papers on keywords you care about
Academic recruiting: Find researchers by h-index and publication record
Competitive intelligence: Monitor competitor research output and emerging tech areas
Grant writing: Find citation stats and related work for proposals

Example Inputs

Search for recent AI safety papers

{
  "mode": "search",
  "query": "AI alignment safety",
  "yearFrom": 2022,
  "sortBy": "date",
  "limit": 50
}

Get Geoffrey Hinton's author profile

{
  "mode": "authorProfile",
  "authorId": "JicYPdAAAAAJ",
  "limit": 100
}

Top cited NLP papers ever

{
  "mode": "search",
  "query": "natural language processing BERT transformer",
  "sortBy": "relevance",
  "limit": 100
}

Pricing

This actor uses Pay Per Event (PPE) pricing — $0.005 per result scraped.

100 papers = $0.50
500 papers = $2.50
1,000 papers = $5.00

Notes

Google Scholar enforces rate limits. The scraper adds 2–6 second delays between pages to stay within limits.
If a CAPTCHA is detected, the scraper waits 30–60 seconds and retries up to 3 times before skipping.
Residential proxies are mandatory. The default configuration uses Apify's residential proxy pool.
Google Scholar shows a maximum of ~100 results per search query through normal pagination. For larger datasets, split your query into multiple narrower searches.

Google Scholar Scraper

george.the.developer/google-scholar-scraper

Scrape Google Scholar for academic papers, citations, author profiles. No API key needed. Extract titles, authors, abstracts, citation counts, PDF links, h-index, i10-index. Export JSON, CSV, Excel. Anti-bot protection with residential proxies, UA rotation, CAPTCHA detection.

George Kioko

105

5.0

Google Scholar Scraper

solidcode/google-scholar-scraper

[💰 $2.0 / 1K] Extract academic papers, author profiles, h-index, i10-index, citation counts, abstracts, and PDF links from Google Scholar. Batch search queries and author IDs, filter by year range, sort by relevance or date.

SolidCode

Google Scholar Scraper

automation-lab/google-scholar-scraper

Search Google Scholar and extract academic papers. Get titles, authors, citation counts, abstracts, PDF links, and publication details. Supports year filtering.

Stas Persiianenko

Google Scholar Scraper

johnlenflure/google-scholar-scraper

Scrape Google Scholar search results. Extract paper titles, authors, abstracts, citation counts, years, PDF links, and related article URLs.

Sinan Donmez

Google Scholar Scraper — Papers & Citations

muhammadafzal/google-scholar-scraper

Scrape Google Scholar results with paper titles, authors, publication details, citation counts, related links, and research metadata.

Muhammad Afzal

🔍 Google Scholar Scraper

scraper-engine/google-scholar-scraper

Google Scholar Scraper research papers from Google Scholar, including titles, authors, publication years, journals, citations, abstracts, PDFs, and profile links. Export structured data to JSON, CSV, Excel, or XML for academic research, literature reviews, citation analysis, and AI workflows.

Scraper Engine

🎓 Google Scholar Scraper — Papers & Citations

nexgendata/google-scholar-scraper

Scrape Google Scholar for papers, citations, authors & h-index data. Semantic Scholar, Scopus & Web of Science alternative for literature reviews, citation analysis, author clustering and research analytics. Pay per paper.

NexGenData

Google Scholar Scraper

kawsar/google-scholar-scraper

Google Scholar scraper that collects paper titles, authors, citations, and PDF links from search results, so you get structured academic data without the manual work.

Kawsar

Google Scholar Article Scraper

agenscrape/google-scholar-article-scraper

Extract academic articles, citations, authors, and publication data from Google Scholar. Perfect for research analysis and literature reviews with fast, reliable scraping.