Pricing

from $1.00 / 1,000 results

Try for free

Go to Apify Store

Google Scholar Scraper

Try for free

Scrape academic papers, articles, and citations from Google Scholar. Search by keywords with filters for year range, document type, sort order, and article type. Extract titles, authors, citations, links, and more.

Pricing

from $1.00 / 1,000 results

Rating

5.0

(1)

Developer

Crawler Bros

Actor stats

Bookmarked

Total users

Monthly active users

4 months ago

Last modified

What can this scraper do?

Search by keywords — Enter any research topic and get structured data for each result
Filter by year range — Limit results to specific publication years
Sort by relevance or date — Choose how results are ordered
Filter by document type — Get only PDF or HTML documents
Filter by article type — Search for review articles specifically
Extract citation data — Get citation counts with links to citing articles
Pagination support — Automatically fetch multiple pages of results

Input

Field	Type	Required	Default	Description
Search Queries	string[]	Yes	—	Keywords to search on Google Scholar
Max Results	integer	No	100	Maximum articles per query (1–1,000)
Sort By	enum	No	Relevance	Sort by relevance or publication date
Document Format	enum	No	All	Filter: all formats, PDF only, or HTML only
Article Type	enum	No	Any	Filter: all types or review articles only
Published After	integer	No	—	Only articles from this year onward
Published Before	integer	No	—	Only articles up to this year
Proxy Configuration	object	No	—	Proxy settings (often not needed)

Example input

{
    "queries": ["machine learning", "deep learning"],
    "maxItems": 50,
    "sortBy": "relevance",
    "newerThan": 2020
}

{
    "queries": ["cancer treatment review"],
    "maxItems": 30,
    "articleType": "review",
    "filter": "pdfOnly"
}

Output

Each row in the dataset represents one academic article or paper found in search results.

Output fields

Field	Type	Example
`title`	string	`"Deep Learning"`
`link`	string	`"https://link.springer.com/article/..."`
`documentLink`	string	`"https://example.com/paper.pdf"`
`documentType`	string	`"PDF"`, `"HTML"`, or empty
`authors`	string	`"Y LeCun, Y Bengio, G Hinton"`
`publication`	string	`"Nature"`
`year`	integer	`2015`
`source`	string	`"springer.com"`
`fullAttribution`	string	`"Y LeCun, Y Bengio, G Hinton - Nature, 2015 - springer.com"`
`searchMatch`	string	Snippet or excerpt from the article
`citations`	integer	`65432`
`citationsLink`	string	Link to view all citing articles
`relatedArticlesLink`	string	Link to related articles on Scholar
`versions`	integer	`12`
`versionsLink`	string	Link to all versions of this article
`type`	string	`"ARTICLE"` or `"CITATION"`
`resultIndex`	integer	`0` (position in results)
`searchQuery`	string	`"deep learning"`
`scrapeTimestamp`	string	`"2026-03-09T12:00:00+00:00"`

Sample output

{
    "title": "Deep Learning",
    "link": "https://www.nature.com/articles/nature14539",
    "documentLink": "https://creativecoding.soe.ucsc.edu/courses/cs523/slides/week3/DeepLearning_LeCun.pdf",
    "documentType": "PDF",
    "authors": "Y LeCun, Y Bengio, G Hinton",
    "publication": "Nature",
    "year": 2015,
    "source": "nature.com",
    "fullAttribution": "Y LeCun, Y Bengio, G Hinton - Nature, 2015 - nature.com",
    "searchMatch": "Deep learning allows computational models composed of multiple processing layers to learn representations of data...",
    "citations": 65432,
    "citationsLink": "https://scholar.google.com/scholar?cites=...",
    "relatedArticlesLink": "https://scholar.google.com/scholar?q=related:...",
    "versions": 12,
    "versionsLink": "https://scholar.google.com/scholar?cluster=...",
    "type": "ARTICLE",
    "resultIndex": 0,
    "searchQuery": "deep learning",
    "scrapeTimestamp": "2026-03-09T12:00:00+00:00"
}

FAQs

Do I need a Google Scholar account?

No. Google Scholar is publicly accessible and the scraper works without any authentication.

Do I need a proxy?

Often not. Google Scholar is more accessible than regular Google Search from datacenter IPs. Try running without a proxy first. If you get blocked (CAPTCHA), enable Apify proxy.

How many results can I get?

Up to 1,000 results per search query. Google Scholar shows 10 results per page, and the scraper automatically paginates through multiple pages.

Can I filter by publication year?

Yes. Use the Published After and Published Before fields to limit results to a specific year range. For example, set "Published After" to 2020 to get only recent articles.

What is the difference between "link" and "documentLink"?

link is the main article URL (journal page, abstract, etc.)
documentLink is a direct link to the document file (PDF or HTML) when available

What does the "citations" field contain?

The number of times this article has been cited by other papers, as reported by Google Scholar. The citationsLink field provides a direct link to see all citing articles.

Can I search for review articles only?

Yes. Set the Article Type to "Review articles only" to filter results to review papers.

What is the "type" field?

Results are either "ARTICLE" (full papers with links) or "CITATION" (references without direct links, typically older works only available as citations).

Limitations

Google Scholar may show CAPTCHA for high-volume requests from datacenter IPs — use proxy if this happens
Maximum 1,000 results per query (Google Scholar pagination limit)
Year filters are not effective when sorting by date
Citation counts and version numbers are as reported by Google Scholar and may not be perfectly up-to-date
The scraper extracts publicly visible search results only

Google Scholar Scraper - Academic Papers & Citations

klondikeking/google-scholar-scraper-v2

Extract academic papers, citations, authors, and PDF links from Google Scholar.

Pierrick McD0nald

Google Scholar Article Scraper

agenscrape/google-scholar-article-scraper

Extract academic articles, citations, authors, and publication data from Google Scholar. Perfect for research analysis and literature reviews with fast, reliable scraping.

Agenscrape

Free Google Scholar Scraper — Papers + Citations

s-r/free-google-scholar-scraper

Google Scholar Scraper - Academic Papers Search

gio21/google-scholar-scraper

Search Google Scholar for academic papers. Get title, authors, year, publication, snippet, cited-by count, PDF links. Filter by year range, language.

Gio

Semantic Scholar Scraper - Papers, Authors, Citations

gio21/semantic-scholar-scraper

Search and fetch academic papers, authors, citations, and references via the Semantic Scholar Graph API.

Gio

Google Scholar Scraper

automation-lab/google-scholar-scraper

Search Google Scholar and extract academic papers. Get titles, authors, citation counts, abstracts, PDF links, and publication details. Supports year filtering.

Stas Persiianenko

Google Scholar Scraper

kawsar/google-scholar-scraper

Google Scholar scraper that collects paper titles, authors, citations, and PDF links from search results, so you get structured academic data without the manual work.

Kawsar

Google Scholar Scraper: Articles, Citations & PDFs

primeparse/google-scholar-scraper

Extract academic data from Google Scholar: titles, authors, years, citations, abstracts, PDF links. Supports queries, year filters (1900-2100), pagination (up to 5 pages). Rate-limited for safety. Ideal for research, citations, datasets, AI. Clean JSON output. Run on Apify with proxies.

PrimeParse

Google Scholar Scraper

marco.gullo/google-scholar-scraper

Scrape publication details from scholar.google.com. Add your query, time range, and optionally document type (PDF or HTML only). Extract information about articles such as titles, authors, links, related articles, and more.

Marco Gullo

1.9K

5.0

🎓 Google Scholar Scraper — Papers & Citations

nexgendata/google-scholar-scraper

Scrape Google Scholar for papers, citations, authors & h-index data. Semantic Scholar, Scopus & Web of Science alternative for literature reviews, citation analysis, author clustering and research analytics. Pay per paper.