Google Scholar Scraper — Academic Papers & Citations avatar

Google Scholar Scraper — Academic Papers & Citations

Pricing

from $5.00 / 1,000 results

Go to Apify Store
Google Scholar Scraper — Academic Papers & Citations

Google Scholar Scraper — Academic Papers & Citations

Extract academic paper titles, authors, abstracts, citation counts, publication details, and PDF links from Google Scholar. Fast, reliable, no browser overhead. Search by keyword, topic, or author name. MCP-optimized for AI agents.

Pricing

from $5.00 / 1,000 results

Rating

0.0

(0)

Developer

Muhammad Afzal

Muhammad Afzal

Maintained by Community

Actor stats

0

Bookmarked

1

Total users

0

Monthly active users

2 days ago

Last modified

Categories

Share

Google Scholar Scraper — Extract Academic Papers, Citations & Research Data

A powerful Google Scholar scraper that extracts academic paper metadata including titles, authors, abstracts, citation counts, publication details, and PDF links. Search by keyword, topic, or author name. Ideal for literature reviews, bibliometric analysis, and research data collection.

Features

  • Lightning fast results — 50 papers scraped in ~10 seconds
  • Rich academic metadata — titles, authors, abstracts, citation counts, publication venues, PDF links, publication years
  • Citation tracking — extract citation counts for impact analysis and research benchmarking
  • Year filtering — narrow results by publication year range to focus on recent or historical research
  • Multi-query support — search multiple keywords or topics in a single run
  • Author search — find papers by specific researchers (e.g., "Geoffrey Hinton", "Yann LeCun")
  • Automatic pagination — fetches up to 500 results per query with intelligent page handling
  • Structured JSON output — clean, well-formatted data ready for analysis, databases, or AI pipelines

Use Cases

  • Literature reviews — collect papers systematically for academic research and systematic reviews
  • Bibliometric analysis — measure research impact, track citation trends, analyze collaboration networks
  • Competitor intelligence — monitor competitor research output and publication patterns
  • Grant writing — find related work, citation context, and research gaps for proposals
  • AI & machine learning — feed structured academic data into LLMs for summarization, classification, or knowledge graphs
  • Content creation — generate research-backed articles, newsletters, and educational materials

Input

FieldTypeDefaultDescription
searchQueriesstring[]["machine learning"]Keywords or topics to search on Google Scholar
authorUrlsstring[][]Author names to search (e.g., "Geoffrey Hinton")
maxResultsinteger50Max papers per query (1–500)
yearLowinteger2000Minimum publication year for filtering
yearHighinteger2026Maximum publication year for filtering
sortBystring"relevance"Sort by "relevance" or "date" (newest first)
articlesOnlybooleantrueExclude patents and non-article results

Output

Each paper record includes 13 fields of structured metadata:

FieldTypeDescription
titlestringFull academic paper title
authorsstring[]Author names parsed from publication metadata
publicationInfostringJournal, venue, year, and publisher details
abstractstringPaper abstract or snippet from Google Scholar
citationCountintegerNumber of citations (from Google Scholar)
paperUrlstringDirect link to the paper or landing page
pdfUrlstring|nullDirect PDF download link when available
sourceTypestringSource type: HTML, PDF, or BOOK
yearintegerPublication year extracted from metadata
citationsUrlstring|nullLink to papers citing this paper
relatedUrlstring|nullLink to related articles on Google Scholar
scrapedAtstringISO 8601 timestamp of when data was scraped
searchQuerystringThe original search query that produced this result

Example Usage

Search by Topic

{
"searchQueries": ["deep learning cancer detection", "transformer architecture"],
"maxResults": 100,
"yearLow": 2020,
"yearHigh": 2025,
"sortBy": "relevance"
}

Search by Author Name

{
"authorUrls": ["Geoffrey Hinton", "Yann LeCun"],
"maxResults": 50,
"yearLow": 2015
}

Quick Test Run

{
"searchQueries": ["reinforcement learning"],
"maxResults": 10
}

Pricing

This actor uses a pay-per-result pricing model at $0.005 per paper scraped.

ResultsCost
10 papers$0.05
50 papers$0.25
100 papers$0.50
500 papers$2.50

Fast, reliable, and cost-effective academic data extraction. No additional infrastructure or API keys required.

Example Output

{
"title": "Deep learning",
"authors": ["Y LeCun", "Y Bengio", "G Hinton"],
"publicationInfo": "Nature, 2015 - nature.com",
"abstract": "Deep learning allows computational models that are composed of multiple processing layers...",
"citationCount": 86734,
"paperUrl": "https://www.nature.com/articles/nature14539",
"pdfUrl": null,
"sourceType": "HTML",
"year": 2015,
"citationsUrl": null,
"relatedUrl": null,
"scrapedAt": "2026-05-03T08:53:26.141Z",
"searchQuery": "deep learning"
}

Why Use This Google Scholar Scraper?

  • No setup required — works out of the box with zero configuration
  • No browser or proxy needed — pure API-based extraction is faster and more reliable
  • Consistent structured data — every record follows the same schema for easy processing
  • Built for scale — handle hundreds of queries with automatic rate limiting and retries
  • AI-ready output — clean JSON format perfect for feeding into LLMs, RAG pipelines, or data warehouses