ResearchGate Academic Scraper avatar

ResearchGate Academic Scraper

Pricing

from $3.00 / 1,000 results

Go to Apify Store
ResearchGate Academic Scraper

ResearchGate Academic Scraper

Scrape ResearchGate for academic publications and researcher profiles. Search papers by query, browse researcher profiles, and fetch detailed publication metadata including citations, reads, DOI, and PDF availability.

Pricing

from $3.00 / 1,000 results

Rating

5.0

(7)

Developer

Crawler Bros

Crawler Bros

Maintained by Community

Actor stats

7

Bookmarked

1

Total users

0

Monthly active users

4 days ago

Last modified

Share

Scrape academic publications and researcher profiles from ResearchGate — one of the world's largest academic social network platforms. No login required.

Features

  • Search Publications — find academic papers by keyword with title, abstract, authors, DOI, citations, and PDF availability
  • Search Researchers — find researcher profiles by name or field with institution, h-index, and research interests
  • By Profile — fetch detailed data for a specific ResearchGate researcher profile
  • By Publication ID — fetch detailed metadata for a single publication by its ID
  • Filter by keyword, minimum citations, and publication year
  • Parses JSON-LD structured data (ScholarlyArticle, Person) embedded in ResearchGate HTML
  • Supplemental extraction from HTML text for reads, citations, h-index, and follower counts

Output Fields

Publication Records (recordType: "publication")

FieldTypeDescription
publicationIdstringResearchGate publication ID
titlestringPublication title
abstractstringAbstract or description (up to 2000 chars)
authorsarrayFull list of author names
primaryAuthorstringFirst/primary author
journalstringJournal or conference name
publishedDatestringPublication year or full date
doistringDigital Object Identifier
readsintegerNumber of reads on ResearchGate
citationsintegerNumber of citations
recommendationsintegerResearchGate recommendations count
publicationUrlstringFull ResearchGate URL
pdfAvailablebooleanWhether a PDF download link was found
keywordsarrayKeywords from the publication
recordTypestringAlways "publication"
siteNamestringAlways "ResearchGate"
scrapedAtstringISO-8601 scrape timestamp

Researcher Records (recordType: "researcher")

FieldTypeDescription
researcherIdstringResearchGate profile username/ID
namestringFull name
usernamestringProfile URL slug
institutionstringAffiliated institution
departmentstringDepartment within institution
researchInterestsarrayResearch interests/expertise areas
publicationsCountintegerNumber of publications on ResearchGate
citationsCountintegerTotal citation count
hIndexintegerh-index
followersCountintegerNumber of followers
profileUrlstringFull ResearchGate profile URL
avatarUrlstringProfile photo URL
recordTypestringAlways "researcher"
siteNamestringAlways "ResearchGate"
scrapedAtstringISO-8601 scrape timestamp

Input Options

FieldTypeDefaultDescription
modeselectsearchPublicationssearchPublications, searchResearchers, byProfile, byPublicationId
querystringdeep learningSearch query for searchPublications or searchResearchers
profileUsernamestringProfile slug for byProfile mode (e.g. Alice-Smith-42)
publicationIdstringPublication ID for byPublicationId mode (e.g. 379417897)
containsKeywordstringCase-insensitive filter on title/abstract
minCitationsintegerMinimum citation count (0–9,999,999)
fromYearintegerOnly include publications from this year onwards (1900–2030)
maxItemsinteger20Maximum records to return (1–1000)

Example Use Cases

Search for papers on deep learning:

{"mode": "searchPublications", "query": "deep learning", "maxItems": 20}

Find highly-cited papers on cancer treatment from 2020+:

{"mode": "searchPublications", "query": "cancer treatment", "minCitations": 100, "fromYear": 2020}

Search for machine learning researchers:

{"mode": "searchResearchers", "query": "machine learning", "maxItems": 20}

Get a specific researcher's profile:

{"mode": "byProfile", "profileUsername": "Yann-LeCun"}

Get a single publication's details:

{"mode": "byPublicationId", "publicationId": "379417897"}

FAQs

Do I need a ResearchGate account? No authentication is required. The actor uses public HTML pages and JSON-LD structured data.

Why might some fields be missing? ResearchGate doesn't always embed all data in machine-readable JSON-LD. Fields like reads, citations, and h-index are extracted from page text using pattern matching and may vary by page.

Are full paper PDFs downloaded? No. The pdfAvailable field indicates whether ResearchGate shows a download link. The actor does not download or store PDF files.

How reliable is this scraper? ResearchGate uses Cloudflare but serves most pages without JavaScript challenges for standard requests. The actor retries on 429/5xx responses with exponential backoff (10/20/40s delays, 3 attempts).

What is the publicationId? It is the numeric ID from the ResearchGate URL: researchgate.net/publication/379417897_... — just the digits.

What is the profileUsername? It is the slug from the ResearchGate profile URL: researchgate.net/profile/Alice-Smith-42 — the Alice-Smith-42 part.