Pricing

Pay per event

Semantic Scholar Author Profiles Scraper

Collect researcher profiles from Semantic Scholar. Extract h-index, citation counts, publication history, affiliations, and external IDs for any academic author. Search by name or author ID. Download structured data as CSV, JSON, or Excel for research evaluation, talent scouting, and grant reviews.

Pricing

Pay per event

Rating

0.0

(0)

Developer

ParseForge

Actor stats

Bookmarked

Total users

Monthly active users

a month ago

Last modified

🔬 Semantic Scholar Author Profiles Scraper

🕒 Last updated: 2026-05-05

Collect academic researcher profiles from Semantic Scholar with h-index, citation counts, affiliations, and publication data - without coding. Perfect for research teams, librarians, and competitive intelligence analysts monitoring researcher trends and building researcher databases.

The Semantic Scholar Author Profiles Scraper collects up to 1,000,000 researcher profiles with h-index, citations, and affiliations - simple to use, no setup required.

✨ What Does It Do

📊 H-Index - Understand researcher impact with the h-index metric
📚 Citation Count - Track total citations across a researcher's entire body of work
📝 Paper Count - See how many publications each researcher has published
🏢 Affiliations - Discover which universities or institutions researchers are associated with
📖 Papers List - Get full publication data including titles, years, venues, and citation counts
🔗 External IDs - Capture identifiers like ORCID, MAG ID, and other research databases
🌐 Researcher Homepage - Extract links to personal websites and research profiles

🔧 Input

Author Search Query - Search for researchers by name (e.g. "Yoshua Bengio"). Use this or Author IDs, not both
Author IDs - Direct lookup using Semantic Scholar author IDs for precise targeting
Max Items - Free users limited to 100, paid users up to 1,000,000 results
Include Papers - Enable to collect each researcher's complete publication list (increases runtime)

Example input:

{
  "query": "Yoshua Bengio",
  "maxItems": 50,
  "includePapers": true
}

📊 Output

Each researcher profile includes up to 12 data fields. Download as JSON, CSV, or Excel.

👤 Researcher Name	🔗 Profile URL	🎓 Author ID
📊 H-Index	📚 Citation Count	📈 Paper Count
🏢 Affiliations	🌐 Homepage	📖 Publication List
🔗 External IDs	📅 Scraped Date	⚠️ Error Message

💎 Why Choose the Semantic Scholar Author Profiles Scraper?

Feature	Our Actor	Similar Tools
Search researchers by name	✔️	❌
Direct lookup by author ID	✔️	❌
H-index and citation metrics	✔️	Partial
Complete affiliation data	✔️	❌
Full publication list included	✔️	❌
Automatic pagination	✔️	❌
No authentication setup required	✔️	❌
Free tier supports 100 results	✔️	❌
Paid users up to 1,000,000 results	✔️	❌
Retry logic for rate limits	✔️	❌
Export as JSON, CSV, Excel	✔️	✔️

📋 How to Use

No technical skills required. Follow these simple steps:

Sign Up: Create a free account with $5 credit
Find the Tool: Search for "Semantic Scholar Author Profiles Scraper" in the Apify Store and configure your input
Run It: Click "Start" and watch your results appear

That's it. No coding, no setup, no complicated configuration. Now you can export your data in CSV, Excel, or JSON format.

🎯 Business Use Cases

📊 Research Librarian - Monitor h-index trends for faculty members to identify top performers for promotion decisions
💼 Grant Program Manager - Build researcher databases by institution to target high-impact scientists for funding campaigns
🔬 Competitive Intelligence Analyst - Track citation metrics for competitor research teams to benchmark innovation capacity

✨ Why choose this Actor

	Capability
🎯	Built for the job. Scoped specifically to this data source so you skip the parser engineering entirely.
🔖	Structured output. Clean, typed fields ready for analysis, dashboards, or downstream pipelines.
⚡	Fast. Optimized request patterns return results in seconds, not minutes.
🔁	Always fresh. Every run pulls live data, so the dataset reflects the source as of run time.
🌐	No infra to manage. Apify handles proxies, retries, scaling, scheduling, and storage.
🛡️	Reliable. Battle-tested across many runs and edge cases, with graceful error handling.
🚫	No code required. Configure in the UI, run from CLI, schedule via cron, or call from any language with the Apify SDK.

📊 Production-grade structured data without the engineering overhead of building and maintaining your own scraper.

📈 How it compares to alternatives

Approach	Cost	Coverage	Refresh	Filters	Setup
⭐ Semantic Scholar Author Profiles Scraper (this Actor)	$5 free credit, then pay-per-use	Full source coverage	Live per run	Source-native filters supported	⚡ 2 min
Build your own scraper	Engineering hours	Full once built	Whenever you maintain it	Custom code	🐢 Days to weeks
Paid managed APIs	$$$ monthly	Vendor-defined	Live	Vendor-defined	⏳ Hours
Third-party data dumps	Varies	Subset, often stale	Periodic	None	🕒 Variable

Pick this Actor when you want broad coverage, server-side filtering, and no pipeline maintenance.

🚀 How to use

📝 Sign up. Create a free account with $5 credit (takes 2 minutes).
🌐 Open the Actor. Go to the Semantic Scholar Author Profiles Scraper page on the Apify Store.
🎯 Set input. Configure the input fields in the form (or paste a JSON), then set maxItems.
🚀 Run it. Click Start and let the Actor collect your data.
📥 Download. Grab your results in the Dataset tab as CSV, Excel, JSON, or XML.

⏱️ Total time from signup to downloaded dataset: 3-5 minutes. No coding required.

💼 Business use cases

📊 Data & Analytics

Build trend reports and dashboards from live source data
Feed BI tools, warehouses, and ML pipelines with structured records
Run periodic snapshots to track changes over time
Compare segments, regions, or categories with consistent fields

🏢 Operations & Strategy

Monitor competitor moves, pricing, and inventory shifts
Build internal directories and lookup tools backed by current data
Power workflows that depend on fresh source records
Cut manual data-gathering time from hours to minutes

🎯 Marketing & Growth

Identify market opportunities and trending topics
Research target audiences and customer personas at scale
Power lead-generation pipelines with verified records
Track sentiment, reviews, or social signals over time

🛠️ Engineering & Product

Prototype features that need real-world data without owning a crawler
Replace fragile in-house scrapers with a managed Actor
Wire datasets into your apps via the Apify API or webhooks
Skip the proxy, retry, and parsing maintenance entirely

🌟 Beyond business use cases

Data like this powers more than commercial workflows. The same structured records support research, education, civic projects, and personal initiatives.

🎓 Research and academia

Empirical datasets for papers, thesis work, and coursework
Longitudinal studies tracking changes across snapshots
Reproducible research with cited, versioned data pulls
Classroom exercises on data analysis and ethical scraping

🎨 Personal and creative

Side projects, portfolio demos, and indie app launches
Data visualizations, dashboards, and infographics
Content research for bloggers, YouTubers, and podcasters
Hobbyist collections and personal trackers

🤝 Non-profit and civic

Transparency reporting and accountability projects
Advocacy campaigns backed by public-interest data
Community-run databases for local issues
Investigative journalism on public records

🧪 Experimentation

Prototype AI and machine-learning pipelines with real data
Validate product-market hypotheses before engineering spend
Train small domain-specific models on niche corpora
Test dashboard concepts with live input

🤖 Ask an AI assistant about this scraper

Open a ready-to-send prompt about this ParseForge actor in the AI of your choice:

❓ Frequently Asked Questions

🔌 Integrate with any app

Semantic Scholar Author Profiles Scraper connects to any cloud service via Apify integrations:

Make - Automate multi-step workflows
Zapier - Connect with 5,000+ apps
Slack - Get run notifications in your channels
Airbyte - Pipe results into your warehouse
GitHub - Trigger runs from commits and releases
Google Drive - Export datasets straight to Sheets

You can also use webhooks to trigger downstream actions when a run finishes. Push fresh data into your product backend, or alert your team in Slack.

💡 More ParseForge Actors

Crunchbase Scraper - Collect company profiles, founders, and funding data
NY Business Entity Scraper - Extract New York business registration and filing information
SEC 13F Holdings Scraper - Scrape institutional investment holdings from SEC filings
Trade Me Property Scraper - Collect real estate listing data from New Zealand

Browse our complete collection of data extraction tools for more.

🚀 Ready to Start?

Create a free account with $5 credit and collect your first 100 results for free. No coding, no setup.

🆘 Need Help?

Check the FAQ section above for common questions
Visit the Apify support page for documentation and tutorials
Contact us to request a new scraper, propose a custom project, or report an issue at Tally contact form

⚠️ Disclaimer

This Actor is an independent tool and is not affiliated with, endorsed by, or sponsored by Semantic Scholar or any of its subsidiaries. All trademarks mentioned are the property of their respective owners.

🔗 Recommended Actors

🔍 Google Search Scraper - Multi-engine SERP results with country and language targeting
🗺️ Nominatim OSM Scraper - Geocode addresses via OpenStreetMap
📊 Indexmundi Scraper - Global demographic and economic indicators
📰 RAG Web Browser - Crawl and extract clean text from any URL for AI retrieval
🌐 Website Content Crawler - Crawl entire sites and export structured content

💡 Pro Tip: browse the complete ParseForge collection for more reference-data scrapers.

Semantic Scholar Scraper

openclawmara/semantic-scholar-scraper

Scrape Semantic Scholar for academic papers, citations, abstracts, and author profiles. Search by topic, author, or venue. Extract citation graphs, reference lists, and research trends. Essential for literature reviews, academic research, and AI/ML paper discovery.

OpenClaw Mara

Semantic Scholar Search Scraper

powerai/semantic-scholar-search-scraper

Scrape academic papers from Semantic Scholar by keyword search, with automatic pagination and comprehensive research data extraction.

PowerAI

Semantic Scholar Scraper

crawlerbros/semanticscholar-scraper

Scrape Semantic Scholar with 200M+ academic papers and authors with full citation graph. Search, fetch by paper/author ID, get citations / references / recommendations, with abstracts, TLDRs, fields-of-study, open-access PDFs, h-index, affiliations, and more

Crawler Bros

Google Scholar Scraper

solidcode/google-scholar-scraper

[💰 $2.0 / 1K] Extract academic papers, author profiles, h-index, i10-index, citation counts, abstracts, and PDF links from Google Scholar. Batch search queries and author IDs, filter by year range, sort by relevance or date.

SolidCode

Semantic Scholar Scraper - Cheap 📚🔎🤖

scrapestorm/semantic-scholar-scraper---cheap

🔎 Easily collect research papers from Semantic Scholar Provide one or multiple search keywords, paper URLs or author profiles and extract structured academic data such as 📄 Paper Title👨‍🔬 Authors 📅 Publication Year 🔗 Paper URL & more Perfect for academic research & AI research monitoring 📚

Storm_Scraper

5.0

Semantic Scholar Paper Scraper

agenscrape/semantic-scholar-paper-scraper

Scrape academic papers from Semantic Scholar. Search by keyword and extract paper titles, abstracts, authors, citation counts, publication dates, DOIs, open access PDFs... Perfect for literature reviews, citation analysis, and research databases. Real time data output with pagination support.

Agenscrape

Semantic Scholar Paper Search

ryanclinton/semantic-scholar-search

Search and extract academic research papers from Semantic Scholar's database of over 200 million publications.

Ryan Clinton

Semantic Scholar Scraper

parseforge/semantic-scholar-scraper

Extract detailed academic paper data from Semantic Scholar, including abstracts, citations, authors, and publication details. Ideal for researchers, academics, and analysts who need structured scholarly data for literature reviews, research workflows, and large-scale academic analysis.

ParseForge

1.1

Google Scholar Profiles Scraper

fetch_cat/google-scholar-profiles-scraper

Extract Google Scholar author profiles, citation metrics, interests, and publications from public profile URLs or user IDs.

Hanna Nosova

Semantic Scholar Scraper

fortuitous_pirate/semantic-scholar-scraper

Search 200M+ academic papers from Semantic Scholar: titles, abstracts, authors, citations, open-access PDFs, and fields of study. Filter by year, venue, or citation count. Free API.