Semantic Scholar Author Profiles Scraper avatar

Semantic Scholar Author Profiles Scraper

Pricing

Pay per event

Go to Apify Store
Semantic Scholar Author Profiles Scraper

Semantic Scholar Author Profiles Scraper

Collect researcher profiles from Semantic Scholar. Extract h-index, citation counts, publication history, affiliations, and external IDs for any academic author. Search by name or author ID. Download structured data as CSV, JSON, or Excel for research evaluation, talent scouting, and grant reviews.

Pricing

Pay per event

Rating

0.0

(0)

Developer

ParseForge

ParseForge

Maintained by Community

Actor stats

0

Bookmarked

4

Total users

0

Monthly active users

10 days ago

Last modified

Share

ParseForge Banner

πŸ”¬ Semantic Scholar Author Profiles Scraper

πŸ•’ Last updated: 2026-05-05

Collect academic researcher profiles from Semantic Scholar with h-index, citation counts, affiliations, and publication data - without coding. Perfect for research teams, librarians, and competitive intelligence analysts monitoring researcher trends and building researcher databases.

The Semantic Scholar Author Profiles Scraper collects up to 1,000,000 researcher profiles with h-index, citations, and affiliations - simple to use, no setup required.

✨ What Does It Do

  • πŸ“Š H-Index - Understand researcher impact with the h-index metric
  • πŸ“š Citation Count - Track total citations across a researcher's entire body of work
  • πŸ“ Paper Count - See how many publications each researcher has published
  • 🏒 Affiliations - Discover which universities or institutions researchers are associated with
  • πŸ“– Papers List - Get full publication data including titles, years, venues, and citation counts
  • πŸ”— External IDs - Capture identifiers like ORCID, MAG ID, and other research databases
  • 🌐 Researcher Homepage - Extract links to personal websites and research profiles

πŸ”§ Input

  • Author Search Query - Search for researchers by name (e.g. "Yoshua Bengio"). Use this or Author IDs, not both
  • Author IDs - Direct lookup using Semantic Scholar author IDs for precise targeting
  • Max Items - Free users limited to 100, paid users up to 1,000,000 results
  • Include Papers - Enable to collect each researcher's complete publication list (increases runtime)

Example input:

{
"query": "Yoshua Bengio",
"maxItems": 50,
"includePapers": true
}

πŸ“Š Output

Each researcher profile includes up to 12 data fields. Download as JSON, CSV, or Excel.

πŸ‘€ Researcher NameπŸ”— Profile URLπŸŽ“ Author ID
πŸ“Š H-IndexπŸ“š Citation CountπŸ“ˆ Paper Count
🏒 Affiliations🌐 HomepageπŸ“– Publication List
πŸ”— External IDsπŸ“… Scraped Date⚠️ Error Message

πŸ’Ž Why Choose the Semantic Scholar Author Profiles Scraper?

FeatureOur ActorSimilar Tools
Search researchers by nameβœ”οΈβŒ
Direct lookup by author IDβœ”οΈβŒ
H-index and citation metricsβœ”οΈPartial
Complete affiliation dataβœ”οΈβŒ
Full publication list includedβœ”οΈβŒ
Automatic paginationβœ”οΈβŒ
No authentication setup requiredβœ”οΈβŒ
Free tier supports 100 resultsβœ”οΈβŒ
Paid users up to 1,000,000 resultsβœ”οΈβŒ
Retry logic for rate limitsβœ”οΈβŒ
Export as JSON, CSV, Excelβœ”οΈβœ”οΈ

πŸ“‹ How to Use

No technical skills required. Follow these simple steps:

  1. Sign Up: Create a free account with $5 credit
  2. Find the Tool: Search for "Semantic Scholar Author Profiles Scraper" in the Apify Store and configure your input
  3. Run It: Click "Start" and watch your results appear

That's it. No coding, no setup, no complicated configuration. Now you can export your data in CSV, Excel, or JSON format.

🎯 Business Use Cases

  • πŸ“Š Research Librarian - Monitor h-index trends for faculty members to identify top performers for promotion decisions
  • πŸ’Ό Grant Program Manager - Build researcher databases by institution to target high-impact scientists for funding campaigns
  • πŸ”¬ Competitive Intelligence Analyst - Track citation metrics for competitor research teams to benchmark innovation capacity


✨ Why choose this Actor

Capability
🎯Built for the job. Scoped specifically to this data source so you skip the parser engineering entirely.
πŸ”–Structured output. Clean, typed fields ready for analysis, dashboards, or downstream pipelines.
⚑Fast. Optimized request patterns return results in seconds, not minutes.
πŸ”Always fresh. Every run pulls live data, so the dataset reflects the source as of run time.
🌐No infra to manage. Apify handles proxies, retries, scaling, scheduling, and storage.
πŸ›‘οΈReliable. Battle-tested across many runs and edge cases, with graceful error handling.
🚫No code required. Configure in the UI, run from CLI, schedule via cron, or call from any language with the Apify SDK.

πŸ“Š Production-grade structured data without the engineering overhead of building and maintaining your own scraper.


πŸ“ˆ How it compares to alternatives

ApproachCostCoverageRefreshFiltersSetup
⭐ Semantic Scholar Author Profiles Scraper (this Actor)$5 free credit, then pay-per-useFull source coverageLive per runSource-native filters supported⚑ 2 min
Build your own scraperEngineering hoursFull once builtWhenever you maintain itCustom code🐒 Days to weeks
Paid managed APIs$$$ monthlyVendor-definedLiveVendor-defined⏳ Hours
Third-party data dumpsVariesSubset, often stalePeriodicNoneπŸ•’ Variable

Pick this Actor when you want broad coverage, server-side filtering, and no pipeline maintenance.


πŸš€ How to use

  1. πŸ“ Sign up. Create a free account with $5 credit (takes 2 minutes).
  2. 🌐 Open the Actor. Go to the Semantic Scholar Author Profiles Scraper page on the Apify Store.
  3. 🎯 Set input. Configure the input fields in the form (or paste a JSON), then set maxItems.
  4. πŸš€ Run it. Click Start and let the Actor collect your data.
  5. πŸ“₯ Download. Grab your results in the Dataset tab as CSV, Excel, JSON, or XML.

⏱️ Total time from signup to downloaded dataset: 3-5 minutes. No coding required.


πŸ’Ό Business use cases

πŸ“Š Data & Analytics

  • Build trend reports and dashboards from live source data
  • Feed BI tools, warehouses, and ML pipelines with structured records
  • Run periodic snapshots to track changes over time
  • Compare segments, regions, or categories with consistent fields

🏒 Operations & Strategy

  • Monitor competitor moves, pricing, and inventory shifts
  • Build internal directories and lookup tools backed by current data
  • Power workflows that depend on fresh source records
  • Cut manual data-gathering time from hours to minutes

🎯 Marketing & Growth

  • Identify market opportunities and trending topics
  • Research target audiences and customer personas at scale
  • Power lead-generation pipelines with verified records
  • Track sentiment, reviews, or social signals over time

πŸ› οΈ Engineering & Product

  • Prototype features that need real-world data without owning a crawler
  • Replace fragile in-house scrapers with a managed Actor
  • Wire datasets into your apps via the Apify API or webhooks
  • Skip the proxy, retry, and parsing maintenance entirely

🌟 Beyond business use cases

Data like this powers more than commercial workflows. The same structured records support research, education, civic projects, and personal initiatives.

πŸŽ“ Research and academia

  • Empirical datasets for papers, thesis work, and coursework
  • Longitudinal studies tracking changes across snapshots
  • Reproducible research with cited, versioned data pulls
  • Classroom exercises on data analysis and ethical scraping

🎨 Personal and creative

  • Side projects, portfolio demos, and indie app launches
  • Data visualizations, dashboards, and infographics
  • Content research for bloggers, YouTubers, and podcasters
  • Hobbyist collections and personal trackers

🀝 Non-profit and civic

  • Transparency reporting and accountability projects
  • Advocacy campaigns backed by public-interest data
  • Community-run databases for local issues
  • Investigative journalism on public records

πŸ§ͺ Experimentation

  • Prototype AI and machine-learning pipelines with real data
  • Validate product-market hypotheses before engineering spend
  • Train small domain-specific models on niche corpora
  • Test dashboard concepts with live input

πŸ€– Ask an AI assistant about this scraper

Open a ready-to-send prompt about this ParseForge actor in the AI of your choice:

πŸ’° How much does it cost?

Apify gives you $5 in free monthly credits on the Apify Free plan, enough to test Semantic Scholar Author Profiles Scraper and pull a real sample dataset. For ongoing usage:

  • Starter plan ($49/month) β€” Recommended for individuals running Semantic Scholar Author Profiles Scraper regularly. Includes higher concurrency and larger datasets.
  • Scale plan ($499/month) β€” Recommended for teams running Semantic Scholar Author Profiles Scraper at production scale.

Pay-Per-Event pricing means you only pay for what you actually use. Failed runs are never charged. See the Pricing tab on this Actor's page for exact event prices.

πŸ’‘ Tips for using Semantic Scholar Author Profiles Scraper

  • Start with a small maxItems (3-10) to validate output format before running larger jobs.
  • Use Apify Schedules to run Semantic Scholar Author Profiles Scraper on a recurring basis and keep your dataset fresh.
  • Export via Integrations: Apify connects to Google Sheets, Airbyte, Make, Zapier, and direct webhooks β€” pipe your data anywhere.
  • Monitor with webhooks: trigger downstream workflows the moment a run finishes.
  • Re-run failed items: if any individual records error out, re-run with their inputs only. Failed events are not charged.

Yes. Semantic Scholar Author Profiles Scraper only collects publicly available data. Web scraping public data has been confirmed as legal by US courts (see hiQ Labs v. LinkedIn) and is widely used for research, market analysis, and business intelligence.

However, you are responsible for:

  • Respecting the source website's Terms of Service.
  • Complying with GDPR, CCPA, and other applicable data-protection laws when personal data is involved.
  • Not republishing copyrighted content without permission.

If you have specific compliance concerns, consult your legal team. See the Apify legal docs for more.

❓ Frequently Asked Questions

πŸ”Œ Integrate with any app

Semantic Scholar Author Profiles Scraper connects to any cloud service via Apify integrations:

  • Make - Automate multi-step workflows
  • Zapier - Connect with 5,000+ apps
  • Slack - Get run notifications in your channels
  • Airbyte - Pipe results into your warehouse
  • GitHub - Trigger runs from commits and releases
  • Google Drive - Export datasets straight to Sheets

You can also use webhooks to trigger downstream actions when a run finishes. Push fresh data into your product backend, or alert your team in Slack.


πŸ’‘ More ParseForge Actors

Browse our complete collection of data extraction tools for more.

πŸš€ Ready to Start?

Create a free account with $5 credit and collect your first 100 results for free. No coding, no setup.

πŸ†˜ Need Help?

  • Check the FAQ section above for common questions
  • Visit the Apify support page for documentation and tutorials
  • Contact us to request a new scraper, propose a custom project, or report an issue at Tally contact form

⚠️ Disclaimer

This Actor is an independent tool and is not affiliated with, endorsed by, or sponsored by Semantic Scholar or any of its subsidiaries. All trademarks mentioned are the property of their respective owners.


πŸ’‘ Pro Tip: browse the complete ParseForge collection for more reference-data scrapers.