Pricing

from $7.50 / 1,000 results

Harvard Dataverse Datasets Scraper

Search the Harvard Dataverse repository for open research datasets and dataverse collections. Filter by record type and free text query to pull global IDs, persistent URLs, titles, authors, descriptions, and publication dates. Useful for social science research and reproducibility audits.

Pricing

from $7.50 / 1,000 results

Rating

0.0

(0)

Developer

ParseForge

Actor stats

Bookmarked

Total users

Monthly active users

an hour ago

Last modified

🔬 Harvard Dataverse Datasets Scraper

🚀 Export Harvard Dataverse data in seconds. Structured records ready for spreadsheet, database, and BI tooling.

The Harvard Dataverse Datasets Scraper turns the public Harvard Dataverse endpoint into a clean, structured dataset. It calls the upstream API, parses the response, and flattens each record into one row, ready for downstream tooling.

🎯 Target Audience	💡 Primary Use Cases
📊 Analysts and researchers	Build clean datasets from public Harvard Dataverse data.
🏢 Operations teams	Monitor key indicators on a schedule.
🤖 ML engineers	Construct training and evaluation sets.
📰 Journalists	Verify figures and pull sources with one click.
👩‍💻 Developers	Mirror upstream data into your warehouse without writing client code.

📋 What the Harvard Dataverse Datasets Scraper does

Calls the public Harvard Dataverse endpoint with the filters you supply.
Parses the response and normalizes each record.
Casts numeric fields where possible so the data drops cleanly into BI tools.
Surfaces upstream errors and rate limits as a single record with the error field set.
Exports to any downstream format supported by Apify.

💡 Why it matters. Public data is great, but raw API responses rarely map cleanly to a spreadsheet. This actor handles the plumbing so you can focus on analysis.

📊 Data fields

Each record includes: authors, citation, dataverse, description, global_id, name, published_at, results, scrapedAt, subjects, type, url. These field names come straight from the actor's dataset schema, so what you see here is what lands in your dataset.

⚠️ Good to Know. This actor talks to the public Harvard Dataverse endpoint. Upstream rate limits and availability apply.

🚀 How to use

Click Try for free.
Adjust the filters or leave defaults.
Click Start. Within seconds, your dataset is ready.

🔗 Recommended Actors

Actor	What it does
ParseForge Alpha Vantage Scraper	Daily, intraday, FX, crypto, and indicators.
ParseForge OurAirports Scraper	Global airport database.
ParseForge NBA Stats Scraper	Player and team stats.
ParseForge CurseForge Mods Scraper	Public mod metadata.

💡 Pro Tip. Browse the complete ParseForge collection for 900+ production-grade scrapers across business intelligence, real estate, e-commerce, sports, finance, and public records.

Disclaimer. This actor scrapes only publicly available data. ParseForge is not affiliated with, endorsed by, or sponsored by any of the third-party services referenced. Users are responsible for complying with the target site's terms of service and applicable law. Create a free account w/ $5 credit.

🆘 Need Help?

If you hit a bug, have questions about setup, or need a scraper we haven't built yet, open our contact form or write to parseforge@protonmail.com. We also take on paid custom data projects.

For faster answers, join our Discord. It's the best place to get support and suggest new actors.

Harvard Dataverse Scraper

fortuitous_pirate/harvard-dataverse-scraper

Scrape research datasets from Harvard Dataverse: search datasets, files, and dataverses across all academic disciplines. Free public API.

Fortuitous Pirate

Research Repository Harvester (Dataverse / DSpace / InvenioRDM)

datamule/repository-platform-extractor

Point at ANY Dataverse, DSpace 7/8, or InvenioRDM/Zenodo install and harvest every dataset's metadata via one unified API — title, authors, DOI/handle, subjects, date, type, license, files, landing page + raw. One actor spans thousands of institutional repositories. Pay per record.

Datamule

Harvard Catalyst Profiles Scraper

futurizerush/harvard-catalyst-profiles-scraper

Extracts researcher contact details from the Harvard Catalyst Profiles directory.

Rush

5.0

Harvard University Scraper

fatihtahta/harvard-university-scraper

Scrapes Harvard University Profiles directory listings with pagination to gather profile URLs, then extracts detailed data; name, email, departments, affiliations, education, honors, bio, and more. Ideal for academic research, lead generation.

Fatih Tahta

Figshare Research Articles Scraper

parseforge/figshare-articles-scraper

Search Figshare for shared research articles, datasets, posters, theses, and code. Filter by item type and free text query to retrieve article IDs, DOIs, titles, authors, descriptions, license info, and publication dates. Useful for scholarly discovery and open research tracking.

ParseForge

Dryad Research Datasets Scraper

parseforge/dryad-datasets-scraper

Search the Dryad Digital Repository for open research datasets. Pass a free text query or an institutional ROR ID to retrieve dataset DOIs, titles, abstracts, authors, publication dates, license info, and related publications. Useful for life sciences research and data citation tracking.

ParseForge

Zenodo Research Records Scraper

parseforge/zenodo-records-scraper

Search the CERN Zenodo repository for research outputs by keyword and resource type. Returns record IDs, DOIs, titles, creators, descriptions, publication dates, license info, and access right flags. Useful for scholarly discovery, citation tracking, open access audits, and meta research.

ParseForge

Harvard Art Museums Collection Scraper

parseforge/harvardart-museums-scraper

Explore objects in the Harvard Art Museums collection across departments. Each object returns the title, classification, date, culture, medium, department, image, and record link. Great for art research, curating references, and studying holdings by culture or period.

ParseForge

OSF Open Science Framework Scraper

parseforge/osf-scraper

Export public research projects, preprints, and registrations from the Open Science Framework (OSF). Search across 1M+ open science records. Filter by keyword, subject, or provider. Pull titles, descriptions, tags, DOIs, authors, institutions, dates, and full metadata.

ParseForge

⚖️ Harvard Caselaw Access Project Scraper

parseforge/caselaw-access-scraper

Search the Harvard Caselaw Access Project for historical US court opinions. Export case name, citation, court, decision date, jurisdiction, judges, full opinion text, and headnotes as CSV, Excel, JSON, JSONL, XML, or HTML. Public-data export with no login required.

ParseForge