ClinicalTrials.gov Scraper
Pricing
from $3.00 / 1,000 review extracteds
ClinicalTrials.gov Scraper
Scrape clinical trials from the official ClinicalTrials.gov API โ no login, no API key, no blocking. 585K+ studies with status, phase, sponsor, conditions, interventions, enrollment, eligibility and locations. Filter by condition, sponsor and status. Tens of thousands per run.
Pricing
from $3.00 / 1,000 review extracteds
Rating
0.0
(0)
Developer
Logiover
Maintained by CommunityActor stats
0
Bookmarked
16
Total users
3
Monthly active users
16 hours ago
Last modified
Categories
Share
๐ฌ ClinicalTrials.gov Scraper โ Clinical Trial Data from the Official API
Scrape clinical trials straight from the official ClinicalTrials.gov API โ no login, no API key, no blocking. This Apify Actor covers the full ClinicalTrials.gov database of 585,000+ studies worldwide, returning deep structured data: trial status, phase, sponsor, conditions, interventions, enrollment, eligibility, locations, and key dates. One run can return tens of thousands of trials.
If you need bulk clinical trial data for pharma intelligence, patient recruitment, or research, this Actor turns the world's largest trials registry into a clean, exportable dataset (JSON, CSV, or Excel).
โจ What this Actor does / Key features
- Official NIH / ClinicalTrials.gov API source โ reliable, stable, and never gets blocked.
- No API key or login required โ just set your filters and run.
- Full database coverage โ 585,000+ studies, fully paginated at 1,000 per page.
- Deep structured data โ status, study type, phase, lead sponsor, sponsor class, collaborators, conditions, and interventions.
- Eligibility details โ sex, minimum and maximum age, and healthy-volunteer eligibility.
- Timeline data โ start date, completion date, primary completion date, first-posted and last-update-posted dates.
- Location data โ array of trial sites per study, plus a results-available flag.
- Powerful filters โ by condition/disease, free-text search term, lead sponsor, and trial status.
- Built for scheduling โ trial statuses update constantly, so run it on a schedule for an always-current feed.
๐ Input
| Field | Type | Description |
|---|---|---|
condition | string | Filter by condition or disease, e.g. diabetes, breast cancer, alzheimer. Leave empty for all. |
searchTerm | string | Free-text search across the whole study record (intervention, sponsor, keyword). Leave empty for all. |
sponsor | string | Filter by lead sponsor, e.g. pfizer, moderna, national cancer institute. Leave empty for all. |
overallStatus | array | Filter by trial status, e.g. RECRUITING, COMPLETED, ACTIVE_NOT_RECRUITING. Leave empty for all. |
maxStudies | integer | Maximum studies to save. 0 = all matching (up to ~585,000 across the whole database). |
๐ Example input
{"condition": "breast cancer","searchTerm": "","sponsor": "","overallStatus": ["RECRUITING"],"maxStudies": 10000}
๐ฆ Output
Each dataset item is one clinical trial study. Output fields:
| Field | Description |
|---|---|
nctId | ClinicalTrials.gov registration number (NCT ID). |
briefTitle | Short study title. |
officialTitle | Full official study title. |
acronym | Study acronym, if any. |
organization | Organization that submitted the record. |
overallStatus | Current trial status (e.g. RECRUITING, COMPLETED). |
studyType | Study type (e.g. interventional, observational). |
phases | Array of trial phases. |
enrollmentCount | Number of participants enrolled or planned. |
leadSponsor | Lead sponsor of the trial. |
sponsorClass | Sponsor classification (e.g. industry, NIH, other). |
collaborators | Array of collaborating organizations. |
conditions | Array of conditions/diseases studied. |
interventions | Array of interventions tested. |
briefSummary | Short summary of the study. |
sex | Eligible sex. |
minimumAge / maximumAge | Age eligibility range. |
healthyVolunteers | Whether healthy volunteers are accepted. |
startDate | Trial start date. |
completionDate | Overall completion date. |
primaryCompletionDate | Primary completion date. |
firstPostedDate | When the study was first posted. |
lastUpdatePostedDate | When the study was last updated. |
locations | Array of trial site locations. |
hasResults | Whether study results have been posted. |
url | Direct link to the study on ClinicalTrials.gov. |
scrapedAt | Timestamp of when the record was scraped. |
๐ก Use cases
- Pharma & biotech competitive intelligence โ track competitor trials by phase, sponsor, and condition.
- CROs & patient recruitment โ find recruiting trials by condition and location at scale.
- Investors & analysts โ monitor the clinical pipeline across the industry.
- Researchers & academics โ build datasets from the world's largest trials registry.
- Journalists & policy teams โ analyze trial trends, sponsor activity, and enrollment data.
โ Frequently Asked Questions
Is this an official ClinicalTrials.gov API or a scraper? It pulls data from the official ClinicalTrials.gov API, so the data is authoritative and stable. There is no affiliation with ClinicalTrials.gov or the NIH.
Do I need an API key or login? No. The Actor handles everything โ you only need an Apify account. There is no ClinicalTrials.gov API key to register.
Is it legal to use this data? ClinicalTrials.gov study records are public information published by the US National Library of Medicine. Use the data responsibly and in line with ClinicalTrials.gov terms.
How much data can I get in one run?
A single run can return tens of thousands of studies. Set maxStudies to 0 to capture everything matching your filters, up to the full ~585,000-study database.
Can I filter the results? Yes โ filter by condition/disease, free-text search term, lead sponsor, and trial status, in any combination.
How fresh is the data?
It reflects what is currently published on ClinicalTrials.gov, including each study's lastUpdatePostedDate. Schedule the Actor to keep a continuously updated dataset.
What output formats are supported? JSON, CSV, Excel (XLSX), HTML, and JSONLines, available through the Apify dataset and API.
โฐ Scheduling & integration
Schedule the Actor on Apify to keep an always-fresh clinical trials database. Export results to JSON, CSV, or Excel, call it from the Apify API, or connect it to Google Sheets, Airtable, databases, or Slack with webhooks and no-code tools like Make, n8n, and Zapier.
Keywords: clinicaltrials.gov scraper, clinical trial data api, clinical trials scraper, clinical trial database, pharma competitive intelligence, patient recruitment data, clinical pipeline data, nct id scraper, clinical research dataset, clinical trials export
