ClinicalTrials.gov Scraper avatar

ClinicalTrials.gov Scraper

Pricing

Pay per usage

Go to Apify Store
ClinicalTrials.gov Scraper

ClinicalTrials.gov Scraper

Scrape clinical trials from ClinicalTrials.gov. Search by condition, drug, sponsor, phase, status. Extract NCT IDs, sponsors, enrollment, eligibility, outcomes, locations. No API key needed. For pharma, biotech, VCs, researchers.

Pricing

Pay per usage

Rating

0.0

(0)

Developer

Ugen Dorji

Ugen Dorji

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

2 days ago

Last modified

Categories

Share

Extract structured clinical trial data from ClinicalTrials.gov — the U.S. National Library of Medicine's registry of 500,000+ clinical studies conducted around the world.

Search by disease, drug, sponsor, phase, or status and get clean, structured JSON output ready for analysis, dashboards, or integration into your data pipeline.

Who is this for?

  • Pharma & biotech companies — monitor competitor pipelines, track trials for specific drug classes, identify partnership opportunities
  • Venture capital & investment analysts — evaluate company pipelines, assess clinical risk for due diligence, track Phase 3 readouts
  • Clinical research organizations (CROs) — find recruiting trials, analyze site selection patterns, benchmark enrollment timelines
  • Academic researchers — systematic reviews, meta-analyses, landscape analyses of therapeutic areas
  • Regulatory professionals — track FDA-regulated studies, monitor status changes across therapeutic areas
  • Patient advocacy groups — find recruiting trials by condition and location for patient matching

Features

  • Search by condition/diseasebreast cancer, diabetes, Alzheimer's disease, NSCLC
  • Search by intervention/drugpembrolizumab, Ozempic, CAR-T, semaglutide
  • Search by sponsorPfizer, Novartis, NIH, Mayo Clinic, Merck
  • Filter by phase — Early Phase 1, Phase 1, Phase 2, Phase 3, Phase 4
  • Filter by status — Recruiting, Completed, Active, Suspended, Terminated, and more
  • Filter by date range — studies started after or before a given date
  • Automatic pagination — fetches all results, not just the first page
  • Enriched, structured output — clean JSON with all key fields extracted and normalized
  • No API key needed — uses the free ClinicalTrials.gov public API
  • No proxy needed — government API with no anti-bot measures
  • Fast — API-based extraction, no browser or rendering overhead

Input Parameters

ParameterTypeRequiredDescription
conditionstringNo*Disease or condition (e.g., "breast cancer")
interventionstringNo*Drug or intervention (e.g., "pembrolizumab")
sponsorstringNo*Sponsor organization (e.g., "Pfizer")
phasearrayNoClinical phase filter (Phase 1, 2, 3, 4)
statusarrayNoStudy status filter (Recruiting, Completed, etc.)
dateFromstringNoStudies started on/after this date (YYYY-MM-DD)
dateTostringNoStudies started on/before this date (YYYY-MM-DD)
maxResultsintegerNoMax trials to return (default: 100, 0 = unlimited)
includeEligibilitybooleanNoInclude full eligibility criteria text (default: true)
includeLocationsbooleanNoInclude study site locations (default: true)

*At least one of condition, intervention, or sponsor is required.

Output Format

Each clinical trial is returned as a structured JSON object:

{
"nctId": "NCT06083844",
"title": "Phase II Investigation of Pembrolizumab in Combination With Bevacizumab...",
"officialTitle": "Phase II Investigation of Pembrolizumab...",
"briefSummary": "To find out if combining pembrolizumab, bevacizumab...",
"status": "Recruiting",
"phase": "Phase 2",
"studyType": "Interventional",
"sponsor": "M.D. Anderson Cancer Center",
"sponsorClass": "OTHER",
"collaborators": [],
"conditions": ["Ovarian Cancer"],
"interventions": [
{"type": "Drug", "name": "Pembrolizumab"},
{"type": "Drug", "name": "Bevacizumab"},
{"type": "Drug", "name": "Cyclophosphamide"}
],
"enrollment": 20,
"startDate": "2023-12-12",
"completionDate": "2027-09-30",
"primaryCompletionDate": "2027-09-30",
"lastUpdateDate": "2026-03-04",
"studyFirstPostDate": "2023-10-16",
"locations": [
{
"facility": "Memorial Sloan Kettering Cancer Center",
"city": "New York",
"state": "New York",
"country": "United States"
}
],
"eligibility": {
"minAge": "18 Years",
"maxAge": "N/A",
"sex": "Female",
"healthyVolunteers": false
},
"primaryOutcomes": [
{
"measure": "Incidence of Adverse Events...",
"timeFrame": "Through study completion; an average of 1 year."
}
],
"studyUrl": "https://clinicaltrials.gov/study/NCT06083844",
"resultsAvailable": false,
"eligibilityCriteria": "Inclusion Criteria: ..."
}

Use Cases

Drug Pipeline Monitoring

Track all Phase 3 trials for a specific drug class across sponsors:

  • Set intervention to your drug class (e.g., "GLP-1 receptor agonist")
  • Set phase to Phase 3
  • Set status to Recruiting or Active
  • Schedule daily/weekly runs via Apify to detect new filings

Competitor Analysis

Monitor a competitor's clinical trial activity:

  • Set sponsor to the company name (e.g., "Eli Lilly")
  • Filter by status to see active pipeline vs. completed studies
  • Track enrollment numbers and completion timelines

Trial Recruitment Intelligence

Find recruiting trials for patient matching or site selection:

  • Set condition to target disease
  • Set status to Recruiting
  • Enable includeLocations to see trial sites
  • Filter by date to focus on recently started studies

Systematic Review / Meta-Analysis

Build comprehensive datasets for research:

  • Combine condition + intervention filters
  • Set maxResults to 0 (unlimited) to capture all relevant trials
  • Include eligibility criteria for protocol analysis
  • Export to CSV/Excel from Apify dataset

Investment Due Diligence

Evaluate a biotech company's pipeline:

  • Set sponsor to the company
  • Look at phase distribution, enrollment progress, completion timelines
  • Cross-reference with SEC filings and press releases

Cost Estimate

This actor uses only the ClinicalTrials.gov REST API — no browser rendering, no proxy.

  • 100 trials: ~5 seconds, minimal compute ($0.01 or less)
  • 1,000 trials: ~30 seconds, ~$0.01-0.02
  • 10,000 trials: ~5 minutes, ~$0.05-0.10
  • Full condition search (e.g., all cancer trials): depends on result count

The ClinicalTrials.gov API is free with no rate limits documented, though the actor includes respectful delays between pages.

Comparison to Commercial Alternatives

FeatureThis ActorGlobalDataCiteline (Pharma Intelligence)
PricePay-per-use (~$0.01/run)$30,000-80,000/year$50,000-100,000+/year
Data sourceClinicalTrials.gov (primary source)Aggregated (includes CT.gov)Aggregated (includes CT.gov)
API accessBuilt-in (Apify API)Extra costExtra cost
Custom filtersFull controlPre-built dashboardsPre-built dashboards
SchedulingApify scheduler (free)Manual or enterpriseManual or enterprise
Output formatJSON, CSV, ExcelProprietary reportsProprietary reports
Setup time2 minutesWeeks (enterprise sales)Weeks (enterprise sales)

This actor gives you direct access to the same ClinicalTrials.gov data that powers the trial sections of commercial platforms — at a fraction of the cost. For teams that need the primary source data in a structured, automatable format, this is the fastest path.

Data Source

All data comes from ClinicalTrials.gov, operated by the U.S. National Library of Medicine (NLM) at the National Institutes of Health (NIH). This is the world's largest clinical trial registry, required by U.S. law (FDAAA 801) for most interventional trials.

Limitations

  • Data reflects what sponsors report to ClinicalTrials.gov — some trials may have outdated statuses
  • The API does not return full results data (endpoints, statistics) for completed trials — only whether results are available
  • Very broad searches (e.g., all studies with no filters) may return hundreds of thousands of results — use filters to narrow scope
  • Some older trials have incomplete location or eligibility data

Support

If you encounter issues or have feature requests, open an issue on the actor's GitHub repository or contact the author through Apify.