ClinicalTrials.gov Scraper
Pricing
Pay per usage
ClinicalTrials.gov Scraper
Scrape clinical trials from ClinicalTrials.gov. Search by condition, drug, sponsor, phase, status. Extract NCT IDs, sponsors, enrollment, eligibility, outcomes, locations. No API key needed. For pharma, biotech, VCs, researchers.
Pricing
Pay per usage
Rating
0.0
(0)
Developer
Ugen Dorji
Actor stats
0
Bookmarked
2
Total users
1
Monthly active users
2 days ago
Last modified
Categories
Share
Extract structured clinical trial data from ClinicalTrials.gov — the U.S. National Library of Medicine's registry of 500,000+ clinical studies conducted around the world.
Search by disease, drug, sponsor, phase, or status and get clean, structured JSON output ready for analysis, dashboards, or integration into your data pipeline.
Who is this for?
- Pharma & biotech companies — monitor competitor pipelines, track trials for specific drug classes, identify partnership opportunities
- Venture capital & investment analysts — evaluate company pipelines, assess clinical risk for due diligence, track Phase 3 readouts
- Clinical research organizations (CROs) — find recruiting trials, analyze site selection patterns, benchmark enrollment timelines
- Academic researchers — systematic reviews, meta-analyses, landscape analyses of therapeutic areas
- Regulatory professionals — track FDA-regulated studies, monitor status changes across therapeutic areas
- Patient advocacy groups — find recruiting trials by condition and location for patient matching
Features
- Search by condition/disease —
breast cancer,diabetes,Alzheimer's disease,NSCLC - Search by intervention/drug —
pembrolizumab,Ozempic,CAR-T,semaglutide - Search by sponsor —
Pfizer,Novartis,NIH,Mayo Clinic,Merck - Filter by phase — Early Phase 1, Phase 1, Phase 2, Phase 3, Phase 4
- Filter by status — Recruiting, Completed, Active, Suspended, Terminated, and more
- Filter by date range — studies started after or before a given date
- Automatic pagination — fetches all results, not just the first page
- Enriched, structured output — clean JSON with all key fields extracted and normalized
- No API key needed — uses the free ClinicalTrials.gov public API
- No proxy needed — government API with no anti-bot measures
- Fast — API-based extraction, no browser or rendering overhead
Input Parameters
| Parameter | Type | Required | Description |
|---|---|---|---|
condition | string | No* | Disease or condition (e.g., "breast cancer") |
intervention | string | No* | Drug or intervention (e.g., "pembrolizumab") |
sponsor | string | No* | Sponsor organization (e.g., "Pfizer") |
phase | array | No | Clinical phase filter (Phase 1, 2, 3, 4) |
status | array | No | Study status filter (Recruiting, Completed, etc.) |
dateFrom | string | No | Studies started on/after this date (YYYY-MM-DD) |
dateTo | string | No | Studies started on/before this date (YYYY-MM-DD) |
maxResults | integer | No | Max trials to return (default: 100, 0 = unlimited) |
includeEligibility | boolean | No | Include full eligibility criteria text (default: true) |
includeLocations | boolean | No | Include study site locations (default: true) |
*At least one of condition, intervention, or sponsor is required.
Output Format
Each clinical trial is returned as a structured JSON object:
{"nctId": "NCT06083844","title": "Phase II Investigation of Pembrolizumab in Combination With Bevacizumab...","officialTitle": "Phase II Investigation of Pembrolizumab...","briefSummary": "To find out if combining pembrolizumab, bevacizumab...","status": "Recruiting","phase": "Phase 2","studyType": "Interventional","sponsor": "M.D. Anderson Cancer Center","sponsorClass": "OTHER","collaborators": [],"conditions": ["Ovarian Cancer"],"interventions": [{"type": "Drug", "name": "Pembrolizumab"},{"type": "Drug", "name": "Bevacizumab"},{"type": "Drug", "name": "Cyclophosphamide"}],"enrollment": 20,"startDate": "2023-12-12","completionDate": "2027-09-30","primaryCompletionDate": "2027-09-30","lastUpdateDate": "2026-03-04","studyFirstPostDate": "2023-10-16","locations": [{"facility": "Memorial Sloan Kettering Cancer Center","city": "New York","state": "New York","country": "United States"}],"eligibility": {"minAge": "18 Years","maxAge": "N/A","sex": "Female","healthyVolunteers": false},"primaryOutcomes": [{"measure": "Incidence of Adverse Events...","timeFrame": "Through study completion; an average of 1 year."}],"studyUrl": "https://clinicaltrials.gov/study/NCT06083844","resultsAvailable": false,"eligibilityCriteria": "Inclusion Criteria: ..."}
Use Cases
Drug Pipeline Monitoring
Track all Phase 3 trials for a specific drug class across sponsors:
- Set
interventionto your drug class (e.g., "GLP-1 receptor agonist") - Set
phaseto Phase 3 - Set
statusto Recruiting or Active - Schedule daily/weekly runs via Apify to detect new filings
Competitor Analysis
Monitor a competitor's clinical trial activity:
- Set
sponsorto the company name (e.g., "Eli Lilly") - Filter by
statusto see active pipeline vs. completed studies - Track enrollment numbers and completion timelines
Trial Recruitment Intelligence
Find recruiting trials for patient matching or site selection:
- Set
conditionto target disease - Set
statusto Recruiting - Enable
includeLocationsto see trial sites - Filter by date to focus on recently started studies
Systematic Review / Meta-Analysis
Build comprehensive datasets for research:
- Combine
condition+interventionfilters - Set
maxResultsto 0 (unlimited) to capture all relevant trials - Include eligibility criteria for protocol analysis
- Export to CSV/Excel from Apify dataset
Investment Due Diligence
Evaluate a biotech company's pipeline:
- Set
sponsorto the company - Look at phase distribution, enrollment progress, completion timelines
- Cross-reference with SEC filings and press releases
Cost Estimate
This actor uses only the ClinicalTrials.gov REST API — no browser rendering, no proxy.
- 100 trials: ~5 seconds, minimal compute ($0.01 or less)
- 1,000 trials: ~30 seconds, ~$0.01-0.02
- 10,000 trials: ~5 minutes, ~$0.05-0.10
- Full condition search (e.g., all cancer trials): depends on result count
The ClinicalTrials.gov API is free with no rate limits documented, though the actor includes respectful delays between pages.
Comparison to Commercial Alternatives
| Feature | This Actor | GlobalData | Citeline (Pharma Intelligence) |
|---|---|---|---|
| Price | Pay-per-use (~$0.01/run) | $30,000-80,000/year | $50,000-100,000+/year |
| Data source | ClinicalTrials.gov (primary source) | Aggregated (includes CT.gov) | Aggregated (includes CT.gov) |
| API access | Built-in (Apify API) | Extra cost | Extra cost |
| Custom filters | Full control | Pre-built dashboards | Pre-built dashboards |
| Scheduling | Apify scheduler (free) | Manual or enterprise | Manual or enterprise |
| Output format | JSON, CSV, Excel | Proprietary reports | Proprietary reports |
| Setup time | 2 minutes | Weeks (enterprise sales) | Weeks (enterprise sales) |
This actor gives you direct access to the same ClinicalTrials.gov data that powers the trial sections of commercial platforms — at a fraction of the cost. For teams that need the primary source data in a structured, automatable format, this is the fastest path.
Data Source
All data comes from ClinicalTrials.gov, operated by the U.S. National Library of Medicine (NLM) at the National Institutes of Health (NIH). This is the world's largest clinical trial registry, required by U.S. law (FDAAA 801) for most interventional trials.
- Registry: https://clinicaltrials.gov
- API documentation: https://clinicaltrials.gov/data-api/api
- No authentication required
- No usage fees
Limitations
- Data reflects what sponsors report to ClinicalTrials.gov — some trials may have outdated statuses
- The API does not return full results data (endpoints, statistics) for completed trials — only whether results are available
- Very broad searches (e.g., all studies with no filters) may return hundreds of thousands of results — use filters to narrow scope
- Some older trials have incomplete location or eligibility data
Support
If you encounter issues or have feature requests, open an issue on the actor's GitHub repository or contact the author through Apify.