ClinicalTrials.gov Scraper avatar

ClinicalTrials.gov Scraper

Pricing

from $3.00 / 1,000 results

Go to Apify Store
ClinicalTrials.gov Scraper

ClinicalTrials.gov Scraper

Extract clinical trial data from ClinicalTrials.gov — the world's largest registry of clinical studies, maintained by the U.S. National Library of Medicine (NLM).

Pricing

from $3.00 / 1,000 results

Rating

0.0

(0)

Developer

Professional Edge

Professional Edge

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

3 days ago

Last modified

Share

Extract clinical trial data from ClinicalTrials.gov — the world's largest registry of clinical studies, maintained by the U.S. National Library of Medicine (NLM). This Actor wraps the official ClinicalTrials.gov v2 API to deliver structured data on over 575,000 clinical studies including drug trials, device studies, and behavioral research.

ClinicalTrials.gov data is essential for pharmaceutical competitive intelligence, biotech investment research, medical device market analysis, academic research, and patient advocacy. Every interventional study conducted in the US — and many international studies — must be registered here. This Actor makes that data instantly accessible in JSON, CSV, or Excel format.

Key Features

FeatureDescription
575,000+ studiesThe world's most comprehensive clinical trial registry
Multi-field searchSearch by keyword, condition, intervention, location, status, and phase
Study phase filteringEarly Phase 1 through Phase 4
Status filteringRecruiting, Completed, Active, Terminated, and more
19 fields per recordSponsor, phases, conditions, interventions, enrollment, eligibility, locations
No authenticationUses the free public ClinicalTrials.gov v2 API — no API key needed

What Data Can You Extract?

FieldDescription
nctIdClinicalTrials.gov identifier (e.g., NCT06123456)
briefTitleShort study title
officialTitleFull official study title
overallStatusCurrent status (Recruiting, Completed, etc.)
startDateStudy start date
completionDateExpected or actual completion date
leadSponsorLead sponsor organization
sponsorClassSponsor type (Industry, NIH, Other)
briefSummaryStudy description and objectives
conditionsConditions or diseases being studied
studyTypeInterventional, Observational, etc.
phasesClinical trial phase (Phase 1-4)
enrollmentCountTarget or actual enrollment number
interventionsDrug names, devices, or procedures being tested
sexEligible sex (All, Female, Male)
minimumAge, maximumAgeAge eligibility range
locationsUp to 5 study site locations
urlDirect link to the study on ClinicalTrials.gov

How to Scrape Clinical Trial Data

  1. Go to this Actor's page on the Apify Store
  2. Click "Start" to open the input form
  3. Set your filters:
    • Enter a Search Term (e.g., cancer immunotherapy, COVID vaccine)
    • Enter a Condition/Disease (e.g., breast cancer, diabetes)
    • Enter an Intervention/Treatment (e.g., pembrolizumab, aspirin)
    • Select a Study Status (Recruiting, Completed, etc.) — or leave blank for all
    • Select a Study Phase (Phase 1-4) — or leave blank for all
    • Enter a Location (e.g., United States, California)
    • Set Max Results (default: 100; max: 10,000)
  4. Click "Start" to run the Actor
  5. Download your data in JSON, CSV, or Excel format from the Dataset tab

Input Example

{
"condition": "breast cancer",
"intervention": "pembrolizumab",
"status": "RECRUITING",
"phase": "PHASE3",
"maxResults": 50
}

Output Example

{
"nctId": "NCT06123456",
"briefTitle": "Pembrolizumab Plus Chemotherapy in Triple-Negative Breast Cancer",
"officialTitle": "A Phase 3, Randomized, Double-Blind Study of Pembrolizumab...",
"overallStatus": "RECRUITING",
"startDate": "2024-03",
"completionDate": "2027-06",
"leadSponsor": "Merck Sharp & Dohme LLC",
"sponsorClass": "INDUSTRY",
"briefSummary": "This study evaluates the efficacy and safety of pembrolizumab...",
"conditions": "Triple Negative Breast Neoplasms",
"studyType": "INTERVENTIONAL",
"phases": "PHASE3",
"enrollmentCount": 1200,
"interventions": ["Pembrolizumab", "Placebo"],
"sex": "Female",
"minimumAge": "18 Years",
"maximumAge": null,
"locations": [
"Memorial Sloan Kettering Cancer Center, New York, New York, United States",
"MD Anderson Cancer Center, Houston, Texas, United States"
],
"url": "https://clinicaltrials.gov/study/NCT06123456"
}

Pricing

This Actor uses pay-per-result pricing:

EventPrice
Actor start$0.00005
Per result$0.002

The ClinicalTrials.gov API is free and public. You only pay for Apify compute resources plus the per-result fee above. A typical run of 100 studies costs approximately $0.20 in Actor fees plus minimal compute costs.

Use Cases

  • Pharmaceutical Competitive Intelligence: Track competitor pipelines, monitor new trial registrations, and analyze therapeutic area trends
  • Biotech Investment Research: Identify companies with promising Phase 2/3 trials for investment analysis
  • Medical Device Market Analysis: Find device trials by condition, sponsor, and development stage
  • Academic Research: Build datasets of clinical studies for systematic reviews and meta-analyses
  • Patient Advocacy: Help patients find recruiting trials by condition, location, and eligibility criteria
  • Regulatory Intelligence: Monitor trial completions and status changes for regulatory filing timelines

Integrations

Connect this Actor to your existing workflows:

  • Export to Google Sheets for collaborative analysis
  • Send results to Slack or email for automated alerts
  • Feed into Zapier, Make, or n8n for custom automation
  • Use the Apify API to integrate directly with your application

FAQ

Q: How current is the data? A: ClinicalTrials.gov is updated continuously as sponsors register new studies and post results. The API reflects the latest submitted data.

Q: Can I search for studies by drug name? A: Yes. Use the Intervention/Treatment field to search by drug name (e.g., "pembrolizumab"), device name, or procedure type.

Q: What's the difference between study phases? A: Phase 1 tests safety in small groups. Phase 2 tests efficacy. Phase 3 confirms effectiveness in large populations. Phase 4 is post-market surveillance.

Q: Can I get results data (outcomes)? A: This Actor extracts study metadata and design information. Detailed results data posted by sponsors requires additional API endpoints.

This Actor accesses publicly available data from the ClinicalTrials.gov API, a free public service provided by the U.S. National Library of Medicine. The data is in the public domain and freely available for any use.

This Actor does not bypass any authentication, does not violate any terms of service, and respects rate limits on the ClinicalTrials.gov API. The Actor is provided as-is without warranty. Users are responsible for ensuring their use of the data complies with applicable laws and regulations.

For questions or support, please open an issue on this Actor's page.