Clinical Trials Scraper avatar

Clinical Trials Scraper

Pricing

from $0.50 / 1,000 results

Go to Apify Store
Clinical Trials Scraper

Clinical Trials Scraper

Search ClinicalTrials.gov for studies by condition, intervention, sponsor, or keyword. Look up individual trials by NCT ID. No API key required.

Pricing

from $0.50 / 1,000 results

Rating

0.0

(0)

Developer

mick_

mick_

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

25 days ago

Last modified

Share

Search ClinicalTrials.gov for studies by condition, intervention, sponsor, or keyword. Look up individual trials by NCT ID. No API key required. MCP-ready for AI agent integration.

What does it do?

Clinical Trials Scraper queries the ClinicalTrials.gov API v2 and returns structured data about clinical studies worldwide. It extracts trial metadata, eligibility criteria, interventions, outcomes, sponsors, and locations into clean, normalized JSON. Returns consistent output -- ready for analysis, pharma pipelines, or consumption by AI agents via MCP.

๐Ÿ‘ฅ Who Uses This

๐Ÿ’Š Pharma and Biotech Pipeline Teams

You need to track what competitors are running โ€” which conditions, which phases, which sponsors. ClinicalTrials.gov has 500,000+ studies. This actor lets you query by sponsor name, intervention, or therapeutic area and get structured output ready for pipeline dashboards without manual site navigation.

{
"mode": "search_studies",
"sponsor": "Pfizer",
"phase": "PHASE3",
"status": "RECRUITING",
"maxResults": 200
}

Run on a schedule to catch new trial registrations. Cross-reference with FDA Drug Labels and FDA Adverse Events for a full compound dossier.


๐Ÿฅ Patient Recruitment and Clinical Operations Teams

You're identifying recruiting trials for specific conditions and locations to support patient referrals, site feasibility assessments, or registry building. The structured eligibility and location fields let you filter without reading each trial page.

{
"mode": "search_by_condition",
"condition": "non-small cell lung cancer",
"status": "RECRUITING",
"maxResults": 100
}

Filter by phase, studyType (interventional vs observational), and date range to narrow to relevant protocols. Pull nct_id lists for downstream processing with get_study mode to retrieve full eligibility text.


๐Ÿ”ฌ Academic Researchers and Systematic Reviewers

You need a structured trial inventory for a systematic review or meta-analysis โ€” not hand-copied PDFs, but normalized JSON with consistent field names across thousands of trials. Batch search across multiple condition terms, deduplicate by NCT ID, and export to CSV.

{
"mode": "search_studies",
"query": "GLP-1 receptor agonist type 2 diabetes cardiovascular",
"studyType": "INTERVENTIONAL",
"maxResults": 500
}

Combine with PubMed Scraper to match trials to their published results. The nct_id links directly to the ClinicalTrials.gov record; PMID cross-references appear in the references field where available.


๐Ÿ“‹ Regulatory Affairs and Compliance Teams

You need to monitor trial status changes โ€” completions, terminations, protocol amendments โ€” for your own programs or competitor benchmarking. Track when trials enter Phase 3, when they complete, or when they're terminated early (often a safety signal).

{
"mode": "search_studies",
"sponsor": "YourCompany",
"status": "COMPLETED",
"startDateFrom": "2023-01-01",
"maxResults": 50
}

Use get_study mode with specific NCT IDs to pull the full record including primary and secondary outcomes, protocol amendments, and results references.


๐Ÿค– AI/LLM Engineers and Agent Builders

You're building biomedical AI pipelines โ€” RAG systems grounded in trial data, agents that can answer questions about active trials, or research assistants that cross-reference literature with ongoing studies.

MCP tool config:

{
"mcpServers": {
"clinical-trials-scraper": {
"url": "https://mcp.apify.com?tools=labrat011/clinical-trials-scraper",
"headers": {
"Authorization": "Bearer <APIFY_TOKEN>"
}
}
}
}

Combine with PubMed Scraper and FDA Drug Labels in the same MCP config to give your agent access to the full biomedical research stack โ€” literature, trials, and drug labeling โ€” as callable tools.


Features

  • 3 modes: search_studies, get_study, search_by_condition
  • No API key required -- ClinicalTrials.gov API v2 is public
  • No proxies needed -- direct API access to government infrastructure
  • Rich filters -- condition, intervention, sponsor, phase, status, study type, date range
  • Pagination -- automatically pages through large result sets
  • Polite rate limiting -- default 0.2s between requests; retry with exponential backoff on failures
  • State persistence -- survives Apify actor migrations mid-run
  • Batch push -- outputs in batches of 25 for efficiency
  • Free tier -- 25 results per run without a subscription
  • MCP-ready -- stable JSON schema with schema_version, no missing keys

What data does it extract?

Studies

FieldDescription
schema_versionSchema version (currently "1.0")
typeAlways "study"
nct_idClinicalTrials.gov identifier (e.g., NCT05678901)
titleOfficial study title
acronymStudy acronym (if any)
overall_statusCurrent status (e.g., RECRUITING, COMPLETED)
start_dateStudy start date
completion_dateEstimated or actual completion date
last_update_dateLast update posted date
brief_summaryBrief study description
conditionsMedical conditions studied
interventionsDrugs, devices, or procedures being tested
phasesTrial phases (e.g., PHASE1, PHASE2, PHASE3)
study_typeStudy type (INTERVENTIONAL, OBSERVATIONAL)
enrollmentNumber of participants
enrollment_typeEnrollment type (ACTUAL or ESTIMATED)
sponsorLead sponsor organization
collaboratorsCollaborating organizations
sexEligible sex (ALL, MALE, FEMALE)
min_ageMinimum eligible age
max_ageMaximum eligible age
healthy_volunteersWhether healthy volunteers accepted
primary_outcomesPrimary outcome measures
secondary_outcomesSecondary outcome measures
locationsStudy sites (facility, city, state, country)
study_urlLink to study on ClinicalTrials.gov

Input

Mode 1: Search Studies

Search for clinical trials by keyword, condition, intervention, or sponsor.

{
"mode": "search_studies",
"query": "lung cancer",
"phase": ["PHASE3"],
"status": ["RECRUITING"],
"maxResults": 100
}

Search by intervention and sponsor:

{
"mode": "search_studies",
"intervention": "pembrolizumab",
"sponsor": "Merck",
"maxResults": 50
}

Mode 2: Get Study

Look up a specific study by NCT ID.

{
"mode": "get_study",
"nctId": "NCT04280705"
}

Mode 3: Search by Condition

Condition-focused search for clinical trials.

{
"mode": "search_by_condition",
"condition": "type 2 diabetes",
"studyType": "INTERVENTIONAL",
"phase": ["PHASE2", "PHASE3"],
"maxResults": 200
}

Input Reference

Common fields:

ParameterDefaultDescription
modesearch_studiessearch_studies, get_study, or search_by_condition
queryGeneral search term across all fields
nctIdNCT ID for get_study mode
conditionDisease or condition name
interventionDrug, device, or procedure
sponsorSponsor organization
maxResults100Max results (1-1000; free tier capped at 25)
requestIntervalSecs0.2Seconds between requests
timeoutSecs30HTTP timeout
maxRetries5Retries on failure

Filter fields:

ParameterDefaultDescription
statusStudy status filter (e.g., RECRUITING, COMPLETED)
phaseTrial phase (e.g., PHASE1, PHASE2, PHASE3)
studyTypeStudy type (INTERVENTIONAL, OBSERVATIONAL)
dateFromStudies starting on or after (YYYY-MM-DD)
dateToStudies starting on or before (YYYY-MM-DD)

Output

Results are saved to the default dataset. Download them in JSON, CSV, Excel, or XML from the Output tab.

Example output

{
"schema_version": "1.0",
"type": "study",
"nct_id": "NCT04280705",
"title": "A Study of Nivolumab Plus Ipilimumab in Participants With Advanced Non-Small Cell Lung Cancer",
"acronym": "",
"overall_status": "COMPLETED",
"start_date": "2020-03-15",
"completion_date": "2024-06-30",
"last_update_date": "2024-08-15",
"brief_summary": "This study evaluated the combination of nivolumab and ipilimumab...",
"conditions": ["Non-Small Cell Lung Cancer", "NSCLC"],
"interventions": ["DRUG: Nivolumab", "DRUG: Ipilimumab"],
"phases": ["PHASE3"],
"study_type": "INTERVENTIONAL",
"enrollment": 1200,
"enrollment_type": "ACTUAL",
"sponsor": "Bristol-Myers Squibb",
"collaborators": [],
"sex": "ALL",
"min_age": "18 Years",
"max_age": "",
"healthy_volunteers": "No",
"primary_outcomes": ["Overall Survival (OS)"],
"secondary_outcomes": ["Progression-Free Survival (PFS)", "Objective Response Rate (ORR)"],
"locations": ["Memorial Sloan Kettering Cancer Center, New York, New York, United States"],
"study_url": "https://clinicaltrials.gov/study/NCT04280705"
}

Cost

This actor uses pay-per-event (PPE) pricing. You pay only for the results you get.

  • $0.50 per 1,000 results ($0.0005 per result)
  • No proxy costs -- public government APIs
  • No API key costs -- ClinicalTrials.gov is free and public
  • Free tier: 25 results per run (no subscription required)

Technical details

  • ClinicalTrials.gov API v2 (clinicaltrials.gov/api/v2/studies) for study search and retrieval
  • No authentication required -- public government data
  • Automatic pagination via nextPageToken
  • Rate limited to 1 request per 0.2 seconds (configurable)
  • Automatic retry with exponential backoff and jitter on failures
  • Results pushed in batches of 25 for efficiency
  • Actor state persisted across migrations
  • No proxies, no browser, no cookies -- direct API access

MCP Integration

This actor works as an MCP tool through Apify's hosted MCP server. No custom server needed.

  • Endpoint: https://mcp.apify.com?tools=labrat011/clinical-trials-scraper
  • Auth: Authorization: Bearer <APIFY_TOKEN>
  • Transport: Streamable HTTP
  • Works with: Claude Desktop, Cursor, VS Code, Windsurf, Warp, Gemini CLI

Example MCP config (Claude Desktop / Cursor):

{
"mcpServers": {
"clinical-trials-scraper": {
"url": "https://mcp.apify.com?tools=labrat011/clinical-trials-scraper",
"headers": {
"Authorization": "Bearer <APIFY_TOKEN>"
}
}
}
}

AI agents can use this actor to search clinical trials, look up study details, track drug development pipelines, and monitor trial status changes -- all as a callable MCP tool.


FAQ

Do I need an API key?

No. ClinicalTrials.gov API v2 is public with no authentication required.

What status values can I filter by?

RECRUITING, NOT_YET_RECRUITING, ACTIVE_NOT_RECRUITING, COMPLETED, ENROLLING_BY_INVITATION, SUSPENDED, TERMINATED, WITHDRAWN.

What phase values can I filter by?

EARLY_PHASE1, PHASE1, PHASE2, PHASE3, PHASE4, NA.

How many studies are in ClinicalTrials.gov?

Over 500,000 studies from 200+ countries.

Can I combine filters?

Yes. All filters are AND-combined. For example, search for Phase 3 recruiting trials for lung cancer with a specific sponsor.


ActorWhat it doesPairs well when...
PubMed Scraper35M+ biomedical abstracts from NCBIMatch trials to published results by NCT ID or condition
FDA Drug Labels ScraperFDA-approved drug labeling dataLook up the label for a trial's intervention drug
FDA Adverse Events ScraperFAERS post-market safety reportsCross-reference trial drug with real-world adverse event signals
FDA Orange Book ScraperPatent, exclusivity, and generic equivalence dataCheck approval and patent status of trial interventions
NPI Provider Contact FinderHealthcare provider directoryFind principal investigators or recruiting site contacts

Feedback

Found a bug or have a feature request? Open an issue on the actor's Issues tab in Apify Console.