Clinicaltrials Scraper avatar
Clinicaltrials Scraper

Pricing

Pay per event

Go to Apify Store
Clinicaltrials Scraper

Clinicaltrials Scraper

Collect structured trial records on ClinicalTrials.gov. It supports keyword searches, country/location filters, built-in status/phase options, document availability indicators, and precise record limits so analysts can create medical research datasets in seconds.

Pricing

Pay per event

Rating

5.0

(1)

Developer

ParseForge

ParseForge

Maintained by Community

Actor stats

0

Bookmarked

3

Total users

2

Monthly active users

10 days ago

Last modified

Share

ClinicalTrials.gov Scraper

🚀 Instantly turn the ClinicalTrials.gov search experience into clean, structured datasets ready for analysis, due diligence, or portfolio monitoring.

This actor calls the same API that powers the new ClinicalTrials.gov interface, so you get up-to-the-minute trial data without fighting HTML pages or browser automation. Pick any combination of keywords, locations, study statuses, phases, funder types, or document availability filters and receive perfectly flattened JSON that is easy to drop into BI tools.

Target Audience: Healthcare investors, market access teams, clinical operations, medical affairs, data vendors.
Primary Use Cases: Competitive landscaping, site feasibility, results monitoring, KOL research, regulatory compliance tracking.


📊 What Does the ClinicalTrials.gov Scraper Do?

This tool collects every public data point exposed through the official Studies API:

  • Trial identifiers (NCT ID, acronyms, org study IDs) and shareable URLs
  • Current status, last-known status, verification dates, and results posting history
  • Design metadata (study type, phases, allocation, intervention model, masking, primary purpose, observational model, time perspective, target duration)
  • Eligibility details (sex, age brackets, healthy volunteer toggle, inclusion and exclusion criteria)
  • Lead sponsor, collaborator list, and funder category
  • Outcome measures (primary and secondary with descriptions), conditions, interventions, uploaded documents (protocols/SAP/ICF)
  • Central contacts, site-level contacts, site coordinates, and location status
  • Historical data including original enrollment info, sponsor changes, and version history

Business Value: Replace manual ClinicalTrials.gov browsing with a repeatable pipeline, feed downstream analytics with reliable identifiers, and document compliance sensitive artifacts (posted results, FDAAA violation flags, document uploads) automatically.


🔧 Input

To start collecting trials, fill in the input form. Choose either a start URL or the filters below:

  • startUrl – Paste a ClinicalTrials.gov search page URL (e.g., https://clinicaltrials.gov/search) or an API URL. We'll reuse its filters and paginate automatically. Prefill: https://clinicaltrials.gov/search
  • Condition/disease – Specific condition or disease to search for (e.g., diabetes, hypertension, cancer)
  • Other terms – Additional search terms (e.g., symptoms, treatments, keywords)
  • Intervention/treatment – Intervention or treatment to search for (e.g., drug name, procedure, device)
  • Location – Search by address, city, state, zip code, or country
  • Study Status – Filter by "All studies" or "Recruiting and not yet recruiting studies"
  • Sex – Filter by eligible sex (All, Female, Male)
  • Age – Use either predefined age groups (Child, Adult, Older adult) via ageGroups OR manually specify custom age values with units (Years, Months, Weeks, Days, Hours, Minutes) via ageRangeFrom/ageRangeTo fields. These options are mutually exclusive - you cannot use both at the same time.
  • Accepts healthy volunteers – Toggle to only show trials that accept healthy volunteers
  • Study Phase – Filter by Early Phase 1, Phase 1, Phase 2, Phase 3, Phase 4, or Not applicable
  • Study Type – Filter by Interventional, Observational, Patient registries, Expanded access, and variations
  • Study Results – Filter by "Any", "With results", or "Without results"
  • Study Documents – Require protocol, SAP, or informed consent form uploads
  • Funder Type – Filter by NIH, Other U.S. federal agency, Industry, or All others
  • Date Ranges – Filter by date ranges using separate "From" and "To" fields for: study start date, primary completion date, study completion date, first posted date, results first posted date, or last update posted date (e.g., studyStartDateFrom, studyStartDateTo)
  • More Ways to Search – Search by title/acronym, outcome measure, sponsor/collaborator, lead sponsor, study IDs, or facility name
  • FDAAA violation flag – Toggle to only return studies flagged for FDAAA results submission violations
  • Sort studies by – Sort by "Relevance" or "Newest First"
  • maxItems – Free users: Required, maximum 50. Paid users: Optional, maximum 1,000,000. Prefill: 10

Example JSON using predefined age groups:

{
"condition": "diabetes",
"otherTerms": "type 2",
"intervention": "metformin",
"location": "United States",
"studyStatusFilter": "recruitingAndNotYetRecruiting",
"ageGroups": ["adult", "older"],
"phases": ["phase2", "phase3"],
"documentTypes": ["protocol"],
"maxItems": 10
}

Example JSON using manual age range:

{
"condition": "diabetes",
"ageRangeFrom": 18,
"ageRangeFromUnit": "years",
"ageRangeTo": 65,
"ageRangeToUnit": "years",
"maxItems": 10
}

Note: You cannot use both ageGroups and ageRangeFrom/ageRangeTo at the same time. The system automatically detects which method you're using based on which fields are present.


📥 Output

When the actor finishes, your dataset contains one flat JSON per trial. Download as JSON, CSV, Excel, or feed directly into the Apify API/webhooks.

Sample record (trimmed):

{
"nctId": "NCT07246980",
"briefTitle": "Continuous Observational Monitoring Program for Allay™ Hydrogel Cap Safety and Success Study",
"studyUrl": "https://clinicaltrials.gov/study/NCT07246980",
"officialTitle": "Continuous Observational Monitoring Program for Allay™ Hydrogel Cap Safety and Success (COMPASS) Study...",
"acronym": "COMPASS",
"orgStudyId": "CLP-0002",
"secondaryIds": null,
"overallStatus": "NOT_YET_RECRUITING",
"hasResults": false,
"studyType": "OBSERVATIONAL",
"phase": null,
"enrollmentCount": 1000,
"enrollmentType": "ESTIMATED",
"targetDuration": "2 Years",
"conditions": ["Peripheral Nerve Injuries"],
"interventions": [{
"type": "DEVICE",
"name": "Nerve Cap",
"description": "Patients who have or will receive allay Hydrogel Cap",
"armGroupLabels": null,
"otherNames": null
}],
"leadSponsorName": "Tulavi Therapeutics,Inc.",
"leadSponsorClass": "INDUSTRY",
"collaborators": null,
"locations": null,
"centralContacts": null,
"briefSummary": "This is a multi-center, open-label, observational registry study...",
"gender": "ALL",
"standardAges": ["CHILD", "ADULT", "OLDER_ADULT"],
"inclusionCriteria": ["Patients who have or will receive allay™ Hydrogel Cap"],
"exclusionCriteria": ["Patients who have not received allay™ Hydrogel Cap"],
"primaryOutcomeMeasures": [{
"measure": "Assessment of change in Pain Scores from baseline using VAS",
"timeFrame": "Baseline (Pre Surgery ), 2 Weeks, 1 month, 4 month, 12 month and 24 month (Post Surgery)",
"description": "Pain scores will be collected using either of the following scales..."
}],
"secondaryOutcomeMeasures": [{
"measure": "Incidence of Device Failures surgical & medical complications...",
"timeFrame": "Upto 2 years Post Surgery",
"description": "Secondary endpoints of the study include safety assessment..."
}],
"scrapedAt": "2025-11-25T00:07:52.271Z"
}

What You Get: Clean identifiers, design metadata, contacts, eligibility criteria, outcome measures, and compliance evidence in one row.
Download Options: CSV, Excel, JSON, XML, or read directly through the Apify API.


⚡ Why Choose the ClinicalTrials.gov Scraper?

  • All filters, zero browsers – We hit the Studies API directly, so there's nothing to break when the UI changes
  • Comprehensive data extraction – Capture all available fields from both listing and detailed API responses, with intelligent deduplication
  • Compliance-ready evidence – Capture results posting dates, FDAAA violation flags, and document download URLs automatically
  • Location intelligence built-in – Every site includes status, city/state/country, ZIP, and coordinates for quick mapping
  • Perfectly flat schema – No nested surprises. Contacts, documents, outcomes, and interventions are tidy arrays that BI tools understand
  • Clean data formatting – Empty strings and arrays are normalized to null for cleaner datasets
  • Huge exports supported – Request up to one million records per run without worrying about throttled browsers

Time Savings: Minutes instead of days copying and pasting from web pages.
💰 Cost Efficiency: Automate repeatable monitoring instead of paying staff/consultants for manual screenshotting.


🎯 How to Use

  1. Sign Up: Create a free account w/ $5 credit – two minutes
  2. Open the Actor: Visit the ClinicalTrials.gov Scraper on Apify
  3. Pick Your Approach: Paste a start URL or configure the filters (condition, status, location, date ranges, etc.)
  4. Run It: Click "Start". Progress logs will show every API page fetched
  5. Download Data: Grab the dataset as CSV/Excel/JSON or connect via webhook/API

Total Time: ~3 minutes for the first run, seconds for repeats.
No Technical Skills Required: Everything is point and click.


💼 Business Use Cases

Clinical Operations

  • Monitor recruiting competition in target indications
  • Validate site availability and contact info before outreach
  • Track protocol/SAP/ICF availability for submission packs

Market Intelligence & BD

  • Size therapeutic landscapes by phase, sponsor class, and geography
  • Identify partners/co-developers by filtering collaborators or funder class
  • Flag terminated or withdrawn programs for acquisition scouting

Medical Affairs / HEOR

  • Build custom registries of outcome measures for meta-analyses
  • Follow expanded access activity for patient advocacy
  • Keep real-time lists of results posted vs. overdue

Compliance / Regulatory

  • Audit FDAAA violation flags and results submission states
  • Archive document download links for internal records
  • Automate evidence packages for periodic reporting

❓ Frequently Asked Questions

Q: Do I need a browser or proxy?
A: No. We call the official Studies API, so there is no headless browser, no proxy juggling, and no HTML parsing.

Q: How current is the data?
A: Exactly as current as ClinicalTrials.gov. Each run reads directly from their backend API.

Q: What if I exceed maxItems?
A: Free users can request up to 50 items. Paid users can set maxItems anywhere up to 1,000,000. The actor paginates until the limit or the dataset is exhausted.

Q: Can I reuse a ClinicalTrials.gov URL?
A: Yes. Paste the URL you see in your browser after applying filters. We convert it into API parameters automatically. When using a startUrl, all other filters are ignored.

Q: What happens if the API returns an error?
A: The actor retries three times with exponential backoff. If it still fails, we log the friendly error, push it to the dataset, and exit gracefully.

Q: Are empty fields included in the output?
A: Empty strings and arrays are automatically converted to null for cleaner datasets and easier analysis.


🔌 Integrate the ClinicalTrials.gov Scraper with any app

This actor plugs into the rest of your stack through Apify integrations:


Looking for more data collection tools? Check out these related actors:

ActorDescriptionLink
FINRA BrokerCheck ScraperCollect broker/advisor registrations and disclosures from FINRAhttps://apify.com/parseforge/finra-brokercheck-scraper
FAA Aircraft Registry ScraperMonitor US aircraft ownership and certification datahttps://apify.com/parseforge/faa-aircraft-registry-scraper
HubSpot Marketplace ScraperTrack go-to-market tools listed on HubSpot's app storehttps://apify.com/parseforge/hubspot-marketplace-scraper
Lead Formatter ToolClean and enrich contact records with AIhttps://apify.com/parseforge/lead-formatter
GSA eLibrary ScraperExtract government services and solutions data from GSA eLibraryhttps://apify.com/parseforge/gsa-elibrary-scraper

Pro Tip: 💡 Browse our complete collection of data collection actors to find the perfect tool for your business needs.


💬 Need Help? Our team is ready to help you connect this actor to your workflow.


⚠️ Disclaimer: This Actor is an independent tool and is not affiliated with, endorsed by, or sponsored by ClinicalTrials.gov or the U.S. National Library of Medicine. All trademarks are the property of their respective owners.