Clinical Trial Investigator and Site Intelligence avatar

Clinical Trial Investigator and Site Intelligence

Pricing

Pay per usage

Go to Apify Store
Clinical Trial Investigator and Site Intelligence

Clinical Trial Investigator and Site Intelligence

Find enriched clinical trial investigators and deterministic site-fit scores from ClinicalTrials.gov, NPI, OpenPayments, and PubMed data.

Pricing

Pay per usage

Rating

0.0

(0)

Developer

George Kioko

George Kioko

Maintained by Community

Actor stats

0

Bookmarked

1

Total users

1

Monthly active users

12 days ago

Last modified

Share

CROs pay six figures for investigator + site fit feeds. The raw data is public. The work is the join.

This actor turns ClinicalTrials.gov study records into enriched investigator profiles and scored trial site rosters. It joins CT.gov study, location, sponsor, phase, and condition data with NPI registry matches, OpenPayments payment summaries, and PubMed publication counts. The output is built for CRO feasibility teams, sponsor diligence, patient recruitment planning, and business development teams that need a clean feed instead of raw trial JSON.

Quick start

Find enriched investigators for a condition:

$curl "https://<standby-url>/investigators?condition=glioblastoma&phase=phase2&limit=3"

Score United States sites for a condition:

$curl "https://<standby-url>/sites?condition=breast+cancer&country=United+States&limit=3"

Batch mode also works from an Apify run input:

{
"mode": "investigators",
"condition": "glioblastoma",
"phase": "phase2",
"limit": 25
}

Standby endpoints

EndpointWhat it returns
GET / and GET /healthService info and endpoint list
GET /investigators?condition=&phase=&status=&limit=Enriched investigator profiles across matching studies
GET /investigator?npi=One NPI based investigator profile
GET /investigator?name=One name based investigator profile with trial history
GET /sites?condition=&country=&state=&limit=Scored facility roster
GET /study?nct=One expanded study with investigators and scored sites
POST /investigators/bulkUp to 100 NPI based profiles

Health probes using values such as test, ping, example.com, or URLs on known test hostnames return a mocked clinical trial shaped response and do not charge.

Investigator schema

FieldMeaning
investigator_idNPI when found, otherwise hash of name and affiliation
name, first_name, last_name, credentialsPublic investigator identity from CT.gov and NPI
npi, primary_taxonomyNPI registry match and primary specialty
affiliationsTrial facilities and sponsors seen in CT.gov
city, state, countryBest available NPI or CT.gov location
active_trial_count, completed_trial_count, total_trial_countTrial experience counters
phase_breakdownCounts for phase 1 through phase 4
therapeutic_areasTop condition terms from matched trials
open_payments_total_usdLatest OpenPayments general payment total when available
open_payments_top_companiesTop manufacturers or GPOs by payment amount
publications_pubmed_countPubMed author search count
first_trial_date, last_trial_dateEarliest and latest observed trial dates
trial_historyNCT level history used to build the profile
fetched_atISO timestamp

Site schema

FieldMeaning
facility_idHash of facility, city, state, and country
facility_name, city, state, countryCT.gov site location
trial_count_3yRecent trial proxy from CT.gov dates
active_trial_countActive, recruiting, or enrolling studies
condition_match_countStudies matching the requested condition
phase_3_4_shareShare of trials in phase 3 or phase 4
investigators_count_uniqueUnique public investigator names linked to the site
principal_investigatorsTop three names by trial count
site_fit_scoreDeterministic score from 0 to 100
score_bandlow, medium, high, or elite
score_rationaleShort explanation of the score
fetched_atISO timestamp

Data flow

flowchart LR
A[Input condition, phase, status, NPI, or NCT] --> B[ClinicalTrials.gov search]
B --> C[Extract investigators and facilities]
C --> D[NPI registry match]
C --> E[OpenPayments summary]
C --> F[PubMed paper count]
D --> G[Normalize investigator profiles]
E --> G
F --> G
C --> H[Score trial sites]
G --> I[Dataset and API response]
H --> I

Scoring

Site scoring is deterministic. Every site starts at 30 points. It receives 20 points for at least three active trials, 15 points for at least two condition matched trials, 15 points when phase 3 or phase 4 share is at least 0.4, 10 points for at least three unique investigators, and 10 points for at least 10 recent trials. Scores are capped at 100. Bands are low from 0 to 30, medium from 31 to 55, high from 56 to 80, and elite at 81 or above.

Pricing

EventPriceCharged when
Actor start$1.00Once per paid Standby request or batch run
Investigator profile$0.10Per enriched investigator profile returned
Site fit score$0.50Per scored site row returned

Charges fire only after data work succeeds and rows are pushed to the dataset.

Comparison

OptionBest forTradeoff
ClinicalTrials.gov directRaw study and location dataNo NPI join, no OpenPayments summary, no scoring
Veeva or MedidataEnterprise feasibility programsSaaS contracts, sales process, and less flexible API use
This APIFast investigator and site feedsPublic data only, deterministic scoring, no private contact scraping

Use cases

  1. CRO RFP response: build a quick evidence base for proposed investigators and sites.
  2. Patient recruitment site targeting: rank facilities before outreach spend.
  3. BD outreach to investigators: identify public trial experience and publication depth.
  4. Sponsor diligence: check whether a target investigator has relevant trial history.
  5. KOL mapping: combine trial count, therapeutic areas, and PubMed footprint.

FAQ

How complete is NPI coverage? NPI coverage is strongest for United States physicians. Non US investigators usually return npi: null.

Why can OpenPayments be null? CMS payment data has publication lag and applies to covered US recipients. No match is returned as null.

Are there rate limits? CT.gov is public fair use. NPI and PubMed are low rate public APIs, so the actor uses bounded requests and accepts partial enrichment.

Can I get a refund for mock probes? Health check payloads return mocked data and are not charged.

Does this include private emails? No. It uses public CT.gov, NPI, OpenPayments, and PubMed data only.

Who do I contact for custom fields? Use the Apify actor issue tab or contact the actor owner through the Apify Store profile.