Open Targets Platform Scraper
Pricing
from $3.75 / 1,000 result items
Open Targets Platform Scraper
Export drug, target, and disease intelligence from the Open Targets Platform. Pull 63,000+ targets, 20,000+ diseases, and 14,000+ drugs with association scores, tractability, mechanisms of action, clinical phase, and cross-references for drug discovery research.
Pricing
from $3.75 / 1,000 result items
Rating
0.0
(0)
Developer
ParseForge
Maintained by CommunityActor stats
0
Bookmarked
2
Total users
1
Monthly active users
4 days ago
Last modified
Share

🧬 Open Targets Platform Scraper
🚀 Export drug, target, and disease intelligence in seconds. Pull 63,000+ targets, 20,000+ diseases, and 14,000+ drugs from the Open Targets Platform with evidence scores, tractability, mechanisms of action, and clinical phase. No registration, no manual extraction.
🕒 Last updated: 2026-05-20 · 📊 18 fields per record · 🧬 63k+ targets · 🦠 20k+ diseases · 💊 14k+ drugs
The Open Targets Platform Scraper exports drug discovery intelligence from the Open Targets Platform and returns up to 18 fields per record, including identifiers, synonyms, descriptions, tractability assessments, target-disease associations, mechanisms of action, and clinical-trial indications. The underlying dataset is the reference open knowledge graph for therapeutic target identification, jointly built by EMBL-EBI, GSK, Sanofi, Bristol Myers Squibb, and other contributors.
Coverage spans 63,000+ targets (Ensembl IDs), 20,000+ diseases (EFO/MONDO), and 14,000+ drugs (ChEMBL) with association scoring, druggability tiers, ubiquitination signals, pathway memberships, and disease-trait phenotype maps.
| 🎯 Target Audience | 💡 Primary Use Cases |
|---|---|
| Pharma R&D teams, bioinformatics groups, biotech analysts, academic researchers, drug-repurposing startups, clinical-trial intelligence | Target prioritization, disease-target evidence reports, drug-repurposing scans, tractability snapshots, mechanism-of-action lookups, indication exploration |
📋 What the Open Targets Platform Scraper does
Three entity-specific workflows in a single run:
- 🧬 Targets. Pull gene-level records with approved symbol, biotype, synonyms, protein IDs, tractability across small-molecule / antibody / PROTAC / oligo modalities, pathway memberships, and the top associated diseases with evidence scores.
- 🦠 Diseases. Pull EFO/MONDO records with descriptions, synonyms, therapeutic areas, cross-references, ancestors/descendants, and the top associated targets.
- 💊 Drugs. Pull ChEMBL drug records with type, maximum clinical phase, approval status, withdrawals, trade names, mechanisms of action, and indication lists.
Each record includes a direct link back to the entity's page on platform.opentargets.org for spot-checking.
💡 Why it matters: drug discovery teams spend significant time stitching together target-disease evidence from disparate genetic, expression, and clinical sources. Open Targets is the canonical pre-stitched graph. This Actor flattens the relevant subset into a CSV/JSON snapshot you can drop into a notebook, dashboard, or LLM workflow.
🎬 Full Demo
🚧 Coming soon: a 3-minute walkthrough showing how to go from sign-up to a downloaded dataset.
⚙️ Input
| Input | Type | Default | Behavior |
|---|---|---|---|
maxItems | integer | 10 | Records to return. Free plan caps at 10, paid plan at 1,000,000. |
entityType | string | "target" | One of target, disease, or drug. |
searchQuery | string | "" | Free-text search. Examples: BRAF, melanoma, pembrolizumab. |
targetIds | string[] | [] | Ensembl gene IDs (used when entityType is target). |
diseaseIds | string[] | [] | EFO or MONDO IDs (used when entityType is disease). |
drugIds | string[] | [] | ChEMBL IDs (used when entityType is drug). |
Example: top 25 search hits for "BRAF" as targets.
{"maxItems": 25,"entityType": "target","searchQuery": "BRAF"}
Example: enrich a list of known drugs.
{"maxItems": 5,"entityType": "drug","drugIds": ["CHEMBL1201585", "CHEMBL1201218", "CHEMBL2108524"]}
⚠️ Good to Know: when a list of IDs is provided, the scraper skips the search step and pulls those entities directly. When the list is empty, the search query is used. Both modes coexist so you can build a targeted ID-batch run or an exploratory search run from the same input form.
📊 Output
Each record contains up to 18 fields depending on entity type. Download the dataset as CSV, Excel, JSON, or XML.
🧾 Schema (target)
| Field | Type | Example |
|---|---|---|
🧬 entityType | string | "target" |
🆔 id | string | "ENSG00000157764" |
🔤 approvedSymbol | string | "BRAF" |
📛 approvedName | string | "B-Raf proto-oncogene, serine/threonine kinase" |
🧪 biotype | string | "protein_coding" |
📜 description | string | null | Protein function summary |
🔁 synonyms | string[] | null | Symbol synonyms |
🧫 proteinIds | object[] | null | UniProt and Ensembl protein IDs |
🔗 crossReferences | object[] | null | DB x-refs |
🎯 tractability | object[] | null | Modality x druggability matrix |
📍 subcellularLocations | object[] | null | Cell compartment evidence |
🛣️ pathways | object[] | null | Reactome pathway memberships |
🧬 geneticConstraint | object[] | null | gnomAD constraint scores |
🧪 targetClass | object[] | null | ChEMBL target-class taxonomy |
🦠 associatedDiseases | object | null | Top disease evidence rows with scores |
🔗 sourceUrl | string | Platform page |
🕒 scrapedAt | ISO 8601 | Collection timestamp |
📦 Sample records
✨ Why choose this Actor
| Capability | |
|---|---|
| 🧬 | Full biomedical graph. 63k+ targets, 20k+ diseases, 14k+ drugs in a single Actor with consistent fields. |
| 🎯 | Multi-modality tractability. Small molecule, antibody, PROTAC, oligonucleotide, and other modality classes per target. |
| 📊 | Association scores. Top-N disease-target evidence with scores baked into the disease and target records. |
| 💊 | Clinical-phase data. Drug records include maximum clinical phase, approval status, withdrawal flags, and indications. |
| ⚡ | Fast. 25 entities in under 30 seconds, hundreds in a few minutes. |
| 🔁 | Always fresh. Every run pulls the latest Open Targets release through the official graph. |
| 🚫 | No authentication. Open scientific data. No login or token required. |
📊 Open Targets is the consensus reference graph for target identification, used by major pharma R&D teams worldwide.
📈 How it compares to alternatives
| Approach | Cost | Coverage | Refresh | Filters | Setup |
|---|---|---|---|---|---|
| ⭐ Open Targets Platform Scraper (this Actor) | $5 free credit, then pay-per-use | 63k targets · 20k diseases · 14k drugs | Live per run | search, ID list, entity type | ⚡ 2 min |
| Manual portal navigation | Free | Per-entity | Manual | None | 🐢 Tedious |
| Direct custom queries | Engineering time | Full | Live | Custom | 🛠️ Days |
| Commercial biomedical databases | $$$$/year | Variable | Variable | Many | ⏳ Long contract |
Pick this Actor when you want a clean, filtered slice of Open Targets without writing custom queries, handling pagination, or maintaining schema knowledge.
🚀 How to use
- 📝 Sign up. Create a free account with $5 credit (takes 2 minutes).
- 🌐 Open the Actor. Go to the Open Targets Platform Scraper page on the Apify Store.
- 🎯 Set input. Pick an entity type, then either supply a list of IDs or a free-text search.
- 🚀 Run it. Click Start and let the Actor collect your data.
- 📥 Download. Grab your results in the Dataset tab as CSV, Excel, JSON, or XML.
⏱️ Total time from signup to downloaded dataset: 3-5 minutes. No coding required.
💼 Business use cases
🔌 Automating Open Targets Platform Scraper
Control the scraper programmatically for scheduled runs and pipeline integrations:
- 🟢 Node.js. Install the
apify-clientNPM package. - 🐍 Python. Use the
apify-clientPyPI package. - 📚 See the Apify documentation for full details.
The Apify Schedules feature lets you trigger this Actor on any cron interval. Monthly refreshes keep downstream analytics aligned with new Open Targets releases.
🌟 Beyond business use cases
Data like this powers more than commercial workflows. The same structured records support research, education, civic projects, and personal initiatives.
🤖 Ask an AI assistant about this scraper
Open a ready-to-send prompt about this ParseForge actor in the AI of your choice:
- 💬 ChatGPT
- 🧠 Claude
- 🔍 Perplexity
- 🅒 Copilot
❓ Frequently Asked Questions
🧩 How does it work?
Pick an entity type and either drop in a list of identifiers or run a free-text search. The Actor queries the Open Targets graph and returns one structured record per entity, including association rows and tractability matrices.
📏 How accurate is the data?
The records mirror the latest Open Targets release. Evidence scores and clinical phases are computed by the Open Targets pipeline from curated sources (ChEMBL, OpenFDA, ClinVar, Reactome, and many more). For clinical decisions, validate against primary sources.
🔁 How often is the dataset refreshed?
Open Targets ships full data releases on a regular cadence. Every run of this Actor pulls live from the graph, so your dataset reflects the latest published release at run time.
🧬 Which identifier systems are supported?
Ensembl gene IDs for targets, EFO and MONDO for diseases, and ChEMBL for drugs. Cross-references in the output map to UniProt, dbSNP, RefSeq, DrugBank, and other systems.
⏰ Can I schedule regular runs?
Yes. Use Apify Schedules to run this Actor on any cron interval (daily, weekly, monthly) and keep a downstream database in sync with new Open Targets releases.
⚖️ Is this data legal to use?
Open Targets data is published under CC0 with attribution. Underlying source data may carry separate licenses; review the Open Targets documentation for your specific use case.
💼 Can I use this data commercially?
Yes. Open Targets data is free for academic and commercial use. The CC0 license places it in the public domain.
💳 Do I need a paid Apify plan to use this Actor?
No. The free Apify plan is enough for testing and small runs (10 records per run). A paid plan lifts the limit and gives you access to scheduling and higher concurrency.
🔁 What happens if a run fails or gets interrupted?
Apify automatically retries transient errors. Single-entity failures are skipped with a warning and the run continues. Partial datasets are preserved.
🧪 What if I need full evidence trails?
This Actor returns the top-N association rows per target or disease. For deeper evidence and provenance, query individual entity pages on platform.opentargets.org or request a companion scraper through the contact form.
🆘 What if I need help?
Our support team is here to help. Contact us through the Apify platform or use the Tally form linked below.
🔌 Integrate with any app
Open Targets Platform Scraper connects to any cloud service via Apify integrations:
- Make - Automate multi-step workflows
- Zapier - Connect with 5,000+ apps
- Slack - Get run notifications in your channels
- Airbyte - Pipe biomedical data into your warehouse
- GitHub - Trigger runs from commits and releases
- Google Drive - Export datasets straight to Sheets
You can also use webhooks to trigger downstream actions when a run finishes. Push refreshed biomedical evidence into your product backend or notebook environment.
🔗 Recommended Actors
- 🧪 ClinicalTrials.gov Scraper - Worldwide clinical-trial registry
- 📐 arXiv Preprint Scraper - Open-access research papers
- 🔬 Figshare Scraper - Open research artifacts
- 🧬 GBIF Biodiversity Scraper - Global biodiversity occurrence records
💡 Pro Tip: browse the complete ParseForge collection for more reference-data scrapers.
🆘 Need Help? Open our contact form to request a new scraper, propose a custom data project, or report an issue.