Chembl Bioactivity Scraper avatar

Chembl Bioactivity Scraper

Pricing

from $10.00 / 1,000 results

Go to Apify Store
Chembl Bioactivity Scraper

Chembl Bioactivity Scraper

Scrapes molecule and bioactivity data from the ChEMBL database API.

Pricing

from $10.00 / 1,000 results

Rating

0.0

(0)

Developer

Donny

Donny

Maintained by Community

Actor stats

0

Bookmarked

1

Total users

0

Monthly active users

2 days ago

Last modified

Categories

Share

ChEMBL Bioactivity Database Scraper

What it does

This actor scrapes molecule and bioactivity data from the ChEMBL database maintained by the European Bioinformatics Institute (EMBL-EBI). It searches for chemical compounds by name or keyword and retrieves detailed molecular information including ChEMBL identifiers, preferred names, molecule types, clinical trial phases, molecular properties, SMILES strings, and InChI keys. The actor handles offset-based pagination and provides clean, structured output.

Why use this actor

This actor provides a simple, reliable, and scalable way to extract data from public APIs without needing to write any code or manage infrastructure. It handles pagination, rate limiting, error recovery, and data normalization automatically. Whether you are a researcher, analyst, or developer, this actor saves you hours of manual data collection work. The structured JSON output integrates seamlessly with spreadsheets, databases, dashboards, and downstream data pipelines. Run it on Apify platform for automatic scheduling, monitoring, and proxy management.

Input parameters

  • searchQuery (string, required): Molecule name or keyword to search ChEMBL. Default: "aspirin".
  • maxResults (integer, optional): Maximum number of molecule records to return. Range: 1-1000. Default: 100.

Output data

The actor outputs a dataset with the following fields:

  • moleculeChemblId: ChEMBL molecule identifier
  • prefName: Preferred name of the molecule
  • moleculeType: Type of molecule (small molecule, protein, etc.)
  • maxPhase: Maximum clinical trial phase reached
  • oralFlag: Whether the molecule is orally available
  • molecularWeight: Full molecular weight
  • smiles: Canonical SMILES representation
  • inchiKey: Standard InChI key

Each record is validated and null-checked before being pushed to the dataset. Missing or unavailable fields are set to null rather than being omitted, ensuring consistent schema across all records.

Example output

[
{
"moleculeChemblId": "CHEMBL25",
"prefName": "ASPIRIN",
"moleculeType": "Small molecule",
"maxPhase": 4,
"oralFlag": true,
"molecularWeight": "180.16",
"smiles": "CC(=O)Oc1ccccc1C(=O)O",
"inchiKey": "BSYNRYMUTXBXSQ-UHFFFAOYSA-N"
}
]

Pricing

This actor is priced based on usage:

  • $0.01 per result returned in the dataset
  • $0.005 per actor start (flat fee per run)

These costs cover Apify platform compute and proxy resources. For large-scale scraping jobs, consider using the maxResults parameter to control costs and stay within your budget. Typical runs of 100 results cost approximately $1.01.

Explore these related actors from brave_paradise:

More scrapers from brave_paradise

Check out other actors published by brave_paradise on the Apify Store for more data extraction tools covering scientific databases, developer communities, news aggregators, and government open-data APIs. All actors follow the same high-quality patterns with robust error handling, automatic pagination, and clean structured output.