EMA Medicines Scraper ๐Ÿ’Š avatar

EMA Medicines Scraper ๐Ÿ’Š

Pricing

Pay per usage

Go to Apify Store
EMA Medicines Scraper ๐Ÿ’Š

EMA Medicines Scraper ๐Ÿ’Š

Scrape European Medicines Agency data for drug approvals, clinical trials & pharmaceutical information. Extract EMA medicines, regulatory documents & authorization data at scale. Perfect for pharma research & compliance.

Pricing

Pay per usage

Rating

0.0

(0)

Developer

Shahid Irfan

Shahid Irfan

Maintained by Community

Actor stats

0

Bookmarked

4

Total users

3

Monthly active users

4 days ago

Last modified

Share

European Medicines Agency Medicines Scraper

Extract comprehensive data from the European Medicines Agency (EMA) with ease. Collect critical information about human and veterinary medicines, including authorization status, therapeutic areas, marketing dates, and official URLs. Perfect for regulatory research, compliance monitoring, and healthcare market intelligence.


Features

  • Comprehensive coverage โ€” Extract records for both human and veterinary medicines published by the EMA.
  • Targeted extraction โ€” Filter medicines by names, INN/common names, status, or specific therapeutic keywords.
  • Flexible input options โ€” Support for direct search URLs, specific medicine detail pages, or direct JSON report URLs.
  • Run size control โ€” Define maximum results and page limits to manage resources and speed.

Use Cases

Regulatory Compliance

Track and monitor authorization statuses, updates, and regulatory timelines directly from the official source. Ensure your compliance database is always up-to-date with current EMA listings.

Market Intelligence

Analyze authorized and suspended medicines to understand market trends and competitive landscapes. Identify potential therapeutic gaps and portfolio opportunities for your organization.

Clinical Research

Gather structured data on common names, therapeutic areas (MeSH), and product details for academic or clinical research. Simplify data collation for scientific reports and meta-analyses.

Healthcare Analytics

Feed downstream business intelligence dashboards and databases with clean, structured CSV or JSON records. Enable automated alerts for medicine status changes or new approvals.


Input Parameters

ParameterTypeRequiredDefaultDescription
urlStringNoโ€”Optional EMA search URL, medicine detail URL, or JSON report URL to start scraping.
keywordStringNoโ€”Optional keyword filter to target specific medicines or therapeutic terms.
results_wantedIntegerNo20Maximum number of medicine records to extract.
max_pagesIntegerNo5Cap on the number of search pages to parse (20 records per page).
proxyConfigurationObjectNoโ€”Proxy settings for running in restricted network environments.

Output Data

Each item in the dataset contains:

FieldTypeDescription
name_of_medicineStringOfficial name of the medicine.
categoryStringMedicine category (e.g., Human or Veterinary).
medicine_statusStringCurrent authorization status of the medicine.
international_non_proprietary_name_common_nameStringInternational Nonproprietary Name (INN) or common name.
therapeutic_area_meshStringTherapeutic area classified by MeSH terms.
marketing_authorisation_dateStringDate the marketing authorization was granted.
last_updated_dateStringDate the medicine record was last updated by the EMA.
ema_product_numberStringUnique product identification number assigned by the EMA.
medicine_urlStringLink to the official EMA medicine detail page.

Usage Examples

Basic Run

Run with defaults to fetch the latest records matching the default keyword:

{
"results_wanted": 20
}

Target a specific therapeutic area or medical condition:

{
"keyword": "oncology",
"results_wanted": 50,
"max_pages": 5
}

Custom Search URL

Extract records matching a pre-filtered search query on the EMA website:

{
"url": "https://www.ema.europa.eu/en/search?search_api_fulltext=vaccine",
"results_wanted": 30
}

Sample Output

{
"category": "Human",
"name_of_medicine": "Nuwiq",
"ema_product_number": "EMEA/H/C/002813",
"medicine_status": "Authorised",
"international_non_proprietary_name_common_name": "simoctocog alfa",
"therapeutic_area_mesh": "Hemophilia A",
"marketing_authorisation_date": "22/07/2014",
"last_updated_date": "21/05/2026",
"medicine_url": "https://www.ema.europa.eu/en/medicines/human/EPAR/nuwiq"
}

Tips for Best Results

Start with Low Volume

Test your configuration with a small run of 10-20 results first. Once you verify the output meets your requirements, scale up the limits.

Use Specific Keywords

Reduce extraction time and data volume by specifying precise medicine names or therapeutic terms in the keyword input.

Use Search URLs for Advanced Filtering

Perform your complex filtering directly on the official EMA website and copy the resulting search URL as input to target exactly what you need.


Proxy Configuration

For reliable results, residential proxies are recommended:

{
"proxyConfiguration": {
"useApifyProxy": true
}
}

Integrations

Connect your data with:

  • Google Sheets โ€” Export medicine data directly to active sheets
  • Airtable โ€” Build searchable relational drug databases
  • Zapier โ€” Automate alerts when medicine status changes
  • Make โ€” Sync results into custom reporting workflows
  • Webhooks โ€” Send real-time updates to your API endpoints

Export Formats

  • JSON โ€” For developers and system integrations
  • CSV โ€” For spreadsheet calculations and analytics
  • Excel โ€” For sharing reports with stakeholders
  • XML โ€” For legacy data management systems

Frequently Asked Questions

How many medicine records can I extract?

You can extract all matching records available on the portal. Set results_wanted and max_pages to a high number to run comprehensive downloads.

Can I extract details for a single medicine?

Yes, simply input the official EMA page URL of that specific medicine into the URL parameter.

Why are some fields empty in the output?

Some fields might be blank if the information is not provided or not applicable to that specific medicine category.

Are veterinary medicines included?

Yes. The scraper collects both human and veterinary records depending on your keywords or filter URL.

Can I automate updates?

Yes. You can schedule regular runs of this actor on the Apify platform and export the new datasets automatically.


Support

For issues or feature requests, contact support through the Apify Console.

Resources


This actor is designed for legitimate data collection purposes. Users are responsible for ensuring compliance with website terms of service and applicable laws. Use data responsibly and respect rate limits.