Pricing

Pay per event

FDA Orange Book Scraper

Search public FDA Orange Book / Drugs@FDA records by brand, generic, ingredient, sponsor, or application number for pharma research.

Pricing

Pay per event

Rating

0.0

(0)

Developer

Stas Persiianenko

Actor stats

Bookmarked

Total users

Monthly active users

3 days ago

Last modified

What does FDA Orange Book Scraper do?

FDA Orange Book Scraper queries the public openFDA Drugs@FDA API and saves normalized application-level records to an Apify dataset.

It turns FDA application JSON into export-ready rows with application numbers, sponsor names, product summaries, active ingredients, dosage forms, routes, strengths, marketing statuses, submissions, and openFDA identifiers.

The actor is API-first, so it does not need a browser, login, cookies, or a private FDA account.

Who is it for?

🧪 Regulatory affairs teams checking approved drug applications.
💊 Generic-drug portfolio analysts comparing brand and ingredient coverage.
⚖️ Pharma IP and market-access teams building patent-cliff research datasets.
📊 Competitive-intelligence teams monitoring sponsors and application families.
🔬 Healthcare data teams joining FDA application records with internal databases.

Why use it?

It provides a simple Apify interface around openFDA Drugs@FDA search.
It supports buyer-friendly inputs instead of requiring users to remember API field names.
It saves one normalized dataset row per application record.
It includes nested products and submissions for downstream auditing.
It can include the raw openFDA record when your compliance workflow needs source evidence.

Data source

The actor uses:

https://api.fda.gov/drug/drugsfda.json

This is a public FDA/openFDA endpoint.

No FDA API token is required for normal use.

What data can you extract?

Field	Description
`applicationNumber`	NDA, ANDA, or BLA application number from Drugs@FDA.
`sponsorName`	Application sponsor / applicant.
`brandNames`	Brand names found in openFDA and product data.
`genericNames`	Generic names from openFDA.
`activeIngredients`	Active ingredient names from product records.
`dosageForms`	Dosage forms across products.
`routes`	Administration routes.
`strengths`	Product strengths.
`marketingStatuses`	Product marketing statuses where provided.
`products`	Nested product summaries.
`submissions`	Nested submission summaries.
`openfda`	Original openFDA identifiers and classification fields.
`patentDataAvailable`	Whether patent records were available from the source.
`exclusivityDataAvailable`	Whether exclusivity records were available from the source.

Search modes

You can search by:

Brand name.
Generic name.
Active ingredient.
Sponsor / applicant.
Exact application number.
Raw openFDA query syntax.

Input example

{
  "queries": [
    "aspirin",
    { "term": "ibuprofen", "field": "ingredient" },
    { "term": "PFIZER", "field": "sponsor" }
  ],
  "applicationNumbers": ["NDA020639"],
  "searchField": "brand",
  "maxItems": 100,
  "includeRawRecord": false
}

Output example

{
  "searchTerm": "aspirin",
  "searchField": "brand",
  "applicationNumber": "NDA020639",
  "sponsorName": "BAYER HEALTHCARE LLC",
  "brandNames": ["ASPIRIN"],
  "activeIngredients": ["ASPIRIN"],
  "dosageForms": ["TABLET"],
  "routes": ["ORAL"],
  "products": [],
  "submissions": [],
  "patentDataAvailable": false,
  "exclusivityDataAvailable": false
}

How much does it cost to scrape FDA Orange Book data?

This actor uses pay-per-event pricing.

A small start fee is charged once per run.
A per-record fee is charged for each FDA application record saved.
Your final cost depends on the number of matching FDA application records and your Apify plan tier.

For most targeted application-number or brand-name lookups, runs are small and inexpensive.

How to run it

Open the actor on Apify.
Add one or more search terms.
Choose the default search field.
Optionally add exact application numbers.
Set maxItems to cap the export size.
Start the run.
Download the dataset as JSON, CSV, Excel, or via API.

Tips for best results

Use exact application numbers when you know them.
Use ingredient for portfolio research by active ingredient.
Use sponsor for applicant-level monitoring.
Use raw only when you already know openFDA query syntax.
Keep maxItems low for quick smoke tests.
Enable includeRawRecord for compliance audits or custom transformations.

Patent and exclusivity fields

The dataset includes patent and exclusivity compatibility fields.

In this version, the reliable public API source is openFDA Drugs@FDA. If patent or exclusivity data is not present in that source, the actor sets:

patentDataAvailable: false
patents: []
exclusivityDataAvailable: false
exclusivities: []

This makes downstream schemas stable while avoiding unreliable scraping of blocked FDA download pages.

Integrations

You can connect the dataset to:

Google Sheets for regulatory watchlists.
Snowflake or BigQuery for pharma analytics.
CRM enrichment pipelines for sponsor intelligence.
Internal dashboards that monitor generic-entry opportunities.
Apify webhooks for scheduled portfolio updates.

API usage with Node.js

import { ApifyClient } from 'apify-client';

const client = new ApifyClient({ token: process.env.APIFY_TOKEN });
const run = await client.actor('automation-lab/fda-orange-book-scraper').call({
  queries: ['aspirin'],
  searchField: 'brand',
  maxItems: 100
});
console.log(run.defaultDatasetId);

API usage with Python

from apify_client import ApifyClient

client = ApifyClient('YOUR_APIFY_TOKEN')
run = client.actor('automation-lab/fda-orange-book-scraper').call(run_input={
    'queries': ['aspirin'],
    'searchField': 'brand',
    'maxItems': 100,
})
print(run['defaultDatasetId'])

API usage with cURL

curl -X POST 'https://api.apify.com/v2/acts/automation-lab~fda-orange-book-scraper/runs?token=YOUR_APIFY_TOKEN' \
  -H 'Content-Type: application/json' \
  -d '{"queries":["aspirin"],"searchField":"brand","maxItems":100}'

MCP integration

Use Apify MCP to call this scraper from Claude Desktop, Claude Code, or other MCP clients.

MCP URL:

https://mcp.apify.com/?tools=automation-lab/fda-orange-book-scraper

Claude Code setup:

$claude mcp add apify-fda-orange-book "https://mcp.apify.com/?tools=automation-lab/fda-orange-book-scraper"

Claude Desktop JSON config:

{
  "mcpServers": {
    "apify-fda-orange-book": {
      "url": "https://mcp.apify.com/?tools=automation-lab/fda-orange-book-scraper"
    }
  }
}

Example prompts:

"Export FDA Orange Book records for ibuprofen and summarize the sponsors."
"Find Drugs@FDA applications for sponsor PFIZER and group by active ingredient."
"Run an application-number lookup for NDA020639 and return the product strengths."

Scheduling

For monitoring workflows, schedule the actor daily, weekly, or monthly.

Common schedules include:

Weekly sponsor monitoring.
Monthly ingredient portfolio exports.
Quarterly regulatory database refreshes.

Data quality notes

The actor reports the data returned by openFDA Drugs@FDA.

It does not provide medical advice.

Always verify regulatory decisions against official FDA systems and primary records.

Legality and responsible use

This actor uses public FDA/openFDA data.

You are responsible for how you use exported data, including compliance with your organization’s regulatory, medical, and legal review processes.

FAQ and troubleshooting

Why did my search return no rows?

Try a different search mode. For example, use ingredient for active ingredients and application_number for NDA/ANDA/BLA identifiers.

Why are patent arrays empty?

The MVP uses the reliable openFDA Drugs@FDA API. Patent/exclusivity download pages may be unavailable or blocked from automated environments, so the actor marks those fields unavailable when the source does not provide them.

How do I get the original FDA JSON?

Set includeRawRecord to true.

Other Automation Lab actors that can support healthcare and regulatory workflows:

Changelog

Initial version:

Public openFDA Drugs@FDA search.
Brand, generic, ingredient, sponsor, application-number, and raw query modes.
Application, product, submission, and openFDA identifier fields.

Support

If you need a missing field, include an example application number and describe the workflow you are trying to automate.

Final note

FDA Orange Book Scraper is designed for practical, repeatable exports, not one-off manual lookups.

Use it whenever your team needs FDA drug application data in a dataset, scheduled job, or API pipeline.

FDA Orange Book Scraper

crawlerbros/fda-orange-book-scraper

Scrape FDA's Orange Book - Approved Drug Products with Therapeutic Equivalence Evaluations. Search by drug name, active ingredient, applicant, or application number; browse by approval date; look up patents and marketing exclusivities. Sourced from FDA's official monthly bulk data files.

Crawler Bros

FDA Orange Book Scraper

labrat011/fda-orange-book-scraper

Extract FDA Orange Book data — drug patent expirations, exclusivity periods, generic equivalents, and therapeutic equivalence ratings. No API key required.

mick_

FDA Orange Book — Drug Patent & Exclusivity API

nexgendata/fda-orange-book-drug-patents

Track FDA Orange Book drug patents and exclusivity data. Clean JSON for IP counsel, brand teams and AI agents.

NexGenData

FDA Orange Book Patent Expirations Monitor

oobr/fda-orange-book-patent-expirations

Monthly Orange Book extracts with patent expiration dates, applicant, active ingredient — for generics timing & IP diligence.

OOBR Team

FDA Orange Book Scraper

parseforge/fda-orange-book-scraper

Scrape the FDA Orange Book of approved drug products with therapeutic equivalence evaluations. Get NDA, ingredient, dosage form, strength, route, applicant, marketing status, TE code, RLD/RS flags, exclusivity and patent data. Perfect for pharma research, payers, and generics intelligence.

ParseForge

FDA Orange Book Patent & Exclusivity Tracker — Quarterly Diff

changewire/fda-orange-book-extraction

Purpose-built FDA Orange Book diff stream — NDA/ANDA/BLA patent listings, exclusivity grants, LoE + TE-code shifts as JSONL. FDA-vocabulary-aware vs general-purpose scrapers like Browse AI; per-record metered vs Cortellis $35-50k/yr seat tax for pharma BI + IP + generics. Public runs.

ChangeWire

Drugs@FDA Approvals Scraper

parseforge/openfda-drugsfda-scraper

Export Drugs@FDA approval records from openFDA. Search 29k+ NDA, ANDA, and BLA applications by application number, sponsor, or product. Pull application details, sponsor, submissions history, products, dosage forms, routes, and marketing status.

ParseForge

FDA Drug Approval & Recall Watch — Pharma Monitor

azureblue/fda-drug-approval-watch

Daily delta monitor of FDA drug approvals (Drugs@FDA) and drug recalls (openFDA enforcement) — for Pharma Regulatory, Pharmacovigilance, and US market access teams. Webhook-ready alerts.