DailyMed Drug Label Scraper - NIH Drug Data avatar

DailyMed Drug Label Scraper - NIH Drug Data

Pricing

from $10.00 / 1,000 results

Go to Apify Store
DailyMed Drug Label Scraper - NIH Drug Data

DailyMed Drug Label Scraper - NIH Drug Data

Scrape DailyMed drug labels from the NIH. Search by drug name. Extract ingredients, dosage forms, NDC codes, labelers, and full pharmaceutical data.

Pricing

from $10.00 / 1,000 results

Rating

0.0

(0)

Developer

lulz bot

lulz bot

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

a day ago

Last modified

Categories

Share

DailyMed Drug Label Scraper

Scrape drug label data from DailyMed, the NIH's official source for FDA-approved drug labeling. Search by drug name or fetch specific labels by set ID.

Features

  • Search mode: Search drug labels by name using the DailyMed API
  • Label mode: Fetch specific drug labels by set ID
  • Rich data extraction: Active ingredients, dosage form, route, NDC codes
  • Paginated search: Automatically follows API pagination for large result sets
  • NIH API + HTML parsing: Combines JSON API search with HTML detail page parsing

Input

FieldTypeDefaultDescription
modestring"search""search" to find drugs by name, "label" for specific set IDs.
searchQueriesstring[][]Drug names to search for (e.g. ["ibuprofen", "aspirin"]).
setIdsstring[][]DailyMed SPL set IDs for direct label fetch.
limitinteger50Maximum number of labels to return per search query.
proxyConfigurationobject-Optional proxy settings.

Output

Each result includes:

FieldDescription
setIdDailyMed SPL set identifier (UUID)
splVersionSPL version number
titleFull drug label title
genericNameGeneric drug name
brandNameBrand/trade name
labelerManufacturer/labeler name
activeIngredientsArray of active ingredient names
dosageFormDosage form (tablet, capsule, solution, etc.)
routeRoute of administration (oral, topical, etc.)
marketingCategoryProduct type / marketing category
ndcCodesArray of National Drug Code numbers
publishDateDate the label was published
effectiveDateMarketing start date
sourceUrlFull URL to the DailyMed drug info page
scrapedAtISO timestamp of when the data was scraped

Usage Examples

Search by drug name

{
"mode": "search",
"searchQueries": ["ibuprofen"],
"limit": 20
}
{
"mode": "search",
"searchQueries": ["aspirin", "acetaminophen", "metformin"],
"limit": 10
}

Fetch specific labels

{
"mode": "label",
"setIds": [
"3a38add6-a114-4e18-836c-3bfc727db231",
"07edcb31-d31d-4607-98c3-dcb1e062cc70"
]
}

How It Works

  1. Search mode: Queries the DailyMed REST API (/services/v2/spls.json) for SPLs matching the drug name. Automatically paginates through results up to the limit. Then visits each drug's HTML detail page to extract rich structured data.

  2. Label mode: Directly fetches the drug info HTML page for each set ID and parses the product information tables.

  3. Data extraction: Parses DailyMed's structured HTML tables including Product Information, Active Ingredients, Product Characteristics, and Marketing Information sections.

Notes

  • DailyMed is maintained by the NIH National Library of Medicine
  • Data comes from FDA-approved drug labeling (package inserts)
  • Set IDs are UUIDs that uniquely identify each drug label
  • NDC (National Drug Code) is the universal product identifier for drugs in the US
  • The API is free and requires no authentication
  • Rate limited to 20 requests/minute to be respectful to NIH servers