DailyMed Drug Label Scraper - NIH Drug Data
Pricing
from $10.00 / 1,000 results
DailyMed Drug Label Scraper - NIH Drug Data
Scrape DailyMed drug labels from the NIH. Search by drug name. Extract ingredients, dosage forms, NDC codes, labelers, and full pharmaceutical data.
Pricing
from $10.00 / 1,000 results
Rating
0.0
(0)
Developer
lulz bot
Maintained by CommunityActor stats
0
Bookmarked
2
Total users
1
Monthly active users
a day ago
Last modified
Categories
Share
DailyMed Drug Label Scraper
Scrape drug label data from DailyMed, the NIH's official source for FDA-approved drug labeling. Search by drug name or fetch specific labels by set ID.
Features
- Search mode: Search drug labels by name using the DailyMed API
- Label mode: Fetch specific drug labels by set ID
- Rich data extraction: Active ingredients, dosage form, route, NDC codes
- Paginated search: Automatically follows API pagination for large result sets
- NIH API + HTML parsing: Combines JSON API search with HTML detail page parsing
Input
| Field | Type | Default | Description |
|---|---|---|---|
mode | string | "search" | "search" to find drugs by name, "label" for specific set IDs. |
searchQueries | string[] | [] | Drug names to search for (e.g. ["ibuprofen", "aspirin"]). |
setIds | string[] | [] | DailyMed SPL set IDs for direct label fetch. |
limit | integer | 50 | Maximum number of labels to return per search query. |
proxyConfiguration | object | - | Optional proxy settings. |
Output
Each result includes:
| Field | Description |
|---|---|
setId | DailyMed SPL set identifier (UUID) |
splVersion | SPL version number |
title | Full drug label title |
genericName | Generic drug name |
brandName | Brand/trade name |
labeler | Manufacturer/labeler name |
activeIngredients | Array of active ingredient names |
dosageForm | Dosage form (tablet, capsule, solution, etc.) |
route | Route of administration (oral, topical, etc.) |
marketingCategory | Product type / marketing category |
ndcCodes | Array of National Drug Code numbers |
publishDate | Date the label was published |
effectiveDate | Marketing start date |
sourceUrl | Full URL to the DailyMed drug info page |
scrapedAt | ISO timestamp of when the data was scraped |
Usage Examples
Search by drug name
{"mode": "search","searchQueries": ["ibuprofen"],"limit": 20}
Multiple drug search
{"mode": "search","searchQueries": ["aspirin", "acetaminophen", "metformin"],"limit": 10}
Fetch specific labels
{"mode": "label","setIds": ["3a38add6-a114-4e18-836c-3bfc727db231","07edcb31-d31d-4607-98c3-dcb1e062cc70"]}
How It Works
-
Search mode: Queries the DailyMed REST API (
/services/v2/spls.json) for SPLs matching the drug name. Automatically paginates through results up to the limit. Then visits each drug's HTML detail page to extract rich structured data. -
Label mode: Directly fetches the drug info HTML page for each set ID and parses the product information tables.
-
Data extraction: Parses DailyMed's structured HTML tables including Product Information, Active Ingredients, Product Characteristics, and Marketing Information sections.
Notes
- DailyMed is maintained by the NIH National Library of Medicine
- Data comes from FDA-approved drug labeling (package inserts)
- Set IDs are UUIDs that uniquely identify each drug label
- NDC (National Drug Code) is the universal product identifier for drugs in the US
- The API is free and requires no authentication
- Rate limited to 20 requests/minute to be respectful to NIH servers