USA HealthData.gov HHS Open Data Scraper avatar
USA HealthData.gov HHS Open Data Scraper

Pricing

Pay per event

Go to Apify Store
USA HealthData.gov HHS Open Data Scraper

USA HealthData.gov HHS Open Data Scraper

Collect health data catalog information from HealthData.gov . Filter by category, tags, view type, authority, and search terms to find exactly what you need. Perfect for researchers, data analysts, and healthcare professionals who need to discover and access public health datasets efficiently.

Pricing

Pay per event

Rating

5.0

(1)

Developer

ParseForge

ParseForge

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

6 days ago

Last modified

Share

πŸ₯ USA HealthData.gov HHS Open Data Scraper

πŸš€ Access comprehensive health data from the U.S. Department of Health and Human Services! Collect datasets, stories, charts, maps, and more from HealthData.gov - the official open data catalog for health information. Perfect for researchers, data analysts, and healthcare professionals who need to discover and access public health datasets efficiently.

This powerful tool helps you discover and collect health data from HealthData.gov, including datasets, stories, charts, maps, forms, and files. Filter by category, tags, view type, authority, and search terms to find exactly what you need. Save hours of manual searching and get structured data ready for analysis.

Target Audience: Healthcare researchers, data analysts, public health professionals, policy makers, and anyone working with health data
Primary Use Cases: Health data research, policy analysis, public health monitoring, data-driven healthcare decisions, academic research

πŸ“Š What Does HealthData.gov Scraper Do?

This tool collects comprehensive health data catalog information from HealthData.gov, supporting both research and analysis needs. It delivers:

  • Dataset Information: Complete details about health datasets including descriptions, categories, tags, and metadata
  • Data Access Points: Direct links to download data in multiple formats (CSV, JSON, XML, RDF)
  • Story Content: Full story articles with titles, authors, content, sections, and links (special extraction for story type items)
  • File Downloads: File information including download URLs, filenames, and MIME types (special extraction for file type items)
  • Metadata: Extended metadata including contact information, program codes, bureau codes, and more
  • Statistics: View counts, download counts, ratings, and engagement metrics
  • Categorization: Categories, tags, and classification information
  • Publisher Information: Publisher details, attribution, contact emails, and licensing
  • Catalog Items: Basic information for all types (datasets, charts, maps, forms, measures, calendars, etc.) from the API

Note: Stories and files have enhanced content extraction (full article content for stories, download information for files). Other types (charts, maps, forms, measures, calendars, filtered views, external datasets) provide catalog information from the API including metadata, descriptions, and access points.

Business Value: This data helps you make informed decisions about public health, track health trends, conduct research, and build data-driven healthcare solutions. All data is publicly available and ready for analysis.

🎬 How to use the HealthData.gov Scraper - Full Demo

Coming soon - Watch this space for a video walkthrough showing how easy it is to get started!

βš™οΈ Input

To start collecting health data from HealthData.gov, simply fill in the input form. You can collect data based on:

  • startUrl - Direct URL to a HealthData.gov browse page. Copy the URL from your browser when viewing search results on HealthData.gov. Example: https://healthdata.gov/browse?category=Health&sortBy=newest
  • maxItems - Maximum number of items to collect. Free users are limited to 100 items. Paid users can collect up to 1,000,000 items.
  • Search Query - Search term to filter datasets (e.g., "diabetes", "vaccination", "hospital")
  • Category - Filter by category such as Health, HHS, CDC, CMS, FDA, NIH, and more
  • Tags - Filter by tags (comma-separated)
  • View Type - Filter by type: Datasets, Stories, Charts, Maps, Forms, Measures, Calendars, Filtered Views, External Datasets, or Files
  • Authority - Filter by authority: Community or Official
  • Sort By - Sort order: Recently added, A to Z, Most viewed, Most relevant, or Recently updated

Important: Use either startUrl OR search filters, not both at the same time.

Here's what the input configuration looks like in JSON:

{
"startUrl": "https://healthdata.gov/browse?sortBy=newest&page=1&pageSize=20",
"maxItems": 10
}

Or using search filters:

{
"q": "diabetes",
"category": "Health",
"limitTo": "dataset",
"sortBy": "newest",
"maxItems": 50
}

πŸ“€ Output

After the Actor finishes its run, you'll get a dataset with the output. The length of the dataset depends on the amount of results you've set. You can download those results as an Excel, HTML, XML, JSON, and CSV document.

The scraper provides multiple views of your data:

  • Overview: Quick summary with key fields for all item types
  • Stories: Detailed view optimized for story content with titles, authors, sections, and links
  • Files: File-specific view with download information, MIME types, and file sizes
  • Catalog Items: Complete dataset with all available fields

Here's an example of scraped HealthData.gov data you'll get if you decide to collect dataset information:

{
"datasetId": "s4q2-58m5",
"datasetName": "FY 2026 HHS Lapse and Contingency Plan",
"datasetUrl": "https://healthdata.gov/HHS/FY-2026-HHS-Lapse-and-Contingency-Plan/s4q2-58m5",
"datasetType": "dataset",
"description": "HHS contingency plan in case of a lapse in government funding fiscal year 2026.",
"category": "HHS",
"categories": [],
"tags": [],
"publisher": "HHS",
"attribution": "HHS ASFR",
"contactEmail": "healthdata@hhs.gov",
"license": "Public Domain U.S. Government",
"licenseId": "USGOV_WORKS",
"dataPreviewUrl": "https://healthdata.gov/HHS/FY-2026-HHS-Lapse-and-Contingency-Plan/s4q2-58m5/data_preview",
"additionalAccessPoints": [
{
"urls": {
"text/csv": "https://data.cdc.gov/api/views/5svk-8bnq/rows.csv?accessType=DOWNLOAD"
}
}
],
"metadata": {
"commonCore": {
"contactEmail": "healthdata@hhs.gov",
"contactName": "HealthData.gov Service Desk",
"programCode": "009:000",
"publisher": "HHS",
"bureauCode": "009:00",
"publicAccessLevel": "public"
}
},
"viewCount": 3439,
"downloadCount": 325,
"lastUpdated": "2025-09-25T20:40:36.000Z",
"scrapedTimestamp": "2025-12-11T03:16:46.678Z"
}

What You Get:

  • Complete Dataset Information: IDs, names, URLs, types, and descriptions
  • Categorization: Categories, tags, and classification data
  • Publisher Details: Publisher, attribution, contact information, and licensing
  • Data Access: Preview URLs and download links in multiple formats
  • Extended Metadata: Contact names, program codes, bureau codes, and access levels
  • Engagement Metrics: View counts, download counts, and ratings
  • Timestamps: Creation dates, publication dates, and update information

Download Options: CSV, Excel, or JSON formats for easy analysis in your preferred tools

⭐ Why Choose the HealthData.gov Scraper?

  • 🎯 Comprehensive Data Collection: Access all types of health data resources - datasets, stories, charts, maps, forms, and files in one place
  • ⚑ Fast & Efficient: Process multiple items in parallel for faster data collection. Save hours compared to manual browsing and downloading
  • πŸ” Powerful Filtering: Filter by category, tags, view type, authority, and search terms to find exactly what you need
  • πŸ“Š Multiple Output Views: Choose from overview, stories, or files views depending on your needs
  • πŸ›‘οΈ Reliable & Accurate: Direct access to official HHS data ensures accuracy and reliability

Time Savings: Instead of manually browsing HealthData.gov and downloading files one by one, this tool collects everything automatically in minutes. What would take hours of manual work is now done in seconds.

Efficiency: Process hundreds of datasets simultaneously with parallel processing. Get structured, ready-to-analyze data instead of scattered files and web pages.

πŸš€ How to Use

  1. Sign Up: Create a free account w/ $5 credit (takes 2 minutes)
  2. Find the Scraper: Visit the HealthData.gov Scraper page
  3. Set Input: Add your search parameters or start URL (we'll show you exactly what to enter)
  4. Run It: Click "Start" and let it collect your data
  5. Download Data: Get your results in the "Dataset" tab as CSV, Excel, or JSON

Total Time: Less than 5 minutes from sign-up to downloaded data
No Technical Skills Required: Everything is point-and-click

πŸ’Ό Business Use Cases

Healthcare Researchers:

  • Collect health datasets for research projects
  • Monitor new health data publications
  • Track health trends and patterns over time

Data Analysts:

  • Build comprehensive health data databases
  • Create regular health data reports
  • Support data-driven healthcare decisions

Public Health Professionals:

  • Access official health statistics
  • Monitor public health indicators
  • Track health policy implementation

Policy Makers:

  • Access health data for policy analysis
  • Monitor health outcomes and trends
  • Support evidence-based decision making

Academic Researchers:

  • Collect datasets for academic studies
  • Access historical health data
  • Build research databases

πŸ”— Using HealthData.gov Scraper with the Apify API

For advanced users who want to automate this process, you can control the scraper programmatically with the Apify API. This allows you to schedule regular data collection and integrate with your existing business tools.

  • Node.js: Install the apify-client NPM package
  • Python: Use the apify-client PyPI package
  • See the Apify API reference for full details

❓ Frequently Asked Questions

Q: How does it work?
A: HealthData.gov Scraper is easy to use and requires no technical knowledge. Simply configure your search parameters or provide a start URL, and let the tool collect the data automatically from the official HealthData.gov API.

Q: How accurate is the data?
A: The data comes directly from the official HealthData.gov API maintained by the U.S. Department of Health and Human Services, ensuring accuracy and reliability. All data is publicly available and verified.

Q: Can I schedule regular runs?
A: Yes! You can schedule regular runs using the Apify platform's scheduling features or integrate with automation tools like Make or Zapier to collect data on a schedule.

Q: What types of data can I collect?
A: You can collect catalog information for all types: datasets, stories, charts, maps, forms, measures, calendars, filtered views, external datasets, and files. Stories include full article content (title, authors, sections, links), and files include download information (URLs, filenames, MIME types). Other types provide catalog metadata, descriptions, and access points from the API. Filter by type to get exactly what you need.

Q: What if I need help?
A: Our support team is here to help you get the most out of this tool. Contact us through the Apify platform for assistance.

Q: Is my data secure?
A: Yes, all data processing happens securely on the Apify platform. Your collected data is private to your account and can be downloaded or deleted at any time.

πŸ”— Integrate HealthData.gov Scraper with any app and automate your workflow

Last but not least, HealthData.gov Scraper can be connected with almost any cloud service or web app thanks to integrations on the Apify platform.

These includes:

Alternatively, you can use webhooks to carry out an action whenever an event occurs, e.g. get a notification whenever HealthData.gov Scraper successfully finishes a run.

Looking for more data collection tools? Check out these related actors:

ActorDescriptionLink
GSA eLibrary ScraperCollects government publications and documents from GSA eLibraryhttps://apify.com/parseforge/gsa-elibrary-scraper
PR Newswire ScraperExtracts press releases and news from PR Newswirehttps://apify.com/parseforge/pr-newswire-scraper
Hubspot Marketplace ScraperCollects business app data from HubSpot marketplacehttps://apify.com/parseforge/hubspot-marketplace-scraper
Hugging Face Model ScraperExtracts AI model information from Hugging Facehttps://apify.com/parseforge/hugging-face-model-scraper
AWS Marketplace ScraperCollects software listings from AWS Marketplacehttps://apify.com/parseforge/aws-marketplace-scraper

Pro Tip: πŸ’‘ Browse our complete collection of data collection actors to find the perfect tool for your business needs.

Need Help? Our support team is here to help you get the most out of this tool.


⚠️ Disclaimer: This Actor is an independent tool and is not affiliated with, endorsed by, or sponsored by HealthData.gov, the U.S. Department of Health and Human Services (HHS), or any of its subsidiaries. All trademarks mentioned are the property of their respective owners.