USA HealthData.gov HHS Open Data Scraper avatar

USA HealthData.gov HHS Open Data Scraper

Pricing

Pay per event

Go to Apify Store
USA HealthData.gov HHS Open Data Scraper

USA HealthData.gov HHS Open Data Scraper

Collect health data catalog information from HealthData.gov . Filter by category, tags, view type, authority, and search terms to find exactly what you need. Perfect for researchers, data analysts, and healthcare professionals who need to discover and access public health datasets efficiently.

Pricing

Pay per event

Rating

0.0

(0)

Developer

ParseForge

ParseForge

Maintained by Community

Actor stats

0

Bookmarked

3

Total users

1

Monthly active users

21 days ago

Last modified

Share

ParseForge Banner

๐Ÿ“Š HealthData.gov Scraper

Collect health datasets, stories, charts, and maps from the US government's open health data portal. Get instant access to HHS public health data, research datasets, and epidemiological information without coding or complex setup. Perfect for healthcare researchers, data analysts, and policy professionals who need to discover and download HealthData.gov datasets without manual browsing, access bulk health data for analysis, or monitor new government health publications regularly.

The HealthData.gov Scraper collects comprehensive health datasets and metadata from the US HHS Open Data Catalog up to 1,000,000 items per run with zero coding required and multiple data views for analysis.

โœจ What Does It Do

  • ๐Ÿ“ Dataset Name and ID - Identify health datasets by unique identifier for tracking and reference
  • ๐Ÿ”— Dataset URL - Direct links to dataset pages on HealthData.gov for verification
  • ๐Ÿ“Š View and Download Count - Track dataset engagement and popularity among research communities
  • ๐Ÿ‘ค Publisher and Contact Information - Know which health authority published the data
  • ๐Ÿท๏ธ Categories and Tags - Organize datasets by topic area and health specialty
  • ๐Ÿ“„ Description and License - Understand dataset contents and permitted usage rights
  • ๐Ÿ“… Date Fields - Track creation, publication, and update timestamps for data freshness
  • ๐Ÿ“‹ File Downloads and Metadata - Access downloadable files with format information for analysis

๐Ÿ”ง Input

  • Start URL - Direct URL to a HealthData.gov browse page. Use this OR search filters below, not both. Example: https://healthdata.gov/browse?category=Health&sortBy=newest
  • Max Items - Maximum datasets to collect. Free users limited to 100, paid users up to 1,000,000
  • Search Query - Search term to find specific health datasets (e.g., diabetes, vaccination, hospital statistics)
  • Category - Filter by health authority like CDC, FDA, CMS, HHS, NIH, or topic like Hospital, State
  • Tags - Filter by health topic tags (comma-separated values)
  • View Type - Filter by content type such as Datasets, Stories, Charts, Maps, Forms, Files, or Calendars
  • Authority - Show only Official health agency data or Community-contributed content
  • Sort By - Order results by newest, A to Z, most viewed, most relevant, or recently updated

Example JSON configuration:

{
"startUrl": "https://healthdata.gov/browse?sortBy=newest&page=1&pageSize=20",
"maxItems": 50
}

๐Ÿ“Š Output

Each dataset includes up to 40 data fields. Download as JSON, CSV, or Excel.

๐Ÿ“ Dataset ID๐Ÿ“ Dataset Name๐Ÿ”— Dataset URL
๐Ÿ“Š View Count๐Ÿ“Š Download Count๐Ÿ‘ค Publisher
๐Ÿ‘ค Contact Email๐Ÿท๏ธ Categories๐Ÿท๏ธ Tags
๐Ÿ“„ Description๐Ÿ“„ License๐Ÿ“„ License ID
๐Ÿ“… Created At๐Ÿ“… Publication Date๐Ÿ“… Last Updated
๐Ÿ“… Data Updated At๐Ÿ“… Metadata Updated At๐Ÿ“… Scraped Timestamp
๐ŸŽฏ View Type๐ŸŽฏ Display Typeโš™๏ธ Provenance
๐Ÿ”“ Is Locked๐Ÿ“ˆ Page Views๐ŸŒ Domain
๐ŸŽฏ Dataset Type๐Ÿ“‹ File Downloads๐Ÿ“‹ Metadata
๐Ÿ‘ค Owner๐ŸŽญ Attributionโญ Average Rating

๐Ÿ’Ž Why Choose the HealthData.gov Scraper?

FeatureOur ActorSimilar Tools
No coding required to start collecting dataโœ”๏ธโŒ
Download results in CSV, JSON, and Excel formatsโœ”๏ธPartial
Filter by category, tags, authority, and view typeโœ”๏ธโŒ
Access story content and featured resourcesโœ”๏ธโŒ
Collect file metadata with MIME type and sizeโœ”๏ธโŒ
Automatic rate limiting and error handlingโœ”๏ธPartial
Support for both direct URLs and search filtersโœ”๏ธโŒ
Real-time pagination and progress trackingโœ”๏ธPartial
Collect up to 1,000,000 items per run (paid users)โœ”๏ธโŒ
Multi-view data organization (Overview, Stories, Files)โœ”๏ธโŒ

๐Ÿ“‹ How to Use

No technical skills required. Follow these simple steps:

  1. Sign Up: Create a free account with $5 credit
  2. Find the Tool: Search for "HealthData.gov Scraper" in the Apify Store and configure your input
  3. Run It: Click "Start" and watch your results appear

That's it. No coding, no setup, no complicated configuration. Now you can export your data in CSV, Excel, or JSON format.

๐ŸŽฏ Business Use Cases

  • ๐Ÿ“Š Public Health Researcher - Search for datasets on infectious disease trends to identify emerging health patterns and publish peer-reviewed findings
  • ๐Ÿ’ผ Policy Analyst - Filter datasets by authority like CDC and CMS to compile evidence for health policy recommendations to government agencies
  • ๐Ÿ”ฌ Academic Data Scientist - Collect epidemiological datasets across multiple categories to train machine learning models for disease prediction

โ“ FAQ

๐Ÿ” How does this actor work? The actor searches the HealthData.gov catalog based on your filters or URL and extracts dataset metadata, descriptions, and file information. All data comes from public government sources.

๐Ÿ“Š How accurate is the data? Data accuracy depends on HealthData.gov itself. The actor captures whatever information is published in the HHS Open Data Catalog. All data is collected as-is from the official source.

๐Ÿ“… Can I schedule this actor to run automatically? Yes. Use the Apify Scheduler to run this actor on a daily, weekly, or monthly basis.

โš–๏ธ Is it legal to collect this data? Yes. All data comes from HealthData.gov, which is public government data. You are responsible for complying with local laws and any data use agreements when processing the collected data.

๐Ÿ›ก๏ธ Will HealthData.gov block me? No. This actor respects rate limits and HealthData.gov is built for public access and data collection. No proxies are needed.

โšก How long does a run take? A typical run collecting 100 datasets takes 2-5 minutes. Larger collections of 1000 or more items may take 10-30 minutes depending on metadata complexity.

โš ๏ธ Are there any limits? Free users can collect up to 100 results per run. Paid users can collect up to 1,000,000 results per run.

๐Ÿ”— Integrate HealthData.gov Scraper with any app

๐Ÿ’ก More ParseForge Actors

Browse our complete collection of data extraction tools for more.

๐Ÿš€ Ready to Start?

Create a free account with $5 credit and collect your first 100 results for free. No coding, no setup.

๐Ÿ†˜ Need Help?

  • Check the FAQ section above for common questions
  • Visit the Apify support page for documentation and tutorials
  • Contact us to request a new scraper, propose a custom project, or report an issue at Tally contact form

โš ๏ธ Disclaimer

This Actor is an independent tool and is not affiliated with, endorsed by, or sponsored by HealthData.gov, the US Department of Health and Human Services, or any of its subsidiaries. All trademarks mentioned are the property of their respective owners.