USA Data.gov U.S. Government's Open Data Scrape avatar

USA Data.gov U.S. Government's Open Data Scrape

Pricing

Pay per event

Go to Apify Store
USA Data.gov U.S. Government's Open Data Scrape

USA Data.gov U.S. Government's Open Data Scrape

Stop wasting hours digging through thousands of government datasets. Our Data.gov scraper automatically gathers complete dataset details from the U.S. government's open data portal in minutes. Ideal for researchers, analysts, journalists, and teams needing reliable data without manual effort.

Pricing

Pay per event

Rating

0.0

(0)

Developer

ParseForge

ParseForge

Maintained by Community

Actor stats

0

Bookmarked

14

Total users

0

Monthly active users

4 days ago

Last modified

Share

ParseForge Banner

πŸ‡ΊπŸ‡Έ Data.gov Scraper

πŸš€ Collect open dataset metadata from Data.gov in minutes. Search by keyword, topic, or category. Export dataset titles, descriptions, download links, and agency info. No coding, no account required.

πŸ•’ Last updated: 2026-04-16 Β· πŸ“Š 20+ fields per dataset Β· πŸ” 3 search filters Β· πŸ“‚ Topic + category Β· 🚫 No auth required

The Data.gov Scraper collects open data catalog metadata from the U.S. government's data portal, returning 20+ fields per dataset: title, description, agency, topic, format, download URL, update frequency, and license. Data.gov hosts over 300,000 datasets from 100+ federal agencies.

The Actor supports keyword search with topic and category filters, plus direct URL scraping.

🎯 Target AudienceπŸ’‘ Primary Use Cases
Data scientists, policy researchers, journalists, civic tech teams, academic researchers, government analystsOpen data discovery, policy research, data journalism, civic technology, federal data monitoring

πŸ“‹ What the Data.gov Scraper does

Three search filters:

  • πŸ” Keyword search. Free-text search across dataset titles and descriptions.
  • πŸ“‚ Topic filter. Subject areas (health, education, climate, finance, etc.).
  • πŸ“‚ Category filter. Detailed categorization within topics.
  • πŸ”— URL mode. Paste a direct Data.gov search URL.

Each dataset record includes title, description, agency, topic, categories, data format, download URLs, update frequency, license, and portal URL.

πŸ’‘ Why it matters: browsing Data.gov for relevant datasets means scrolling through 300,000+ listings. This Actor exports structured catalog metadata at scale for your open data pipelines or research projects.


🎬 Full Demo

🚧 Coming soon: a 3-minute walkthrough showing how to go from sign-up to a downloaded dataset.


βš™οΈ Input

InputTypeDefaultBehavior
startUrlstring""Direct Data.gov search URL.
searchQuerystring""Keyword search.
topicsarray[]Subject area filters.
topicCategoriesarray[]Detailed categories within topics.

Example: climate datasets.

{
"searchQuery": "climate change",
"topics": ["Climate"],
"maxItems": 50
}

Example: health data from CDC.

{
"searchQuery": "CDC",
"topics": ["Health"],
"maxItems": 100
}

⚠️ Good to Know: Data.gov is the U.S. government's official open data portal. Datasets link to various formats (CSV, JSON, XML, API) hosted by individual agencies.


πŸ“Š Output

Each dataset record contains 20+ fields. Download the catalog as CSV, Excel, JSON, or XML.

🧾 Schema

FieldTypeExample
πŸ“ titlestring"Daily Climate Normals"
πŸ“„ descriptionstring"NOAA climate normals for U.S. stations..."
πŸ›οΈ agencystring"NOAA"
πŸ“‚ topicstring"Climate"
πŸ“‚ categoriesarray["Weather", "Environment"]
πŸ“¦ formatstring"CSV"
πŸ”— downloadUrlstring"https://catalog.data.gov/dataset/..."
πŸ“… lastUpdatedstring"2026-01-15"
πŸ”„ updateFrequencystring"Annual"
πŸ“œ licensestring"Public Domain"
πŸ”— portalUrlstring"https://catalog.data.gov/dataset/..."
πŸ•’ scrapedAtISO 8601"2026-04-16T00:00:00.000Z"

πŸ“¦ Sample records


✨ Why choose this Actor

Capability
πŸ‡ΊπŸ‡Έ300,000+ datasets. Full U.S. federal open data catalog.
πŸ”3 search filters. Keyword, topic, category.
πŸ“¦Format and download links. Direct URLs to CSV, JSON, XML, API.
πŸ›οΈAgency data. Federal agency per dataset.
πŸ“…Update frequency. Daily, monthly, annual cadence.
⚑Scalable. Quick lookups to full catalog sweeps.
🚫No authentication. Public open data portal.

πŸ“Š Data.gov hosts 300,000+ datasets from 100+ federal agencies. Structured access powers every open data project, policy research, and civic technology workflow.


πŸ“ˆ How it compares to alternatives

ApproachCostCoverageRefreshFiltersSetup
⭐ Data.gov Scraper (this Actor)$5 free credit, then pay-per-useFull catalogLive per runkeyword, topic, category⚑ 2 min
Manual Data.gov browsingFreeOne at a timeManualUI onlyπŸ•’ Hours
CKAN API (direct)FreeFullReal-timeMany⏳ Hours

Pick this Actor when you want U.S. federal open data catalog metadata on demand, with topic and category filters.


πŸš€ How to use

  1. πŸ“ Sign up. Create a free account with $5 credit (takes 2 minutes).
  2. 🌐 Open the Actor. Go to the Data.gov Scraper page on the Apify Store.
  3. 🎯 Set input. Enter a keyword, pick topics and categories.
  4. πŸš€ Run it. Click Start.
  5. πŸ“₯ Download. Grab results in the Dataset tab.

⏱️ Total time: 3-5 minutes. No coding required.


πŸ’Ό Business use cases

πŸ“Š Data Science & Research

  • Discover federal datasets for research
  • Build open data catalogs
  • Track new publications by agency
  • Analyze data coverage by topic

πŸ›οΈ Policy & Civic Tech

  • Monitor federal data releases
  • Build transparency dashboards
  • Track agency publishing rates
  • Power civic apps with open data

πŸ“° Data Journalism

  • Find datasets for investigative reporting
  • Track data freshness across agencies
  • Build story pipelines from government data
  • Monitor new dataset publications

🏒 Business Intelligence

  • Enrich models with federal data
  • Track economic indicators by agency
  • Build market data pipelines
  • Monitor regulatory data releases

πŸ”Œ Automating Data.gov Scraper

  • 🟒 Node.js. Install the apify-client NPM package.
  • 🐍 Python. Use the apify-client PyPI package.
  • πŸ“š See the Apify API documentation for full details.

The Apify Schedules feature lets you trigger this Actor on any cron interval. Weekly pulls catch new dataset publications.


❓ Frequently Asked Questions


πŸ”Œ Integrate with any app


πŸ’‘ Pro Tip: browse the complete ParseForge collection for more government and open data scrapers.


πŸ†˜ Need Help? Open our contact form to request a new scraper, propose a custom data project, or report an issue.


⚠️ Disclaimer: this Actor is an independent tool and is not affiliated with, endorsed by, or sponsored by the U.S. government or Data.gov. All trademarks mentioned are the property of their respective owners. Only publicly available open data catalog metadata is collected.