USA Data.gov U.S. Government's Open Data Scrape
Pricing
Pay per event
USA Data.gov U.S. Government's Open Data Scrape
Stop wasting hours digging through thousands of government datasets. Our Data.gov scraper automatically gathers complete dataset details from the U.S. government's open data portal in minutes. Ideal for researchers, analysts, journalists, and teams needing reliable data without manual effort.
Pricing
Pay per event
Rating
0.0
(0)
Developer
ParseForge
Actor stats
0
Bookmarked
14
Total users
0
Monthly active users
4 days ago
Last modified
Categories
Share

πΊπΈ Data.gov Scraper
π Collect open dataset metadata from Data.gov in minutes. Search by keyword, topic, or category. Export dataset titles, descriptions, download links, and agency info. No coding, no account required.
π Last updated: 2026-04-16 Β· π 20+ fields per dataset Β· π 3 search filters Β· π Topic + category Β· π« No auth required
The Data.gov Scraper collects open data catalog metadata from the U.S. government's data portal, returning 20+ fields per dataset: title, description, agency, topic, format, download URL, update frequency, and license. Data.gov hosts over 300,000 datasets from 100+ federal agencies.
The Actor supports keyword search with topic and category filters, plus direct URL scraping.
| π― Target Audience | π‘ Primary Use Cases |
|---|---|
| Data scientists, policy researchers, journalists, civic tech teams, academic researchers, government analysts | Open data discovery, policy research, data journalism, civic technology, federal data monitoring |
π What the Data.gov Scraper does
Three search filters:
- π Keyword search. Free-text search across dataset titles and descriptions.
- π Topic filter. Subject areas (health, education, climate, finance, etc.).
- π Category filter. Detailed categorization within topics.
- π URL mode. Paste a direct Data.gov search URL.
Each dataset record includes title, description, agency, topic, categories, data format, download URLs, update frequency, license, and portal URL.
π‘ Why it matters: browsing Data.gov for relevant datasets means scrolling through 300,000+ listings. This Actor exports structured catalog metadata at scale for your open data pipelines or research projects.
π¬ Full Demo
π§ Coming soon: a 3-minute walkthrough showing how to go from sign-up to a downloaded dataset.
βοΈ Input
| Input | Type | Default | Behavior |
|---|---|---|---|
startUrl | string | "" | Direct Data.gov search URL. |
searchQuery | string | "" | Keyword search. |
topics | array | [] | Subject area filters. |
topicCategories | array | [] | Detailed categories within topics. |
Example: climate datasets.
{"searchQuery": "climate change","topics": ["Climate"],"maxItems": 50}
Example: health data from CDC.
{"searchQuery": "CDC","topics": ["Health"],"maxItems": 100}
β οΈ Good to Know: Data.gov is the U.S. government's official open data portal. Datasets link to various formats (CSV, JSON, XML, API) hosted by individual agencies.
π Output
Each dataset record contains 20+ fields. Download the catalog as CSV, Excel, JSON, or XML.
π§Ύ Schema
| Field | Type | Example |
|---|---|---|
π title | string | "Daily Climate Normals" |
π description | string | "NOAA climate normals for U.S. stations..." |
ποΈ agency | string | "NOAA" |
π topic | string | "Climate" |
π categories | array | ["Weather", "Environment"] |
π¦ format | string | "CSV" |
π downloadUrl | string | "https://catalog.data.gov/dataset/..." |
π
lastUpdated | string | "2026-01-15" |
π updateFrequency | string | "Annual" |
π license | string | "Public Domain" |
π portalUrl | string | "https://catalog.data.gov/dataset/..." |
π scrapedAt | ISO 8601 | "2026-04-16T00:00:00.000Z" |
π¦ Sample records
β¨ Why choose this Actor
| Capability | |
|---|---|
| πΊπΈ | 300,000+ datasets. Full U.S. federal open data catalog. |
| π | 3 search filters. Keyword, topic, category. |
| π¦ | Format and download links. Direct URLs to CSV, JSON, XML, API. |
| ποΈ | Agency data. Federal agency per dataset. |
| π | Update frequency. Daily, monthly, annual cadence. |
| β‘ | Scalable. Quick lookups to full catalog sweeps. |
| π« | No authentication. Public open data portal. |
π Data.gov hosts 300,000+ datasets from 100+ federal agencies. Structured access powers every open data project, policy research, and civic technology workflow.
π How it compares to alternatives
| Approach | Cost | Coverage | Refresh | Filters | Setup |
|---|---|---|---|---|---|
| β Data.gov Scraper (this Actor) | $5 free credit, then pay-per-use | Full catalog | Live per run | keyword, topic, category | β‘ 2 min |
| Manual Data.gov browsing | Free | One at a time | Manual | UI only | π Hours |
| CKAN API (direct) | Free | Full | Real-time | Many | β³ Hours |
Pick this Actor when you want U.S. federal open data catalog metadata on demand, with topic and category filters.
π How to use
- π Sign up. Create a free account with $5 credit (takes 2 minutes).
- π Open the Actor. Go to the Data.gov Scraper page on the Apify Store.
- π― Set input. Enter a keyword, pick topics and categories.
- π Run it. Click Start.
- π₯ Download. Grab results in the Dataset tab.
β±οΈ Total time: 3-5 minutes. No coding required.
πΌ Business use cases
π Automating Data.gov Scraper
- π’ Node.js. Install the
apify-clientNPM package. - π Python. Use the
apify-clientPyPI package. - π See the Apify API documentation for full details.
The Apify Schedules feature lets you trigger this Actor on any cron interval. Weekly pulls catch new dataset publications.
β Frequently Asked Questions
π Integrate with any app
- Make - Automate workflows
- Zapier - Connect 5,000+ apps
- Slack - Get notifications
- Airbyte - Data pipelines
- GitHub - Trigger from commits
- Google Drive - Export to Sheets
π Recommended Actors
- π¬π§ Data.gov.uk Scraper - UK open data catalog
- π USAspending Scraper - Federal spending data
- π FRED Scraper - Economic data
- π GSA eLibrary Scraper - Government contracts
- π Indexmundi Scraper - Global indicators
π‘ Pro Tip: browse the complete ParseForge collection for more government and open data scrapers.
π Need Help? Open our contact form to request a new scraper, propose a custom data project, or report an issue.
β οΈ Disclaimer: this Actor is an independent tool and is not affiliated with, endorsed by, or sponsored by the U.S. government or Data.gov. All trademarks mentioned are the property of their respective owners. Only publicly available open data catalog metadata is collected.