USA Data.gov U.S. Government's Open Data Scrape
Pricing
Pay per event
USA Data.gov U.S. Government's Open Data Scrape
Stop wasting hours digging through thousands of government datasets. Our Data.gov scraper automatically gathers complete dataset details from the U.S. government's open data portal in minutes. Ideal for researchers, analysts, journalists, and teams needing reliable data without manual effort.
Pricing
Pay per event
Rating
0.0
(0)
Developer
ParseForge
Maintained by CommunityActor stats
0
Bookmarked
16
Total users
1
Monthly active users
6 days ago
Last modified
Categories
Share

πΊπΈ Data.gov Scraper
π Collect open dataset metadata from Data.gov in minutes. Search by keyword, topic, or category. Export dataset titles, descriptions, download links, and agency info. No coding, no account required.
π Last updated: 2026-04-23 Β· π 20+ fields per dataset Β· π 3 search filters Β· π Topic + category Β· π« No auth required
The Data.gov Scraper collects open data catalog metadata from the U.S. government's data portal, returning 20+ fields per dataset: title, description, agency, topic, format, download URL, update frequency, and license. Data.gov hosts over 300,000 datasets from 100+ federal agencies.
The Actor supports keyword search with topic and category filters, plus direct URL scraping.
| π― Target Audience | π‘ Primary Use Cases |
|---|---|
| Data scientists, policy researchers, journalists, civic tech teams, academic researchers, government analysts | Open data discovery, policy research, data journalism, civic technology, federal data monitoring |
π What the Data.gov Scraper does
Three search filters:
- π Keyword search. Free-text search across dataset titles and descriptions.
- π Topic filter. Subject areas (health, education, climate, finance, etc.).
- π Category filter. Detailed categorization within topics.
- π URL mode. Paste a direct Data.gov search URL.
Each dataset record includes title, description, agency, topic, categories, data format, download URLs, update frequency, license, and portal URL.
π‘ Why it matters: browsing Data.gov for relevant datasets means scrolling through 300,000+ listings. This Actor exports structured catalog metadata at scale for your open data pipelines or research projects.
π¬ Full Demo
π§ Coming soon: a 3-minute walkthrough showing how to go from sign-up to a downloaded dataset.
βοΈ Input
| Input | Type | Default | Behavior |
|---|---|---|---|
startUrl | string | "" | Direct Data.gov search URL. |
searchQuery | string | "" | Keyword search. |
topics | array | [] | Subject area filters. |
topicCategories | array | [] | Detailed categories within topics. |
Example: climate datasets.
{"searchQuery": "climate change","topics": ["Climate"],"maxItems": 50}
Example: health data from CDC.
{"searchQuery": "CDC","topics": ["Health"],"maxItems": 100}
β οΈ Good to Know: Data.gov is the U.S. government's official open data portal. Datasets link to various formats (CSV, JSON, XML, API) hosted by individual agencies.
π Output
Each dataset record contains 20+ fields. Download the catalog as CSV, Excel, JSON, or XML.
π§Ύ Schema
| Field | Type | Example |
|---|---|---|
π title | string | "Daily Climate Normals" |
π description | string | "NOAA climate normals for U.S. stations..." |
ποΈ agency | string | "NOAA" |
π topic | string | "Climate" |
π categories | array | ["Weather", "Environment"] |
π¦ format | string | "CSV" |
π downloadUrl | string | "https://catalog.data.gov/dataset/..." |
π
lastUpdated | string | "2026-01-15" |
π updateFrequency | string | "Annual" |
π license | string | "Public Domain" |
π portalUrl | string | "https://catalog.data.gov/dataset/..." |
π scrapedAt | ISO 8601 | "2026-04-16T00:00:00.000Z" |
π¦ Sample records
β¨ Why choose this Actor
| Capability | |
|---|---|
| πΊπΈ | 300,000+ datasets. Full U.S. federal open data catalog. |
| π | 3 search filters. Keyword, topic, category. |
| π¦ | Format and download links. Direct URLs to CSV, JSON, XML, API. |
| ποΈ | Agency data. Federal agency per dataset. |
| π | Update frequency. Daily, monthly, annual cadence. |
| β‘ | Scalable. Quick lookups to full catalog sweeps. |
| π« | No authentication. Public open data portal. |
π Data.gov hosts 300,000+ datasets from 100+ federal agencies. Structured access powers every open data project, policy research, and civic technology workflow.
π How it compares to alternatives
| Approach | Cost | Coverage | Refresh | Filters | Setup |
|---|---|---|---|---|---|
| β Data.gov Scraper (this Actor) | $5 free credit, then pay-per-use | Full catalog | Live per run | keyword, topic, category | β‘ 2 min |
| Manual Data.gov browsing | Free | One at a time | Manual | UI only | π Hours |
| CKAN API (direct) | Free | Full | Real-time | Many | β³ Hours |
Pick this Actor when you want U.S. federal open data catalog metadata on demand, with topic and category filters.
π How to use
- π Sign up. Create a free account with $5 credit (takes 2 minutes).
- π Open the Actor. Go to the Data.gov Scraper page on the Apify Store.
- π― Set input. Enter a keyword, pick topics and categories.
- π Run it. Click Start.
- π₯ Download. Grab results in the Dataset tab.
β±οΈ Total time: 3-5 minutes. No coding required.
πΌ Business use cases
π Beyond business use cases
Data like this powers more than commercial workflows. The same structured records support research, education, civic projects, and personal initiatives.
π€ Ask an AI assistant about this scraper
Open a ready-to-send prompt about this ParseForge actor in the AI of your choice:
- π¬ ChatGPT
- π§ Claude
- π Perplexity
- π Copilot
π° How much does it cost?
Apify gives you $5 in free monthly credits on the Apify Free plan, enough to test Data.gov Scraper and pull a real sample dataset. For ongoing usage:
- Starter plan ($49/month) β Recommended for individuals running Data.gov Scraper regularly. Includes higher concurrency and larger datasets.
- Scale plan ($499/month) β Recommended for teams running Data.gov Scraper at production scale.
Pay-Per-Event pricing means you only pay for what you actually use. Failed runs are never charged. See the Pricing tab on this Actor's page for exact event prices.
π‘ Tips for using Data.gov Scraper
- Start with a small
maxItems(3-10) to validate output format before running larger jobs. - Use Apify Schedules to run Data.gov Scraper on a recurring basis and keep your dataset fresh.
- Export via Integrations: Apify connects to Google Sheets, Airbyte, Make, Zapier, and direct webhooks β pipe your data anywhere.
- Monitor with webhooks: trigger downstream workflows the moment a run finishes.
- Re-run failed items: if any individual records error out, re-run with their inputs only. Failed events are not charged.
βοΈ Is it legal to use Data.gov Scraper?
Yes. Data.gov Scraper only collects publicly available data. Web scraping public data has been confirmed as legal by US courts (see hiQ Labs v. LinkedIn) and is widely used for research, market analysis, and business intelligence.
However, you are responsible for:
- Respecting the source website's Terms of Service.
- Complying with GDPR, CCPA, and other applicable data-protection laws when personal data is involved.
- Not republishing copyrighted content without permission.
If you have specific compliance concerns, consult your legal team. See the Apify legal docs for more.
β Frequently Asked Questions
π Automating Data.gov Scraper
- π’ Node.js. Install the
apify-clientNPM package. - π Python. Use the
apify-clientPyPI package. - π See the Apify API documentation for full details.
The Apify Schedules feature lets you trigger this Actor on any cron interval. Weekly pulls catch new dataset publications.
π Integrate with any app
- Make - Automate workflows
- Zapier - Connect 5,000+ apps
- Slack - Get notifications
- Airbyte - Data pipelines
- GitHub - Trigger from commits
- Google Drive - Export to Sheets
π Recommended Actors
- π¬π§ Data.gov.uk Scraper - UK open data catalog
- π USAspending Scraper - Federal spending data
- π FRED Scraper - Economic data
- π GSA eLibrary Scraper - Government contracts
- π Indexmundi Scraper - Global indicators
π‘ Pro Tip: browse the complete ParseForge collection for more government and open data scrapers.
π Need Help? Open our contact form to request a new scraper, propose a custom data project, or report an issue.
β οΈ Disclaimer: this Actor is an independent tool and is not affiliated with, endorsed by, or sponsored by the U.S. government or Data.gov. All trademarks mentioned are the property of their respective owners. Only publicly available open data catalog metadata is collected.