Data Gov Catalog Scraper avatar

Data Gov Catalog Scraper

Pricing

from $3.50 / 1,000 results

Go to Apify Store
Data Gov Catalog Scraper

Data Gov Catalog Scraper

Pricing

from $3.50 / 1,000 results

Rating

0.0

(0)

Developer

Fortuitous Pirate

Fortuitous Pirate

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

3 days ago

Last modified

Categories

Share

Data.gov Federal Dataset Catalog

Overview

Search and discover 300,000+ federal open datasets from Data. gov. Filter by agency, format, tags, topics, and keywords.

Features

  • Search by keywords to find specific results
  • Filter results by category or type
  • Export data in JSON, CSV, or Excel formats
  • Control output volume with configurable result limits
  • Built-in proxy support for reliable data collection

Use Cases

  • Track - Track federal government data releases and updates
  • Build - Build datasets for policy research and analysis
  • Monitor - Monitor regulatory changes and compliance requirements
  • Aggregate - Aggregate public government data for transparency projects

Input Parameters

ParameterTypeDescriptionDefault
searchQuerystringKeyword search across dataset titles, descriptions, and metadata (e.g., "clim...
organizationstringFilter by federal agency slug (e.g., "nasa", "epa", "noaa", "census-gov", "us...
formatstringFilter datasets by resource format.``
tagsstringFilter by tag (e.g., "climate", "health", "transportation"). Single tag per q...
topicstringFilter by group/topic (e.g., "agriculture8571", "climate5434", "health3702")....
sortBystringSort order for results.relevance
maxItemsintegerMaximum number of datasets to return.100
proxyConfigurationobjectProxy configuration for requests. Usually not needed for this public API.

Output Example

Each result contains structured data like this:

{
"title": "Sample Government - Federal Result",
"organization": "Sample organization",
"numResources": "Sample numResources",
"tags": "Sample tags",
"metadataModified": "Sample metadataModified",
"licenseTitle": "Sample licenseTitle",
"url": "https://example.com/item/12345"
}

Pricing

This actor uses pay-per-result pricing:

  • $0.001 per result
  • $1.00 per 1,000 results

No monthly fees. You only pay for what you scrape. Apify Free plan includes $5/month in platform credits.

How to Run

Apify Console

  1. Go to the Data.gov Federal Dataset Catalog actor page
  2. Configure your input parameters
  3. Click Start and wait for the results
  4. Download data in JSON, CSV, or Excel format

API

curl -X POST "https://api.apify.com/v2/acts/fortuitous_pirate~data-gov-catalog-scraper/runs?token=YOUR_API_TOKEN" \
-H "Content-Type: application/json" \
-d '{"maxItems": 10}'

Python SDK

from apify_client import ApifyClient
client = ApifyClient("YOUR_API_TOKEN")
run = client.actor("fortuitous_pirate/data-gov-catalog-scraper").call(
run_input={"maxItems": 10}
)
for item in client.dataset(run["defaultDatasetId"]).iterate_items():
print(item)

Integration

Connect Data.gov Federal Dataset Catalog with your existing tools and workflows:

  • API access - Programmatic access via Apify API
  • Webhooks - Get notified when scraping completes
  • Scheduling - Set up recurring runs on any schedule
  • Zapier / Make - Connect with 5,000+ apps via Apify integrations
  • Python / Node.js SDKs - Native client libraries for easy integration