Pricing

from $10.00 / 1,000 results

Data Gov Catalog Scraper

Scrapes dataset metadata from the Data.gov catalog API, the home of the U.S. Government open data. Access information about hundreds of thousands of federal datasets including descriptions, organizations, tags, and resource counts.

Pricing

from $10.00 / 1,000 results

Rating

0.0

(0)

Developer

Donny

Actor stats

Bookmarked

Total users

Monthly active users

19 hours ago

Last modified

Data.gov Dataset Catalog Scraper

What it does

This actor connects to a public API, fetches structured data based on your search criteria, and stores the results in a clean, normalized dataset on the Apify platform. It handles pagination automatically so you can collect large volumes of results without worrying about API limits or offsets. The actor is designed to be robust with built-in error handling, request timeouts, and input validation to ensure reliable data collection every time you run it.

Why use this actor

Manually querying APIs and handling pagination, rate limits, and data normalization is tedious and error-prone. This actor automates the entire process. Simply provide your search parameters, set the maximum number of results you want, and let the actor handle the rest. The data is stored in a structured dataset that you can export as JSON, CSV, or Excel. You can integrate this actor into larger workflows using the Apify API, schedule it for recurring data collection, or trigger it from your own applications via webhooks.

Input parameters

searchQuery (string, required): The search term to query Data.gov. Default: "transportation".
maxResults (integer, optional): Maximum number of results to return. Default: 100. Range: 1-1000.

All inputs are validated at startup with sensible defaults applied when values are missing. The actor will log warnings for any misconfigured options and continue with safe defaults rather than failing outright.

Output data

Each result in the dataset contains the following fields:

id: Dataset identifier
title: Dataset title
notes: Dataset description
organization: Publishing organization name
metadata_created: Date the metadata was created
resources_count: Number of resources in the dataset
tags: Comma-separated tag names
url: URL to the dataset

All string fields are null-checked to ensure consistent data quality. Missing or undefined values are stored as null rather than empty strings or undefined values.

Example output

{
    "id": "abc-123-def",
    "title": "National Highway Traffic Safety Data",
    "notes": "Comprehensive traffic safety statistics...",
    "organization": "Department of Transportation",
    "metadata_created": "2023-01-15T00:00:00Z",
    "resources_count": 5,
    "tags": "transportation, safety, highway",
    "url": "https://catalog.data.gov/dataset/abc-123-def"
}

Pricing

This actor is available on the Apify platform with transparent usage-based pricing. Each run incurs a small startup cost of approximately $0.005 per start, plus roughly $0.01 per result collected. Actual costs depend on the number of results, API response times, and memory allocation. You can control costs by setting the maxResults parameter to limit the number of results collected per run. For high-volume use cases, consider running the actor on a schedule during off-peak hours to optimize platform resource usage.

More scrapers from brave_paradise

Check out these other useful data collection actors by brave_paradise:

Visit the brave_paradise profile on Apify to explore the full collection of specialized data scrapers and automation tools.

Data Gov Catalog Scraper

fortuitous_pirate/data-gov-catalog-scraper

Fortuitous Pirate

Data.gov Dataset Catalog Scraper

tropical_quince/data-gov-dataset-scraper

Scrape Data.gov dataset catalog. Extract titles, agencies, formats, tags, and download links.

Donny Nguyen

Data.gov Dataset Scraper

consummate_mandala/data-gov-dataset-scraper

Scrape Data.gov dataset catalog. Extract titles, agencies, formats, tags, and download links.

Donny Nguyen

USA Data.gov U.S. Government's Open Data Scrape

parseforge/data-gov-scraper

Stop wasting hours digging through thousands of government datasets. Our Data.gov scraper automatically gathers complete dataset details from the U.S. government's open data portal in minutes. Ideal for researchers, analysts, journalists, and teams needing reliable data without manual effort.

ParseForge

5.0

(1)

CMS Nursing Home Ratings Scraper

parseforge/cms-nursing-home-ratings-scraper

Extract nursing home five-star ratings, inspection results, staffing data, and penalties for all 14,700 US facilities from the official CMS database. Filter by state, city, ratings, ownership, or name. Get 47 fields per facility including quality scores, staffing hours, fines, and deficiency counts.

ParseForge

5.0

(1)

Uspto Trademark Scraper

fortuitous_pirate/uspto-trademark-scraper

Fortuitous Pirate

Data Gov UK Scraper

parseforge/data-gov-uk-scraper

Streamline UK open data research with an automated Data.gov.uk scraper. Collect detailed dataset information from the UK government’s open data portal, enabling daily updates, structured results, and seamless integration into research, analytics, or data-driven workflows.

ParseForge

5.0

(1)

USA HealthData.gov HHS Open Data Scraper

parseforge/healthdata-scraper

Collect health data catalog information from HealthData.gov . Filter by category, tags, view type, authority, and search terms to find exactly what you need. Perfect for researchers, data analysts, and healthcare professionals who need to discover and access public health datasets efficiently.

ParseForge

5.0

(1)

NY Business Entity Scraper

parseforge/ny-business-entity-scraper

Scrape all New York State business entities from the official NY DOS database. Extract names, legal status, CEO, registered agent, service of process and principal office addresses with complete filing details. Filter by name, DOS ID, entity type, county, status and date.

ParseForge

Home Depot Product Details Scraper

ecomscrape/homedepot-product-details-scraper

Professional Home Depot product data scraper that extracts comprehensive product information including pricing, specifications, reviews, and inventory data. Perfect for market research, price monitoring, and competitive analysis with reliable proxy support and structured output formats.

ecomscrape