Socrata Open Data — Any Gov Portal (CDC, HHS, NY, NYC…) avatar

Socrata Open Data — Any Gov Portal (CDC, HHS, NY, NYC…)

Pricing

Pay per usage

Go to Apify Store
Socrata Open Data — Any Gov Portal (CDC, HHS, NY, NYC…)

Socrata Open Data — Any Gov Portal (CDC, HHS, NY, NYC…)

Search and pull datasets from any Socrata government open-data portal: CDC, HHS, CMS, NY, NYC, Texas and hundreds more. Discovery search + SoQL filtered data pull. Structured JSON, no anti-bot.

Pricing

Pay per usage

Rating

0.0

(0)

Developer

The Mine Works

The Mine Works

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

4 days ago

Last modified

Share

Socrata Open Data — Any Government Portal (CDC, HHS, NY, NYC & 100s more)

Search and pull data from any Socrata open-data portal — the platform behind hundreds of US federal, state and city government data sites: data.cdc.gov, datahub.hhs.gov, data.cms.gov, data.ny.gov, data.cityofnewyork.us, data.texas.gov and many more. Two modes in one actor: discover datasets across every portal by keyword, then pull the rows you want with SoQL filters. Clean structured JSON, no API key, no anti-bot.

Keywords: Socrata API, SODA API, open data, government data, data.gov alternative, CDC data, NYC open data, public data API, SoQL.


Why this actor

Socrata powers a huge share of US government open data, and it exposes two excellent public APIs — a cross-portal Discovery API and the per-dataset SODA query API with SoQL filtering. This actor wraps both so you can go from "I need X data" to clean rows in one run:

  • Discovery mode — search by keyword across every Socrata portal and get back each matching dataset's domain, ID, name and description.
  • Data mode — point at a domain + dataset ID and pull the rows, with full SoQL support: $where filters, $select field projection, $order sorting.
  • Hundreds of portals — federal (CDC, HHS, CMS), state (NY, Texas, Washington), and city (NYC, LA, Seattle, Chicago).
  • No API key required — keyless by default; add a free Socrata app token for higher rate limits.
  • No anti-bot — these are the governments' own official data APIs.

Two ways to use it

1. Find a dataset (discovery): leave domain/datasetId blank and pass a searchQuery.

{ "searchQuery": "restaurant inspections", "maxResults": 25 }

→ returns datasets like data.cityofnewyork.us / 43nn-pn8j — DOHMH NYC Restaurant Inspection Results.

2. Pull the data: pass the domain + datasetId (from discovery or the dataset URL), plus optional SoQL.

{
"domain": "data.cdc.gov",
"datasetId": "9bhg-hcku",
"select": "state,sex,age_group,covid_19_deaths",
"where": "state='United States'",
"order": "covid_19_deaths DESC",
"maxResults": 5000
}

Input

FieldTypeDescription
searchQuerystringKeyword to find datasets across all portals (discovery mode).
domainstringPortal domain, e.g. data.cdc.gov (data mode).
datasetIdstringDataset 4x4 ID, e.g. 9bhg-hcku (data mode).
wherestringSoQL $where filter.
selectstringSoQL $select (comma-separated fields).
orderstringSoQL $order sort.
appTokenstringOptional free Socrata app token (higher rate limits).
maxResultsintegerMax records (rows or datasets) to return.

Output

Data mode returns each row as the dataset defines it, plus _domain, _dataset_id, _scraped_at. Discovery mode returns one record per matching dataset:

{
"_type": "dataset",
"domain": "data.cityofnewyork.us",
"dataset_id": "43nn-pn8j",
"name": "DOHMH New York City Restaurant Inspection Results",
"description": "...",
"rows_count": 280000,
"permalink": "https://data.cityofnewyork.us/d/43nn-pn8j",
"scraped_at": "2026-06-11T13:00:00.000Z"
}

A final {"_type": "summary"} record reports the mode and record count.


Pricing

Your first 25 records are free — every Apify account, no card, no trial clock. After that it is a flat $0.001 per record.

  • First 25 records free per account (lifetime), then $0.001/record ($1 per 1,000)
  • Zero charge on empty runs
  • No monthly minimum, no rental

FAQ

Do I need an API key? No. Socrata's APIs are keyless. A free app token only raises rate limits for very large pulls.

Which portals work? Any Socrata-powered portal — hundreds of US federal, state and city government sites.

How do I find a dataset ID? Use discovery mode (search by keyword), or copy the 4x4 ID from the dataset's URL.

Can I filter server-side? Yes — full SoQL: $where, $select, $order. Filtering happens on Socrata's side, so you only pull (and pay for) the rows you want.