US Compliance & Regulatory Data API
Pricing
Pay per usage
US Compliance & Regulatory Data API
Search 33 US regulatory data sources — EPA environmental violations, NHTSA vehicle recalls, OFAC sanctions, FDIC bank data, CFPB consumer complaints, FEMA disaster records, EIA energy data, and more. Essential for compliance screening, ESG reporting, and risk assessment via SIP Public Data Gateway.
Pricing
Pay per usage
Rating
0.0
(0)
Developer
kane liu
Actor stats
0
Bookmarked
0
Total users
0
Monthly active users
13 hours ago
Last modified
Categories
Share
US Regulatory & Compliance Data Search
Apify Actor that queries the SIP Public Data Gateway over HTTPS — no browser, no scraping. Provide search terms and toggle the regulatory domains you need; the Actor fires all queries in parallel and returns normalized compliance records enriched with source metadata.
Covers 33 active US regulatory products across EPA environmental enforcement, NHTSA vehicle safety, OFAC sanctions, FDIC bank data, CFPB consumer complaints, FEMA disaster records, EIA energy statistics, NREL renewable resource data, and CAISO electricity market prices.
What it does
For each search term you provide, the Actor:
- Determines which product categories are enabled (environmental, vehicle safety, financial, consumer protection, disaster, energy).
- Fires all enabled product searches simultaneously using
asyncio.gather— up to 33 parallel requests per term. - Enriches every result row with
_product_id,_source,_search_term, and_collected_at. - Pushes all rows to the Apify dataset in a single batch per term.
This makes it fast: a full run across all 33 products with a single search term typically completes in under 10 seconds.
Data sources
Environmental (EPA) — 3 products
| Source | product_id | What it contains |
|---|---|---|
| EPA Toxic Release Inventory | us_epa_tri_facilities | Industrial facilities reporting toxic chemical releases under EPCRA Section 313 |
| EPA Greenhouse Gas Reporting | us_epa_ghg_facilities | Large emitter facilities reporting GHG emissions under the GHGRP |
| EPA Drinking Water Systems | us_epa_drinking_water_systems | Public water systems regulated under the Safe Drinking Water Act |
Vehicle Safety (NHTSA) — 3 products
| Source | product_id | What it contains |
|---|---|---|
| NHTSA Vehicle Recalls | us_nhtsa_recalls | Safety-related defect and compliance recalls by make/model/year |
| NHTSA Consumer Complaints | us_nhtsa_complaints | Vehicle safety complaints filed by consumers with NHTSA |
| NHTSA VIN Decoder | us_nhtsa_vin_decoder | Vehicle specifications decoded from VIN |
Financial & Regulatory (OFAC, FDIC, Treasury) — 5 products
| Source | product_id | What it contains |
|---|---|---|
| OFAC Sanctions List | us_ofac_sanctions | SDN and consolidated sanctions list (individuals, entities, vessels) |
| FDIC Bank Failures | us_fdic_failures | Historical bank and thrift failures since 2000 |
| FDIC Financial Data | us_fdic_financials | Call report financials for FDIC-insured institutions |
| Treasury Debt Data | us_treasury_debt | US public debt outstanding and debt limit data |
| Treasury Exchange Rates | us_treasury_exchange_rates | Official Treasury foreign currency exchange rates |
Consumer Protection (CFPB) — 1 product
| Source | product_id | What it contains |
|---|---|---|
| CFPB Complaints | us_cfpb_complaints | Consumer financial product complaints (mortgages, credit cards, student loans, etc.) |
Disaster & Insurance (FEMA) — 2 products
| Source | product_id | What it contains |
|---|---|---|
| FEMA Disaster Declarations | us_fema_disaster_declarations | Presidentially declared disasters by state, county, and incident type |
| FEMA NFIP Claims | us_fema_nfip_claims | National Flood Insurance Program claim records |
Energy (EIA, BLS, NRC, NREL, CAISO) — 19 products
| Source | product_id | What it contains |
|---|---|---|
| EIA CO2 Emissions | us_eia_co2_emissions | Energy-related CO2 emissions by sector and fuel |
| EIA Coal Production | us_eia_coal_production | US coal production by region and mine type |
| EIA Electricity Generation | us_eia_electricity_generation | Net electricity generation by source and state |
| EIA Electricity Retail Sales | us_eia_electricity_retail_sales | Retail electricity sales, revenue, and customers by sector |
| EIA International | us_eia_international | International energy statistics (production, consumption, trade) |
| EIA Natural Gas Prices | us_eia_natural_gas_prices | Henry Hub and city-gate natural gas price series |
| EIA Nuclear Outages | us_eia_nuclear_outages | Nuclear plant outage and capacity data |
| EIA Petroleum Prices | us_eia_petroleum_prices | Crude oil and petroleum product spot and retail prices |
| EIA State Energy Data (SEDS) | us_eia_seds_consumption | State-level energy consumption by sector and fuel type |
| EIA Short-Term Energy Outlook | us_eia_steo_forecasts | STEO price and supply/demand forecasts |
| BLS Energy CPI | us_bls_energy_cpi | Bureau of Labor Statistics Consumer Price Index for energy |
| NRC Reactor Status | us_nrc_reactor_status | Daily nuclear reactor power output and operational status |
| NREL Alt Fuel Stations | us_nrel_alt_fuel_stations | Alternative fuel station locations (EV, CNG, LPG, hydrogen, etc.) |
| NREL PVWatts | us_nrel_pvwatts | Solar PV system performance estimates by location |
| NREL Solar Resource | us_nrel_solar_resource | Direct normal irradiance and global horizontal irradiance data |
| NREL Utility Rates | us_nrel_utility_rates | Utility electricity rate structures and pricing |
| CAISO Day-Ahead LMP | us_caiso_day_ahead_lmp | California ISO day-ahead locational marginal prices |
| CAISO Real-Time LMP | us_caiso_realtime_lmp | California ISO real-time locational marginal prices |
| CAISO Renewable Forecast | us_caiso_renewable_forecast | California ISO renewable energy generation forecasts |
Use cases
Compliance screening — Run a company name against OFAC sanctions and CFPB complaints in one call. Essential for KYC/KYB workflows, financial onboarding, and third-party risk management.
ESG reporting — Pull EPA TRI and GHG facility data to assess the environmental footprint of a supplier, portfolio company, or property address. Combine with EIA energy consumption data for scope 3 analysis.
Risk assessment — Identify flood-prone properties with FEMA NFIP claims history, check vehicle fleets against NHTSA recalls, and monitor FDIC-insured counterparties for financial health signals.
Due diligence — Screen acquisition targets and supply chain partners against the full regulatory record: sanctions exposure, environmental violations, consumer complaints, and energy risk.
Regulatory monitoring — Track new FEMA disaster declarations, NRC reactor outages, and NHTSA recalls affecting your industry. Run on a schedule using Apify's built-in cron to get fresh data daily or weekly.
Input
| Field | Type | Default | Description |
|---|---|---|---|
searchTerms | string[] | — | Required. Keywords to search (e.g. ["Texas", "Exxon", "Toyota"]). Each term is queried across all enabled sources in parallel. |
includeEnvironmental | boolean | true | EPA TRI, GHG, and Drinking Water Systems (3 products). |
includeVehicleSafety | boolean | true | NHTSA Recalls, Complaints, VIN Decoder (3 products). |
includeFinancialRegulatory | boolean | true | OFAC, FDIC Failures, FDIC Financials, Treasury Debt, Treasury FX Rates (5 products). |
includeConsumerProtection | boolean | true | CFPB Complaints (1 product). |
includeDisaster | boolean | false | FEMA Disaster Declarations and NFIP Claims (2 products). |
includeEnergy | boolean | false | EIA, BLS, NRC, NREL, CAISO — 19 energy products. |
maxResultsPerSource | integer | 50 | Cap per SIP product call (1–200; values above 200 are clamped to 200). |
Example input (JSON)
{"searchTerms": ["Texas", "Exxon"],"includeEnvironmental": true,"includeVehicleSafety": true,"includeFinancialRegulatory": true,"includeConsumerProtection": true,"includeDisaster": false,"includeEnergy": false,"maxResultsPerSource": 50}
Output
Each dataset item is a single regulatory record enriched with Actor metadata:
| Field | Description |
|---|---|
_product_id | SIP product identifier (e.g. us_epa_tri_facilities) |
_source | Short source label (e.g. epa_tri, ofac_sanctions, nhtsa_recalls) |
_search_term | The search term that produced this row |
_collected_at | ISO 8601 UTC timestamp of collection |
... | All native fields from the SIP product schema |
Example record — EPA TRI facility
{"facility_name": "EXXONMOBIL CHEMICAL PLANT","state": "TX","city": "Baytown","zip": "77520","chemical": "BENZENE","total_releases": 12500,"reporting_year": 2022,"_product_id": "us_epa_tri_facilities","_source": "epa_tri","_search_term": "Exxon","_collected_at": "2026-04-08T09:00:00Z"}
Example record — OFAC sanctions
{"name": "TEXAS OIL TRADING LLC","program": "IRAN","type": "Entity","sdnType": "SDN","title": null,"_product_id": "us_ofac_sanctions","_source": "ofac_sanctions","_search_term": "Texas","_collected_at": "2026-04-08T09:00:00Z"}
Configuration
Set the SIP gateway API key in the Apify Actor environment variables (recommended — mark as secret so it is not stored in run input):
| Variable | Description |
|---|---|
SIP_API_KEY | SIP X-API-Key header value. Mark as secret in Apify. |
Optional overrides:
| Variable | Description |
|---|---|
SIP_API_BASE | Override the gateway root URL. Default: https://sip.myskillstore.dev/api/v1/data. |
For local testing only, you may pass sipApiKey directly in the input JSON. Never use this on Apify — keys stored in run input are visible in run logs.
Pricing (reference)
Approximate positioning: a small per-run platform fee plus ~$3 per 1,000 results. With maxResultsPerSource=50 and 12 enabled products, a single search term may return up to 600 rows. Configure exact USD amounts in the Apify Actor Pricing tab.
Local development
cd "/Users/kane/Projects/Apify Actors/sip-us-compliance-search"python3.13 -m venv .venv.venv/bin/pip install -r requirements.txtexport SIP_API_KEY="your-key"mkdir -p storage/key_value_stores/defaultecho '{"searchTerms": ["Texas"],"includeEnvironmental": true,"includeVehicleSafety": true,"includeFinancialRegulatory": true,"includeConsumerProtection": true,"includeDisaster": false,"includeEnergy": false,"maxResultsPerSource": 5}' > storage/key_value_stores/default/INPUT.jsonapify run
Or run directly with:
$SIP_API_KEY=your-key python -m src
(with INPUT.json placed under storage/key_value_stores/default/ per Apify CLI conventions).
Deploy to Apify
apify loginapify push
Use APIFY_TOKEN or the CLI login — never commit tokens to version control.
Legal disclaimer
This Actor is an API client to your own SIP Public Data Gateway deployment. You are responsible for:
- Complying with SIP terms of use and any applicable data provider agreements.
- Complying with Apify terms of service.
- Lawful use of downstream regulatory data in accordance with applicable data protection, privacy, and export control laws (including OFAC regulations).
- Ensuring your use of government data complies with each agency's data use policies (EPA, NHTSA, FDIC, CFPB, FEMA, EIA, NREL, CAISO, etc.).
This Actor does not provide legal, financial, or compliance advice. Results are for informational purposes only.
Use as MCP Tool (AI Agent Integration)
This Actor works as an MCP tool — AI agents (Claude, GPT, Cursor) can discover and run it automatically.
Quick setup (Claude Desktop / Cursor / VS Code)
Add to your MCP config:
{"mcpServers": {"apify": {"url": "https://mcp.apify.com","headers": {"Authorization": "Bearer YOUR_APIFY_TOKEN"}}}}
Then ask your AI: "Search for chemical spill in US compliance databases"
Direct API call
from apify_client import ApifyClientclient = ApifyClient("YOUR_APIFY_TOKEN")run = client.actor("lentic_clockss/us-compliance-search").call(run_input={"searchTerms": ["chemical spill"], "maxResultsPerSource": 50})for item in client.dataset(run["defaultDatasetId"]).iterate_items():print(item)