# EPA Envirofacts Facilities Scraper (`parseforge/epa-envirofacts-facilities-scraper`) Actor

Reach the US EPA Envirofacts registry and pull regulated facility records with facilityId, primary name, full address, city, state, zip, latitude, longitude, NAICS code, EPA region, county, and operating status. Filter by table, state, city, zip, or NAICS for compliance research.

- **URL**: https://apify.com/parseforge/epa-envirofacts-facilities-scraper.md
- **Developed by:** [ParseForge](https://apify.com/parseforge) (community)
- **Categories:** Automation, Integrations, Lead generation
- **Stats:** 2 total users, 1 monthly users, 100.0% runs succeeded, NaN bookmarks
- **User rating**: No ratings yet

## Pricing

from $7.50 / 1,000 results

This Actor is paid per event. You are not charged for the Apify platform usage, but only a fixed price for specific events.
Since this Actor supports Apify Store discounts, the price gets lower the higher subscription plan you have.

Learn more: https://docs.apify.com/platform/actors/running/actors-in-store#pay-per-event

## What's an Apify Actor?

Actors are a software tools running on the Apify platform, for all kinds of web data extraction and automation use cases.
In Batch mode, an Actor accepts a well-defined JSON input, performs an action which can take anything from a few seconds to a few hours,
and optionally produces a well-defined JSON output, datasets with results, or files in key-value store.
In Standby mode, an Actor provides a web server which can be used as a website, API, or an MCP server.
Actors are written with capital "A".

## How to integrate an Actor?

If asked about integration, you help developers integrate Actors into their projects.
You adapt to their stack and deliver integrations that are safe, well-documented, and production-ready.
The best way to integrate Actors is as follows.

In JavaScript/TypeScript projects, use official [JavaScript/TypeScript client](https://docs.apify.com/api/client/js.md):

```bash
npm install apify-client
```

In Python projects, use official [Python client library](https://docs.apify.com/api/client/python.md):

```bash
pip install apify-client
```

In shell scripts, use [Apify CLI](https://docs.apify.com/cli/docs.md):

````bash
# MacOS / Linux
curl -fsSL https://apify.com/install-cli.sh | bash
# Windows
irm https://apify.com/install-cli.ps1 | iex
```bash

In AI frameworks, you might use the [Apify MCP server](https://docs.apify.com/platform/integrations/mcp.md).

If your project is in a different language, use the [REST API](https://docs.apify.com/api/v2.md).

For usage examples, see the [API](#api) section below.

For more details, see Apify documentation as [Markdown index](https://docs.apify.com/llms.txt) and [Markdown full-text](https://docs.apify.com/llms-full.txt).


# README

![ParseForge Banner](https://github.com/ParseForge/apify-assets/blob/ad35ccc13ddd068b9d6cba33f323962e39aed5b2/banner.jpg?raw=true)

## 🏭 EPA Envirofacts Public Facilities Scraper

> 🚀 **Export EPA Envirofacts public facility records in seconds. Names, addresses, NAICS codes, lat-long, and program affiliations for US regulated sites straight from the official data.epa.gov endpoint into a clean dataset.**

> 🕒 **Last updated:** 2026-06-05 · **📊 12 fields** per record · 4 million+ facilities · FRS, TRI, RCRA, NPDES, Air, SDWIS · State, city, ZIP, NAICS filters

The EPA Envirofacts Public Facilities Scraper turns EPA's master facility registry into a structured dataset. It calls the public Envirofacts REST endpoint against the table you choose (FRS, TRI, RCRA, ICIS-Air, NPDES, SDWIS) with optional state, city, ZIP, or NAICS filters, parses the response, and flattens each facility into one row.

You can pull the EPA Facility Registry Service (FRS) master record or zoom into program-specific tables for Toxics Release Inventory facilities, hazardous waste handlers under RCRA, water dischargers under NPDES, air emitters under ICIS-Air, or drinking water systems under SDWIS.

| 🎯 Target Audience | 💡 Primary Use Cases |
|---|---|
| 🌿 Environmental analysts | Map US regulated facilities by program |
| 🏢 ESG and compliance teams | Vendor and supplier screening |
| 🏛️ Local government | Inventory regulated sites in a jurisdiction |
| 🎓 Researchers | Geographic and industry studies of environmental burden |
| 📰 Journalists | Locate facilities near a community for investigative stories |
| 👩‍💻 Developers | Mirror EPA facility data into your own database |

### 📋 What this scraper does

- Calls the official Envirofacts REST endpoint against the table you select.
- Applies optional state, city, ZIP, and NAICS filters via the Envirofacts URL path syntax.
- Paginates through the result set automatically up to your `maxItems` cap.
- Normalizes fields across program tables to a common shape (facility ID, name, address, lat-long, NAICS, programs).
- Surfaces upstream errors as a single diagnostic record instead of crashing.

> 💡 **Why it matters:** Envirofacts is one of the most authoritative public registries of US regulated facilities, but its URL-path filter syntax is unusual and trips up most clients. This actor handles the syntax for you and returns clean, tabular rows.

### 🎬 Full Demo

_🚧 Coming soon._

### ⚙️ Input

<table>
<tr><th>Field</th><th>Type</th><th>Required</th><th>Description</th></tr>
<tr><td><code>table</code></td><td>enum</td><td>No</td><td>Envirofacts table. Default <code>frs.frs_facility_site</code>.</td></tr>
<tr><td><code>maxItems</code></td><td>integer</td><td>No</td><td>Free users 10. Paid users up to 1,000,000. Prefill 10.</td></tr>
<tr><td><code>state</code></td><td>string</td><td>No</td><td>Two-letter US state code.</td></tr>
<tr><td><code>city</code></td><td>string</td><td>No</td><td>City name.</td></tr>
<tr><td><code>zip</code></td><td>string</td><td>No</td><td>ZIP code (or ZIP prefix).</td></tr>
<tr><td><code>naics</code></td><td>string</td><td>No</td><td>NAICS industry code (or prefix).</td></tr>
</table>

**Example 1, all FRS facilities in California:**
````

{
"table": "frs.frs\_facility\_site",
"state": "CA",
"maxItems": 1000
}

```

**Example 2, TRI facilities with NAICS 3241 (petroleum and coal manufacturing):**
```

{
"table": "tri.tri\_facility",
"naics": "3241",
"maxItems": 500
}

````

> ⚠️ **Good to Know:** Envirofacts is a fully public service. No API key is required. Result sets can be large (millions of FRS records), so set `maxItems` thoughtfully.

### 📊 Output

Each record is a flat object. The `error` field is always last.

| Field | Type | Description |
|---|---|---|
| 🆔 `facilityId` | string | Envirofacts registry or program facility ID. |
| 🏭 `name` | string | Primary facility name. |
| 📍 `address` | string | Location street address. |
| 🏙️ `city` | string | City. |
| 🗺️ `state` | string | Two-letter US state code. |
| 📮 `zip` | string | ZIP code. |
| 🌐 `latitude` | number | Latitude (NAD83 when available). |
| 🌐 `longitude` | number | Longitude (NAD83 when available). |
| 🏷️ `naics` | string | NAICS industry code. |
| 📋 `programs` | string | EPA program acronyms or interest types. |
| 🕒 `scrapedAt` | string | ISO timestamp when the row was fetched. |
| ❌ `error` | string | Set if the upstream response was an error. |

### ✨ Why choose this Actor

| 🆓 | Works with EPA's fully public Envirofacts service. No authentication needed. |
| 🧹 | Normalizes field names across FRS, TRI, RCRA, NPDES, ICIS-Air, and SDWIS. |
| 📍 | Returns geocoordinates so you can map facilities directly. |
| 🛟 | Surfaces upstream errors as a clean diagnostic record. |
| 🔌 | Handles Envirofacts URL-path filter syntax for you. |
| 💾 | Push to dataset for instant export. |

### 📈 How it compares to alternatives

| Approach | Setup time | Path-filter syntax | Cross-table normalization | Error handling |
|---|---|---|---|---|
| Roll your own fetch | 1 hour | Manual | Manual | None |
| EPA's own report builder UI | Slow, manual | Built in | Limited | Limited |
| **This Actor** | 5 seconds, no install | Built in | Built in | Built in |

### 🚀 How to use

1. Click **Try for free**.
2. Pick a table (default is the FRS master registry).
3. (Optional) Filter by state, city, ZIP, or NAICS.
4. Click **Start**. Within seconds your dataset is ready.

### 💼 Business use cases

**🌿 Environmental due diligence.** Screen suppliers for active hazardous waste handler status under RCRA.

**🏢 ESG screening.** Cross-reference vendor addresses with TRI facilities for chemical-release exposure.

**🏛️ Local government inventory.** Pull every regulated facility in a city or ZIP for environmental justice studies.

**🤖 ML training data.** Build geographic features tied to environmental program affiliations.

### 🔌 Automating this scraper

- **Make and Zapier** trigger this actor on a monthly cadence, push results to Airtable or Google Sheets.
- **Cron schedule** uses Apify's native scheduler.
- **Webhooks** notify your endpoint the moment a run finishes.
- **Pipe to BigQuery, Snowflake, or Postgres** with native Apify integrations.

### 🌟 Beyond business use cases

**🎓 Education.** Teach environmental policy with real EPA facility data.

**🧪 Personal research.** Map regulated facilities near your community.

**🤝 Non-profit and open data.** Build public dashboards of environmental burden by ZIP or NAICS.

**🧰 Tinkering and prototyping.** Spin up a facility data feed in seconds for a mapping demo.

### 🤖 Ask an AI assistant about this scraper

Pop this README into Claude, ChatGPT, or any AI assistant and ask it to map your specific workflow to the actor's inputs.

### ❓ Frequently Asked Questions

**❓ Do I need an API key?** No. Envirofacts is fully public.

**❓ Which table should I start with?** `frs.frs_facility_site` is the master registry. Program tables (TRI, RCRA, NPDES) give program-specific detail.

**❓ Can I filter by ZIP prefix?** Yes. The scraper uses Envirofacts BEGINNING operator so a partial ZIP works.

**❓ Are coordinates cast to numbers?** Yes. Latitude and longitude come back as real numbers when available.

**❓ How do you handle errors?** Upstream errors are pushed as a single record with the `error` field populated instead of crashing.

**❓ Can I schedule runs?** Yes, use Apify's native scheduler or hook into Make or Zapier.

**❓ Is this scraping or an API?** API. The Envirofacts REST endpoint is the official public interface.

**❓ Will the schema change?** Core normalized fields are stable. Table-specific extras are passed through as-is.

**❓ What format can I download?** Use any export format offered by the Apify dataset UI.

**❓ How large is the FRS table?** About 4 million facilities. Set `maxItems` and filters to keep runs efficient.

### 🔌 Integrate with any app

Apify ships native integrations with Make, Zapier, Slack, Discord, Google Drive, Google Sheets, Gmail, Airbyte, Keboola, Telegram, GitHub, and any REST API or webhook endpoint.

### 🔗 Recommended Actors

| Actor | What it does |
|---|---|
| [ParseForge USDA ERS Scraper](https://apify.com/parseforge) | USDA farm-sector data. |
| [ParseForge Denmark Statbank Scraper](https://apify.com/parseforge) | Denmark official statistics. |
| [ParseForge OurAirports Scraper](https://apify.com/parseforge/ourairports-scraper) | Global airport database. |
| [ParseForge Alpha Vantage Scraper](https://apify.com/parseforge) | Market data and indicators. |

> 💡 **Pro Tip:** browse the complete [ParseForge collection](https://apify.com/parseforge) for 900+ production-grade scrapers across business intelligence, real estate, e-commerce, sports, finance, and public records.

---

**Disclaimer:** This actor scrapes only publicly available data. ParseForge is not affiliated with, endorsed by, or sponsored by any of the third-party services referenced. Users are responsible for complying with the target site's terms of service and applicable law. [Create a free account w/ $5 credit](https://console.apify.com/sign-up?fpr=vmoqkp).

# Actor input Schema

## `table` (type: `string`):

Envirofacts facility table to query.
## `maxItems` (type: `integer`):

Free users: Limited to 10 items (preview). Paid users: Optional, max 1,000,000.
## `state` (type: `string`):

Optional two-letter US state code to filter facilities (for example CA, TX, NY).
## `city` (type: `string`):

Optional city name to filter facilities.
## `zip` (type: `string`):

Optional ZIP code to filter facilities.
## `naics` (type: `string`):

Optional NAICS industry code to filter facilities.

## Actor input object example

```json
{
  "table": "frs.frs_facility_site",
  "maxItems": 10
}
````

# Actor output Schema

## `results` (type: `string`):

No description

# API

You can run this Actor programmatically using our API. Below are code examples in JavaScript, Python, and CLI, as well as the OpenAPI specification and MCP server setup.

## JavaScript example

```javascript
import { ApifyClient } from 'apify-client';

// Initialize the ApifyClient with your Apify API token
// Replace the '<YOUR_API_TOKEN>' with your token
const client = new ApifyClient({
    token: '<YOUR_API_TOKEN>',
});

// Prepare Actor input
const input = {
    "maxItems": 10
};

// Run the Actor and wait for it to finish
const run = await client.actor("parseforge/epa-envirofacts-facilities-scraper").call(input);

// Fetch and print Actor results from the run's dataset (if any)
console.log('Results from dataset');
console.log(`💾 Check your data here: https://console.apify.com/storage/datasets/${run.defaultDatasetId}`);
const { items } = await client.dataset(run.defaultDatasetId).listItems();
items.forEach((item) => {
    console.dir(item);
});

// 📚 Want to learn more 📖? Go to → https://docs.apify.com/api/client/js/docs

```

## Python example

```python
from apify_client import ApifyClient

# Initialize the ApifyClient with your Apify API token
# Replace '<YOUR_API_TOKEN>' with your token.
client = ApifyClient("<YOUR_API_TOKEN>")

# Prepare the Actor input
run_input = { "maxItems": 10 }

# Run the Actor and wait for it to finish
run = client.actor("parseforge/epa-envirofacts-facilities-scraper").call(run_input=run_input)

# Fetch and print Actor results from the run's dataset (if there are any)
print("💾 Check your data here: https://console.apify.com/storage/datasets/" + run["defaultDatasetId"])
for item in client.dataset(run["defaultDatasetId"]).iterate_items():
    print(item)

# 📚 Want to learn more 📖? Go to → https://docs.apify.com/api/client/python/docs/quick-start

```

## CLI example

```bash
echo '{
  "maxItems": 10
}' |
apify call parseforge/epa-envirofacts-facilities-scraper --silent --output-dataset

```

## MCP server setup

```json
{
    "mcpServers": {
        "apify": {
            "command": "npx",
            "args": [
                "mcp-remote",
                "https://mcp.apify.com/?tools=parseforge/epa-envirofacts-facilities-scraper",
                "--header",
                "Authorization: Bearer <YOUR_API_TOKEN>"
            ]
        }
    }
}

```

## OpenAPI specification

```json
{
    "openapi": "3.0.1",
    "info": {
        "title": "EPA Envirofacts Facilities Scraper",
        "description": "Reach the US EPA Envirofacts registry and pull regulated facility records with facilityId, primary name, full address, city, state, zip, latitude, longitude, NAICS code, EPA region, county, and operating status. Filter by table, state, city, zip, or NAICS for compliance research.",
        "version": "0.1",
        "x-build-id": "fZAbkQiPOSaQNRDdi"
    },
    "servers": [
        {
            "url": "https://api.apify.com/v2"
        }
    ],
    "paths": {
        "/acts/parseforge~epa-envirofacts-facilities-scraper/run-sync-get-dataset-items": {
            "post": {
                "operationId": "run-sync-get-dataset-items-parseforge-epa-envirofacts-facilities-scraper",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor, waits for its completion, and returns Actor's dataset items in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK"
                    }
                }
            }
        },
        "/acts/parseforge~epa-envirofacts-facilities-scraper/runs": {
            "post": {
                "operationId": "runs-sync-parseforge-epa-envirofacts-facilities-scraper",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor and returns information about the initiated run in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK",
                        "content": {
                            "application/json": {
                                "schema": {
                                    "$ref": "#/components/schemas/runsResponseSchema"
                                }
                            }
                        }
                    }
                }
            }
        },
        "/acts/parseforge~epa-envirofacts-facilities-scraper/run-sync": {
            "post": {
                "operationId": "run-sync-parseforge-epa-envirofacts-facilities-scraper",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor, waits for completion, and returns the OUTPUT from Key-value store in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK"
                    }
                }
            }
        }
    },
    "components": {
        "schemas": {
            "inputSchema": {
                "type": "object",
                "properties": {
                    "table": {
                        "title": "Envirofacts table",
                        "enum": [
                            "frs.frs_facility_site",
                            "frs.frs_program_facility",
                            "tri.tri_facility",
                            "rcra.rcra_facility",
                            "air.icis_air_facility",
                            "npdes.npdes_facility",
                            "sdwis.water_system"
                        ],
                        "type": "string",
                        "description": "Envirofacts facility table to query.",
                        "default": "frs.frs_facility_site"
                    },
                    "maxItems": {
                        "title": "Max Items",
                        "minimum": 1,
                        "maximum": 1000000,
                        "type": "integer",
                        "description": "Free users: Limited to 10 items (preview). Paid users: Optional, max 1,000,000."
                    },
                    "state": {
                        "title": "State",
                        "type": "string",
                        "description": "Optional two-letter US state code to filter facilities (for example CA, TX, NY)."
                    },
                    "city": {
                        "title": "City",
                        "type": "string",
                        "description": "Optional city name to filter facilities."
                    },
                    "zip": {
                        "title": "ZIP code",
                        "type": "string",
                        "description": "Optional ZIP code to filter facilities."
                    },
                    "naics": {
                        "title": "NAICS code",
                        "type": "string",
                        "description": "Optional NAICS industry code to filter facilities."
                    }
                }
            },
            "runsResponseSchema": {
                "type": "object",
                "properties": {
                    "data": {
                        "type": "object",
                        "properties": {
                            "id": {
                                "type": "string"
                            },
                            "actId": {
                                "type": "string"
                            },
                            "userId": {
                                "type": "string"
                            },
                            "startedAt": {
                                "type": "string",
                                "format": "date-time",
                                "example": "2025-01-08T00:00:00.000Z"
                            },
                            "finishedAt": {
                                "type": "string",
                                "format": "date-time",
                                "example": "2025-01-08T00:00:00.000Z"
                            },
                            "status": {
                                "type": "string",
                                "example": "READY"
                            },
                            "meta": {
                                "type": "object",
                                "properties": {
                                    "origin": {
                                        "type": "string",
                                        "example": "API"
                                    },
                                    "userAgent": {
                                        "type": "string"
                                    }
                                }
                            },
                            "stats": {
                                "type": "object",
                                "properties": {
                                    "inputBodyLen": {
                                        "type": "integer",
                                        "example": 2000
                                    },
                                    "rebootCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "restartCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "resurrectCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "computeUnits": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            },
                            "options": {
                                "type": "object",
                                "properties": {
                                    "build": {
                                        "type": "string",
                                        "example": "latest"
                                    },
                                    "timeoutSecs": {
                                        "type": "integer",
                                        "example": 300
                                    },
                                    "memoryMbytes": {
                                        "type": "integer",
                                        "example": 1024
                                    },
                                    "diskMbytes": {
                                        "type": "integer",
                                        "example": 2048
                                    }
                                }
                            },
                            "buildId": {
                                "type": "string"
                            },
                            "defaultKeyValueStoreId": {
                                "type": "string"
                            },
                            "defaultDatasetId": {
                                "type": "string"
                            },
                            "defaultRequestQueueId": {
                                "type": "string"
                            },
                            "buildNumber": {
                                "type": "string",
                                "example": "1.0.0"
                            },
                            "containerUrl": {
                                "type": "string"
                            },
                            "usage": {
                                "type": "object",
                                "properties": {
                                    "ACTOR_COMPUTE_UNITS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_WRITES": {
                                        "type": "integer",
                                        "example": 1
                                    },
                                    "KEY_VALUE_STORE_LISTS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_INTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_EXTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_RESIDENTIAL_TRANSFER_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_SERPS": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            },
                            "usageTotalUsd": {
                                "type": "number",
                                "example": 0.00005
                            },
                            "usageUsd": {
                                "type": "object",
                                "properties": {
                                    "ACTOR_COMPUTE_UNITS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_WRITES": {
                                        "type": "number",
                                        "example": 0.00005
                                    },
                                    "KEY_VALUE_STORE_LISTS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_INTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_EXTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_RESIDENTIAL_TRANSFER_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_SERPS": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            }
                        }
                    }
                }
            }
        }
    }
}
```
