EPA Toxics Release Inventory (TRI) Scraper avatar

EPA Toxics Release Inventory (TRI) Scraper

Pricing

from $3.00 / 1,000 results

Go to Apify Store
EPA Toxics Release Inventory (TRI) Scraper

EPA Toxics Release Inventory (TRI) Scraper

Extract toxic chemical release data from the EPA Toxics Release Inventory (TRI). Over 3 million records of industrial facility emissions reported since 1987. Filter by state, year, and chemical name.

Pricing

from $3.00 / 1,000 results

Rating

0.0

(0)

Developer

Compute Edge

Compute Edge

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

a day ago

Last modified

Share

Extract toxic chemical release data from the EPA Toxics Release Inventory (TRI) database — the most comprehensive public dataset of industrial chemical emissions in the United States. This Actor wraps the official EPA Envirofacts REST API to deliver structured, clean data on over 3.2 million release records reported by US industrial facilities since 1987.

The TRI is a cornerstone dataset for environmental compliance monitoring, ESG due diligence, supply chain risk assessment, and community health research. Every year, thousands of industrial facilities across the US are required by law to report the quantities of toxic chemicals they release into the air, water, and land. This Actor makes that data instantly accessible in JSON, CSV, or Excel format — no API expertise required.

Key Features

FeatureDescription
State filteringExtract releases for any US state using 2-letter codes (CA, TX, NY, etc.)
Year filteringFocus on a specific reporting year (1987-2023)
Chemical filteringSearch for specific chemicals (Lead, Mercury, Benzene, etc.)
Comprehensive data30+ fields per record including facility location, GPS coordinates, parent company, industry sector, release amounts by medium, and chemical classifications
No authenticationUses the free public EPA Envirofacts API — no API key needed
Pagination handledAutomatically pages through large result sets up to 50,000 records

What Data Can You Extract?

Each record represents one chemical reported by one facility in one year. Fields include:

FieldDescription
facilityNameName of the reporting facility
streetAddress, city, state, zipCode, countyFull facility address
latitude, longitudeGPS coordinates for mapping
parentCompanyParent company name
industrySectorIndustry classification (e.g., "Chemicals", "Primary Metals")
chemicalChemical name (e.g., "Lead", "Toluene")
casNumberCAS registry number for chemical identification
totalReleasesTotal releases in pounds
onSiteReleaseTotalOn-site releases (air, water, land)
offSiteReleaseTotalOff-site transfers for disposal
fugitiveAir, stackAir, water, underground, landfillsRelease breakdown by medium
carcinogen, metal, pfas, pbtChemical classification flags
reportingYearYear the release was reported
triFacilityId, frsId, primaryNaicsFacility identifiers

How to Scrape EPA TRI Data

  1. Go to this Actor's page on the Apify Store
  2. Click "Start" to open the input form
  3. Set your filters:
    • Enter a State code (e.g., CA for California) — or leave blank for all states
    • Enter a Reporting Year (e.g., 2023) — or leave blank for all years
    • Enter a Chemical Name (e.g., Lead) — or leave blank for all chemicals
    • Set Max Results (default: 100, max: 50,000)
  4. Click "Start" to run the Actor
  5. Download your data in JSON, CSV, or Excel format from the Dataset tab

Input Example

{
"state": "CA",
"year": 2023,
"chemical": "Lead",
"maxResults": 500
}

Output Example

{
"facilityName": "EXIDE TECHNOLOGIES",
"streetAddress": "2700 S INDIANA ST",
"city": "VERNON",
"state": "CA",
"zipCode": "90058",
"county": "LOS ANGELES",
"latitude": 34.0031,
"longitude": -118.2259,
"parentCompany": "EXIDE TECHNOLOGIES",
"industrySector": "Electrical Equipment",
"chemical": "Lead",
"casNumber": "7439-92-1",
"totalReleases": 1250,
"onSiteReleaseTotal": 850,
"offSiteReleaseTotal": 400,
"reportingYear": "2023",
"carcinogen": "NO",
"metal": "YES",
"pfas": "NO"
}

Pricing

This Actor uses pay-per-result pricing:

EventPrice
Actor start$0.00005
Per result$0.002

The EPA API is free and public. You only pay for Apify compute resources plus the per-result fee above. A typical run of 1,000 records costs approximately $2.00 in Actor fees plus minimal compute costs.

Use Cases

  • ESG & Sustainability Research: Identify the largest polluters in a region or industry for ESG scoring and investment screening
  • Environmental Compliance: Monitor facility emissions over time to track regulatory compliance
  • Supply Chain Due Diligence: Screen suppliers and manufacturing partners for toxic chemical releases
  • Community Health: Map toxic releases near specific communities for public health analysis
  • Journalism & Investigations: Data-driven reporting on industrial pollution trends
  • Academic Research: Environmental science, public health, and policy studies

Integrations

Connect this Actor to your existing workflows:

  • Export to Google Sheets for collaborative analysis
  • Send results to Slack or email for automated alerts
  • Feed into Zapier, Make, or n8n for custom automation
  • Use the Apify API to integrate directly with your application

FAQ

Yes. The EPA Toxics Release Inventory is a public database maintained by the U.S. Environmental Protection Agency. This Actor uses the official EPA Envirofacts REST API, which is free and requires no authentication. All data is publicly available government information.

How much does it cost to scrape EPA TRI?

The Actor charges $0.002 per result plus a $0.00005 Actor start fee. A typical run of 1,000 records costs approximately $2.00 in Actor fees plus minimal compute costs. See the pricing table above for details.

Can I export EPA TRI data to Excel or CSV?

Yes. Apify supports exporting data in JSON, CSV, Excel, XML, HTML, and RSS formats. After the Actor run completes, go to the Dataset tab and choose your preferred export format.

How often is the EPA TRI data updated?

You can schedule this Actor to run at any interval — daily, weekly, or monthly. The EPA updates the TRI database annually, with the most recent data typically from the previous calendar year (e.g., 2023 data available in 2024).

What date range does the TRI data cover?

The TRI database contains toxic chemical release records dating back to 1987, with over 3.2 million release records. You can filter by reporting year to focus on specific time periods.

Other Scrapers by SeatSignal

This Actor accesses publicly available data from the EPA Envirofacts API, a free public service provided by the US Environmental Protection Agency. The data is in the public domain and freely available for any use.

This Actor does not bypass any authentication, does not violate any terms of service, and respects rate limits on the EPA API. The Actor is provided as-is without warranty. Users are responsible for ensuring their use of the data complies with applicable laws and regulations.

For questions or support, please open an issue on this Actor's page.