EPA Greenhouse Gas Emitters Scraper
Pricing
from $3.00 / 1,000 results
EPA Greenhouse Gas Emitters Scraper
Extract EPA Envirofacts Greenhouse Gas Reporting Program (GHGRP) facility data — names, locations, NAICS codes, and reporting years. Filter by state, year, and industry. No API key required.
Pricing
from $3.00 / 1,000 results
Rating
0.0
(0)
Developer
Compute Edge
Maintained by CommunityActor stats
0
Bookmarked
2
Total users
1
Monthly active users
4 days ago
Last modified
Categories
Share
Extract facility data from the EPA's Greenhouse Gas Reporting Program (GHGRP). Access detailed information on major industrial emitters including location, coordinates, reporting year, and industry classification. Filter by state, year, and NAICS code for targeted facility research.
What This Actor Does
This Actor provides direct access to the EPA Envirofacts database—specifically the Public Dimension Facility table from the Greenhouse Gas Reporting Program (GHGRP). It allows you to extract facility-level greenhouse gas emissions reporting data without authentication.
Key Features
- 900,000+ facilities — Access the complete EPA GHGRP dataset
- Flexible filtering — Filter by state, reporting year, and NAICS industry code
- Location data — Latitude/longitude coordinates for mapping and geospatial analysis
- Industry codes — NAICS classification for supply-chain targeting
- No API key required — Direct access to public EPA data
- Batch processing — Efficient extraction in windows of 10,000 records
- Deduplication — Automatic removal of duplicate facilities across reporting years
- Error handling — Graceful fallback for network issues
Typical Use Cases
- Environmental Compliance — Monitor which facilities report greenhouse gas emissions in your region
- Supply Chain Mapping — Identify suppliers and vendor locations in high-emission industries
- Geospatial Analysis — Map facilities by location and assess environmental risk
- Industry Research — Discover facility counts and distribution by NAICS code
- Carbon Credit Sourcing — Find eligible offset projects and emissions verification sources
Getting Started
Step 1: Configure Inputs (All Optional)
- State — 2-letter state code (e.g., "CA", "TX"). Leave blank for all states.
- Reporting Year — Year of emissions report (e.g., 2022, 2023). Leave as 0 for all years.
- NAICS Code — Industry code (e.g., "221112"). Leave blank for all industries.
- Max Results — Maximum facilities to return (default 1000, max 50000).
Step 2: Run the Actor
- Fill in your desired filters (or leave all blank for 1000 random facilities)
- Click Start
- Results appear in the Dataset tab
Step 3: Download Results
Export results as JSON, CSV, or other formats via the Apify UI.
How to scrape EPA greenhouse gas emitter data
Tutorial 1: Find All Power Generation Facilities in California
Goal: Identify electric power generation facilities (NAICS code 221112) reporting emissions in California.
Input configuration:
- State:
CA - NAICS Code:
221112 - Max Results:
1000
Expected output:
[{"facilityId": 1000001,"facilityName": "PSE Ferndale Generating Station","city": "FERNDALE","state": "WA","stateName": "Washington","zip": "98248","county": "WHATCOM COUNTY","address1": "5105 LAKE TERRELL ROAD","latitude": 48.82,"longitude": -122.68,"naicsCode": "221112","reportingYear": 2022},...]
Use case: Feed facility locations into a mapping tool to visualize power plants, or cross-reference with utility service territories to identify energy infrastructure.
Tutorial 2: Track Year-Over-Year Facility Changes in Texas
Goal: Compare facilities reporting emissions in Texas for 2021 vs. 2022 to identify new emitters or facility closures.
Input configuration (Run 1):
- State:
TX - Reporting Year:
2021 - Max Results:
10000
Input configuration (Run 2):
- State:
TX - Reporting Year:
2022 - Max Results:
10000
Expected workflow:
- Export Run 1 results as CSV
- Export Run 2 results as CSV
- Use a spreadsheet to compare facility IDs between the two years
- Identify new facilities (in Run 2 but not Run 1) and closed facilities (in Run 1 but not Run 2)
Use case: Track industrial development trends, identify emerging pollution sources, or monitor facility lifecycle changes.
Tutorial 3: Survey Chemical Manufacturing Facilities Nationwide
Goal: Identify all chemical manufacturing facilities (NAICS codes 325xxx) reporting greenhouse gas emissions.
Input configuration:
- State: (blank — all states)
- NAICS Code:
325(matches all 6-digit codes starting with 325) - Max Results:
50000
Expected output: A dataset of all chemical manufacturers reporting GHGRP emissions, with their locations and year of most recent report.
Use case: Build a database of chemical manufacturers for regulatory research, supplier discovery, or environmental risk assessment.
Input Parameters
| Parameter | Type | Default | Description |
|---|---|---|---|
| state | string | "" | 2-letter state code (e.g., "CA", "TX"). Blank = all states. Case-insensitive (input "ca" works). |
| year | integer | 0 | Reporting year (e.g., 2022). Leave as 0 to include all years. |
| naicsCode | string | "" | NAICS industry code (e.g., "221112" or "325" for partial match). Blank = all industries. |
| maxResults | integer | 1000 | Maximum facilities to return (minimum 1, maximum 50000). |
Output Fields
Each record includes:
| Field | Type | Description |
|---|---|---|
| facilityId | integer | Unique EPA facility identifier |
| facilityName | string | Name of the facility |
| city | string | City where facility is located |
| state | string | 2-letter state code |
| stateName | string | Full state name (e.g., "California") |
| zip | string | ZIP code |
| county | string | County name |
| address1 | string | Street address |
| latitude | number | Latitude for mapping (null if unavailable) |
| longitude | number | Longitude for mapping (null if unavailable) |
| naicsCode | string | NAICS industry code |
| reportingYear | integer | Year of emissions report |
Pricing
This Actor uses no API keys or subscriptions — it accesses freely available EPA public data.
- Compute cost: ~$0.0001-0.001 per run (depends on result volume and API latency)
- Typical cost per batch: $0.01-0.05 for 1000 facilities
- Bulk runs (10,000+ facilities): ~$0.10-0.25 per run
The EPA data itself is completely free — you pay only for Apify compute time.
Output Example
{"facilityId": 1001234,"facilityName": "Acme Chemical Plant #3","city": "Houston","state": "TX","stateName": "Texas","zip": "77001","county": "HARRIS COUNTY","address1": "1234 Industrial Blvd","latitude": 29.7604,"longitude": -95.3698,"naicsCode": "325110","reportingYear": 2022}
Troubleshooting
"No records found"
- Verify the state code is valid (2 letters, e.g., "CA" not "California")
- Check that the NAICS code is correct (e.g., "221112" for power generation)
- Try running without filters (leave blank) to fetch all facilities
- Note: Not all states or years have reports — try a different state or year range
"Actor timed out"
- Reduce
maxResultsto a smaller number (e.g., 1000 instead of 10000) - Add a state filter to narrow the dataset
- Try running multiple times with different years
"Empty dataset"
- The combination of filters may not match any facilities
- Try removing filters one at a time to identify which filter is too restrictive
- Use the EPA's GHGRP search at https://ghgrp.epa.gov/ghgrpublicsearch/ to manually verify data exists for your filters
FAQ
Q: What is the EPA's Greenhouse Gas Reporting Program (GHGRP)?
A: GHGRP is a mandatory federal program requiring large industrial emitters (primarily ≥25,000 metric tons CO₂e/year) to report greenhouse gas emissions annually. The data is public and updated yearly.
Q: Why are some facilities missing?
A: Not all facilities report every year. Some may have closed, been acquired, or fall below the reporting threshold. Facilities in certain sectors (e.g., agriculture, smaller operations) may not be included.
Q: Can I filter by emissions volume?
A: No — this Actor returns facility records only. To filter by emissions magnitude, download the data and post-process it, or contact the EPA for detailed emissions reports (available separately).
Q: How often is the EPA data updated?
A: Annual reports are typically published in the second quarter of the following year. Data is refreshed when EPA updates their Envirofacts service.
Q: Can I use this for compliance monitoring?
A: Yes — the data is official and public. However, verify facility details against official EPA records before making compliance decisions.
Related Actors
Looking for other EPA or environmental data sources?
- EPA TRI Scraper — Toxic Release Inventory facility data
- EPA ECHO Scraper — Enforcement and Compliance History Online (ECHO) environmental permit and inspection records
- CISA KEV Scraper — Cybersecurity vulnerability intelligence (for compliance automation)
Legal & Support
Data Source: All data is sourced from the EPA's Envirofacts service (https://data.epa.gov/), which is in the public domain and freely available.
Disclaimer: This Actor provides access to publicly available EPA data. Verify facility records against official EPA GHGRP publications (https://ghgrp.epa.gov/) before using in regulatory or compliance contexts. The Actor is not affiliated with or endorsed by the EPA.
Terms of Use: Respect EPA's data usage terms. The EPA generally allows free public reuse of data; always attribute the EPA as the source and include a link to the original GHGRP publication.
Support: If you encounter issues:
- Verify your state code (2 letters) or NAICS code (5-6 digits)
- Test the EPA API directly:
curl "https://data.epa.gov/efservice/PUB_DIM_FACILITY/STATE/CA/ROWS/0:9/JSON" - Check EPA Envirofacts: https://data.epa.gov/
- Open an issue on Apify Community or contact support
Built with ❤️ for environmental researchers, supply chain analysts, and sustainability professionals.