🌱 Open-Data ESG Scoring Inputs
Pricing
Pay per usage
🌱 Open-Data ESG Scoring Inputs
Compile directional ESG signals using public data from OSHA, the EPA, FEC, and SEC. Augment existing models with structured US regulatory enforcement data.
ESG Enhanced Scoring
Evaluating corporate sustainability and governance requires looking beyond marketing claims to actual legal and regulatory footprints. The Open-Data ESG Scoring Inputs scraper delivers a completely transparent, public-record approach to environmental, social, and governance evaluation. Instead of relying on costly black-box ratings from legacy financial data providers, this tool extracts structured enforcement and disclosure data directly from US government portals, including the EPA ECHO system, OSHA establishment databases, SEC EDGAR filings, and FEC contribution records.
Data scientists, investment researchers, and corporate compliance teams use this web scraper to systematically track the operational reality of public and private entities. By scheduling regular crawls of these federal databases, users can automatically monitor shifts in a company’s risk profile. It is an ideal solution for alternative data ingestion, providing the raw material needed to construct custom composite scores or augment existing financial intelligence dashboards.
Extracted details include specific workplace safety violation types, environmental penalty amounts, governance filing dates, and categorized political expenditure signals. Each output row connects a target company to its empirical regulatory history. This allows teams to extract unbiased, high-fidelity data to accurately assess supply chain risks, validate corporate responsibility claims, and build robust, evidence-based ESG assessment frameworks.
Status
Scaffolded as part of Wave 17 Batch S — Tier 3 (strategic / emerging platforms + governance). Domain logic lives in src/workflow.js.
Feasibility
High — US government open-data portals (EPA ECHO, SEC EDGAR, OSHA establishment search, FEC political-donation API) expose enforcement, filings, and violations without authentication. Private-sector ESG scores (MSCI, Sustainalytics) remain out of scope.
V1 scope
Public US government open-data only: EPA ECHO enforcement, SEC EDGAR climate-related filings, OSHA establishment search, FEC committee lookups. Per company: environmental violations, governance filings, workplace-safety citations, political-spend signals, composite E/S/G directional score. OUT OF SCOPE: MSCI / Sustainalytics / Refinitiv proprietary scores, non-US regulators, carbon-accounting (Scope 1/2/3), supply-chain ESG (covered by sibling actor).
Extraction surfaces
- EPA ECHO: https://echo.epa.gov/tools/web-services/facility-search-all-data
- SEC EDGAR: https://data.sec.gov/submissions/CIK{cik}.json
- SEC EDGAR search: https://efts.sec.gov/LATEST/search-index?q={term}
- OSHA establishment: https://www.osha.gov/pls/imis/establishment.search
- FEC OpenFEC: https://api.open.fec.gov/v1/committees (requires free DEMO_KEY)
Known limitations and explicit warnings
- ESG scoring is directional, not a certified rating — this actor surfaces SIGNALS that feed into a score, not a final rating to publish.
- Coverage is US-centric (EPA, SEC, OSHA) — international subsidiaries are underrepresented.
- Company name → facility / establishment matching is fuzzy; exact CIK is always preferred.
- EPA ECHO data lags real enforcement actions by 1-3 months.
- SEC EDGAR climate filings (10-K Item 1A risk factors) are text-heavy; the actor surfaces the filing references, not full NLP-extracted metrics.
- OSHA establishment search is establishment-level, not consolidated at parent-company level.
- FEC DEMO_KEY is shared and rate-limited; consumers should supply their own api.data.gov key for volume use.
- Composite E/S/G numeric scores are simple normalized signal counts — NOT comparable to MSCI or ISS ratings.
- Historical trend data depends on each source's retention policy and is not back-filled by this actor.
- Positive ESG actions (disclosures, green-bond issuance) are NOT weighted as heavily as negative signals in V1 — V2 plans a balanced scoring rubric.
Input
- Company identifiers (name / ticker / CIK / EIN)
- Delivery mode (dataset or webhook)
- Dry-run support for local validation
Output
- Normalized
scoresarray metasection with implementation status, feasibility note, V1 scope, warnings, and notes
Local run
npm testnpm start
⭐ Was this helpful?
If this actor saved you time, please leave a ★ rating on Apify Store. It takes 10 seconds, helps other developers discover it, and keeps updates free.
Bug report or feature request? Open an issue on the Issues tab of this actor.