EDGAR JRC Global GHG Emissions Database Scraper avatar

EDGAR JRC Global GHG Emissions Database Scraper

Pricing

Pay per event

Go to Apify Store
EDGAR JRC Global GHG Emissions Database Scraper

EDGAR JRC Global GHG Emissions Database Scraper

Pull country, sector, gas, and year GHG emissions from EU JRC EDGAR — the most-cited global GHG dataset (IPCC AR6, EU Green Deal). CO2, CH4, N2O, F-gases and AR5 CO2e totals 1970-2023. Includes IEA-EDGAR CO2 fuel-combustion subset. No auth or proxy needed.

Pricing

Pay per event

Rating

0.0

(0)

Developer

BowTiedRaccoon

BowTiedRaccoon

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

4 days ago

Last modified

Share

Pull country-level greenhouse gas emissions from the EU Commission's JRC EDGAR database — the most-cited global GHG dataset in use, referenced by IPCC AR6, the EU Green Deal, and virtually every serious climate research paper written in the last decade.

What it does

The scraper fetches the EDGAR dataset index page to discover the current release's ZIP manifest, downloads the requested gas bundles from the EU JRC open-data server, parses the XLSX files inside each ZIP, and emits one record per country × IPCC sector × gas × year. No authentication. No proxy. No rate-limit gymnastics. The EU Commission just puts the data out there.

Coverage:

  • Gases: CO2 (IEA-EDGAR fuel combustion), CO2bio, CH4, N2O, F-gases, AR5_GHG (CO2-equivalent totals, GWP-100)
  • Countries: ~250 (ISO-3 codes)
  • Years: 1970–2023
  • Sectors: IPCC 2006 classification (~25 per country per gas) or totals-only mode

The IEA-EDGAR CO2 bundle deserves a specific note. It's the closest freely available substitute for IEA's "CO2 Emissions from Fuel Combustion" product, which is paywalled. You get the same underlying methodology. The IEA just charges for the branded report.

Output schema

One row per emission record.

FieldTypeDescription
country_codestringISO-3 country code (e.g. USA, DEU, CHN)
country_namestringCountry display name
regionstringEDGAR regional group (e.g. Annex_I, EU27)
yearintegerReporting year (1970–2023)
gasstringGas identifier: CO2, CH4, N2O, F-gases, AR5_GHG
sector_ipcc2006stringIPCC 2006 sector code (e.g. 1A1a). Empty in totals-only mode.
sector_namestringHuman-readable sector name. Empty in totals-only mode.
subsectorstringfossil or bio sub-classification (null for totals rows)
emission_ktnumberEmission quantity in kt (or kt CO2e for AR5_GHG)
emission_unitstringAlways "kt"
dataset_versionstringEDGAR release directory, e.g. EDGAR_2024_GHG
source_filestringSource ZIP filename for provenance
is_fuel_combustion_ieabooleanTrue for IEA-EDGAR CO2 fuel combustion rows

Input options

ParameterDefaultDescription
gases["CO2","CH4","N2O","AR5_GHG"]Gas bundles to download. Valid: CO2, CO2bio, CH4, N2O, F-gases, AR5_GHG.
countries[] (all)ISO-3 country filter. Empty returns all ~250 countries.
yearFrom2000First year to include. Earliest available: 1970.
yearTo2023Last year to include.
includeIeaFuelCombustiontrueInclude IEA-EDGAR CO2 fuel combustion bundle.
includeSectorBreakdowntrueEmit per-IPCC-sector rows. When false, emit country/gas/year totals only.
includeMonthlyfalseDownload monthly bundles (much larger, slower).
maxItems200Cap on output records.

Use cases

ESG and supply-chain carbon accounting. Tools like Watershed, Persefoni, and Climatiq use country-level emission factors as baselines. EDGAR is the standard source. This actor eliminates the ZIP-parse pipeline.

Climate research. IPCC AR6 Working Group III references EDGAR extensively. If you're reproducing or extending that analysis, this is your data source.

Journalism and NGO reporting. EU Green Deal progress tracking, national emissions comparisons, sector-specific breakdowns. The actor returns data in a format that loads directly into a spreadsheet or database.

National statistics offices. Countries building their own GHG inventories cross-check against EDGAR. The sector breakdown by IPCC 2006 codes aligns with standard reporting frameworks.

Technical notes

The actor operates as a data pipeline, not a scraper in the traditional sense. It fetches the EDGAR index page to discover the current release year's ZIP manifest, then downloads each requested bundle sequentially. Files are unzipped in memory and parsed with SheetJS. The wide-format year columns (Y_1970 through Y_2023) are pivoted to long-format rows before output.

Memory requirement: 2 GB recommended for full multi-gas runs. Each ZIP is 6–50 MB uncompressed, and the EDGAR_AR5_GHG bundle is the largest at roughly 10 MB compressed.

EDGAR releases annually, typically mid-year. The actor auto-detects the current release from the index page, so it survives annual updates without code changes. You can also pin a specific version via the datasetVersion input.

Data source

EU Commission Joint Research Centre — EDGAR (Emissions Database for Global Atmospheric Research)

Cite as: Crippa M., et al. (2024) EDGAR Community GHG Database, European Commission, JRC.