Pricing

Pay per event

US Energy.gov Data Scraper

Scrape energy-related content from Energy.gov, including articles, press releases, documents, titles, dates, offices, and types. Automate collection of structured data from the U.S. Department of Energy, ideal for researchers, journalists, and professionals needing accurate, up-to-date information.

Pricing

Pay per event

Rating

0.0

(0)

Developer

ParseForge

Actor stats

Bookmarked

Total users

Monthly active users

24 days ago

Last modified

🔬 Energy.gov Scraper

🚀 Collect U.S. Department of Energy articles, press releases, and documents in seconds. Filter by keyword, office, article type, and language. No coding, no API keys required.

🕒 Last updated: 2026-04-16 · 📊 8 fields · 🏛️ Official DOE data · 🌐 English and Spanish

Pull structured records from US Energy.gov Data — clean fields ready as CSV, JSON, JSONL, Excel, or XML for downstream pipelines.

Copy to your AI assistant

Copy this block into ChatGPT, Claude, Cursor, or any LLM to start using this actor.

parseforge/energy-gov-scraper on Apify. Call: ApifyClient("TOKEN").actor("parseforge/energy-gov-scraper").call(run_input={...}), then client.dataset(run["defaultDatasetId"]).list_items().items for results. Key inputs: maxItems (integer, default 10), keywords (string), articleType (string), language (string), office (string), sort (string, default "date"). Full actor spec: fetch build via GET https://api.apify.com/v2/acts/parseforge~energy-gov-scraper (Bearer TOKEN). Get token: https://console.apify.com/account/integrations

The Energy.gov Scraper automates the collection of official content from the U.S. Department of Energy website. It pulls articles, press releases, congressional testimonies, blog posts, success stories, and multimedia content directly from the DOE search system. Each record includes the headline, publication date, source office, content category, direct link, and unique identifier. You can filter by keyword, article type, DOE office, and language to zero in on exactly the content you need. Free users can collect up to 10 items per run, while paid users can retrieve up to 1,000,000 results.

Whether you are tracking renewable energy policy shifts, monitoring nuclear research announcements, or building a dataset of DOE press releases for media analysis, this tool replaces hours of manual browsing with a single automated run. Results export to JSON, CSV, or Excel for immediate use in spreadsheets, dashboards, or data pipelines. Schedule recurring runs to stay current with the latest DOE publications without lifting a finger. The scraper handles pagination, deduplication, and rate limiting automatically so you can focus on analysis instead of data collection.

Target Audience	Use Cases
Policy Analysts	Monitor federal energy policy announcements and congressional testimonies
Academic Researchers	Build literature databases from DOE research publications
Energy Industry Professionals	Track regulatory changes and press releases by office
Journalists	Follow DOE news across topics like renewables, nuclear, and fossil fuels
Data Analysts	Export structured DOE content for trend analysis and reporting
Government Affairs Teams	Stay current on DOE initiatives and funding announcements

📋 What the Energy.gov Scraper does

📝 Article headlines - capture the title of every article, press release, blog post, or document published on energy.gov
🔗 Direct URLs - collect working links to each piece of content for quick reference or archival
📅 Publication dates - track when content was published to build timelines and spot trends
👤 Source offices - identify which DOE office or organization published the content (e.g., Office of Energy Efficiency and Renewable Energy)
🎯 Content categories - classify each item by type: blog, press release, document, success story, congressional testimony, or multimedia
🆔 Unique identifiers - get UUIDs for each article to manage deduplication and data integrity

The scraper connects to the DOE search system and iterates through results using your specified filters. It collects structured data from each listing, normalizes timestamps, and removes duplicate entries using unique article IDs. All results are pushed to an Apify dataset in real time, so you can preview data as the run progresses.

💡 Why it matters: Energy.gov publishes thousands of articles annually across dozens of offices. Manually tracking this content is impractical. This scraper gives you structured, filterable access to the entire catalog in minutes.

🎬 Full Demo

🚧 Coming soon...

⚙️ Input

Field	Type	Required	Description
maxItems	integer	No	Maximum articles to collect. Free users limited to 10. Paid users up to 1,000,000.
keywords	string	No	Search terms to filter articles (e.g., "renewable energy", "nuclear").
articleType	string	No	Content type filter: Blog, Press Releases, Documents, Success Stories, Multimedia, Congressional Testimony.
language	string	No	Language filter: "en" for English, "es" for Spanish.
office	string	No	Filter by DOE office or organization name.
sort	string	No	Sort order: "date" or "relevance".

Example 1: Renewable energy press releases

{
  "maxItems": 50,
  "keywords": "renewable energy",
  "articleType": "7",
  "language": "en",
  "sort": "date"
}

Example 2: All recent content from a specific office

{
  "maxItems": 100,
  "office": "Office of Energy Efficiency & Renewable Energy",
  "sort": "date"
}

⚠️ Good to Know: Free users are automatically limited to 10 items per run. Leave keywords empty to browse all available content. The articleType field uses numeric codes internally, but you can also use descriptive names.

📊 Output

🧾 Schema

Emoji	Field	Type	Description
📝	title	string	Article headline or document title
🔗	url	string	Direct link to the full article on energy.gov
📅	publicationDate	string	Date the article was published
👤	office	string	DOE office or organization that published the content
🎯	contentType	string	Category: blog, press release, document, testimony, etc.
🆔	uuid	string	Unique article identifier for deduplication
🕐	scrapedAt	string	Timestamp of when the data was collected
⚠️	error	string	Error message if the item failed to process

📦 Sample records

✨ Why choose this Actor

Feature	Energy.gov Scraper	Alternatives
Filter by DOE office	Yes	Rarely available
Content type filtering (blog, press, testimony)	Yes	Limited
English and Spanish language support	Yes	English only
Sort by date or relevance	Yes	Date only
Automatic pagination and deduplication	Yes	Manual
Up to 1,000,000 results per run	Yes	Capped lower
Export to JSON, CSV, and Excel	Yes	JSON only

📊 The DOE publishes content from 20+ offices. This scraper lets you filter by any combination of office, topic, and content type in a single run.

📈 How it compares to alternatives

Capability	This Actor	Manual Browsing	Generic Web Scrapers
DOE-specific filters (office, type, language)	Yes	Yes	No
Automatic pagination	Yes	No	Partial
Structured output with deduplication	Yes	No	Varies
Scheduled recurring runs	Yes	No	Varies
No coding required	Yes	Yes	No
Bulk export (CSV, Excel, JSON)	Yes	No	Partial

This scraper is purpose-built for energy.gov, so it handles the site's search system, pagination, and data structure out of the box.

🚀 How to use

Sign up - Create a free Apify account with $5 credit
Find the Actor - Search for "Energy.gov Scraper" in the Apify Store
Configure your filters - Set keywords, article type, office, language, and max items
Start the run - Click "Start" and watch results appear in real time
Export your data - Download as JSON, CSV, or Excel from the dataset tab

🕒 Typical run time: 30 seconds to 2 minutes for up to 100 items. Larger runs with 500+ items may take 5 to 10 minutes.

💼 Business use cases

Policy Analysis

Monitor DOE announcements weekly for regulatory shifts
Track congressional testimonies by topic area
Compare publication volume across DOE offices
Build policy briefing datasets for stakeholders

Academic Research

Compile literature reviews from DOE publications
Track funding announcements and research grants
Build citation databases from official DOE content
Monitor specific research programs over time

Media and Journalism

Track press releases for breaking energy news
Build story leads from DOE blog posts and testimonies
Monitor multiple DOE offices in a single scheduled run
Export structured data for editorial calendars

Energy Industry Intelligence

Follow regulatory updates from the Office of Fossil Energy
Track renewable energy program announcements
Monitor nuclear energy policy developments
Build competitive intelligence from DOE success stories

🌟 Beyond business use cases

Data like this powers more than commercial workflows. The same structured records support research, education, civic projects, and personal initiatives.

🎓 Research and academia

Empirical datasets for papers, thesis work, and coursework
Longitudinal studies tracking changes across snapshots
Reproducible research with cited, versioned data pulls
Classroom exercises on data analysis and ethical scraping

🎨 Personal and creative

Side projects, portfolio demos, and indie app launches
Data visualizations, dashboards, and infographics
Content research for bloggers, YouTubers, and podcasters
Hobbyist collections and personal trackers

🤝 Non-profit and civic

Transparency reporting and accountability projects
Advocacy campaigns backed by public-interest data
Community-run databases for local issues
Investigative journalism on public records

🧪 Experimentation

Prototype AI and machine-learning pipelines with real data
Validate product-market hypotheses before engineering spend
Train small domain-specific models on niche corpora
Test dashboard concepts with live input

🤖 Ask an AI assistant about this scraper

Open a ready-to-send prompt about this ParseForge actor in the AI of your choice:

❓ Frequently Asked Questions

🔌 Automating Energy.gov Scraper

Node.js example:

import { ApifyClient } from 'apify-client';
const client = new ApifyClient({ token: 'YOUR_API_TOKEN' });
const run = await client.actor('parseforge/energy-gov-scraper').call({
    maxItems: 50,
    keywords: 'renewable energy',
    sort: 'date'
});
const { items } = await client.dataset(run.defaultDatasetId).listItems();
console.log(items);

Python example:

from apify_client import ApifyClient
client = ApifyClient('YOUR_API_TOKEN')
run = client.actor('parseforge/energy-gov-scraper').call(run_input={
    'maxItems': 50,
    'keywords': 'renewable energy',
    'sort': 'date'
})
items = list(client.dataset(run['defaultDatasetId']).iterate_items())
print(items)

Schedules: Set up recurring runs on daily, weekly, or monthly intervals using the Apify scheduler. Combine with Slack or email integrations to get notified whenever new DOE content matches your filters.

🔌 Integrate with any app

Make - Automate energy data workflows and route articles to your team
Zapier - Connect to 5,000+ apps and trigger actions on new DOE content
Slack - Get notifications when new articles match your filters
Airbyte - Stream DOE data into your data warehouse
GitHub - Version control your scraper configurations
Google Drive - Export results directly to Google Sheets

🔗 Recommended Actors

Actor	Description
USAspending Scraper	Extract federal spending data and contract information from USAspending.gov
GSA eLibrary Scraper	Collect government contractor and vendor data from the GSA eLibrary
PR Newswire Scraper	Collect press releases and news articles from PR Newswire
FINRA BrokerCheck Scraper	Search broker and firm registration data from the FINRA registry
FAA Aircraft Registry Scraper	Look up aircraft registration records by N-number from the FAA

💡 Pro Tip: Combine the Energy.gov Scraper with the USAspending Scraper to cross-reference DOE announcements with actual federal spending data.

🆘 Need Help? Open our contact form to request a new scraper, propose a custom data project, or report an issue. We typically respond within 24 hours.

Disclaimer: This Actor is an independent tool and is not affiliated with, endorsed by, or sponsored by the U.S. Department of Energy or Energy.gov. All trademarks mentioned are the property of their respective owners.

Energy Market Notices & Reports

datapilot/energy-market-notices-reports

Energy Notice Scraper collects real-time energy alerts from RSS feeds including U.S. Energy Information Administration, National Grid, Reuters, and International Energy Agency. It filters by keyword, extracts notice details, and outputs structured JSON for monitoring global energy market updates.

Data Pilot

EIA US Energy Data Search

ryanclinton/eia-energy-data

US energy data from the Energy Information Administration: electricity prices, natural gas, petroleum, and state energy profiles.

Ryan Clinton

EIA Energy Data Scraper - US Energy Prices & Stats

compute-edge/eia-energy-scraper

Extract energy data from the US EIA API. Access petroleum prices, natural gas, coal, electricity, nuclear outages, and state energy data. 14 data categories with pagination, facet filtering, and date range support.

Compute Edge

5.0

Eia Energy Scraper

fortuitous_pirate/eia-energy-scraper

Extract energy data from the EIA API v2. Petroleum prices, electricity sales, natural gas prices, and total energy statistics with date filtering and pagination.

Fortuitous Pirate

EIA Energy Intelligence MCP — Electricity, Oil & Gas Data

andrew_avina/eia-energy-mcp

An MCP (Model Context Protocol) server that exposes US energy market data from the Energy Information Administration directly to AI assistants. It covers electricity retail prices, natural gas, crude oil, renewables, and coal — the full picture of the US energy economy. A free...

Andrew Avina

EPREL Product Energy Label Scraper

eloquent_mountain/eprel-product-data-scraper

Extracts detailed energy efficiency data, product sheets (PIS), and energy labels from the official EU EPREL database.

Paco

EIA Energy Data Scraper

parseforge/eia-energy-data-scraper

Scrape US energy prices from the Energy Information Administration. Get weekly crude oil spot prices, gasoline prices by state, and diesel prices by region. Includes current, historical, and year-over-year price comparisons.

ParseForge

Energy Transition Intelligence MCP Server

ryanclinton/energy-transition-intelligence-mcp

Energy market and grid risk intelligence via the Model Context Protocol.

Ryan Clinton

Energy Star Certified Products Search (290K+ Products)

compute-edge/energystar-scraper

Search the EPA Energy Star database for certified energy-efficient products. Find appliances, HVAC equipment, and electronics by brand, product type, or certification status. 290K+ certified products with energy ratings.

Compute Edge

EU Energy & Education Data API

lentic_clockss/eu-energy-education-search

Search 18 EU energy and education data sources in parallel — ENTSO-G gas flows, Eurostat electricity prices, energy balances, renewable energy statistics, ESCO occupations and skills taxonomy, Eurostat education statistics, and OpenAIRE research publications. Official European data.