US Energy.gov Data Scraper avatar

US Energy.gov Data Scraper

Pricing

Pay per event

Go to Apify Store
US Energy.gov Data Scraper

US Energy.gov Data Scraper

Scrape energy-related content from Energy.gov, including articles, press releases, documents, titles, dates, offices, and types. Automate collection of structured data from the U.S. Department of Energy, ideal for researchers, journalists, and professionals needing accurate, up-to-date information.

Pricing

Pay per event

Rating

0.0

(0)

Developer

ParseForge

ParseForge

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

0

Monthly active users

2 days ago

Last modified

Share

ParseForge Banner

🔬 Energy.gov Scraper

🚀 Collect U.S. Department of Energy articles, press releases, and documents in seconds. Filter by keyword, office, article type, and language. No coding, no API keys required.

🕒 Last updated: 2026-04-16 · 📊 8 fields · 🏛️ Official DOE data · 🌐 English and Spanish

The Energy.gov Scraper automates the collection of official content from the U.S. Department of Energy website. It pulls articles, press releases, congressional testimonies, blog posts, success stories, and multimedia content directly from the DOE search system. Each record includes the headline, publication date, source office, content category, direct link, and unique identifier. You can filter by keyword, article type, DOE office, and language to zero in on exactly the content you need. Free users can collect up to 10 items per run, while paid users can retrieve up to 1,000,000 results.

Whether you are tracking renewable energy policy shifts, monitoring nuclear research announcements, or building a dataset of DOE press releases for media analysis, this tool replaces hours of manual browsing with a single automated run. Results export to JSON, CSV, or Excel for immediate use in spreadsheets, dashboards, or data pipelines. Schedule recurring runs to stay current with the latest DOE publications without lifting a finger. The scraper handles pagination, deduplication, and rate limiting automatically so you can focus on analysis instead of data collection.

Target AudienceUse Cases
Policy AnalystsMonitor federal energy policy announcements and congressional testimonies
Academic ResearchersBuild literature databases from DOE research publications
Energy Industry ProfessionalsTrack regulatory changes and press releases by office
JournalistsFollow DOE news across topics like renewables, nuclear, and fossil fuels
Data AnalystsExport structured DOE content for trend analysis and reporting
Government Affairs TeamsStay current on DOE initiatives and funding announcements

📋 What the Energy.gov Scraper does

  • 📝 Article headlines - capture the title of every article, press release, blog post, or document published on energy.gov
  • 🔗 Direct URLs - collect working links to each piece of content for quick reference or archival
  • 📅 Publication dates - track when content was published to build timelines and spot trends
  • 👤 Source offices - identify which DOE office or organization published the content (e.g., Office of Energy Efficiency and Renewable Energy)
  • 🎯 Content categories - classify each item by type: blog, press release, document, success story, congressional testimony, or multimedia
  • 🆔 Unique identifiers - get UUIDs for each article to manage deduplication and data integrity

The scraper connects to the DOE search system and iterates through results using your specified filters. It collects structured data from each listing, normalizes timestamps, and removes duplicate entries using unique article IDs. All results are pushed to an Apify dataset in real time, so you can preview data as the run progresses.

💡 Why it matters: Energy.gov publishes thousands of articles annually across dozens of offices. Manually tracking this content is impractical. This scraper gives you structured, filterable access to the entire catalog in minutes.


🎬 Full Demo

🚧 Coming soon...


⚙️ Input

FieldTypeRequiredDescription
maxItemsintegerNoMaximum articles to collect. Free users limited to 10. Paid users up to 1,000,000.
keywordsstringNoSearch terms to filter articles (e.g., "renewable energy", "nuclear").
articleTypestringNoContent type filter: Blog, Press Releases, Documents, Success Stories, Multimedia, Congressional Testimony.
languagestringNoLanguage filter: "en" for English, "es" for Spanish.
officestringNoFilter by DOE office or organization name.
sortstringNoSort order: "date" or "relevance".

Example 1: Renewable energy press releases

{
"maxItems": 50,
"keywords": "renewable energy",
"articleType": "7",
"language": "en",
"sort": "date"
}

Example 2: All recent content from a specific office

{
"maxItems": 100,
"office": "Office of Energy Efficiency & Renewable Energy",
"sort": "date"
}

⚠️ Good to Know: Free users are automatically limited to 10 items per run. Leave keywords empty to browse all available content. The articleType field uses numeric codes internally, but you can also use descriptive names.


📊 Output

🧾 Schema

EmojiFieldTypeDescription
📝titlestringArticle headline or document title
🔗urlstringDirect link to the full article on energy.gov
📅publicationDatestringDate the article was published
👤officestringDOE office or organization that published the content
🎯contentTypestringCategory: blog, press release, document, testimony, etc.
🆔uuidstringUnique article identifier for deduplication
🕐scrapedAtstringTimestamp of when the data was collected
⚠️errorstringError message if the item failed to process

📦 Sample records


✨ Why choose this Actor

FeatureEnergy.gov ScraperAlternatives
Filter by DOE officeYesRarely available
Content type filtering (blog, press, testimony)YesLimited
English and Spanish language supportYesEnglish only
Sort by date or relevanceYesDate only
Automatic pagination and deduplicationYesManual
Up to 1,000,000 results per runYesCapped lower
Export to JSON, CSV, and ExcelYesJSON only

📊 The DOE publishes content from 20+ offices. This scraper lets you filter by any combination of office, topic, and content type in a single run.


📈 How it compares to alternatives

CapabilityThis ActorManual BrowsingGeneric Web Scrapers
DOE-specific filters (office, type, language)YesYesNo
Automatic paginationYesNoPartial
Structured output with deduplicationYesNoVaries
Scheduled recurring runsYesNoVaries
No coding requiredYesYesNo
Bulk export (CSV, Excel, JSON)YesNoPartial

This scraper is purpose-built for energy.gov, so it handles the site's search system, pagination, and data structure out of the box.


🚀 How to use

  1. Sign up - Create a free Apify account with $5 credit
  2. Find the Actor - Search for "Energy.gov Scraper" in the Apify Store
  3. Configure your filters - Set keywords, article type, office, language, and max items
  4. Start the run - Click "Start" and watch results appear in real time
  5. Export your data - Download as JSON, CSV, or Excel from the dataset tab

🕒 Typical run time: 30 seconds to 2 minutes for up to 100 items. Larger runs with 500+ items may take 5 to 10 minutes.


💼 Business use cases

Policy Analysis

  • Monitor DOE announcements weekly for regulatory shifts
  • Track congressional testimonies by topic area
  • Compare publication volume across DOE offices
  • Build policy briefing datasets for stakeholders

Academic Research

  • Compile literature reviews from DOE publications
  • Track funding announcements and research grants
  • Build citation databases from official DOE content
  • Monitor specific research programs over time

Media and Journalism

  • Track press releases for breaking energy news
  • Build story leads from DOE blog posts and testimonies
  • Monitor multiple DOE offices in a single scheduled run
  • Export structured data for editorial calendars

Energy Industry Intelligence

  • Follow regulatory updates from the Office of Fossil Energy
  • Track renewable energy program announcements
  • Monitor nuclear energy policy developments
  • Build competitive intelligence from DOE success stories


🌟 Beyond business use cases

Data like this powers more than commercial workflows. The same structured records support research, education, civic projects, and personal initiatives.

🎓 Research and academia

  • Empirical datasets for papers, thesis work, and coursework
  • Longitudinal studies tracking changes across snapshots
  • Reproducible research with cited, versioned data pulls
  • Classroom exercises on data analysis and ethical scraping

🎨 Personal and creative

  • Side projects, portfolio demos, and indie app launches
  • Data visualizations, dashboards, and infographics
  • Content research for bloggers, YouTubers, and podcasters
  • Hobbyist collections and personal trackers

🤝 Non-profit and civic

  • Transparency reporting and accountability projects
  • Advocacy campaigns backed by public-interest data
  • Community-run databases for local issues
  • Investigative journalism on public records

🧪 Experimentation

  • Prototype AI and machine-learning pipelines with real data
  • Validate product-market hypotheses before engineering spend
  • Train small domain-specific models on niche corpora
  • Test dashboard concepts with live input

🤖 Ask an AI assistant about this scraper

Open a ready-to-send prompt about this ParseForge actor in the AI of your choice:

❓ Frequently Asked Questions


🔌 Automating Energy.gov Scraper

Node.js example:

import { ApifyClient } from 'apify-client';
const client = new ApifyClient({ token: 'YOUR_API_TOKEN' });
const run = await client.actor('parseforge/energy-gov-scraper').call({
maxItems: 50,
keywords: 'renewable energy',
sort: 'date'
});
const { items } = await client.dataset(run.defaultDatasetId).listItems();
console.log(items);

Python example:

from apify_client import ApifyClient
client = ApifyClient('YOUR_API_TOKEN')
run = client.actor('parseforge/energy-gov-scraper').call(run_input={
'maxItems': 50,
'keywords': 'renewable energy',
'sort': 'date'
})
items = list(client.dataset(run['defaultDatasetId']).iterate_items())
print(items)

Schedules: Set up recurring runs on daily, weekly, or monthly intervals using the Apify scheduler. Combine with Slack or email integrations to get notified whenever new DOE content matches your filters.

🔌 Integrate with any app

  • Make - Automate energy data workflows and route articles to your team
  • Zapier - Connect to 5,000+ apps and trigger actions on new DOE content
  • Slack - Get notifications when new articles match your filters
  • Airbyte - Stream DOE data into your data warehouse
  • GitHub - Version control your scraper configurations
  • Google Drive - Export results directly to Google Sheets

ActorDescription
USAspending ScraperExtract federal spending data and contract information from USAspending.gov
GSA eLibrary ScraperCollect government contractor and vendor data from the GSA eLibrary
PR Newswire ScraperCollect press releases and news articles from PR Newswire
FINRA BrokerCheck ScraperSearch broker and firm registration data from the FINRA registry
FAA Aircraft Registry ScraperLook up aircraft registration records by N-number from the FAA

💡 Pro Tip: Combine the Energy.gov Scraper with the USAspending Scraper to cross-reference DOE announcements with actual federal spending data.


🆘 Need Help? Open our contact form to request a new scraper, propose a custom data project, or report an issue. We typically respond within 24 hours.


Disclaimer: This Actor is an independent tool and is not affiliated with, endorsed by, or sponsored by the U.S. Department of Energy or Energy.gov. All trademarks mentioned are the property of their respective owners.