Google News Scraper avatar

Google News Scraper

Pricing

$19.99/month + usage

Go to Apify Store
Google News Scraper

Google News Scraper

📰 Google News Scraper collects headlines, snippets, sources, dates & links from Google News by topic, keyword, region & language. 🔎 Export to JSON/CSV for monitoring trends, competitors & PR. ⚡ Fast, reliable, proxy-ready. 🚀 Perfect for media research & market intel.

Pricing

$19.99/month + usage

Rating

0.0

(0)

Developer

ScrapAPI

ScrapAPI

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

9 days ago

Last modified

Share

Google News Scraper

Google News Scraper is a fast, reliable Google News scraping tool that collects headlines, snippets, sources, dates, and links from Google News search via RSS — a practical Google News API alternative for marketers, developers, analysts, and researchers. It solves the challenge of monitoring news at scale by turning Google News search results into structured data, with robust filters for region, language, and time. Built as a Google News RSS scraper with proxy fallback and async fetching, it powers dashboards, trend analysis, and content pipelines at scale.

What data / output can you get?

Below are the exact JSON fields this Google News crawler pushes to the dataset when you scrape Google News headlines and articles via RSS. Export results to JSON or CSV for downstream analysis, reporting, or ingestion.

Data typeDescriptionExample value
positionResult index in the output (1-based)1
titleArticle headline“Tesla shares surge after earnings beat expectations”
linkDirect article URL (resolved from RSS item/redirect)https://www.example.com/article/tesla-earnings”
domainDomain extracted from source or URL“example.com”
sourcePublisher/source name parsed from the RSS title“Bloomberg”
dateHuman-readable published time (relative)“2 hours ago”
date_utcISO-8601 UTC timestamp derived from RSS pubDate“2026-04-03T02:40:23+00:00”
snippetCleaned text snippet extracted from the RSS description“Tesla reported quarterly revenue beating analyst estimates…”
thumbnailBase64 data URL of a fetched article image (when available)“data:image/jpeg;base64,/9j/4AAQSkZJRgABAQ...”
block_positionAlias of position (maintained for compatibility)1

Notes:

  • Thumbnails are fetched from the article page using multiple strategies (Open Graph, Twitter Card, or in-page images) and encoded as base64; if no suitable image is found, thumbnail may be empty.
  • Snippets are cleaned from HTML in the RSS description when present.
  • Exports support JSON/CSV on Apify, making this a lightweight Google News feed parser and Google News search results scraper for analysis or integration.

Key features

  • ⚡ Async RSS extraction with proxy fallback
    High-throughput Google News scraping without browsers using aiohttp, with resilient fallback from no proxy ➜ datacenter ➜ residential to reduce blocks.

  • 🌍 Region and language targeting
    Filter by Google Country (gl), UI Language (hl), Language Results (lr), and Country Results (cr) to tailor results by market and locale.

  • ⏱️ Time period filtering
    Limit results to last hour/day/week/month/year, or specify a custom date range for focused monitoring.

  • 🧼 Clean snippets + accurate dates
    Extracts cleaned text snippets from RSS and exposes both human-readable “time ago” and ISO-8601 UTC timestamps.

  • 🖼️ Smart thumbnail capture
    Attempts Open Graph/Twitter Card and in-page images; encodes thumbnails as base64 data URLs for portable storage or embedding.

  • 🔁 Robustness at scale
    Includes deduplication, retry logic, basic rate limiting, and real-time dataset saving — ideal for continuous pipelines and trend tracking.

  • 🧰 Developer friendly
    Works great as a Google News scraper Python integration or Google News scraper Node.js consumer via the Apify API. Clean JSON makes it a practical Google News API alternative for workflows and BI tools.

  • 🔒 Built for stability and ethics
    Starts without a proxy, escalates only when needed. Designed for compliant, responsible Google News data extraction without full-page rendering.

How to use Google News Scraper - step by step

  1. Sign in to Apify
    Create a free Apify account or log in to your dashboard.

  2. Open the actor
    Find “Google News Scraper” from your dashboard or the Apify Store.

  3. Enter your input

    • query: Set the search term (e.g., “Elon Musk”).
    • maxItems: Set the cap for results (100–5000).
    • gl/hl/lr/cr: Choose country and language preferences.
    • time_period: Select “last_hour”, “last_day”, “last_week”, “last_month”, “last_year”, or “custom”. For “custom”, set time_period_min and time_period_max in MM/DD/YYYY.
    • nfpr/filter: Control autocorrect and similar/omitted results filters (0 or 1).
    • proxyConfiguration: Optionally enable Apify Proxy.
  4. Review key settings

    • Use “gl” and “hl” to match your target region and UI language.
    • Use “lr” and “cr” to further limit language and country results.
    • Start with no proxy; the actor auto-falls back to datacenter and residential proxies on blocks.
  5. Run the actor
    Click Start. The Google News RSS scraper will fetch and parse multiple time strategies (if no explicit time_period), save items in real time, and log progress.

  6. Monitor logs
    You’ll see messages like “Found X new articles” and proxy state changes if fallback occurs.

  7. Export results
    Open the run dataset to download your results as JSON or CSV. Each record includes position, title, link, domain, source, date/date_utc, snippet, thumbnail, and block_position.

Pro Tip: Use the Apify API to connect this Google News aggregator script to your pipelines (Python/Node.js), or integrate it as a Google News scraping without API dependency by consuming the dataset export feeds directly.

Use cases

Use case nameDescription
Marketing + PR monitoringTrack brand and executive mentions; export headlines/snippets to JSON/CSV for daily briefing.
SEO & content planningAnalyze fresh topics and keywords from Google News results to inform editorial calendars.
Competitive intelligenceMonitor product launches and market moves by region and language for global teams.
Financial & market analysisCollect headlines on earnings and macro events to feed sentiment or event studies.
Academic & policy researchBuild time-series datasets across locales with reproducible filters for methodology transparency.
Data pipelines (API)Consume structured JSON from the Apify dataset API using Google News scraper Python or Node.js clients for ETL/ELT.

Why choose Google News Scraper?

This production-ready Google News scraping tool focuses on precision, resilience, and developer-centric outputs.

  • 🎯 Accuracy with structure: Clean snippets, ISO timestamps, and parsed domains keep downstream models and dashboards tidy.
  • 🌐 Multilingual + regional: gl/hl/lr/cr parameters align results with your target markets and languages.
  • 📈 Scales reliably: Async requests, retries, and real-time saves make high-volume runs dependable.
  • 🔌 Developer-ready: Ideal as a Google News API alternative with simple JSON, approachable via Python or Node.js through the Apify API.
  • 🔐 Ethical and robust: No headless browser; proxy fallback escalates only when necessary to reduce friction and risk.
  • 💸 Cost-effective: Flat monthly plan on Apify with a trial period to validate your workflow.
  • 🧩 Integrations-friendly: Export to JSON/CSV and plug into BI tools, notebooks, and automation stacks.

Compared with brittle browser extensions, this Google News feed parser uses stable RSS endpoints plus smart image extraction — delivering consistent, automation-ready news data.

Yes — when done responsibly. This actor extracts data from publicly accessible Google News RSS feeds and article pages.

Guidelines for compliant use:

  • Respect Google’s terms of service and robots directives.
  • Avoid overloading servers; use sensible limits and schedules.
  • Use results for lawful purposes and cite original publishers when required.
  • Do not collect private or authenticated content.
    For specific legal contexts or edge cases, consult your legal team.

Input parameters & output format

Example input JSON

{
"maxItems": 300,
"query": "Elon Musk",
"gl": "United States",
"hl": "English",
"lr": "English",
"cr": "United States",
"time_period": "last_week",
"nfpr": 0,
"filter": 1,
"proxyConfiguration": {
"useApifyProxy": false
}
}

Parameters

FieldTypeDescriptionDefaultRequired
maxItemsintegerMaximum number of search results to retrieve (min 100, max 5000).100Yes
querystringThe search term to use.“Elon Musk”Yes
glstring (enum)The Google country to use for the query.NoneNo
hlstring (enum)The Google UI language to return results.NoneNo
lrstring (enum)Limit the results to a specific language.NoneNo
crstring (enum)Limit the results to a specific country.NoneNo
time_periodstring (enum)Time period for results: last_hour, last_day, last_week, last_month, last_year, custom.NoneNo
time_period_minstringMinimum date for custom time period (MM/DD/YYYY).NoneNo
time_period_maxstringMaximum date for custom time period (MM/DD/YYYY).NoneNo
nfprintegerExclude results from auto-corrected queries (0 or 1).0No
filterintegerEnable/disable Similar Results and Omitted Results filters (0 or 1).1No
proxyConfigurationobjectConfigure proxy; actor starts with no proxy, then falls back to datacenter and residential if needed.None (UI prefill: useApifyProxy=false)No

Example output JSON

{
"position": 1,
"title": "Tesla shares surge after earnings beat expectations",
"link": "https://www.example.com/markets/tesla-earnings",
"domain": "www.example.com",
"source": "Bloomberg",
"date": "2 hours ago",
"date_utc": "2026-04-03T02:40:23+00:00",
"snippet": "Tesla reported quarterly revenue beating analyst estimates as deliveries exceeded forecasts...",
"thumbnail": "data:image/jpeg;base64,/9j/4AAQSkZJRgABAQ...",
"block_position": 1
}

Notes:

  • thumbnail may be empty if no suitable image is found.
  • snippet depends on the presence and content of the RSS description.

FAQ

Is there a free trial or free tier?

Yes. The Apify listing provides a trial period (120 trial minutes) and a flat monthly plan ($19.99/month) so you can validate the workflow before committing.

Does this work as a Google News API alternative?

Yes. It’s a Google News scraping tool that outputs clean JSON from RSS-based search results, making it a practical Google News API alternative for pipelines and apps.

Can I use it with Python or Node.js?

Yes. You can consume the dataset via the Apify API from Google News scraper Python scripts or Google News scraper Node.js code, and automate end-to-end workflows.

How many results can I scrape per run?

You control this with maxItems (100–5000). The actor enforces these limits and will paginate strategies to collect more results when time_period isn’t specified.

Does it fetch full article text?

No. It collects headlines, links, source, timestamps, and cleaned snippets from RSS, and it attempts to fetch and encode a thumbnail image.

How does proxy handling work?

The run starts with no proxy. On blocks or errors, it automatically falls back to datacenter proxies and then residential proxies for resilience.

Can I target specific countries or languages?

Yes. Use gl/hl for Google country and UI language, lr to limit by language, and cr to limit by country results. You can also filter by time_period.

Is this a Google News RSS scraper or a feed parser?

Both — it builds Google News RSS queries behind the scenes, parses RSS XML, cleans fields, and saves structured JSON you can export or consume via API.

Closing CTA / Final thoughts

Google News Scraper is built for fast, structured Google News data extraction at scale. With RSS-based collection, regional and language filters, proxy fallback, and clean JSON output, it helps marketers, researchers, and developers turn headlines into actionable datasets.

Use it as your Google News API alternative to power dashboards, trend tracking, and analytics — exporting to JSON/CSV and integrating via Python or Node.js. Start extracting smarter news insights today and keep your pipelines fueled with fresh, reliable data.