Google News Scraper
Pricing
$19.99/month + usage
Google News Scraper
📰 Google News Scraper collects headlines, snippets, sources, dates & links from Google News by topic, keyword, region & language. 🔎 Export to JSON/CSV for monitoring trends, competitors & PR. ⚡ Fast, reliable, proxy-ready. 🚀 Perfect for media research & market intel.
Pricing
$19.99/month + usage
Rating
0.0
(0)
Developer
ScrapAPI
Actor stats
0
Bookmarked
2
Total users
1
Monthly active users
9 days ago
Last modified
Categories
Share
Google News Scraper
Google News Scraper is a fast, reliable Google News scraping tool that collects headlines, snippets, sources, dates, and links from Google News search via RSS — a practical Google News API alternative for marketers, developers, analysts, and researchers. It solves the challenge of monitoring news at scale by turning Google News search results into structured data, with robust filters for region, language, and time. Built as a Google News RSS scraper with proxy fallback and async fetching, it powers dashboards, trend analysis, and content pipelines at scale.
What data / output can you get?
Below are the exact JSON fields this Google News crawler pushes to the dataset when you scrape Google News headlines and articles via RSS. Export results to JSON or CSV for downstream analysis, reporting, or ingestion.
| Data type | Description | Example value |
|---|---|---|
| position | Result index in the output (1-based) | 1 |
| title | Article headline | “Tesla shares surge after earnings beat expectations” |
| link | Direct article URL (resolved from RSS item/redirect) | “https://www.example.com/article/tesla-earnings” |
| domain | Domain extracted from source or URL | “example.com” |
| source | Publisher/source name parsed from the RSS title | “Bloomberg” |
| date | Human-readable published time (relative) | “2 hours ago” |
| date_utc | ISO-8601 UTC timestamp derived from RSS pubDate | “2026-04-03T02:40:23+00:00” |
| snippet | Cleaned text snippet extracted from the RSS description | “Tesla reported quarterly revenue beating analyst estimates…” |
| thumbnail | Base64 data URL of a fetched article image (when available) | “data:image/jpeg;base64,/9j/4AAQSkZJRgABAQ...” |
| block_position | Alias of position (maintained for compatibility) | 1 |
Notes:
- Thumbnails are fetched from the article page using multiple strategies (Open Graph, Twitter Card, or in-page images) and encoded as base64; if no suitable image is found, thumbnail may be empty.
- Snippets are cleaned from HTML in the RSS description when present.
- Exports support JSON/CSV on Apify, making this a lightweight Google News feed parser and Google News search results scraper for analysis or integration.
Key features
-
⚡ Async RSS extraction with proxy fallback
High-throughput Google News scraping without browsers using aiohttp, with resilient fallback from no proxy ➜ datacenter ➜ residential to reduce blocks. -
🌍 Region and language targeting
Filter by Google Country (gl), UI Language (hl), Language Results (lr), and Country Results (cr) to tailor results by market and locale. -
⏱️ Time period filtering
Limit results to last hour/day/week/month/year, or specify a custom date range for focused monitoring. -
🧼 Clean snippets + accurate dates
Extracts cleaned text snippets from RSS and exposes both human-readable “time ago” and ISO-8601 UTC timestamps. -
🖼️ Smart thumbnail capture
Attempts Open Graph/Twitter Card and in-page images; encodes thumbnails as base64 data URLs for portable storage or embedding. -
🔁 Robustness at scale
Includes deduplication, retry logic, basic rate limiting, and real-time dataset saving — ideal for continuous pipelines and trend tracking. -
🧰 Developer friendly
Works great as a Google News scraper Python integration or Google News scraper Node.js consumer via the Apify API. Clean JSON makes it a practical Google News API alternative for workflows and BI tools. -
🔒 Built for stability and ethics
Starts without a proxy, escalates only when needed. Designed for compliant, responsible Google News data extraction without full-page rendering.
How to use Google News Scraper - step by step
-
Sign in to Apify
Create a free Apify account or log in to your dashboard. -
Open the actor
Find “Google News Scraper” from your dashboard or the Apify Store. -
Enter your input
- query: Set the search term (e.g., “Elon Musk”).
- maxItems: Set the cap for results (100–5000).
- gl/hl/lr/cr: Choose country and language preferences.
- time_period: Select “last_hour”, “last_day”, “last_week”, “last_month”, “last_year”, or “custom”. For “custom”, set time_period_min and time_period_max in MM/DD/YYYY.
- nfpr/filter: Control autocorrect and similar/omitted results filters (0 or 1).
- proxyConfiguration: Optionally enable Apify Proxy.
-
Review key settings
- Use “gl” and “hl” to match your target region and UI language.
- Use “lr” and “cr” to further limit language and country results.
- Start with no proxy; the actor auto-falls back to datacenter and residential proxies on blocks.
-
Run the actor
Click Start. The Google News RSS scraper will fetch and parse multiple time strategies (if no explicit time_period), save items in real time, and log progress. -
Monitor logs
You’ll see messages like “Found X new articles” and proxy state changes if fallback occurs. -
Export results
Open the run dataset to download your results as JSON or CSV. Each record includes position, title, link, domain, source, date/date_utc, snippet, thumbnail, and block_position.
Pro Tip: Use the Apify API to connect this Google News aggregator script to your pipelines (Python/Node.js), or integrate it as a Google News scraping without API dependency by consuming the dataset export feeds directly.
Use cases
| Use case name | Description |
|---|---|
| Marketing + PR monitoring | Track brand and executive mentions; export headlines/snippets to JSON/CSV for daily briefing. |
| SEO & content planning | Analyze fresh topics and keywords from Google News results to inform editorial calendars. |
| Competitive intelligence | Monitor product launches and market moves by region and language for global teams. |
| Financial & market analysis | Collect headlines on earnings and macro events to feed sentiment or event studies. |
| Academic & policy research | Build time-series datasets across locales with reproducible filters for methodology transparency. |
| Data pipelines (API) | Consume structured JSON from the Apify dataset API using Google News scraper Python or Node.js clients for ETL/ELT. |
Why choose Google News Scraper?
This production-ready Google News scraping tool focuses on precision, resilience, and developer-centric outputs.
- 🎯 Accuracy with structure: Clean snippets, ISO timestamps, and parsed domains keep downstream models and dashboards tidy.
- 🌐 Multilingual + regional: gl/hl/lr/cr parameters align results with your target markets and languages.
- 📈 Scales reliably: Async requests, retries, and real-time saves make high-volume runs dependable.
- 🔌 Developer-ready: Ideal as a Google News API alternative with simple JSON, approachable via Python or Node.js through the Apify API.
- 🔐 Ethical and robust: No headless browser; proxy fallback escalates only when necessary to reduce friction and risk.
- 💸 Cost-effective: Flat monthly plan on Apify with a trial period to validate your workflow.
- 🧩 Integrations-friendly: Export to JSON/CSV and plug into BI tools, notebooks, and automation stacks.
Compared with brittle browser extensions, this Google News feed parser uses stable RSS endpoints plus smart image extraction — delivering consistent, automation-ready news data.
Is it legal / ethical to use Google News Scraper?
Yes — when done responsibly. This actor extracts data from publicly accessible Google News RSS feeds and article pages.
Guidelines for compliant use:
- Respect Google’s terms of service and robots directives.
- Avoid overloading servers; use sensible limits and schedules.
- Use results for lawful purposes and cite original publishers when required.
- Do not collect private or authenticated content.
For specific legal contexts or edge cases, consult your legal team.
Input parameters & output format
Example input JSON
{"maxItems": 300,"query": "Elon Musk","gl": "United States","hl": "English","lr": "English","cr": "United States","time_period": "last_week","nfpr": 0,"filter": 1,"proxyConfiguration": {"useApifyProxy": false}}
Parameters
| Field | Type | Description | Default | Required |
|---|---|---|---|---|
| maxItems | integer | Maximum number of search results to retrieve (min 100, max 5000). | 100 | Yes |
| query | string | The search term to use. | “Elon Musk” | Yes |
| gl | string (enum) | The Google country to use for the query. | None | No |
| hl | string (enum) | The Google UI language to return results. | None | No |
| lr | string (enum) | Limit the results to a specific language. | None | No |
| cr | string (enum) | Limit the results to a specific country. | None | No |
| time_period | string (enum) | Time period for results: last_hour, last_day, last_week, last_month, last_year, custom. | None | No |
| time_period_min | string | Minimum date for custom time period (MM/DD/YYYY). | None | No |
| time_period_max | string | Maximum date for custom time period (MM/DD/YYYY). | None | No |
| nfpr | integer | Exclude results from auto-corrected queries (0 or 1). | 0 | No |
| filter | integer | Enable/disable Similar Results and Omitted Results filters (0 or 1). | 1 | No |
| proxyConfiguration | object | Configure proxy; actor starts with no proxy, then falls back to datacenter and residential if needed. | None (UI prefill: useApifyProxy=false) | No |
Example output JSON
{"position": 1,"title": "Tesla shares surge after earnings beat expectations","link": "https://www.example.com/markets/tesla-earnings","domain": "www.example.com","source": "Bloomberg","date": "2 hours ago","date_utc": "2026-04-03T02:40:23+00:00","snippet": "Tesla reported quarterly revenue beating analyst estimates as deliveries exceeded forecasts...","thumbnail": "data:image/jpeg;base64,/9j/4AAQSkZJRgABAQ...","block_position": 1}
Notes:
- thumbnail may be empty if no suitable image is found.
- snippet depends on the presence and content of the RSS description.
FAQ
Is there a free trial or free tier?
Yes. The Apify listing provides a trial period (120 trial minutes) and a flat monthly plan ($19.99/month) so you can validate the workflow before committing.
Does this work as a Google News API alternative?
Yes. It’s a Google News scraping tool that outputs clean JSON from RSS-based search results, making it a practical Google News API alternative for pipelines and apps.
Can I use it with Python or Node.js?
Yes. You can consume the dataset via the Apify API from Google News scraper Python scripts or Google News scraper Node.js code, and automate end-to-end workflows.
How many results can I scrape per run?
You control this with maxItems (100–5000). The actor enforces these limits and will paginate strategies to collect more results when time_period isn’t specified.
Does it fetch full article text?
No. It collects headlines, links, source, timestamps, and cleaned snippets from RSS, and it attempts to fetch and encode a thumbnail image.
How does proxy handling work?
The run starts with no proxy. On blocks or errors, it automatically falls back to datacenter proxies and then residential proxies for resilience.
Can I target specific countries or languages?
Yes. Use gl/hl for Google country and UI language, lr to limit by language, and cr to limit by country results. You can also filter by time_period.
Is this a Google News RSS scraper or a feed parser?
Both — it builds Google News RSS queries behind the scenes, parses RSS XML, cleans fields, and saves structured JSON you can export or consume via API.
Closing CTA / Final thoughts
Google News Scraper is built for fast, structured Google News data extraction at scale. With RSS-based collection, regional and language filters, proxy fallback, and clean JSON output, it helps marketers, researchers, and developers turn headlines into actionable datasets.
Use it as your Google News API alternative to power dashboards, trend tracking, and analytics — exporting to JSON/CSV and integrating via Python or Node.js. Start extracting smarter news insights today and keep your pipelines fueled with fresh, reliable data.