Google News Scraper
Pricing
from $1.00 / 1,000 results
Google News Scraper
Search Google News by keyword and extract article titles, sources, publication dates, URLs, and thumbnails. Filter by time range, language, and country.
Pricing
from $1.00 / 1,000 results
Rating
0.0
(0)
Developer
SolidCode
Actor stats
0
Bookmarked
3
Total users
2
Monthly active users
a day ago
Last modified
Categories
Share
Extract fresh news articles from Google News at scale — headlines, publishers, bylines, publication dates, thumbnails, and direct publisher URLs for any topic, keyword, or news section worldwide. Built for analysts, researchers, and teams who need reliable news data without monthly rental fees.
Why This Scraper?
- Every Google News surface in one actor — keyword search, topic sections (Business, Tech, Sports…), publisher feeds, and Full Coverage story pages, all with the same clean output
- Advanced search operators — exact-phrase matching, title-only search, include/exclude specific publishers, and term exclusions — no need to craft Google query strings yourself
- Bypass the 100-article cap — set a date range and the scraper automatically sweeps it in windows to return thousands of articles per keyword
- Direct publisher URLs — optional one-click resolution of Google's
/read/...redirects into the real publisher links (nytimes.com, bbc.com, reuters.com, …) - 19 languages, 47 countries — get localized news from the US, UK, France, Germany, Japan, Brazil, India, and more
- Author bylines included — extracted automatically when Google exposes them (~30% of articles)
- Clean, predictable output — 13 typed fields, ISO 8601 timestamps, no base64 image bloat, no emoji-laden enum values
- Pay only for what you get — no monthly rental, transparent per-result pricing
Use Cases
Media Monitoring & PR
- Track brand, product, or executive mentions across global news outlets
- Monitor competitor coverage in near real-time with the "Past hour" or "Past 24 hours" filter
- Build press clip archives by publisher for stakeholder reports
Market & Trend Research
- Measure volume and sentiment around emerging topics, industries, or technologies
- Pull a full year of coverage on a topic with the date-range sweep
- Localize research by country and language for international markets
Financial & Investment Intelligence
- Gather news around public companies, tickers, or macro themes
- Track regulatory, policy, or geopolitical stories by country
- Feed fresh news into sentiment or event-driven trading models
Content & SEO
- Identify trending stories inside a niche or topic section
- Analyze which publishers dominate coverage of a given keyword
- Curate topical newsletters automatically
Academic & Journalistic Research
- Build news corpora for NLP, bias, or misinformation research
- Investigate how a single event was covered by different publishers with Full Coverage pages
- Archive news for longitudinal studies with arbitrary date ranges
Getting Started
Simple Keyword Search
The minimum input — one or more keywords:
{"keywords": ["Artificial Intelligence"],"maxResults": 50}
Recent News Only
Limit to the past 24 hours and sort newest first:
{"keywords": ["OpenAI", "Anthropic"],"timeFilter": "24h","sortBy": "date","maxResults": 100}
Advanced Operators — Precise Queries
Match an exact phrase, limit to the article title, and filter by publisher:
{"keywords": ["climate change"],"exactPhrase": true,"inTitleOnly": true,"sources": ["BBC", "Reuters", "The Guardian"],"excludeTerms": ["opinion"],"timeFilter": "week","maxResults": 100}
Historical Sweep — Beyond the 100-Article Cap
Set a custom date range to pull thousands of articles by automatically splitting the range into smaller windows:
{"keywords": ["Federal Reserve"],"dateFrom": "2024-01-01","dateTo": "2024-12-31","maxResults": 0,"sortBy": "date"}
Topic Sections, Publisher Feeds, and Story Pages
Paste any Google News URL — topic, publisher, or Full Coverage page:
{"startUrls": ["https://news.google.com/topics/CAAqJggKIiBDQkFTRWdvSUwyMHZNRGRqTVhZU0FtVnVHZ0pWVXlnQVAB","https://news.google.com/publications/CAAqBwgKMKbdrQww0L-7Aw","https://news.google.com/stories/CAAqNggKIjBDQklTSGpvSmMzUnZjbmt0TXpZd1NoRUtEd2lnODRLd0NoRC1sTU9qQmlnQVAB"],"maxResults": 100}
Direct Publisher URLs
Resolve Google's /read/... redirects into the real publisher links:
{"keywords": ["SpaceX launch"],"timeFilter": "week","resolvePublisherUrls": true,"maxResults": 25}
Localized News — Any Language, Any Country
Get news for France, in French:
{"keywords": ["élection"],"language": "fr","country": "FR","timeFilter": "week","maxResults": 50}
Input Reference
Search
| Parameter | Type | Default | Description |
|---|---|---|---|
keywords | string[] | ["Technology"] | Topics or terms to search. Each keyword runs independently. Use OR inside a single entry (e.g. "apple OR tesla") to combine two queries into one. |
exactPhrase | boolean | false | Wrap each keyword in quotes so Google matches the phrase exactly. |
inTitleOnly | boolean | false | Only match keywords that appear in the article title. Cuts noise from articles that mention the term in passing. |
excludeTerms | string[] | [] | Words to exclude from results (e.g. ["opinion", "sponsored"]). |
sources | string[] | [] | Restrict results to specific publishers (e.g. ["BBC", "Reuters"]). Multiple sources combined with OR. |
excludeSources | string[] | [] | Skip articles from specific publishers. |
URL Mode
| Parameter | Type | Default | Description |
|---|---|---|---|
startUrls | URL[] | [] | Any news.google.com URL — search, topic, publisher feed, or Full Coverage story page. Each URL is fetched and all article cards are extracted. Bypasses keyword operators. |
Either keywords or startUrls must be provided. Both can be set — keywords are processed first.
Time Filter
| Parameter | Type | Default | Description |
|---|---|---|---|
timeFilter | string | "any" | Preset window: any, hour, 24h, week, month, year. |
dateFrom | string | — | Start of custom date range (YYYY-MM-DD, inclusive). |
dateTo | string | — | End of custom date range (YYYY-MM-DD, inclusive). When both are set, the range is swept in smaller windows — can return thousands of articles per keyword. |
Localization
| Parameter | Type | Default | Description |
|---|---|---|---|
language | string | "en" | Interface language — English, Spanish, French, German, Italian, Portuguese, Dutch, Polish, Russian, Turkish, Japanese, Korean, Chinese, Arabic, Hindi, Indonesian, Thai, Vietnamese. |
country | string | "US" | Country bias for results — 47 countries supported, or "any" for no regional preference. |
Output Options
| Parameter | Type | Default | Description |
|---|---|---|---|
includeAuthor | boolean | true | Extract the article byline when shown on the card. |
resolvePublisherUrls | boolean | false | Recover the real publisher URL for each Google News redirect. Adds a small cost per resolved article. |
sortBy | string | "relevance" | "relevance" keeps Google's native ordering; "date" re-sorts by publication date (newest first). |
deduplicateAcrossKeywords | boolean | false | Drop articles that appear in more than one keyword's results. Off by default — each keyword's results are independent. |
Limits
| Parameter | Type | Default | Description |
|---|---|---|---|
maxResults | integer | 50 | Max articles per keyword or start URL. Use 0 for as many as Google returns. Without a date range, Google caps at ~100 per keyword. |
maxRequestsPerKeyword | integer | 300 | Safety cap on date-range sweep queries per keyword. |
Output
Each record is one article with up to 13 structured fields:
{"keyword": "Artificial Intelligence","sourceUrl": "https://news.google.com/search?q=Artificial+Intelligence+when%3A7d&hl=en-US&gl=US&ceid=US%3Aen","title": "OpenAI unveils new reasoning model with major benchmark gains","description": null,"source": "The New York Times","author": "Cade Metz","url": "https://news.google.com/read/CBMiiAFBVV95cUxPa...","publisherUrl": "https://www.nytimes.com/2026/04/18/technology/openai-reasoning-model.html","publishedAt": "2026-04-18T14:32:00Z","publishedRelative": "2 days ago","imageUrl": "https://lh3.googleusercontent.com/...","storyUrl": "https://news.google.com/stories/CAAqNggKIjBDQklTSGpvSmMzUnZjbmt0TXpZd1NoRUtEd2lnODRLd0NoRC1sTU9qQmlnQVAB","scrapedAt": "2026-04-20T13:45:12Z"}
All Available Fields
| Field | Type | Description |
|---|---|---|
keyword | string | null | The search keyword that produced this result. null for start-URL results. |
sourceUrl | string | The URL this record was fetched from — lets you trace records back to their input. |
title | string | Article headline. |
description | null | Always null — Google News cards don't include article snippets. Kept for schema stability. |
source | string | Publisher name (e.g. "The New York Times"). |
author | string | null | Article byline when Google exposes it on the card. |
url | string | Google News redirect URL for the article. Always present. |
publisherUrl | string | null | Direct publisher URL. Populated when resolvePublisherUrls is enabled. |
publishedAt | string | null | ISO 8601 publication timestamp (UTC). |
publishedRelative | string | Raw relative timestamp as shown by Google in the selected language (e.g. "3 days ago", "Il y a 2 heures"). |
imageUrl | string | null | Thumbnail URL (~99% of cards have one). |
storyUrl | string | null | URL to the Full Coverage page for this story, when Google groups the article into one. |
scrapedAt | string | ISO 8601 timestamp of when the record was captured. |
Tips for Best Results
- Beat the 100-article cap with a date range — any query capped at ~100 results can be expanded into thousands by setting
dateFromanddateTo. The scraper automatically splits the range into smaller windows. - Use exact-phrase + title-only for precision — combining
exactPhrase: truewithinTitleOnly: trueremoves ~90% of noise from broad keyword searches. - Narrow by publisher with
sources— ideal for monitoring a specific outlet's coverage of a topic without scraping their entire site. - Turn on
resolvePublisherUrlsselectively — it costs two extra HTTP requests per article. Leave it off for large scans and only enable when direct publisher links matter. - Match
languagetocountry— for best results, pair the market you care about with its primary language (e.g.language: "ja"+country: "JP"). Mismatches can silently return generic results. - Sort by date for freshness —
sortBy: "date"is perfect for newsfeed-style use cases where the newest articles matter most.
Pricing
$1.00 per 1,000 articles — no monthly rental, no tiered plans. You pay only for the articles you actually receive.
| Articles | Estimated Cost |
|---|---|
| 100 | $0.10 |
| 1,000 | $1.00 |
| 10,000 | $10.00 |
| 100,000 | $100.00 |
Optional: resolvePublisherUrls adds $1.00 per 1,000 resolved URLs on top (charged only when a direct publisher URL was successfully recovered). Failed decodes are never charged.
Platform fees (compute, proxy, storage) are additional and depend on your Apify plan.
Integrations
Export data in JSON, CSV, Excel, XML, or RSS. Connect to 1,500+ apps via:
- Zapier / Make / n8n — Workflow automation
- Google Sheets — Direct spreadsheet export
- Slack / Email — Notifications on new results
- Webhooks — Get notified when a run completes
- Apify API — Full programmatic access
Legal & Ethical Use
This actor is designed for legitimate media monitoring, market research, and journalistic work. Users are responsible for complying with applicable laws and Google's Terms of Service, as well as the terms of the publishers whose articles appear in the results. Respect copyright when redistributing headlines or summaries, and do not use collected data for spam, harassment, or any illegal purpose.