Google News Scraper(Full text) - Cheap
Pricing
from $2.50 / 1,000 results
Google News Scraper(Full text) - Cheap
Scrape Google News with 100% success. Automatically resolves encrypted JS redirects & extracts clean article text without ads. Perfect for AI/RAG, market monitoring, and sentiment analysis.
Pricing
from $2.50 / 1,000 results
Rating
0.0
(0)
Developer
Yusuf Barış
Maintained by CommunityActor stats
0
Bookmarked
43
Total users
9
Monthly active users
4 days ago
Last modified
Categories
Share
Google News Scraper & Full-Text Extractor
Scrape Google News results by keyword, country, language, and time period, then optionally visit each source article to extract clean full article text, final publisher URLs, snippets, publication dates, source names, and main images.
This Actor is built for AI/RAG pipelines, media monitoring, market research, competitor tracking, brand monitoring, and automated news workflows.
What does this Actor do?
This Actor searches Google News RSS, resolves Google News redirect links, and returns structured news article data in an Apify dataset. When extractFullText is enabled, it visits each resolved publisher URL and extracts readable article body text with Mozilla Readability.
The results can be exported as JSON, CSV, Excel, XML, RSS, HTML, or consumed through the Apify API.
Features
- Search Google News by keyword, phrase, company, person, brand, or topic
- Localize Google News results by country and interface language
- Filter article results by language and source country
- Filter news by time period, including past hour, past day, past week, past month, past year, or a custom date range
- Resolve Google News redirect URLs to original publisher URLs
- Extract full article text when available
- Extract the main article image URL when detected
- Save clean structured dataset items
- Supports residential proxy configuration for more reliable extraction
- API-ready output for automation, dashboards, newsletters, AI workflows, and data pipelines
Use cases
- Build fresh news datasets for AI, RAG, and LLM workflows
- Monitor brand, company, product, competitor, or executive mentions
- Track market, finance, policy, technology, or industry news
- Create automated news alerts, feeds, newsletters, or dashboards
- Collect article text for sentiment analysis, topic classification, and research
- Enrich internal monitoring systems with publisher URLs, snippets, dates, and article text
- Compare coverage across countries, languages, and sources
Input
Configure the Actor with a search query, localization settings, result filters, extraction options, and proxy settings.
| Field | Type | Required | Default | Description |
|---|---|---|---|---|
searchQuery | string | Yes | technology | Keyword, phrase, company, person, brand, or topic to search on Google News. |
maxItems | integer | No | 5 | Maximum number of Google News items to parse from the RSS feed. Google News may return fewer results than requested. |
googleCountry | string | Yes | US | Google News country used for localized search context. Examples: US, TR, GB, DE, FR. |
uiLanguage | string | Yes | en | Google News interface language. Examples: en, tr, de, fr, es. |
resultsLanguage | string | No | all | Filter results to a specific article language. Use all to allow any language. |
resultsCountry | string | No | all | Filter results to sources associated with a specific country. Use all to allow any country. |
timePeriod | string | No | all | Filter news by relative or custom time period. Supported values: all, 1h, 1d, 7d, 30d, 1y, custom. |
customTimePeriodMin | string | No | empty | Start date for a custom time period in YYYY-MM-DD format. Used with timePeriod: custom. |
customTimePeriodMax | string | No | empty | End date for a custom time period in YYYY-MM-DD format. Used with timePeriod: custom. |
noAutocorrect | boolean | No | false | Disable Google's automatic query correction. |
filter | boolean | No | true | Enable Google's duplicate and similar-result filtering. |
extractFullText | boolean | No | false | Visit each article URL and extract readable article body text. Enable this when you need full article content. |
proxyConfiguration | object | No | disabled | Proxy settings for full-text extraction. Residential proxies are recommended when extractFullText is enabled. |
Example input
{"searchQuery": "artificial intelligence startups","maxItems": 5,"googleCountry": "US","uiLanguage": "en","resultsLanguage": "en","resultsCountry": "US","timePeriod": "7d","noAutocorrect": false,"filter": true,"extractFullText": false,"proxyConfiguration": {"useApifyProxy": false}}
Output
Each dataset item represents one Google News result. If full-text extraction is enabled, the Actor attempts to resolve the publisher URL and extract article body text.
| Field | Type | Description |
|---|---|---|
title | string | Article headline from Google News. |
link | string | Final publisher article URL when resolved, or the best available source URL. |
source | string | Publisher or source name. |
date | string | Publication date in ISO 8601 format when available. |
snippet | string | Short Google News summary. |
searchQuery | string | Search query used for the run. |
fullText | string or null | Extracted article body text. null when extraction is disabled, blocked, or unsuccessful. |
topImage | string or null | Main article image URL when detected. |
extractionStatus | string | Extraction result: success, failed, skipped, or blocked. |
statusCode | integer | Optional publisher HTTP status code when the page response is available, for example 401, 403, or 429. |
error | string | Optional error details when extraction fails or a URL cannot be resolved. |
Example output
{"title": "OpenAI announces new model for enterprise AI workflows","link": "https://example-news-site.com/technology/openai-enterprise-ai-model","source": "Example News","date": "2026-02-05T14:30:00.000Z","snippet": "OpenAI introduced a new model focused on business automation and reasoning...","searchQuery": "artificial intelligence startups","fullText": "OpenAI introduced a new model designed for enterprise AI workflows...","topImage": "https://example-news-site.com/images/openai-model.jpg","extractionStatus": "success"}
How to use
- Enter a keyword, phrase, brand, company, person, or topic in
searchQuery. - Set
maxItemsto control how many Google News items to collect. - Choose
googleCountryanduiLanguagefor localized Google News results. - Optionally set
resultsLanguageandresultsCountryto filter returned articles. - Choose
timePeriod, or usecustomTimePeriodMinandcustomTimePeriodMaxfor a custom range. - Enable
extractFullTextwhen you need full article body text. - Run the Actor.
- Export the dataset as JSON, CSV, Excel, XML, RSS, HTML, or use the Apify API.
Configuration options
Full-text extraction
Use extractFullText: true when you need article body text for AI, RAG, sentiment analysis, summarization, or research. This mode is more useful but may take longer because the Actor visits publisher websites.
Use extractFullText: false when you only need headlines, snippets, dates, sources, and URLs. This is faster and better for lightweight monitoring jobs.
Time filtering
Use timePeriod for relative filters:
all- any time1h- past hour1d- past 24 hours7d- past week30d- past month1y- past yearcustom- custom date range
For custom date ranges, set:
customTimePeriodMin- start date, for example2026-01-01customTimePeriodMax- end date, for example2026-01-31
Localization and result filters
Use googleCountry and uiLanguage to control Google News localization. Use resultsLanguage and resultsCountry when you need stricter filtering by article language or source country.
Proxies
Residential proxies are recommended for full-text extraction because publisher websites may block datacenter traffic or automated requests.
Supported countries
The input schema supports these Google News country options:
US, TR, GB, DE, FR, ES, IT, NL, BR, IN, AU, CA, JP, KR, RU, CN
Supported languages
The input schema supports these language options:
en, tr, de, fr, es, it, nl, pt, hi, ja, ko, ru, zh-CN, ar
Pricing
Costs depend on the pricing model shown on the Actor page and your Apify platform usage.
In general:
- Larger
maxItemsvalues can increase runtime. extractFullText: truecan increase runtime because publisher websites are visited.- Residential proxy usage may increase platform usage.
- Failed, blocked, or empty article extraction can still consume runtime because the Actor has to request and process the page.
Check the Actor page and your Apify run details for the exact charging model and usage breakdown.
Limitations
- Google News may return fewer results than requested.
- Some publisher websites block automated requests.
- Blocked publisher pages are saved as dataset items with
extractionStatus: blockedinstead of failing the run. - Paywalled content is not guaranteed to be extracted.
- Some articles may return
fullText: nullif the source page blocks access or does not expose readable article content. topImagemay be missing if the page does not provide a detectable main image.- Results can vary by country, language, time period, and Google News availability.
- Full-text extraction quality depends on the publisher page structure.
- Very large runs may take longer depending on result count, publisher response times, and proxy conditions.
FAQ
Why is fullText sometimes null?
The publisher website may block access, return a paywall, use unsupported page markup, or fail to expose meaningful article text.
Can I use this Actor only for headlines?
Yes. Set extractFullText to false to return metadata such as title, source, date, snippet, and URL without visiting each publisher page for full text.
Can I filter by country and language?
Yes. Use googleCountry and uiLanguage for Google News localization. Use resultsLanguage and resultsCountry for stricter result filtering.
Can I use a custom date range?
Yes. Set timePeriod to custom, then provide customTimePeriodMin and/or customTimePeriodMax in YYYY-MM-DD format.
Can I use proxies?
Yes. proxyConfiguration is supported. Residential proxies are recommended for reliable full-text extraction.
Can I export the data?
Yes. Apify datasets can be exported as JSON, CSV, Excel, XML, RSS, HTML, or accessed programmatically through the Apify API.
Does this Actor support API usage?
Yes. You can run the Actor through the Apify API and consume the default dataset URL from the run output.
Troubleshooting
I received fewer results than maxItems
Google News may not return enough results for the selected query, country, language, and time period. Try broadening the query or using timePeriod: all.
Many articles are marked as blocked
Some publishers block automated access. Use residential proxies and consider reducing concurrency through platform settings if needed.
The Actor is slower than expected
Full-text extraction depends on publisher website response times. Disable extractFullText for faster headline-only runs.
Support
If you have questions, feature requests, or find a bug, open an issue on the Actor page.