Google News Scraper(Full text) - Cheap avatar

Google News Scraper(Full text) - Cheap

Pricing

from $2.50 / 1,000 results

Go to Apify Store
Google News Scraper(Full text) - Cheap

Google News Scraper(Full text) - Cheap

Scrape Google News with 100% success. Automatically resolves encrypted JS redirects & extracts clean article text without ads. Perfect for AI/RAG, market monitoring, and sentiment analysis.

Pricing

from $2.50 / 1,000 results

Rating

0.0

(0)

Developer

Yusuf Barış

Yusuf Barış

Maintained by Community

Actor stats

0

Bookmarked

43

Total users

9

Monthly active users

4 days ago

Last modified

Categories

Share

Google News Scraper & Full-Text Extractor

Scrape Google News results by keyword, country, language, and time period, then optionally visit each source article to extract clean full article text, final publisher URLs, snippets, publication dates, source names, and main images.

This Actor is built for AI/RAG pipelines, media monitoring, market research, competitor tracking, brand monitoring, and automated news workflows.

What does this Actor do?

This Actor searches Google News RSS, resolves Google News redirect links, and returns structured news article data in an Apify dataset. When extractFullText is enabled, it visits each resolved publisher URL and extracts readable article body text with Mozilla Readability.

The results can be exported as JSON, CSV, Excel, XML, RSS, HTML, or consumed through the Apify API.

Features

  • Search Google News by keyword, phrase, company, person, brand, or topic
  • Localize Google News results by country and interface language
  • Filter article results by language and source country
  • Filter news by time period, including past hour, past day, past week, past month, past year, or a custom date range
  • Resolve Google News redirect URLs to original publisher URLs
  • Extract full article text when available
  • Extract the main article image URL when detected
  • Save clean structured dataset items
  • Supports residential proxy configuration for more reliable extraction
  • API-ready output for automation, dashboards, newsletters, AI workflows, and data pipelines

Use cases

  • Build fresh news datasets for AI, RAG, and LLM workflows
  • Monitor brand, company, product, competitor, or executive mentions
  • Track market, finance, policy, technology, or industry news
  • Create automated news alerts, feeds, newsletters, or dashboards
  • Collect article text for sentiment analysis, topic classification, and research
  • Enrich internal monitoring systems with publisher URLs, snippets, dates, and article text
  • Compare coverage across countries, languages, and sources

Input

Configure the Actor with a search query, localization settings, result filters, extraction options, and proxy settings.

FieldTypeRequiredDefaultDescription
searchQuerystringYestechnologyKeyword, phrase, company, person, brand, or topic to search on Google News.
maxItemsintegerNo5Maximum number of Google News items to parse from the RSS feed. Google News may return fewer results than requested.
googleCountrystringYesUSGoogle News country used for localized search context. Examples: US, TR, GB, DE, FR.
uiLanguagestringYesenGoogle News interface language. Examples: en, tr, de, fr, es.
resultsLanguagestringNoallFilter results to a specific article language. Use all to allow any language.
resultsCountrystringNoallFilter results to sources associated with a specific country. Use all to allow any country.
timePeriodstringNoallFilter news by relative or custom time period. Supported values: all, 1h, 1d, 7d, 30d, 1y, custom.
customTimePeriodMinstringNoemptyStart date for a custom time period in YYYY-MM-DD format. Used with timePeriod: custom.
customTimePeriodMaxstringNoemptyEnd date for a custom time period in YYYY-MM-DD format. Used with timePeriod: custom.
noAutocorrectbooleanNofalseDisable Google's automatic query correction.
filterbooleanNotrueEnable Google's duplicate and similar-result filtering.
extractFullTextbooleanNofalseVisit each article URL and extract readable article body text. Enable this when you need full article content.
proxyConfigurationobjectNodisabledProxy settings for full-text extraction. Residential proxies are recommended when extractFullText is enabled.

Example input

{
"searchQuery": "artificial intelligence startups",
"maxItems": 5,
"googleCountry": "US",
"uiLanguage": "en",
"resultsLanguage": "en",
"resultsCountry": "US",
"timePeriod": "7d",
"noAutocorrect": false,
"filter": true,
"extractFullText": false,
"proxyConfiguration": {
"useApifyProxy": false
}
}

Output

Each dataset item represents one Google News result. If full-text extraction is enabled, the Actor attempts to resolve the publisher URL and extract article body text.

FieldTypeDescription
titlestringArticle headline from Google News.
linkstringFinal publisher article URL when resolved, or the best available source URL.
sourcestringPublisher or source name.
datestringPublication date in ISO 8601 format when available.
snippetstringShort Google News summary.
searchQuerystringSearch query used for the run.
fullTextstring or nullExtracted article body text. null when extraction is disabled, blocked, or unsuccessful.
topImagestring or nullMain article image URL when detected.
extractionStatusstringExtraction result: success, failed, skipped, or blocked.
statusCodeintegerOptional publisher HTTP status code when the page response is available, for example 401, 403, or 429.
errorstringOptional error details when extraction fails or a URL cannot be resolved.

Example output

{
"title": "OpenAI announces new model for enterprise AI workflows",
"link": "https://example-news-site.com/technology/openai-enterprise-ai-model",
"source": "Example News",
"date": "2026-02-05T14:30:00.000Z",
"snippet": "OpenAI introduced a new model focused on business automation and reasoning...",
"searchQuery": "artificial intelligence startups",
"fullText": "OpenAI introduced a new model designed for enterprise AI workflows...",
"topImage": "https://example-news-site.com/images/openai-model.jpg",
"extractionStatus": "success"
}

How to use

  1. Enter a keyword, phrase, brand, company, person, or topic in searchQuery.
  2. Set maxItems to control how many Google News items to collect.
  3. Choose googleCountry and uiLanguage for localized Google News results.
  4. Optionally set resultsLanguage and resultsCountry to filter returned articles.
  5. Choose timePeriod, or use customTimePeriodMin and customTimePeriodMax for a custom range.
  6. Enable extractFullText when you need full article body text.
  7. Run the Actor.
  8. Export the dataset as JSON, CSV, Excel, XML, RSS, HTML, or use the Apify API.

Configuration options

Full-text extraction

Use extractFullText: true when you need article body text for AI, RAG, sentiment analysis, summarization, or research. This mode is more useful but may take longer because the Actor visits publisher websites.

Use extractFullText: false when you only need headlines, snippets, dates, sources, and URLs. This is faster and better for lightweight monitoring jobs.

Time filtering

Use timePeriod for relative filters:

  • all - any time
  • 1h - past hour
  • 1d - past 24 hours
  • 7d - past week
  • 30d - past month
  • 1y - past year
  • custom - custom date range

For custom date ranges, set:

  • customTimePeriodMin - start date, for example 2026-01-01
  • customTimePeriodMax - end date, for example 2026-01-31

Localization and result filters

Use googleCountry and uiLanguage to control Google News localization. Use resultsLanguage and resultsCountry when you need stricter filtering by article language or source country.

Proxies

Residential proxies are recommended for full-text extraction because publisher websites may block datacenter traffic or automated requests.

Supported countries

The input schema supports these Google News country options:

US, TR, GB, DE, FR, ES, IT, NL, BR, IN, AU, CA, JP, KR, RU, CN

Supported languages

The input schema supports these language options:

en, tr, de, fr, es, it, nl, pt, hi, ja, ko, ru, zh-CN, ar

Pricing

Costs depend on the pricing model shown on the Actor page and your Apify platform usage.

In general:

  • Larger maxItems values can increase runtime.
  • extractFullText: true can increase runtime because publisher websites are visited.
  • Residential proxy usage may increase platform usage.
  • Failed, blocked, or empty article extraction can still consume runtime because the Actor has to request and process the page.

Check the Actor page and your Apify run details for the exact charging model and usage breakdown.

Limitations

  • Google News may return fewer results than requested.
  • Some publisher websites block automated requests.
  • Blocked publisher pages are saved as dataset items with extractionStatus: blocked instead of failing the run.
  • Paywalled content is not guaranteed to be extracted.
  • Some articles may return fullText: null if the source page blocks access or does not expose readable article content.
  • topImage may be missing if the page does not provide a detectable main image.
  • Results can vary by country, language, time period, and Google News availability.
  • Full-text extraction quality depends on the publisher page structure.
  • Very large runs may take longer depending on result count, publisher response times, and proxy conditions.

FAQ

Why is fullText sometimes null?

The publisher website may block access, return a paywall, use unsupported page markup, or fail to expose meaningful article text.

Can I use this Actor only for headlines?

Yes. Set extractFullText to false to return metadata such as title, source, date, snippet, and URL without visiting each publisher page for full text.

Can I filter by country and language?

Yes. Use googleCountry and uiLanguage for Google News localization. Use resultsLanguage and resultsCountry for stricter result filtering.

Can I use a custom date range?

Yes. Set timePeriod to custom, then provide customTimePeriodMin and/or customTimePeriodMax in YYYY-MM-DD format.

Can I use proxies?

Yes. proxyConfiguration is supported. Residential proxies are recommended for reliable full-text extraction.

Can I export the data?

Yes. Apify datasets can be exported as JSON, CSV, Excel, XML, RSS, HTML, or accessed programmatically through the Apify API.

Does this Actor support API usage?

Yes. You can run the Actor through the Apify API and consume the default dataset URL from the run output.

Troubleshooting

I received fewer results than maxItems

Google News may not return enough results for the selected query, country, language, and time period. Try broadening the query or using timePeriod: all.

Many articles are marked as blocked

Some publishers block automated access. Use residential proxies and consider reducing concurrency through platform settings if needed.

The Actor is slower than expected

Full-text extraction depends on publisher website response times. Disable extractFullText for faster headline-only runs.

Support

If you have questions, feature requests, or find a bug, open an issue on the Actor page.