Google News Scraper(Full text) avatar
Google News Scraper(Full text)
Under maintenance

Pricing

from $5.00 / 1,000 results

Go to Apify Store
Google News Scraper(Full text)

Google News Scraper(Full text)

Under maintenance

Scrape Google News with 100% success. Automatically resolves encrypted JS redirects & extracts clean article text without ads. Perfect for AI/RAG, market monitoring, and sentiment analysis.

Pricing

from $5.00 / 1,000 results

Rating

5.0

(1)

Developer

Yusuf Barış

Yusuf Barış

Maintained by Community

Actor stats

0

Bookmarked

6

Total users

3

Monthly active users

6 days ago

Last modified

Categories

Share

Google News Scraper & Full Text Extractor 🚀

Can't open Google News links? We fixed it. This is the only scraper that reliably resolves Google's encrypted redirect URLs and extracts the clean full text of articles.


🌟 Why This Actor?

Most Google News scrapers are broken. They either give you 403 errors when trying to open links, or they only provide useless snippets.

This Actor is built for business. It uses a hybrid smart engine to:

  1. 🔍 Find News: Scrape Google News for any keyword, country, or language.
  2. 🔓 Fix Links: Automatically resolves Google's complex redirection links (news.google.com/rss/articles/...) to get the real source URL.
  3. 📄 Extract Content: Visits the actual news site and extracts the full article text, main image, and metadata—stripped of ads and popups.

💼 What Can You Do With It?

  • AI & RAG Applications: Feed clean, full-text news data directly into your LLMs (ChatGPT, Claude, etc.) for high-quality context.
  • Market Intelligence: Monitor competitors and industry trends by reading what is being said, not just the headlines.
  • Sentiment Analysis: Analyze the entire article body for accurate sentiment scoring, avoiding misleading headlines.
  • Brand Monitoring: Track every mention of your brand or products across thousands of news sources instantly.
  • Automated News Feeds: Build your own news aggregation app or newsletter with zero manual work.

✨ Features

FeatureBenefit
Full Text ExtractionGet the complete story. No clickbait, no paywalls (where possible), no ads.
100% URL ResolutionOur hybrid engine handles Google's JS redirects perfectly. Say goodbye to broken links.
Global ScaleSupport for 15+ countries and 14 languages built-in.
Smart Anti-BlockingUses intelligent proxy rotation to stay undetected.
Image ExtractionAutomatically captures the main featured image of the article.
Time Range FilterFilter news by time: last hour, 24 hours, 7 days, 30 days, or 1 year.

⏱️ Performance Expectations

ModeSpeedWhat You Get
Full Text ON (extractFullText: true)~6 minutes for 50 articlesComplete article body, main image, metadata
Full Text OFF (extractFullText: false)Seconds (near-instant)Title, source, date, snippet, link only

💡 Tip: If you only need headlines and metadata for high-volume monitoring, set extractFullText: false for lightning-fast results. Enable it when you need the actual article content for AI/RAG applications.

📥 Input Parameters

ParameterTypeDescriptionDefault
searchQueryStringWhat are you looking for? (e.g., "Tesla stock", "Elections")-
maxResultsIntegerHow many articles do you need?50
countryStringTarget country code (e.g., US, UK, TR)US
languageStringTarget language code (e.g., en, tr)en
extractFullTextBooleanSet to true to get the article body.true

Example Input

{
"searchQuery": "artificial intelligence startups",
"maxResults": 10,
"country": "US",
"language": "en",
"extractFullText": true,
"proxyConfiguration": {
"useApifyProxy": true
}
}

📤 Output (Sample)

You get a clean JSON object for every article:

{
"title": "OpenAI Announces New Model",
"link": "https://techcrunch.com/2026/02/05/openai-new-model",
"source": "TechCrunch",
"date": "2026-02-05T14:30:00Z",
"snippet": "OpenAI has just revealed its latest breakthrough in...",
"fullText": "OpenAI today announced the release of GPT-5... [Full article content follows] ... The model shows significant improvements in reasoning.",
"topImage": "https://techcrunch.com/wp-content/uploads/openai.jpg",
"searchQuery": "artificial intelligence startups",
"extractionStatus": "success"
}

🌍 Supported Regions

🇺🇸 USA, 🇬🇧 UK, 🇩🇪 Germany, 🇫🇷 France, 🇪🇸 Spain, 🇮🇹 Italy, 🇳🇱 Netherlands, 🇹🇷 Turkey, 🇧🇷 Brazil, 🇮🇳 India, 🇦🇺 Australia, 🇨🇦 Canada, 🇯🇵 Japan, 🇰🇷 Korea, 🇷🇺 Russia, 🇨🇳 China.


🛠️ For Developers (Technical Details)

This Actor is engineered for reliability.

  • Engine: Hybrid Node.js 20+ (Cheerio + Playwright)
  • Scraping Method: RSS Feed -> JS Redirect Resolution -> Readability Extraction
  • Output Format: JSON / CSV / Excel / XML
  • Maintenance: Actively maintained to adapt to Google's layout changes.

Need Help?

If you have any feature requests or find a bug, please create an issue in the Issues tab. We respond quickly!