Google News Scraper(Full text)
Pricing
from $5.00 / 1,000 results
Google News Scraper(Full text)
Scrape Google News with 100% success. Automatically resolves encrypted JS redirects & extracts clean article text without ads. Perfect for AI/RAG, market monitoring, and sentiment analysis.
Pricing
from $5.00 / 1,000 results
Rating
5.0
(1)
Developer
Yusuf Barış
Actor stats
0
Bookmarked
6
Total users
3
Monthly active users
6 days ago
Last modified
Categories
Share
Google News Scraper & Full Text Extractor 🚀
Can't open Google News links? We fixed it. This is the only scraper that reliably resolves Google's encrypted redirect URLs and extracts the clean full text of articles.
🌟 Why This Actor?
Most Google News scrapers are broken. They either give you 403 errors when trying to open links, or they only provide useless snippets.
This Actor is built for business. It uses a hybrid smart engine to:
- 🔍 Find News: Scrape Google News for any keyword, country, or language.
- 🔓 Fix Links: Automatically resolves Google's complex redirection links (
news.google.com/rss/articles/...) to get the real source URL. - 📄 Extract Content: Visits the actual news site and extracts the full article text, main image, and metadata—stripped of ads and popups.
💼 What Can You Do With It?
- AI & RAG Applications: Feed clean, full-text news data directly into your LLMs (ChatGPT, Claude, etc.) for high-quality context.
- Market Intelligence: Monitor competitors and industry trends by reading what is being said, not just the headlines.
- Sentiment Analysis: Analyze the entire article body for accurate sentiment scoring, avoiding misleading headlines.
- Brand Monitoring: Track every mention of your brand or products across thousands of news sources instantly.
- Automated News Feeds: Build your own news aggregation app or newsletter with zero manual work.
✨ Features
| Feature | Benefit |
|---|---|
| Full Text Extraction | Get the complete story. No clickbait, no paywalls (where possible), no ads. |
| 100% URL Resolution | Our hybrid engine handles Google's JS redirects perfectly. Say goodbye to broken links. |
| Global Scale | Support for 15+ countries and 14 languages built-in. |
| Smart Anti-Blocking | Uses intelligent proxy rotation to stay undetected. |
| Image Extraction | Automatically captures the main featured image of the article. |
| Time Range Filter | Filter news by time: last hour, 24 hours, 7 days, 30 days, or 1 year. |
⏱️ Performance Expectations
| Mode | Speed | What You Get |
|---|---|---|
Full Text ON (extractFullText: true) | ~6 minutes for 50 articles | Complete article body, main image, metadata |
Full Text OFF (extractFullText: false) | Seconds (near-instant) | Title, source, date, snippet, link only |
💡 Tip: If you only need headlines and metadata for high-volume monitoring, set
extractFullText: falsefor lightning-fast results. Enable it when you need the actual article content for AI/RAG applications.
📥 Input Parameters
| Parameter | Type | Description | Default |
|---|---|---|---|
searchQuery | String | What are you looking for? (e.g., "Tesla stock", "Elections") | - |
maxResults | Integer | How many articles do you need? | 50 |
country | String | Target country code (e.g., US, UK, TR) | US |
language | String | Target language code (e.g., en, tr) | en |
extractFullText | Boolean | Set to true to get the article body. | true |
Example Input
{"searchQuery": "artificial intelligence startups","maxResults": 10,"country": "US","language": "en","extractFullText": true,"proxyConfiguration": {"useApifyProxy": true}}
📤 Output (Sample)
You get a clean JSON object for every article:
{"title": "OpenAI Announces New Model","link": "https://techcrunch.com/2026/02/05/openai-new-model","source": "TechCrunch","date": "2026-02-05T14:30:00Z","snippet": "OpenAI has just revealed its latest breakthrough in...","fullText": "OpenAI today announced the release of GPT-5... [Full article content follows] ... The model shows significant improvements in reasoning.","topImage": "https://techcrunch.com/wp-content/uploads/openai.jpg","searchQuery": "artificial intelligence startups","extractionStatus": "success"}
🌍 Supported Regions
🇺🇸 USA, 🇬🇧 UK, 🇩🇪 Germany, 🇫🇷 France, 🇪🇸 Spain, 🇮🇹 Italy, 🇳🇱 Netherlands, 🇹🇷 Turkey, 🇧🇷 Brazil, 🇮🇳 India, 🇦🇺 Australia, 🇨🇦 Canada, 🇯🇵 Japan, 🇰🇷 Korea, 🇷🇺 Russia, 🇨🇳 China.
🛠️ For Developers (Technical Details)
This Actor is engineered for reliability.
- Engine: Hybrid Node.js 20+ (Cheerio + Playwright)
- Scraping Method: RSS Feed -> JS Redirect Resolution -> Readability Extraction
- Output Format: JSON / CSV / Excel / XML
- Maintenance: Actively maintained to adapt to Google's layout changes.
Need Help?
If you have any feature requests or find a bug, please create an issue in the Issues tab. We respond quickly!