Google News Scraper – News Monitoring & Article Data Extractor avatar

Google News Scraper – News Monitoring & Article Data Extractor

Pricing

from $1.00 / 1,000 results

Go to Apify Store
Google News Scraper – News Monitoring & Article Data Extractor

Google News Scraper – News Monitoring & Article Data Extractor

Extract title, URL, source, publish date, and thumbnail image etc. Perfect for news monitoring, research, and media tracking workflows.

Pricing

from $1.00 / 1,000 results

Rating

0.0

(0)

Developer

Epic Scrapers

Epic Scrapers

Maintained by Community

Actor stats

0

Bookmarked

1

Total users

0

Monthly active users

a day ago

Last modified

Categories

Share

Google News Scraper

Gets featured articles from Google News with title, link, source, publication date and image.

What does Google News Scraper do?

Google News Scraper is a powerful tool that queries the Google News RSS API to collect news article metadata—headlines, article URLs, publisher/source, publication timestamps, and preview image URLs. It can optionally fetch full article pages to decode final links and extract images.

This Actor is perfect for:

  • News aggregation - Collect articles from multiple sources on specific topics
  • Media monitoring - Track mentions of your brand, competitors, or industry keywords
  • Market research - Analyze news trends and sentiment over time
  • Content curation - Gather articles for newsletters or content platforms
  • Academic research - Collect news data for analysis and studies

Unlike manually browsing Google News (which limits you to ~100 results per search), this Actor can retrieve significantly more results by automatically iterating through date ranges day by day.

Why use Google News Scraper?

  • Bypass result limits - Get more than the standard 100 results per search
  • Advanced search operators - Use intitle, inurl, site, exact match, exclusions, and boolean operators
  • Date range filtering - Search specific time windows or open-ended ranges (last hour, day, week, year)
  • Topic-based search - Scrape predefined topics or use hashed topic IDs for specific sections
  • Multilingual support - Search in 50+ languages and regions
  • Cost-effective - Uses RSS API instead of browser automation for faster, cheaper runs
  • Flexible output - Get simple RSS links or enriched data with decoded URLs and images

How to use Google News Scraper

Enter a search query as you would in Google News. You can use advanced operators:

OperatorExampleDescription
intitle:intitle:"AI"Find articles with keyword in title
site:site:bbc.comSearch within specific site
"""climate change"Exact phrase match
-apple -fruitExclude term
AND / ORAI AND (ethics OR regulation)Boolean operators

Example queries:

  • intitle:"AI" AND site:forbes.com - AI articles from Forbes
  • site:reuters.com "stock market" -crypto - Stock market news excluding crypto
  • "Samsung Galaxy S25" AND (review OR comparison) - Reviews or comparisons

Select predefined topics or use hashed topic IDs:

Predefined topics:

  • WORLD 🌎, NATION 🚩, BUSINESS 🪙, TECHNOLOGY 💻, ENTERTAINMENT 🎸, SPORTS 🏒, SCIENCE 🧪, HEALTH 🧑‍⚕️

Hashed topics: Copy topic IDs from Google News URLs for custom topics. Example:

CAAqJggKIiBDQkFTRWdvSUwyMHZNRGRqTVhZU0FtVnVHZ0pWVXlnQVAB/sections/CAQiQ0NCQVNMQW9JTDIwdk1EZGpNWFlTQW1WdUdnSlZVeUlOQ0FRYUNRb0hMMjB2TUcxcmVpb0pFZ2N2YlM4d2JXdDZLQUEqKggAKiYICiIgQ0JBU0Vnb0lMMjB2TURkak1YWVNBbVZ1R2dKVlV5Z0FQAVAB

This targets "Technology > Artificial Intelligence" section.

3. Configure date range

Option A: Fixed dates

  • Date from: 2024-01-01
  • Date to: 2024-01-31

Option B: Open-ended range

  • 1h - Last hour
  • 24h - Last 24 hours
  • 7d - Last week
  • 30d - Last month
  • 1y - Last year

4. Set language and region

Choose from 50+ options like:

  • US:en - United States (English)
  • GB:en - United Kingdom (English)
  • DE:de - Germany (German)
  • FR:fr - France (French)
  • JP:ja - Japan (Japanese)

5. Choose output detail level

Fetch article details: ON

  • Decodes RSS links to actual article URLs
  • Extracts preview images from article pages
  • Slower but more informative

Fetch article details: OFF

  • Returns RSS feed links only
  • Much faster and cheaper
  • No images, encoded URLs

Input

FieldTypeRequiredDefaultDescription
querystringNo-Search query with optional advanced operators
topicsarrayNo[]Predefined topics (WORLD, NATION, BUSINESS, etc.)
topicsHashedarrayNo[]Hashed topic IDs from Google News URLs
languagestringYesUS:enLanguage and region pair
maxItemsintegerNo-Maximum number of items to scrape
fetchArticleDetailsbooleanNotrueDecode RSS links and fetch images
dateFromstringNo-Start date (YYYY-MM-DD)
dateTostringNo-End date (YYYY-MM-DD)
openEndedDateRangestringNo-Open-ended range (e.g., 1h, 7d, 1y)
proxyConfigurationobjectNo{useApifyProxy: true}Proxy settings

Example Input (JSON)

{
"query": "intitle:\"artificial intelligence\" AND site:techcrunch.com",
"language": "US:en",
"maxItems": 50,
"fetchArticleDetails": true,
"openEndedDateRange": "7d",
"proxyConfiguration": {
"useApifyProxy": true
}
}

Output

The Actor stores results in a dataset. You can download data in JSON, HTML, CSV, or Excel formats.

Output Schema

{
"title": "Article headline",
"link": "Direct article URL (decoded if fetchArticleDetails=true)",
"guid": "Unique article identifier",
"source": "Publisher name (e.g., 'BBC News')",
"sourceUrl": "Publisher website URL",
"publishedAt": "2024-01-15T14:30:00.000Z",
"loadedUrl": "Final URL after redirects",
"rssLink": "Original RSS feed link",
"image": "Preview image URL (if available)"
}

Example Output

[
{
"title": "Web Scraping Optimization: Tips for Faster, Smarter Scrapers",
"link": "https://hackernoon.com/web-scraping-optimization-tips-for-faster-smarter-scrapers",
"guid": "CBMiiAFBVV95cUxQUXh5WVZ2RkNpNG9ndjF6V3hMRHBRTGRSVnNkelpwZDY2TWJzejBSMGZrRC1rSm5DZ1BxanpoeFFGdDRjWGpZR0tOUG9FY0kyeWFXOE9MSzBobTg1ajRiZzVhSWhtbm5nSVNJVWExSDBSaEFjUUJkT1JRRDJHSDBrMU9jU2ZZN3RN",
"source": "hackernoon.com",
"sourceUrl": "https://hackernoon.com",
"publishedAt": "2024-11-15T08:00:00.000Z",
"loadedUrl": "https://hackernoon.com/web-scraping-optimization-tips-for-faster-smarter-scrapers",
"rssLink": "https://news.google.com/rss/articles/CBMiiAFBVV95cUxQUXh5WVZ2RkNpNG9ndjF6V3hMRHBRTGRSVnNkelpwZDY2TWJzejBSMGZrRC1rSm5DZ1BxanpoeFFGdDRjWGpZR0tOUG9FY0kyeWFXOE9MSzBobTg1ajRiZzVhSWhtbm5nSVNJVWExSDBSaEFjUUJkT1JRRDJHSDBrMU9jU2ZZN3RN?oc=5",
"image": "https://hackernoon.imgix.net/images/0FC9YtxD4fbD3T7mPipOt4HSxY42-7y034nb.png"
}
]

Data table

FieldTypeDescription
titlestringArticle headline
linkstringDirect article URL
guidstringUnique article identifier from RSS
sourcestringNews publisher name
sourceUrlstringPublisher's website URL
publishedAtstring (ISO 8601)Publication timestamp
loadedUrlstringFinal URL after following redirects
rssLinkstringOriginal Google News RSS link
imagestringPreview image URL (if fetched)

Pricing

This Actor is priced at $20/month with a 7-day free trial (10,080 minutes).

Cost estimation:

ScenarioArticlesWith Article DetailsEst. Cost
Quick search100No~$0.01
Medium search500Yes~$0.05-0.10
Large search2000Yes~$0.20-0.40

Note: Costs include Apify platform usage (compute units, proxy). Actual costs vary based on proxy type and run duration.

Tips to reduce costs:

  1. Set fetchArticleDetails to false if you only need RSS links (~10x faster)
  2. Use maxItems to limit results
  3. Use shorter date ranges for focused searches
  4. Use the FREE plan's monthly credits for small runs

Advanced Tips

Combining search methods

You can combine query, topics, and topicsHashed in a single run. The Actor will search each independently and merge results.

Finding hashed topic IDs

  1. Go to Google News
  2. Navigate to a topic or section
  3. Copy the ID from the URL after /topics/
  4. For sections, include the /sections/ part too

Handling large result sets

If you need thousands of articles:

  1. Set maxItems to your desired count
  2. The Actor automatically iterates day-by-day when maxItems > 100
  3. Use broader date ranges for historical data
  4. Consider running multiple Actors with different queries in parallel

Language/region tricks

Get different perspectives on the same topic:

  • Search US news in German: language: DE:de + US topic hash
  • Search international news about Japan in English: language: US:en + Japan topic hash

FAQ and Support

Is scraping Google News legal? This Actor uses the public Google News RSS API, which is designed for syndication. However, you should:

  • Respect robots.txt and terms of service
  • Not overwhelm the service with excessive requests
  • Use extracted data in compliance with copyright laws
  • Consider data privacy regulations (GDPR, CCPA) when storing/processing data

Why are some images missing? Images are extracted from article pages using Open Graph and Twitter Card metadata. Not all websites provide this data. The image URL comes directly from the publisher's site, not Google News.

Can I get full article text? This Actor extracts metadata only. For full article text, use the extracted link field with a dedicated article scraper like Smart Article Extractor.

What if I get blocked? The Actor uses Apify Proxy by default. If you experience issues:

  1. Check your proxy settings
  2. Reduce concurrency (built-in default is conservative)
  3. Use shorter date ranges
  4. Enable fetchArticleDetails only when needed

Need help?

Resources