Google News Scraper avatar

Google News Scraper

Pricing

from $4.00 / 1,000 results

Go to Apify Store
Google News Scraper

Google News Scraper

Scrape Google News in real-time. Supports keyword search, date filters, full-text article extraction, and image extraction.

Pricing

from $4.00 / 1,000 results

Rating

5.0

(10)

Developer

Crawler Bros

Crawler Bros

Maintained by Community

Actor stats

11

Bookmarked

2

Total users

1

Monthly active users

3 days ago

Last modified

Share

Scrape Google News articles by keyword in real-time. Get headlines, full article text, images, and author information — all structured and ready to use. No proxy needed.

What does Google News Scraper do?

This actor searches Google News for your keywords and returns structured article data. It resolves the actual article URLs (not Google redirect links), and can optionally extract the full article text and images from each page.

You can run multiple search queries in a single run. Results are automatically deduplicated across queries, so you never get the same article twice.

What data can you extract?

FieldDescription
titleArticle headline
urlDirect link to the article (resolved from Google News)
sourcePublisher name (e.g. BBC, Reuters, CNN)
domainPublisher domain (e.g. bbc.com)
publishedAtPublication date and time
snippetShort summary from Google News
fullTextFull article body text (optional)
imageUrlArticle hero image URL (optional)
authorArticle author name (optional)
languageLanguage code used for the search

Use cases

  • Media monitoring — Track news coverage of your brand, competitors, or industry topics across thousands of sources
  • Market research — Collect news about specific markets, companies, or trends for analysis
  • Content aggregation — Build news feeds or newsletters from curated topics and sources
  • Sentiment analysis — Feed article text into NLP pipelines to gauge public sentiment on any topic
  • Academic research — Gather news datasets filtered by date, region, and topic for studies
  • AI/LLM training data — Extract full article text at scale for language model fine-tuning

How to use it

  1. Add your search queries — Enter one or more keywords (e.g. "artificial intelligence", "climate change")
  2. Set your filters — Choose date range, language, country, or restrict to a specific site
  3. Choose what to extract — Enable full-text extraction and/or image extraction if needed
  4. Run the actor — Results appear in the dataset, ready to download as JSON, CSV, or Excel

Input

FieldTypeDefaultDescription
queriesstring[]requiredSearch keywords — each query is processed separately
maxResultsPerQueryinteger20Maximum articles to return per query (1–100)
languagestringenLanguage code (e.g. en, de, fr, es, ja)
countrystringUSCountry code (e.g. US, GB, DE, IN)
dateRangestringanyFilter by recency: 1h, 6h, 1d, 7d, 1m, 1y, or any
dateFromstringStart date YYYY-MM-DD (overrides dateRange)
dateTostringEnd date YYYY-MM-DD
siteFilterstringRestrict to a domain (e.g. reuters.com, bbc.com)
excludeWordsstring[][]Words or phrases to exclude from results
extractFullTextbooleanfalseExtract the full article body text
includeImagesbooleanfalseExtract the article's main image URL
maxConcurrencyinteger5Parallel browser pages (1–20). Higher = faster but more memory

Output example

Each item in the dataset represents one article:

{
"query": "artificial intelligence",
"title": "AI Achieves New Milestone in Scientific Research",
"url": "https://www.nature.com/articles/d41586-026-00123-4",
"source": "Nature",
"domain": "nature.com",
"publishedAt": "2026-03-03T09:15:00+00:00",
"snippet": "Researchers at MIT report a new breakthrough in machine learning...",
"fullText": "Full article body text here...",
"imageUrl": "https://nature.com/images/article-hero.jpg",
"author": "Dr. Jane Smith",
"language": "en",
"scrapedAt": "2026-03-03T12:00:00.000000+00:00"
}

fullText, imageUrl, and author are null unless extractFullText / includeImages are enabled.

Input examples

Headlines only — quick scan of today's news

{
"queries": ["artificial intelligence", "climate change"],
"maxResultsPerQuery": 20,
"dateRange": "1d"
}

Full articles with images

{
"queries": ["renewable energy"],
"maxResultsPerQuery": 10,
"extractFullText": true,
"includeImages": true
}

News from a specific site

{
"queries": ["technology"],
"siteFilter": "bbc.com",
"maxResultsPerQuery": 20
}

Custom date range with exclusions

{
"queries": ["elections"],
"dateFrom": "2026-01-01",
"dateTo": "2026-02-28",
"excludeWords": ["opinion", "editorial"],
"maxResultsPerQuery": 50
}

Non-English news

{
"queries": ["Nachrichten"],
"language": "de",
"country": "DE",
"maxResultsPerQuery": 10
}

Limitations

  • Paywalled articles — Full-text extraction returns null for sites behind paywalls (e.g. NYTimes, WSJ)