Google News Scraper - Fast URLs, Images & Alerts
Pricing
$0.70 / 1,000 results
Google News Scraper - Fast URLs, Images & Alerts
Fast Google News scraper with structured source rows for keywords, publishers, topics, local news, URLs, alerts, and Full Coverage stories. Returns clean rows with dates, publishers, direct URLs, images, 100+ backfill, new-article alerts, descriptions, entities, and best-effort full text.
Pricing
$0.70 / 1,000 results
Rating
5.0
(1)
Developer
VortexData
Maintained by CommunityActor stats
1
Bookmarked
21
Total users
13
Monthly active users
5 days ago
Last modified
Categories
Share
๐ฐ Google News Scraper
Fast Google News scraping for keyword search, new-article alerts, top headlines, topic feeds, publisher feeds, local news, Full Coverage stories, and Google News/RSS URLs.
The default run returns a clean Articles dataset with headlines, snippets when Google exposes them, publishers, publication dates, direct publisher URLs when they can be resolved, thumbnails, Full Coverage links, company/ticker detection, publisher-page descriptions/authors, best-effort full article text, and source/filter metadata.
Publisher-page enrichment is on by default so first runs return the richest useful rows. Turn off Extra Data toggles when you need a cheaper feed-only run.
No Google account or Google API key is required.
๐ Quick Start
Best first run:
- In Sources, keep Source type = Search query and change the value.
- Add extra source rows only when you want publisher feeds, topics, local news, Google News URLs, or Full Coverage stories.
- Pick country/language, time period, and article count.
- Keep the Extra Data toggles on for rich rows, or turn off full text/descriptions when speed and cost matter more than enrichment.
Example keyword search:
- Source type: Search query
- Value:
OpenAI - Articles to return:
20 - Time period: Last 24 hours
- Country / language edition: United States / English
The visible form uses structured source rows.
| Source type | Example values |
|---|---|
| Search query | OpenAI, "electric cars", Apple OR Microsoft, site:reuters.com AI, intitle:earnings |
| Publisher/domain | Reuters, BBC, cnn.com, reuters.com, Google News publication URL |
| Topic feed | Technology, Business, Sports, Health, World |
| Local news | New York, London, Tokyo, California |
| Google News URL | Google News search, topic, publication, section, or RSS URL |
| Full Coverage story | storyUrl, /stories/... path, or Google News story ID |
| Top headlines | Turn on Also include top headlines |
๐ New Article Alerts
Use New Article Alerts when you want scheduled runs that return only articles not seen before.
Example alert setup:
- Add two Search query rows:
Acme CorpandAcme CEO - Enable New Article Alerts
- Alert name:
Acme daily monitor - Articles to return:
100 - Time period: Last 24 hours
Recommended Apify setup:
- Test the Actor once with your search input.
- Save the configuration as an Apify Task, or add this Actor directly to an Apify Schedule.
- Keep New Article Alerts enabled and keep the same Alert name for the same alert.
- Attach an Apify Schedule for recurring runs and a webhook/integration if you want notifications or downstream automation.
The first scheduled run returns current matches and remembers them for that Alert name. Later runs with the same Alert name return only articles the actor has not returned before. Use Send all matches again when testing or when you intentionally want to reset the alert.
To run alerts automatically, save the input as an Apify Task and attach an Apify Schedule. Add a webhook if you want to send new articles to another app or endpoint.
๐งญ Sources
Use Source rows to combine several collection targets in one run. Most users only need one Search query row. Add publisher, topic, local news, Google News URL, or Full Coverage rows when you want those specific feeds too.
Turn on Also include top headlines only when you also want the selected country/language front page. Turn on New Article Alerts only for scheduled runs that should return articles not seen before.
Keyword searches support common Google operators such as exact phrases, OR, -exclude, site:domain.com, and intitle:term. You can combine several source types in one run by adding multiple source rows, and Total run limit caps the output across all enabled sources.
โจ Data Included by Default
Normal runs already return clean article rows with headline, snippet, publisher, publication date, Google News URL, source context, country/language, and the best article URL the actor can resolve.
When Google News exposes them, rows can also include thumbnails, Full Coverage links, related RSS articles, and social post cards inside Full Coverage stories.
Extra Data is on by default. You can turn these off only when you want a faster, cheaper feed-only run:
- Fetch descriptions and authors adds publisher-page canonical URL, site name, author, article description, page language, and better image metadata when available.
- Fetch full article text adds best-effort article text, word count, publisher-page dates, section, and tags when the publisher page allows extraction.
- Detect companies and tickers adds company, stock ticker, and crypto ticker signals when there is enough signal in the article data.
Publisher-page data is best-effort. Paywalls, consent pages, bot blocks, publisher downtime, and JavaScript-heavy pages can leave these fields empty. The row is still kept with its Google News metadata.
๐ฆ Output
Each dataset item is one article-like Google News result.
Common fields:
| Field | Meaning |
|---|---|
title | Article headline. |
description | Google News card or RSS snippet when available. |
source / sourceDomain | Publisher name and normalized domain. |
url | Best article URL. When URL resolving succeeds, this is the direct publisher URL. |
publisherUrl | Direct publisher article URL when resolved. |
googleNewsUrl | Original Google News result URL. |
publishedAt | Publication time in UTC. |
imageUrl | Thumbnail URL when available. |
storyUrl | Google News Full Coverage URL when available. |
platform / socialUrl / socialAuthor | Present only for Full Coverage social_post rows when Google exposes X/Twitter cards. |
search / sourceType | The keyword, topic, location, URL, or monitoring source that produced the row. |
isNew / monitorName | Present in monitoring runs. |
When description enrichment is enabled, the actor may also add canonicalUrl, siteName, publisherName, articleDescription, articleLanguage, and author.
When full-text extraction is enabled, the actor may also add fullText, wordCount, articleExtractionMethod, articlePublishedAt, articleModifiedAt, articleSection, and articleTags. Internal diagnostic fields are kept out of the default dataset so normal exports stay clean.
๐งฐ Filters
Use Search only these publishers when you want Google News itself to search only selected publishers. Enter one value per line, for example reuters.com and bbc.com.
Use publisher include/exclude filters when you want to keep or remove specific resolved publisher domains after scraping.
Use Exclude words for words or phrases such as opinion, sponsored, or press release.
๐งต Full Coverage and Publishers
Top headlines, topic, and keyword runs may return a storyUrl. Paste that value into ๐งต Full Coverage to collect the related articles in the same Google News story cluster. If Google News shows X/Twitter cards inside that story page, the actor can include them as social post rows without making separate X/Twitter requests.
Example: add a Source row with Source type Full Coverage story, then paste a storyUrl from a previous result or a Google News story URL.
For a publisher feed, enter a publisher name or domain such as Reuters, CNN, BBC, or reuters.com. You can also paste a Google News /publications/... URL or ID into ๐ข Publisher feed.
Example: add a Source row with Source type Publisher/domain and value Reuters.
โ ๏ธ Notes
- Google News ranking and availability vary by country, language, query, and time.
- A single Google News RSS feed usually exposes about 100 results. Large keyword and publisher/domain requests are split into daily windows, but the final count depends on how many unique results Google exposes. Top headlines, topics, local feeds, pasted URLs, and Full Coverage stories are limited by what that Google News surface exposes.
- Some articles do not expose thumbnails, summaries, authors, Full Coverage links, or social post cards.
- Direct URL resolving depends on Google News redirect behavior and may occasionally fail.
- This actor is not affiliated with Google. Users are responsible for lawful use and respecting publisher terms.
- Need help with a run? Open the Issues tab and include the run ID, input source, country/language edition, and expected result count.