Google News Scraper - Fast URLs, Images & Alerts avatar

Google News Scraper - Fast URLs, Images & Alerts

Pricing

$0.70 / 1,000 results

Go to Apify Store
Google News Scraper - Fast URLs, Images & Alerts

Google News Scraper - Fast URLs, Images & Alerts

Fast Google News scraper with structured source rows for keywords, publishers, topics, local news, URLs, alerts, and Full Coverage stories. Returns clean rows with dates, publishers, direct URLs, images, 100+ backfill, new-article alerts, descriptions, entities, and best-effort full text.

Pricing

$0.70 / 1,000 results

Rating

5.0

(1)

Developer

VortexData

VortexData

Maintained by Community

Actor stats

1

Bookmarked

21

Total users

13

Monthly active users

5 days ago

Last modified

Share

๐Ÿ“ฐ Google News Scraper

Fast Google News scraping for keyword search, new-article alerts, top headlines, topic feeds, publisher feeds, local news, Full Coverage stories, and Google News/RSS URLs.

The default run returns a clean Articles dataset with headlines, snippets when Google exposes them, publishers, publication dates, direct publisher URLs when they can be resolved, thumbnails, Full Coverage links, company/ticker detection, publisher-page descriptions/authors, best-effort full article text, and source/filter metadata.

Publisher-page enrichment is on by default so first runs return the richest useful rows. Turn off Extra Data toggles when you need a cheaper feed-only run.

No Google account or Google API key is required.

๐Ÿš€ Quick Start

Best first run:

  1. In Sources, keep Source type = Search query and change the value.
  2. Add extra source rows only when you want publisher feeds, topics, local news, Google News URLs, or Full Coverage stories.
  3. Pick country/language, time period, and article count.
  4. Keep the Extra Data toggles on for rich rows, or turn off full text/descriptions when speed and cost matter more than enrichment.

Example keyword search:

  • Source type: Search query
  • Value: OpenAI
  • Articles to return: 20
  • Time period: Last 24 hours
  • Country / language edition: United States / English

The visible form uses structured source rows.

Source typeExample values
Search queryOpenAI, "electric cars", Apple OR Microsoft, site:reuters.com AI, intitle:earnings
Publisher/domainReuters, BBC, cnn.com, reuters.com, Google News publication URL
Topic feedTechnology, Business, Sports, Health, World
Local newsNew York, London, Tokyo, California
Google News URLGoogle News search, topic, publication, section, or RSS URL
Full Coverage storystoryUrl, /stories/... path, or Google News story ID
Top headlinesTurn on Also include top headlines

๐Ÿ”” New Article Alerts

Use New Article Alerts when you want scheduled runs that return only articles not seen before.

Example alert setup:

  • Add two Search query rows: Acme Corp and Acme CEO
  • Enable New Article Alerts
  • Alert name: Acme daily monitor
  • Articles to return: 100
  • Time period: Last 24 hours

Recommended Apify setup:

  1. Test the Actor once with your search input.
  2. Save the configuration as an Apify Task, or add this Actor directly to an Apify Schedule.
  3. Keep New Article Alerts enabled and keep the same Alert name for the same alert.
  4. Attach an Apify Schedule for recurring runs and a webhook/integration if you want notifications or downstream automation.

The first scheduled run returns current matches and remembers them for that Alert name. Later runs with the same Alert name return only articles the actor has not returned before. Use Send all matches again when testing or when you intentionally want to reset the alert.

To run alerts automatically, save the input as an Apify Task and attach an Apify Schedule. Add a webhook if you want to send new articles to another app or endpoint.

๐Ÿงญ Sources

Use Source rows to combine several collection targets in one run. Most users only need one Search query row. Add publisher, topic, local news, Google News URL, or Full Coverage rows when you want those specific feeds too.

Turn on Also include top headlines only when you also want the selected country/language front page. Turn on New Article Alerts only for scheduled runs that should return articles not seen before.

Keyword searches support common Google operators such as exact phrases, OR, -exclude, site:domain.com, and intitle:term. You can combine several source types in one run by adding multiple source rows, and Total run limit caps the output across all enabled sources.

โœจ Data Included by Default

Normal runs already return clean article rows with headline, snippet, publisher, publication date, Google News URL, source context, country/language, and the best article URL the actor can resolve.

When Google News exposes them, rows can also include thumbnails, Full Coverage links, related RSS articles, and social post cards inside Full Coverage stories.

Extra Data is on by default. You can turn these off only when you want a faster, cheaper feed-only run:

  • Fetch descriptions and authors adds publisher-page canonical URL, site name, author, article description, page language, and better image metadata when available.
  • Fetch full article text adds best-effort article text, word count, publisher-page dates, section, and tags when the publisher page allows extraction.
  • Detect companies and tickers adds company, stock ticker, and crypto ticker signals when there is enough signal in the article data.

Publisher-page data is best-effort. Paywalls, consent pages, bot blocks, publisher downtime, and JavaScript-heavy pages can leave these fields empty. The row is still kept with its Google News metadata.

๐Ÿ“ฆ Output

Each dataset item is one article-like Google News result.

Common fields:

FieldMeaning
titleArticle headline.
descriptionGoogle News card or RSS snippet when available.
source / sourceDomainPublisher name and normalized domain.
urlBest article URL. When URL resolving succeeds, this is the direct publisher URL.
publisherUrlDirect publisher article URL when resolved.
googleNewsUrlOriginal Google News result URL.
publishedAtPublication time in UTC.
imageUrlThumbnail URL when available.
storyUrlGoogle News Full Coverage URL when available.
platform / socialUrl / socialAuthorPresent only for Full Coverage social_post rows when Google exposes X/Twitter cards.
search / sourceTypeThe keyword, topic, location, URL, or monitoring source that produced the row.
isNew / monitorNamePresent in monitoring runs.

When description enrichment is enabled, the actor may also add canonicalUrl, siteName, publisherName, articleDescription, articleLanguage, and author.

When full-text extraction is enabled, the actor may also add fullText, wordCount, articleExtractionMethod, articlePublishedAt, articleModifiedAt, articleSection, and articleTags. Internal diagnostic fields are kept out of the default dataset so normal exports stay clean.

๐Ÿงฐ Filters

Use Search only these publishers when you want Google News itself to search only selected publishers. Enter one value per line, for example reuters.com and bbc.com.

Use publisher include/exclude filters when you want to keep or remove specific resolved publisher domains after scraping.

Use Exclude words for words or phrases such as opinion, sponsored, or press release.

๐Ÿงต Full Coverage and Publishers

Top headlines, topic, and keyword runs may return a storyUrl. Paste that value into ๐Ÿงต Full Coverage to collect the related articles in the same Google News story cluster. If Google News shows X/Twitter cards inside that story page, the actor can include them as social post rows without making separate X/Twitter requests.

Example: add a Source row with Source type Full Coverage story, then paste a storyUrl from a previous result or a Google News story URL.

For a publisher feed, enter a publisher name or domain such as Reuters, CNN, BBC, or reuters.com. You can also paste a Google News /publications/... URL or ID into ๐Ÿข Publisher feed.

Example: add a Source row with Source type Publisher/domain and value Reuters.

โš ๏ธ Notes

  • Google News ranking and availability vary by country, language, query, and time.
  • A single Google News RSS feed usually exposes about 100 results. Large keyword and publisher/domain requests are split into daily windows, but the final count depends on how many unique results Google exposes. Top headlines, topics, local feeds, pasted URLs, and Full Coverage stories are limited by what that Google News surface exposes.
  • Some articles do not expose thumbnails, summaries, authors, Full Coverage links, or social post cards.
  • Direct URL resolving depends on Google News redirect behavior and may occasionally fail.
  • This actor is not affiliated with Google. Users are responsible for lawful use and respecting publisher terms.
  • Need help with a run? Open the Issues tab and include the run ID, input source, country/language edition, and expected result count.