Pricing

from $1.50 / 1,000 results

Try for free

Go to Apify Store

Ultimate News Scraper - Rise of the Phoenix

Try for free

Search a news archive by country, website, and publication date. Estimate result counts, fetch paginated historical articles, and export clean news datasets without running a live scrape.

Pricing

from $1.50 / 1,000 results

Rating

5.0

(1)

Developer

Inus Grobler

Actor stats

Bookmarked

Total users

Monthly active users

18 days ago

Last modified

Global News Archive Search

Search historical news articles from a Supabase-powered news archive by country, website, and published date. This Apify Actor is built for fast article retrieval, result estimation, and cursor-based pagination without running a live scrape during the Actor run.

What this Actor does

Searches archived news articles already stored in Supabase
Filters results by countries or websites
Filters results by published_from and published_to
Supports estimate_only so you can check result size before fetching rows
Returns article data to the default Apify dataset
Supports continuation tokens for paging through large result sets

What this Actor does not do

It does not scrape websites live during the run
It does not open browsers or crawl pages on demand
It only returns data that already exists in the underlying archive

Best use cases

News monitoring
Media intelligence
Historical article lookup
Research workflows
Data enrichment pipelines
Country-level or source-level article exports

Quick start

Choose countries or websites
Set your date range
Turn on estimate_only if you want a count first
Run the Actor
Read rows from the dataset and paging info from OUTPUT

Simple input examples

Search by country

{
  "countries": ["Africa"],
  "published_from": "1000 days",
  "published_to": "0 days",
  "max_results": 100
}

Search by website

{
  "websites": ["Reuters", "AP News"],
  "published_from": "2025-01-01",
  "published_to": "2025-12-31",
  "max_results": 100
}

Estimate results first

{
  "countries": ["United States"],
  "published_from": "30 days",
  "published_to": "0 days",
  "estimate_only": true,
  "max_results": 500
}

Continue to the next page

{
  "countries": ["Africa"],
  "published_from": "1000 days",
  "published_to": "0 days",
  "max_results": 100,
  "continuation_token": "{\"date_published\":\"2026-05-06T15:44:00+00:00\",\"url_hash\":\"81a489c65af24950956dd717c2f7b4be\"}"
}

Input guide

Field	Type	Required	How it works
`countries`	`string[]`	No	Search by one or more countries. Use this or `websites`, not both.
`websites`	`string[]`	No	Search by one or more source names. Use this or `countries`, not both.
`published_from`	`string`	No	Start of the date range. Supports ISO dates like `2025-01-01` and relative dates like `30 days`.
`published_to`	`string`	No	End of the date range. Supports ISO dates like `2025-12-31` and relative dates like `0 days`.
`estimate_only`	`boolean`	No	If `true`, the Actor returns a count estimate and no article rows.
`max_results`	`integer`	No	Maximum number of rows to return. Default is `10`. Maximum is `5000`.
`continuation_token`	`string`	No	Use the token from the previous run to fetch the next page.

Helpful defaults

If you provide neither countries nor websites, the Actor defaults to AP News
If you provide no dates, the Actor defaults to the last 10 days
If you provide only published_from, published_to defaults to 0 days
If you provide only published_to, published_from is derived automatically

Date format examples

2025-01-01
2025-01-01T00:00:00Z
7 days
30 days
12 months
2 years

Output

Article rows are pushed to the default Apify dataset.

Common dataset fields:

site_name
country
region
language
article_title
author
article_body
tags
date_published
article_url
main_image_url
seo_description

Example dataset item:

{
  "site_name": "Africa News",
  "country": "Africa",
  "region": "Western Africa | Eastern Africa | Southern Africa | Middle Africa | Northern Africa",
  "language": "en|fr",
  "article_title": "Trump hosts Dutch royals at the White House for dinner and overnight stay | Africanews",
  "author": null,
  "article_body": "Normalized article text...",
  "tags": [],
  "date_published": "2026-05-06T16:13:00+00:00",
  "article_url": "https://www.euronews.com/2026/04/14/trump-hosts-dutch-royals-at-the-white-house-for-dinner-and-overnight-stay",
  "main_image_url": null,
  "seo_description": null
}

OUTPUT record

Each run also writes a lightweight OUTPUT record with summary metadata.

Typical fields:

resultCount
hasMore
nextContinuationToken
filters
estimatedMatchCount in estimate mode
estimatedReturnedThisRun in estimate mode

Estimate mode

Use estimate_only: true when you want to see how many articles match before pulling rows.

In estimate mode:

no dataset rows are returned
the OUTPUT record includes the estimated match count
you can rerun the same input with estimate_only: false to fetch rows

Pagination

When hasMore is true, the OUTPUT record includes nextContinuationToken.

To fetch the next page:

Copy nextContinuationToken from OUTPUT
Use the same filters again
Paste the token into continuation_token
Run the Actor again

Why you might get zero results

The archive does not currently contain matching rows
The country or website filter is too narrow
The date range is too small
You requested a page beyond the available results

Production notes

The Actor is optimized for archive retrieval, not live crawling
Default run memory is 128 MB
Results are returned newest first
The Actor is suitable for production use when your Supabase archive is populated and DATABASE_URL is configured correctly

Ultimate News API

glitch_404/Ultimate-News-Scraper

Scrape up to 10000 news articles from over 4500 news sources in less than 20 minutes, news from over 20 categories, e.g., Crypto news, World News, Latest News, Celebrities, and a lot more. You can find news on websites such as Fox News, BBC News, CNN, and Cryptocurrency-Related News Sources.

Yousif Wael

247

1.0

Free Google News API — Search News by Keyword + Country

s-r/google-news

Free Google News scraper — get clean structured news results for any query, country, and language. Use it as a Google News API for brand monitoring, topic alerts, news clipping, and bulk article URL harvesting.

News Archive Scraper

fortuitous_pirate/news-archive-scraper

News Archive Aggregator. Structured data export for lead generation, enrichment, and competitive research.

Fortuitous Pirate

Google News Scraper

fortuitous_pirate/google-news-scraper

Scrape news articles from Google News by search query or topic. Extracts article title, source, published date, and URL. Supports language and country filtering. Export to JSON, CSV, or Excel.

Fortuitous Pirate

Bing News Scraper

piotrv1001/bing-news-scraper

Scrapes news articles from Bing News search results, extracting titles, URLs, sources, publication dates, descriptions, and thumbnails. Ideal for media monitoring, trend analysis, and news aggregation.

FalconScrape

Google News Scraper

piotrv1001/google-news-scraper

Scrapes news articles from Google News, extracting titles, sources, publication dates, and links. Search by keywords, browse by topic, or get top headlines with multi-language and region support. Ideal for news monitoring, media analysis, and content aggregation.

FalconScrape

Google News Scraper

muscular_quadruplet/google-news-scraper

Scrape Google News articles by keyword or topic. Get headlines, sources, publish dates, snippets. Monitor news mentions, track industry trends, build news aggregators. Real-time news scraping.

Do It

Google News PPR

devisty/google-news-ppr

Provide real-time news and articles sourced from Google News (Pay per result)

Devisty

Google News Scraper

oneary/google-news-scraper

Luan

News Website Crawler & Article Extractor

xtech/news-source-crawler

Scrape all articles from any news website. Extract full text, metadata, keywords, and summaries. Ideal for content analysis, research, and news aggregation.