Nikkei Asia Article Scraper avatar

Nikkei Asia Article Scraper

Pricing

from $2.00 / 1,000 results

Go to Apify Store
Nikkei Asia Article Scraper

Nikkei Asia Article Scraper

Extract full article text, headline, authors and metadata from any asia.nikkei.com URL. HTTP-only, no browser. Supports `mode: latest` to pull newest headlines from the Nikkei Asia RSS feed.

Pricing

from $2.00 / 1,000 results

Rating

0.0

(0)

Developer

Xtractoo

Xtractoo

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

5 days ago

Last modified

Categories

Share

Extract full article text, headline, authors, dates and metadata from any asia.nikkei.com article URL. Nikkei Asia is the English-language sister of Japan's leading business newspaper, covering Asia-Pacific business, politics, technology, and finance.

Why Use This Actor?

  • Asia-Pacific business intelligence — Nikkei Asia is the canonical English source for Japanese, Chinese, and ASEAN corporate news.
  • FT-quality reporting — owned by Nikkei Inc. (also the parent of the Financial Times), with the same editorial standards.
  • Live coverage support — handles Spotlight live blogs (e.g. the Trump-Xi summit) as cleanly as standard articles.
  • Free tier coverage — most articles return full body via HTTP-only extraction; premium articles fall back to intro with a warning.

How It Works

This actor uses only HTTP requests — no browser, no Selenium, no Playwright. Article body lives in <div class="ezrichtext-field articleBody"> and is extracted directly with a paragraph filter that strips ads, share blocks, and related-stories noise.

Input

{
"url": "https://asia.nikkei.com/spotlight/trump-xi-summit/example-article",
"urls": [
"https://asia.nikkei.com/article/one",
"https://asia.nikkei.com/article/two"
],
"mode": "article",
"limit": 10
}

Output

{
"url": "https://asia.nikkei.com/spotlight/trump-xi-summit/trump-xi-summit-live-us-president-discusses-outcomes-aboard-air-force-one",
"source": "Nikkei Asia",
"title": "Trump-Xi summit live: US president discusses outcomes aboard Air Force One",
"description": "Trump's day-long visit to Beijing brought few concrete deals but de-escalated tensions.",
"content": "Aboard Air Force One on Friday, US President Donald Trump reflected on his summit with Chinese President Xi Jinping in Beijing the previous day...",
"image": "https://images.ft.com/v3/image/raw/...",
"language": "en_GB",
"word_count": 4868,
"published_date": "2026-05-15T12:29:10.000Z",
"modified_date": "",
"authors": ["Staff Writer"],
"categories": "Spotlight",
"tags": ""
}

Fetch Latest News

Set mode to "latest" to fetch the newest article URLs and titles from Nikkei Asia instead of extracting a single article.

Input:

{
"mode": "latest",
"limit": 10
}

Output — array of objects:

[
{
"url": "https://asia.nikkei.com/spotlight/example-newest-article",
"title": "US stocks fall after Trump-Xi summit leaves investors 'underwhelmed'",
"published_date": "Fri, 15 May 2026 12:29:10 GMT",
"source": "Nikkei Asia"
}
]

Source: https://asia.nikkei.com/rss/feed/nar (RSS feed)

Cron Schedule: Auto-Fetch Newest Articles

Combine mode: "latest" and mode: "article" to keep a fresh feed running on autopilot:

  1. Schedule a recurring run of this Actor with {"mode": "latest", "limit": 20} via Apify Schedules (UI ▸ Schedules ▸ Create new). A cron expression like */30 * * * * runs it every 30 minutes.
  2. Webhook the dataset of the latest run into another Actor run with mode: "article" and the new URLs as input — Apify integrations let you chain runs via the "Actor finished" webhook without any glue code.
  3. The article-mode run extracts the full body, image, authors, and metadata for each URL and appends to your master dataset.

Common cron expressions:

FrequencyCron
Every 15 minutes*/15 * * * *
Hourly0 * * * *
Every 6 hours0 */6 * * *
Daily at 06:00 UTC0 6 * * *

Notes

  • Free articles return the full body; premium articles return the intro paragraphs with warning: "Nikkei Asia paywall - only intro available".
  • Works on regular articles AND /spotlight/* live blogs.