New York Times News Scraper avatar

New York Times News Scraper

Pricing

from $5.00 / 1,000 results

Go to Apify Store
New York Times News Scraper

New York Times News Scraper

Extracts full New York Times articles while successfully bypassing partial-render paywalls to ensure complete content delivery.

Pricing

from $5.00 / 1,000 results

Rating

0.0

(0)

Developer

Xtractoo

Xtractoo

Maintained by Community

Actor stats

0

Bookmarked

14

Total users

8

Monthly active users

5 hours ago

Last modified

Share

New York Times Article Scraper

Extract full article text, headline, byline, authors, publication date, categories, tags, and metadata from nytimes.com article URLs. Extracts full article text including content behind the metered soft paywall.

Why Use This Actor?

  • Bypasses soft paywall - extracts full NYT article text available to web visitors.
  • Full article extraction - retrieves complete article text (10,000+ chars) including investigative long-form pieces.
  • Rich metadata - extracts authors, published/modified dates, categories, tags, and description.
  • Investigative journalism research - NYT investigative pieces are the definitive source for deep US stories.
  • Cultural coverage - NYT covers books, film, food, and culture alongside hard news.

How It Works

This actor uses only HTTP requests - no browser, no Selenium, no Playwright. Bypasses Cloudflare protection without a real browser. Articles are extracted in seconds with RAM usage well under 256 MB.

Input

{
"url": "https://www.nytimes.com/2026/04/12/technology/example-article.html",
"urls": [
"https://www.nytimes.com/2026/04/12/us/politics/article-one.html",
"https://www.nytimes.com/2026/04/11/world/article-two.html"
]
,
"mode": "article",
"limit": 10
}

Output

{
"url": "https://www.nytimes.com/2026/05/15/nyregion/hezbollah-arrest-nyc-jewish-targets.html",
"source": "New York Times",
"title": "Kataib Hezbollah Commander Accused of Planning Attacks on N.Y.C.",
"description": "Mohammad Baqer Saad Dawood Al-Saadi is accused of directing and urging others to attack American and Israeli interests. Prosecutors say he is a leader of Kataib Hezbollah, an Iraqi militia with ties to Iran.",
"content": "A commander of an Iran-backed militia has been charged with plotting to attack Jewish sites in the United States, including a synagogue in New York City, federal authorities said. A criminal complaint unsealed on Friday accused the commander, Mohammad Baqer Saad Dawood Al-Saadi, of planning at least 18 attacks in Europe and Canada since late February, in retaliation for the United States and Israel’s attacks on Iran. Mr....",
"image": "https://static01.nyt.com/newsgraphics/images/icons/defaultPromoCrop.png",
"language": "en",
"word_count": 313,
"published_date": "2026-05-15T16:54:16.351Z",
"modified_date": "",
"authors": [],
"categories": "",
"tags": ""
}

Fetch Latest News

Set mode to "latest" to fetch the newest article URLs and titles from New York Times instead of extracting a single article.

Input:

{
"mode": "latest",
"limit": 10
}

Output - array of objects:

[
{
"url": "https://www.nytimes.com/2026/04/20/world/asia/asia-pacific-iran-war-oil.html",
"title": "The Iran War Sent Shock Waves Through Asia That Are Likely to Spread",
"published_date": "Mon, 20 Apr 2026 05:39:43 +0000",
"source": "New York Times"
}
//...
]

Source: https://rss.nytimes.com/services/xml/rss/nyt/HomePage.xml (RSS feed)

Cron Schedule: Auto-Fetch Newest Articles

Combine mode: "latest" and mode: "article" to keep a fresh feed running on autopilot:

  1. Schedule a recurring run of this Actor with {"mode": "latest", "limit": 20} via Apify Schedules (UI ▸ Schedules ▸ Create new). A cron expression like */30 * * * * runs it every 30 minutes.
  2. Webhook the dataset of the latest run into another Actor run with mode: "article" and the new URLs as input — Apify integrations let you chain runs via the "Actor finished" webhook without any glue code.
  3. The article-mode run extracts the full body, image, authors, and metadata for each URL and appends to your master dataset.

Common cron expressions:

FrequencyCron
Every 15 minutes*/15 * * * *
Hourly0 * * * *
Every 6 hours0 */6 * * *
Daily at 06:00 UTC0 6 * * *

Other News Actors

Need a different news source? All actors in this collection:

ActorSource
aljazeera-scraperAl Jazeera
apnews-scraperAP News
bbc-scraperBBC News
cnbc-scraperCNBC
forbes-scraperForbes
fortune-scraperFortune
ft-scraperFinancial Times
guardian-scraperThe Guardian
msn-scraperMSN News
nytimes-scraperNew York Times
reuters-scraperReuters
scmp-scraperSouth China Morning Post
techcrunch-scraperTechCrunch
upi-scraperUPI
yahoo-finance-scraperYahoo Finance
smart-news-loaderAny URL - adaptive HTTP loader
bloomberg-scraperBloomberg

All actors support mode: "latest" for fetching newest article URLs from each source.