Pricing

from $20.00 / 1,000 results

Try for free

Go to Apify Store

Barron's Article Scraper

Try for free

Extract Barron's articles (barrons.com) - title, body, authors and metadata. Fast, HTTP-only and no cookies required. Mode `latest` scrapes the homepage for the newest article URLs.

Pricing

from $20.00 / 1,000 results

Rating

5.0

(1)

Developer

Farhan Febrian Nauval

Actor stats

Bookmarked

Total users

Monthly active users

2 months ago

Last modified

Why Use This Actor?

US markets coverage — Barron's is a primary venue for buy-side commentary and "next week's market" pieces.
Same CMS as WSJ — Barron's runs on the Dow Jones platform, so the article shape is identical to wsj.com (good for normalised cross-DJ pipelines).
DataDome bypass — actor clears DataDome anti-bot via primp (Rust rquest TLS stack) with a rotating profile pool.

How It Works

Barron's has two layers of protection identical to WSJ:

DataDome returns 401 + datadome cookie unless TLS fingerprint matches a real browser. This actor uses only HTTP requests — no browser, no Selenium, no Playwright to bypass this.
Subscriber paywall — Content is supplemented with the auto-generated bullet summary and SEO summary. subscribers get the full body in articleData.flattenedBody.

Proxy requirements

A US residential proxy is required — DataDome blocks both datacenter IPs and non-US residential pools. The actor reads proxyConfiguration from input and uses it for every fetch:

{
  "proxyConfiguration": {
    "useApifyProxy": true,
    "apifyProxyGroups": ["RESIDENTIAL"],
    "apifyProxyCountry": "US"
  }
}

This is set as the default in the input schema.

Input

{
  "url": "https://www.barrons.com/articles/example-article-daf7f9aa",
  "urls": [
    "https://www.barrons.com/articles/article-one"
  ],
  "mode": "article",
  "limit": 10
}

Output

{
  "url": "https://www.barrons.com/articles/prediction-markets-conference-las-vegas-casino-daf7f9aa",
  "source": "Barron's",
  "title": "Casino Cancels Prediction Markets Conference. Not Everything Can Stay in Vegas.",
  "description": "Nevada's gambling regulator has taken a strong stance against prediction markets.",
  "content": "What happens in Vegas, stays in Vegas—or so the slogan goes. But it seems prediction markets can't stay at all.\n\nKey points:\n- The Predict 2026 conference in Las Vegas was canceled by Aria casino due to concerns over its gambling license.\n- Nevada regulators have taken a strong stance against prediction markets.\n- Prediction markets claim federal regulation, but some states view them as illegal gambling.\n\nThe Aria Resort and Casino canceled an upcoming predictions market conference. Nevada's gambling regulator has taken a strong stance against prediction markets.",
  "image": "https://images.barrons.com/im-12787006",
  "language": "en_US",
  "word_count": 89,
  "full_word_count": 636,
  "full_paragraph_count": 13,
  "published_date": "2026-05-14T19:06:00Z",
  "modified_date": "2026-05-15T00:36:00Z",
  "authors": ["Nick Devor"],
  "categories": "Daily",
  "tags": "",
  "warning": "Barron's paywall - only snippet extracted; full body is subscriber-only"
}

Fetch Latest News

Set mode to "latest" to fetch the newest article URLs from Barron's homepage. The official RSS endpoint (feeds.a.dj.com/rss/BarronsFront.xml) returns 403, so we scrape barrons.com's homepage HTML and collect URLs matching the article pattern.

Input:

{
  "mode": "latest",
  "limit": 10
}

Output — array of objects:

[
  {
    "url": "https://www.barrons.com/articles/example-newest-article-664c6761",
    "title": "Treasury Yields Slip as Fed Rate Cut Bets Grow",
    "source": "Barron's"
  }
]

Source: https://www.barrons.com/ (homepage scraping via primp — no public RSS).

Cron Schedule: Auto-Fetch Newest Articles

Combine mode: "latest" and mode: "article" to keep a fresh feed running on autopilot:

Schedule a recurring run of this Actor with {"mode": "latest", "limit": 20} via Apify Schedules (UI ▸ Schedules ▸ Create new). A cron expression like */30 * * * * runs it every 30 minutes.
Webhook the dataset of the latest run into another Actor run with mode: "article" and the new URLs as input — Apify integrations let you chain runs via the "Actor finished" webhook without any glue code.
The article-mode run extracts the snippet body, image, authors, and metadata for each URL and appends to your master dataset.

Common cron expressions:

Frequency	Cron
Every 15 minutes	`/15 * * *`
Hourly	`0 * * * *`
Every 6 hours	`0 /6 * *`
Daily at 06:00 UTC	`0 6 * * *`

Caixin Global Article Scraper

xtracto/caixin-scraper

Extract Caixin Global (caixinglobal.com) articles - title, body, authors and metadata. HTTP-only. Mode `latest` scrapes the homepage for the newest article URLs.

Farhan Febrian Nauval

News Article Scraper for Feeding LLM

proscraper/newsarticlescraper

Scrape news articles metadata to feed into LLM models. Returns article body, published date, article title, author etc.

Owais Nazir

184

Data Indonesia Article Scraper

xtracto/dataindonesia-scraper

Extract full article text, authors, dates, and metadata from dataindonesia.id URLs. No browser needed - fast HTTP-only extraction via Next.js data.

Farhan Febrian Nauval

Bbc Article Scraper

xtracto/bbc-scraper

Extract full article text, headline, authors, and publication date from any bbc.com URL. Supports `mode: latest` to fetch newest BBC headlines via RSS. No browser needed - HTTP-only, fast and lightweight.

Farhan Febrian Nauval

investors.com Business Daily Article Scraper

xtracto/investors-scraper

Extract article metadata and visible intro content from investors.com (IBD). Full articles contents, No browser needed - HTTP-only.

Farhan Febrian Nauval

The Straits Times Article Scraper

xtracto/straitstimes-scraper

Extract full article text, authors, dates, and metadata from The Straits Times URLs. No browser needed - fast HTTP-only extraction.

Farhan Febrian Nauval

Cnbc Article Scraper

xtracto/cnbc-scraper

Scrape full article content, title, authors, and metadata from cnbc.com. Supports `mode: latest` for live CNBC headline feed. HTTP-only, no browser

Farhan Febrian Nauval

Public Article Intelligence & Citation Extractor

jacksu/public-article-intelligence-agent

Extract clean article text, metadata, summaries, citations, diagnostics, and change signals from public article URLs.

jack su

iRozhlas Article Extractor

jakub.kopecky/irozhlas-extractor

Extracts clean article body text from irozhlas.cz news articles. Give it a list of article URLs and it returns the paragraph content - no navigation, no sidebar, no metadata.

Jakub Kopecký

Advanced News Scraper

dorcy/advanced-news-scraper

Extract the latest news articles with custom search queries, providing all the information, including article titles, sources, publication dates, full article text, and an AI-generated summary.