Barron's Article Scraper avatar

Barron's Article Scraper

Pricing

from $20.00 / 1,000 results

Go to Apify Store
Barron's Article Scraper

Barron's Article Scraper

Extract Barron's articles (barrons.com) - title, body, authors and metadata. Fast, HTTP-only and no cookies required. Mode `latest` scrapes the homepage for the newest article URLs.

Pricing

from $20.00 / 1,000 results

Rating

0.0

(0)

Developer

Xtractoo

Xtractoo

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

11 hours ago

Last modified

Categories

Share

Extract Barron's articles (barrons.com) — title, snippet body, authors, lead image, dates, and section. Barron's is the financial weekly published by Dow Jones / News Corp, focused on investing, markets, and corporate finance.

Why Use This Actor?

  • US markets coverage — Barron's is a primary venue for buy-side commentary and "next week's market" pieces.
  • Same CMS as WSJ — Barron's runs on the Dow Jones platform, so the article shape is identical to wsj.com (good for normalised cross-DJ pipelines).
  • DataDome bypass — actor clears DataDome anti-bot via primp (Rust rquest TLS stack) with a rotating profile pool.

How It Works

Barron's has two layers of protection identical to WSJ:

  1. DataDome returns 401 + datadome cookie unless TLS fingerprint matches a real browser. This actor uses only HTTP requests — no browser, no Selenium, no Playwright to bypass this.
  2. Subscriber paywall — Content is supplemented with the auto-generated bullet summary and SEO summary. subscribers get the full body in articleData.flattenedBody.

Proxy requirements

A US residential proxy is required — DataDome blocks both datacenter IPs and non-US residential pools. The actor reads proxyConfiguration from input and uses it for every fetch:

{
"proxyConfiguration": {
"useApifyProxy": true,
"apifyProxyGroups": ["RESIDENTIAL"],
"apifyProxyCountry": "US"
}
}

This is set as the default in the input schema.

Input

{
"url": "https://www.barrons.com/articles/example-article-daf7f9aa",
"urls": [
"https://www.barrons.com/articles/article-one"
],
"mode": "article",
"limit": 10
}

Output

{
"url": "https://www.barrons.com/articles/prediction-markets-conference-las-vegas-casino-daf7f9aa",
"source": "Barron's",
"title": "Casino Cancels Prediction Markets Conference. Not Everything Can Stay in Vegas.",
"description": "Nevada's gambling regulator has taken a strong stance against prediction markets.",
"content": "What happens in Vegas, stays in Vegas—or so the slogan goes. But it seems prediction markets can't stay at all.\n\nKey points:\n- The Predict 2026 conference in Las Vegas was canceled by Aria casino due to concerns over its gambling license.\n- Nevada regulators have taken a strong stance against prediction markets.\n- Prediction markets claim federal regulation, but some states view them as illegal gambling.\n\nThe Aria Resort and Casino canceled an upcoming predictions market conference. Nevada's gambling regulator has taken a strong stance against prediction markets.",
"image": "https://images.barrons.com/im-12787006",
"language": "en_US",
"word_count": 89,
"full_word_count": 636,
"full_paragraph_count": 13,
"published_date": "2026-05-14T19:06:00Z",
"modified_date": "2026-05-15T00:36:00Z",
"authors": ["Nick Devor"],
"categories": "Daily",
"tags": "",
"warning": "Barron's paywall - only snippet extracted; full body is subscriber-only"
}

Fetch Latest News

Set mode to "latest" to fetch the newest article URLs from Barron's homepage. The official RSS endpoint (feeds.a.dj.com/rss/BarronsFront.xml) returns 403, so we scrape barrons.com's homepage HTML and collect URLs matching the article pattern.

Input:

{
"mode": "latest",
"limit": 10
}

Output — array of objects:

[
{
"url": "https://www.barrons.com/articles/example-newest-article-664c6761",
"title": "Treasury Yields Slip as Fed Rate Cut Bets Grow",
"source": "Barron's"
}
]

Source: https://www.barrons.com/ (homepage scraping via primp — no public RSS).

Cron Schedule: Auto-Fetch Newest Articles

Combine mode: "latest" and mode: "article" to keep a fresh feed running on autopilot:

  1. Schedule a recurring run of this Actor with {"mode": "latest", "limit": 20} via Apify Schedules (UI ▸ Schedules ▸ Create new). A cron expression like */30 * * * * runs it every 30 minutes.
  2. Webhook the dataset of the latest run into another Actor run with mode: "article" and the new URLs as input — Apify integrations let you chain runs via the "Actor finished" webhook without any glue code.
  3. The article-mode run extracts the snippet body, image, authors, and metadata for each URL and appends to your master dataset.

Common cron expressions:

FrequencyCron
Every 15 minutes*/15 * * * *
Hourly0 * * * *
Every 6 hours0 */6 * * *
Daily at 06:00 UTC0 6 * * *