Pbs News Scraper avatar

Pbs News Scraper

Pricing

from $10.00 / 1,000 results

Go to Apify Store
Pbs News Scraper

Pbs News Scraper

Pricing

from $10.00 / 1,000 results

Rating

0.0

(0)

Developer

Donny

Donny

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

a day ago

Last modified

Categories

Share

PBS NewsHour Scraper

What Does PBS NewsHour Scraper Do?

The PBS NewsHour Scraper is an Apify actor that extracts news articles from PBS NewsHour, one of America's most respected broadcast news programs. It collects article titles, summaries, publication dates, authors, categories, and featured images from the PBS NewsHour website in a structured JSON format ready for analysis or integration into your data pipeline.

Why Scrape PBS NewsHour?

PBS NewsHour is known for balanced, in-depth journalism covering politics, economy, science, health, and world affairs. For media researchers, content aggregators, political analysts, and educators, having structured access to PBS NewsHour content enables comprehensive news monitoring, media comparison studies, content curation, and educational resource building. This scraper provides clean, reliable access to their news catalog.

How to Use This PBS News Scraper

  1. Provide one or more PBS NewsHour URLs to start scraping (defaults to the latest articles page).
  2. Set the maximum number of articles to extract.
  3. Run the actor and download your structured news dataset.

The actor uses a Cheerio-based crawler for fast, lightweight extraction without requiring a full browser.

Input Parameters

ParameterTypeDescriptionDefault
startUrlsarrayPBS NewsHour URLs to scrape["https://www.pbs.org/newshour/latest"]
maxResultsintegerMaximum articles to scrape25

Output Data

Each article in the dataset includes:

  • title - Article headline
  • url - Full URL to the article
  • summary - Brief description or excerpt
  • date - Publication date
  • author - Article author
  • category - Topic category (politics, world, economy, etc.)
  • imageUrl - Featured image URL
  • scrapedAt - Timestamp of the scrape

Cost of Usage

Cost-effective news scraping:

  • Per result: $0.01
  • Per 1,000 results: $10
  • Actor start cost: $0.005

A typical run extracting 25 articles finishes in under a minute at minimal cost.

Tips and Best Practices

  • Start with the /newshour/latest URL to get the most recent articles across all categories.
  • Provide section-specific URLs for targeted scraping (e.g., /newshour/world for international news).
  • Schedule regular runs to build an ongoing archive of PBS NewsHour coverage.
  • Combine with sentiment analysis tools for media research projects.
  • The lightweight Cheerio-based approach ensures reliability and speed.

Related actors worth exploring: