Pbs News Scraper
Pricing
from $10.00 / 1,000 results
Pbs News Scraper
PBS NewsHour Scraper
What Does PBS NewsHour Scraper Do?
The PBS NewsHour Scraper is an Apify actor that extracts news articles from PBS NewsHour, one of America's most respected broadcast news programs. It collects article titles, summaries, publication dates, authors, categories, and featured images from the PBS NewsHour website in a structured JSON format ready for analysis or integration into your data pipeline.
Why Scrape PBS NewsHour?
PBS NewsHour is known for balanced, in-depth journalism covering politics, economy, science, health, and world affairs. For media researchers, content aggregators, political analysts, and educators, having structured access to PBS NewsHour content enables comprehensive news monitoring, media comparison studies, content curation, and educational resource building. This scraper provides clean, reliable access to their news catalog.
How to Use This PBS News Scraper
- Provide one or more PBS NewsHour URLs to start scraping (defaults to the latest articles page).
- Set the maximum number of articles to extract.
- Run the actor and download your structured news dataset.
The actor uses a Cheerio-based crawler for fast, lightweight extraction without requiring a full browser.
Input Parameters
| Parameter | Type | Description | Default |
|---|---|---|---|
startUrls | array | PBS NewsHour URLs to scrape | ["https://www.pbs.org/newshour/latest"] |
maxResults | integer | Maximum articles to scrape | 25 |
Output Data
Each article in the dataset includes:
- title - Article headline
- url - Full URL to the article
- summary - Brief description or excerpt
- date - Publication date
- author - Article author
- category - Topic category (politics, world, economy, etc.)
- imageUrl - Featured image URL
- scrapedAt - Timestamp of the scrape
Cost of Usage
Cost-effective news scraping:
- Per result: $0.01
- Per 1,000 results: $10
- Actor start cost: $0.005
A typical run extracting 25 articles finishes in under a minute at minimal cost.
Tips and Best Practices
- Start with the /newshour/latest URL to get the most recent articles across all categories.
- Provide section-specific URLs for targeted scraping (e.g., /newshour/world for international news).
- Schedule regular runs to build an ongoing archive of PBS NewsHour coverage.
- Combine with sentiment analysis tools for media research projects.
- The lightweight Cheerio-based approach ensures reliability and speed.
Related actors worth exploring:
- NPR Article Scraper - NPR news articles
- UN News Article Scraper - United Nations global news
- WHO Health News Scraper - World Health Organization news
