News Aggregator - RSS Feed Parser & Article Extractor avatar

News Aggregator - RSS Feed Parser & Article Extractor

Pricing

Pay per usage

Go to Apify Store
News Aggregator - RSS Feed Parser & Article Extractor

News Aggregator - RSS Feed Parser & Article Extractor

Extract structured news articles from any RSS feed. Get headlines, summaries, publication dates, authors, and source URLs in clean JSON. Perfect for media monitoring, content curation, and news aggregation pipelines.

Pricing

Pay per usage

Rating

0.0

(0)

Developer

Pierrick McD0nald

Pierrick McD0nald

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

5 days ago

Last modified

Categories

Share

News Aggregator - RSS Feed Parser — Structured News Data from Any RSS Feed

Aggregate news articles from multiple RSS feeds into clean, structured JSON data. This Actor extracts titles, links, descriptions, publication dates, categories, and source domains from standard RSS 2.0 and Atom feeds, making it easy to monitor news sources, track topics, and build content pipelines without writing custom parsers.

Use Cases

  • Content Monitoring — Track news from multiple sources in a single structured dataset for analysis, dashboards, or alerting
  • Market Intelligence — Monitor industry news, competitor mentions, and trending topics across publications
  • Research & Analysis — Collect news articles for academic research, sentiment analysis, or trend detection
  • Media Aggregation — Build curated news feeds for websites, newsletters, or internal dashboards
  • SEO & Content Strategy — Analyze publishing frequency, topic coverage, and content gaps across news sources

Input

FieldTypeRequiredDescription
feedUrlsArrayYesList of RSS or Atom feed URLs to aggregate (e.g., ['https://feeds.bbci.co.uk/news/rss.xml'])
maxItemsNumberNoMaximum total articles to extract across all feeds (default: 100)
proxyConfigurationObjectNoProxy configuration for feed requests. Disabled by default since RSS feeds rarely require proxies

Output

The Actor outputs a dataset with the following fields for each article:

{
"title": "Example News Article Title",
"link": "https://example.com/news/article",
"description": "A brief summary of the article content",
"pubDate": "Mon, 28 Jun 2026 12:00:00 GMT",
"source": "feeds.bbci.co.uk",
"category": "World",
"feedUrl": "https://feeds.bbci.co.uk/news/rss.xml",
"scrapedAt": "2026-06-28T12:00:00.000Z"
}

Pricing

Pay per event: $0.001 per article extracted (equivalent to $1.00 per 1,000 articles). No charge for failed requests or empty feeds. Pricing is based on the number of articles successfully pushed to the dataset.

Limitations

  • This Actor extracts data from RSS and Atom feeds only. It does not scrape full article content from the original websites
  • Some RSS feeds may truncate descriptions or omit categories. The Actor returns exactly what the feed provides
  • Very large feeds (10,000+ items) may be memory-intensive. Use maxItems to limit output
  • Feed parsing depends on the RSS/Atom being well-formed. Malformed feeds may return partial data or errors
  • This Actor does not deduplicate articles across feeds. The same article appearing in multiple feeds will be extracted multiple times

FAQ

Q: Can I use this with any RSS feed? A: Yes. The Actor supports standard RSS 2.0 and Atom feed formats. Simply provide the feed URL in the feedUrls input field.

Q: Do I need a proxy? A: No. RSS feeds are generally public and do not require proxy rotation. The proxy configuration is disabled by default.

Q: How many articles can I extract per run? A: The maxItems input controls the total. The default is 100. There is no hard limit, but very large runs will consume more compute.

Q: Does this Actor extract full article text? A: No. This Actor extracts the metadata provided by the RSS feed (title, link, description, date, category). To extract full article text, you would need to feed the extracted links into a separate article scraper.

Changelog

  • v1.0.0 — Initial release. RSS 2.0 and Atom feed parsing with structured JSON output, PPE charging, and multi-source aggregation.