Pricing

from $0.50 / 1,000 results

AI-Powered RSS Aggregator & Summarizer

Enterprise-grade RSS aggregator with AI-powered summarization. Collects, filters, and processes feeds from any source. Ideal for content analysis, news monitoring, and AI training. Features keyword filtering, metadata extraction, and structured output in JSON/CSV. Built with Hugging Face.

Pricing

from $0.50 / 1,000 results

Rating

0.0

(0)

Developer

PrimeParse

Actor stats

Bookmarked

Total users

Monthly active users

11 days ago

Last modified

🌐 RSS Aggregator: AI-Powered RSS Aggregator & Summarizer

High-quality RSS Feed Aggregator & Processor for Content Teams, Researchers, and AI Engineers

Automatically aggregates RSS feeds, filters by keywords, extracts summaries, and optionally generates AI-powered summaries — clean, structured, ready for analysis or AI.

Built for:

Content aggregators & news monitoring teams
Researchers tracking academic papers and publications
AI/ML engineers building content datasets
Marketing teams monitoring industry trends
Data analysts processing feed data

✅ Smart keyword filtering
✅ AI-powered summarization (Hugging Face transformers)
✅ Multiple feed support (1-5 feeds recommended)
✅ Rich metadata extraction (date, author, tags, description)
✅ Rate limiting & respectful crawling
✅ AI-ready structured output

👉 Runs on Apify • No code required

🚀 Why This Aggregator

✔ Purpose-Built for RSS Processing
Intelligently aggregates and processes RSS feeds from any source — news sites, academic journals, blogs, corporate feeds.

✔ AI Summarization Ready
Optional integration with Hugging Face transformers (BART, Pegasus) for advanced AI-powered summarization of feed entries.

✔ Clean & Structured Output
Extracts only meaningful content — title, link, summary, author, tags, publication date — ready for analysis.

✔ Smart Keyword Filtering
Filter entries by custom keywords (case-insensitive) across title, summary, and tags for relevance.

✔ AI & ML Ready
Structured JSON/CSV output perfect for RAG systems, LLM fine-tuning, or training datasets.

✔ Fast & Efficient
Powered by feedparser — excellent for RSS/Atom feeds. Lightweight and fast processing.

✔ Safe & Controlled Processing
Configurable rate limiting, entry limits per feed, and graceful error handling.

💼 Use Cases

News monitoring — Track industry news and trends from multiple sources
Academic research — Aggregate papers from arXiv, PubMed, and other academic feeds
Content curation — Collect and filter relevant content for newsletters or blogs
AI training data — Generate clean datasets for LLM fine-tuning or RAG systems
Competitive intelligence — Monitor competitor blogs and news feeds
Market research — Track product announcements and industry updates

📊 Supported Sources

News feeds — TechCrunch, Reuters, BBC, Guardian, etc.
Academic feeds — arXiv, PubMed, academic journals
Blog feeds — Medium, WordPress, custom blog RSS
Corporate feeds — Company blogs, press releases, announcements
Any RSS/Atom feed — Standard-compliant feeds

⚙️ How It Works

Provide RSS feed URLs (1-5 feeds recommended)
Set custom keywords and processing options
Optionally enable AI summarization
Run the Actor
Download clean, structured RSS datasets

🧩 Input Configuration

Example JSON Input

{
  "rssFeeds": [
    "https://arxiv.org/rss/cs.AI",
    "https://techcrunch.com/feed/"
  ],
  "maxEntriesPerFeed": 10,
  "keywords": [
    "AI",
    "machine learning",
    "artificial intelligence"
  ],
  "enableSummarization": true,
  "enableAISummarization": true,
  "aiModelName": "facebook/bart-large-cnn",
  "aiMaxLength": 1024,
  "aiMinLength": 50,
  "aiMaxSummaryLength": 150,
  "delayBetweenFeeds": 1.0
}

Key Options

rssFeeds — List of RSS feed URLs to aggregate (required, 1-5 recommended)
maxEntriesPerFeed — Maximum entries per feed (0 = unlimited, default: 10)
keywords — Custom keywords for filtering entries (case-insensitive, empty = all entries)
enableSummarization — Extract summary/description from feeds (default: true)
enableAISummarization — Use Hugging Face AI for advanced summarization (default: false)
aiModelName — Hugging Face model identifier (default: "facebook/bart-large-cnn")
aiMaxLength — Maximum input length for AI model (default: 1024 tokens)
aiMinLength — Minimum summary length (default: 50 tokens)
aiMaxSummaryLength — Maximum summary length (default: 150 tokens)
delayBetweenFeeds — Delay in seconds between feeds for rate limiting (default: 1.0)

📂 Output Dataset

All entries are stored in the default Apify dataset with the following structure:

Example Output Record

{
  "title": "Adobe hit with proposed class-action, accused of misusing authors' work in AI training",
  "link": "https://techcrunch.com/2025/12/17/adobe-hit-with-proposed-class-action-accused-of-misusing-authors-work-in-ai-training/",
  "published": "2025-12-18T00:44:55",
  "summary": "The lawsuit is just the latest in a string of copyright-related legal complaints aimed at the AI industry.",
  "feedTitle": "TechCrunch",
  "feedUrl": "https://techcrunch.com/feed/",
  "author": "Lucas Ropek",
  "tags": [
    "AI",
    "Adobe",
    "Anthropic",
    "artificial intelligence"
  ]
}

With AI Summarization

When enableAISummarization: true, the summary field contains AI-generated summaries:

{
  "title": "Breakthrough in Quantum Computing",
  "link": "https://example.com/quantum-breakthrough",
  "published": "2025-12-15T10:30:00",
  "summary": "Researchers achieve significant milestone in quantum error correction, bringing practical quantum computing closer to reality. The new method reduces error rates by 50%...",
  "feedTitle": "Science News",
  "feedUrl": "https://example.com/feed.xml",
  "author": "Dr. Jane Smith",
  "tags": ["quantum computing", "research", "technology"]
}

🤖 AI Summarization Models

Supported Hugging Face models for summarization:

facebook/bart-large-cnn (default) — Best for news articles and general content
google/pegasus-xsum — Optimized for news summaries
Any summarization model — Compatible with Hugging Face transformers

The Actor automatically falls back to basic extraction if AI summarization fails or is unavailable.

🏁 Getting Started

Quick Start on Apify

Click "Try for free" on Apify
Paste RSS feed URLs (e.g., https://techcrunch.com/feed/)
Customize keywords and options
Optionally enable AI summarization
Run and download your dataset

📈 Performance

Processing Speed — ~1-2 seconds per feed (depending on entries)
Rate Limiting — Configurable delay between feeds (default: 1s)
Memory Efficient — Processes feeds sequentially
Scalability — Handles 1-5 feeds optimally (can process more)

🔧 Advanced Configuration

Custom AI Models

You can use any Hugging Face summarization model:

{
  "enableAISummarization": true,
  "aiModelName": "google/pegasus-xsum",
  "aiMaxLength": 2048,
  "aiMinLength": 100,
  "aiMaxSummaryLength": 200
}

📧 Support

Email: kidaxxb@gmail.com
Response within 24 hours
Issues: Use Apify Issues tab

Tags: RSS, feed aggregator, content processing, AI summarization, Hugging Face, news aggregation, feed parser, content analysis, RAG, LLM training, data extraction

Built with ❤️ on Apify

RSS & News Intelligence - Real-Time AI Monitor

viralanalyzer/rss-news-intelligence

Monitor RSS feeds and Google News with AI-powered summarization.

viralanalyzer

5.0

RSS News Aggregator

louvre/rss-news-aggregator

Louvre LLC

RSS Feed Aggregator

eloquent_mountain/rss-feed-aggregator

RSS Feed Aggregator Collect and consolidate multiple RSS feeds effortlessly with this Apify actor. Fetch items concurrently from various feeds, deduplicate entries, and select specific fields for a customized output. Ideal for news aggregation and content curation.

Paco

News Aggregator API

vivid_astronaut/news-aggregator

Fabio Suizu

AI News API & Aggregator

code-node-tools/ai-news-updates-api

AI news API and aggregator. Get latest AI news today from 30+ sources including ArXiv, TechCrunch, Reddit, and research labs. Filter by keywords, categories, and time range.

CodeNodeTools

Social Media & Influencer News Intelligence

visita/social-media-influencer-news-intelligence

This Apify Actor processes AI and Tech news from major RSS feeds and transforms headlines into structured, actionable intelligence using the Google Search Results Scraper's AI Overview and combined Language Model (LLM) analysis.

Visita Intelligence

AI-Enhanced Website Metadata

njoylab/ai-enhanced-website-metadata

Extracts complete website metadata including SEO tags, OpenGraph data, social media links, contact information and performs link analysis. Features AI-powered content summarization with multilingual support and structured data extraction. Perfect for gathering deep insights from any URL.