RSS Feed Aggregator & Article Extractor
Pricing
Pay per usage
Go to Apify Store
RSS Feed Aggregator & Article Extractor
Aggregate RSS/Atom feeds and extract full article content. Multi-feed ingestion, deduplication, keyword filtering, rich metadata. Returns clean JSON with full-text extraction. For news monitoring, AI training, and curation.
Pricing
Pay per usage
Rating
0.0
(0)
Developer
Oaida Adrian
Maintained by CommunityActor stats
0
Bookmarked
1
Total users
0
Monthly active users
5 hours ago
Last modified
Categories
Share
Aggregate RSS/Atom feeds and extract full article content with clean, structured JSON output.
Features
- Multi-feed aggregation — Process dozens of RSS/Atom feeds in a single run
- Full article extraction — Uses trafilatura for high-precision main content extraction
- Smart deduplication — Removes duplicate articles by URL or title
- Date & category filtering — Filter articles by publish date or feed categories
- Rich metadata — Extracts authors, tags, enclosures, images, word counts, and more
- Structured output — Clean JSON schema ready for downstream processing, AI training, or content curation
Input Parameters
| Parameter | Type | Default | Description |
|---|---|---|---|
feedUrls | array | required | RSS/Atom feed URLs to aggregate |
maxArticles | integer | 100 | Maximum articles to extract (0 = unlimited) |
extractFullContent | boolean | true | Fetch and extract full article text from each URL |
deduplicateBy | string | "link" | Deduplication field: link, title, or none |
dateFilter | string | "" | Only include articles after this date (ISO 8601) |
includeCategories | string | "" | Comma-separated categories to include |
proxyConfiguration | object | Apify proxy | Proxy settings for content extraction |
Output Fields
| Field | Description |
|---|---|
title | Article title |
link | Article URL |
pubDate | Publication date (ISO 8601) |
source | Feed source name |
author | Article author |
summary | RSS feed summary/excerpt |
fullText | Full extracted article text (if extractFullContent is enabled) |
wordCount | Word count of extracted text |
categories | Article categories/tags |
tags | Extracted meta tags |
enclosures | Media attachments (images, podcasts, etc.) |
Use Cases
- News monitoring — Track multiple news sources in one feed
- Content curation — Aggregate niche content for blogs or newsletters
- AI training data — Collect clean text for model training
- Media intelligence — Monitor competitors, track mentions, analyse trends
- SEO monitoring — Track industry publications and backlink opportunities
Pricing
This actor uses pay-per-event pricing. You are charged per article successfully extracted.