RSS News Aggregator
Pricing
$10.00 / 1,000 requests
RSS News Aggregator
Pricing
$10.00 / 1,000 requests
Rating
0.0
(0)
Developer
Louvre LLC
Actor stats
0
Bookmarked
4
Total users
3
Monthly active users
2 days ago
Last modified
Categories
Share
π Introduction
π― What is this actor?
The RSS News Aggregator is a powerful Apify actor designed to fetch, combine, and process multiple RSS and Atom feeds. It takes a list of feed URLs, sorts all articles by publication date, and groups them by their source, making it easy to create customized news streams.
π Key Features
- Aggregate Multiple Feeds: Combine articles from various RSS/Atom sources.
- Group by Source: Automatically groups articles by their root domain (e.g.,
marketwatch.com). - Global Sorting: Sorts all articles chronologically across all feeds.
- Flexible Output: Choose between a clean, normalized format or the raw, unprocessed data from the feeds.
- Deploy as API: Runs on the Apify platform and can be used as a serverless API.
π Use Cases
- News Dashboards: Build a personal or professional dashboard that pulls in news from all your favorite sources.
- Content Curation: Curate content for newsletters, websites, or social media by monitoring specific topics across different blogs and news sites.
- Market Research: Keep track of industry news, competitor announcements, and market trends in one place.
- Data Analysis: Use the raw data output to perform text analysis, sentiment analysis, or other research on a large corpus of articles.
π₯ Inputs
The actor requires the following inputs.
| Field | Type | Required | Description |
|---|---|---|---|
rss_feeds | Array | Yes | An array of objects, where each object contains a url key pointing to a valid RSS or Atom feed URL. |
raw_data | Boolean | No | If set to true, the actor will return the raw, unprocessed JSON for each feed, grouped by source domain, without applying normalization or pagination. Defaults to false. |
β Example Input
{"rss_feeds": [{ "url": "https://www.federalreserve.gov/feeds/press_monetary.xml" },{ "url": "https://www.marketwatch.com/rss/topstories" },{ "url": "http://feeds.bbci.co.uk/news/rss.xml" }],"raw_data": false}
π€ Outputs
The actor's output format depends on the raw_data input parameter.
Standard Output (raw_data: false)
By default, the actor performs a global sort on all articles from all feeds and then groups the results by their source. The output is an array of objects, where each object represents a source and contains its articles.
β Example Standard Output
[{"source": "federalreserve.gov","feeds": [{"title": "Fed Announces Monetary Policy Update","link": "https://www.federalreserve.gov/newsevents/pressreleases/monetary20231026a.htm","pub_date": "2023-10-26T18:00:00.000Z","source": "federalreserve.gov","guid": "https://www.federalreserve.gov/newsevents/pressreleases/monetary20231026a.htm","author": "","category": "Monetary Policy","enclosure": null}]},{"source": "marketwatch.com","feeds": [{"title": "Market Hits Record Highs After Tech Rally","link": "https://www.marketwatch.com/story/market-hits-record-highs-11635280000","pub_date": "2023-10-26T16:30:00.000Z","source": "marketwatch.com","guid": "https://www.marketwatch.com/story/market-hits-record-highs-11635280000","author": "John Doe","category": "Top Stories","enclosure": {"url": "https://example.com/image.jpg","type": "image/jpeg","length": "12345"}}]}]
Raw Data Output (raw_data: true)
When raw_data is true, the actor fetches the raw XML from each feed, converts it to JSON, and groups the full JSON objects into an array of source-specific objects. This mode does not apply any sorting. It is designed to give you the complete, original data structure for each feed.
β Example Raw Data Output
[{"source": "federalreserve.gov","feeds": [{"title": "Federal Reserve Board - Press Release","link": {"$": {"href": "https://www.federalreserve.gov/feeds/press_monetary.xml","rel": "self"}},"description": "Recent press releases on monetary policy.","lastBuildDate": "Thu, 26 Oct 2023 18:00:00 GMT","item": [{"title": "Fed Announces Monetary Policy Update","link": "https://www.federalreserve.gov/newsevents/pressreleases/monetary20231026a.htm","description": "The Federal Reserve issued a statement on its latest monetary policy decisions...","pubDate": "Thu, 26 Oct 2023 18:00:00 GMT","guid": {"_": "https://www.federalreserve.gov/newsevents/pressreleases/monetary20231026a.htm","isPermaLink": "true"}}]}]}]
βοΈ How to Use
π Running on Apify Platform
- Log in or Sign up on the Apify platform.
- Find the "RSS News Aggregator" Actor in the Apify Store.
- Click "Try actor" and then create a new task.
- Enter your desired RSS feed URLs and any other optional parameters in the Input tab.
- Start the task by clicking the "Start" button. The results will be available in the task's Dataset tab once the run is finished.
π οΈ Troubleshooting
| Issue | Possible Cause | Solution |
|---|---|---|
Invalid URL error | One or more URLs in rss_feeds are malformed or not valid RSS/Atom feeds. | Check all provided URLs to ensure they are correct and point to a valid feed. |
Parsing Error | The feed's XML structure is broken or does not conform to RSS/Atom standards. | Try a different feed from that source or contact the source's administrator. |
Empty Output | The provided feeds are empty, or there was a network issue preventing them from being fetched. | Ensure the source feeds contain articles and that your network connection is stable. |