Rss To Json Extractor avatar
Rss To Json Extractor

Pricing

from $3.00 / 1,000 results

Go to Apify Store
Rss To Json Extractor

Rss To Json Extractor

📰 Convert RSS and Atom feeds into structured JSON format for easy integration with your applications. Fast, reliable, and production-ready.

Pricing

from $3.00 / 1,000 results

Rating

0.0

(0)

Developer

SimplifySME Toolbox

SimplifySME Toolbox

Maintained by Community

Actor stats

0

Bookmarked

1

Total users

0

Monthly active users

2 days ago

Last modified

Share

📰 Convert RSS and Atom feeds into structured JSON format for easy integration with your applications. Fast, reliable, and production-ready.


📺 What It Extracts

  • Feed Metadata: Title, description, link, language, last build date
  • Article Data: Title, link, publication date, content, description, author
  • Content Organization: Categories, tags, GUID identifiers
  • Timestamps: Both human-readable and ISO 8601 formatted dates

🚀 Key Features

FeatureDescription
🔄 Universal Feed SupportSupports RSS 2.0, Atom, and other feed formats
⏱️ Smart Timeouts30-second timeout with up to 5 redirects
📊 Structured LoggingAll operations logged with runId and metadata
💾 Key-Value Store IntegrationSummary stored in OUTPUT key for webhook integration
🔍 ObservabilityRun metadata included in all outputs
⚠️ Error HandlingComprehensive error logging and reporting

📥 Input

Required

  • rssUrl (string): The URL of the RSS/Atom feed to extract
    • Example: "https://example.com/feed.xml"
    • Supports RSS 2.0, Atom, and other standard feed formats

📤 Output

Returns structured feed data:

Feed Metadata

{
"feed": {
"title": "Example Blog",
"description": "A sample blog feed",
"link": "https://example.com",
"language": "en",
"lastBuildDate": "Mon, 01 Jan 2024 12:00:00 GMT"
}
}

Articles Array

{
"articles": [
{
"title": "Sample Article",
"link": "https://example.com/article",
"pubDate": "Mon, 01 Jan 2024 10:00:00 GMT",
"isoDate": "2024-01-01T10:00:00.000Z",
"content": "Article content...",
"description": "Article description",
"author": "John Doe",
"categories": ["tech", "programming"],
"guid": "article-123"
}
],
"totalArticles": 1
}

Complete Output

{
"feed": { /* feed metadata */ },
"articles": [ /* array of articles */ ],
"totalArticles": 1,
"_metadata": {
"runId": "abc123",
"actorId": "user/rss-to-json",
"processedAt": "2024-01-01T12:00:00.000Z",
"processingTimeMs": 1234,
"rssUrl": "https://example.com/feed.xml"
}
}

💡 Use Cases

  • News Dashboards - Aggregate content from multiple RSS feeds
  • Curated Newsletters - Collect articles for email newsletters
  • SEO Monitoring - Track content updates and new publications
  • Content Aggregation - Build content databases from RSS sources
  • Social Media Automation - Extract content for automated posting
  • Research & Analysis - Collect articles for content analysis

⚙️ Technical Details

  • Feed Parser: Uses rss-parser library for robust feed parsing
  • Timeout Strategy: 30-second timeout with 5 redirect maximum
  • Error Handling: Comprehensive error logging with structured metadata
  • Storage: Summary stored in Key-Value Store OUTPUT key
  • Dataset Integration: All articles pushed to default dataset

🔗 Integration

Webhooks

Configure webhooks in Apify to trigger on:

  • Run succeeded: Get summary from key-value store OUTPUT
  • Run failed: Error details in OUTPUT key

Key-Value Store

  • OUTPUT: Contains run summary with success status, article count, and metadata

Dataset

All extracted articles are pushed to the default dataset. Access via:

  • Apify API: GET /v2/datasets/{datasetId}/items
  • Webhook payload: Includes dataset ID in run metadata

📝 Example Usage

Basic Extraction

{
"rssUrl": "https://example.com/feed.xml"
}
{
"rssUrl": "https://feeds.bbci.co.uk/news/rss.xml"
}