Pricing

$6.99/month + usage

RSS Feed Scraper — News Scraper & Article Extractor

Scrape any RSS or Atom news feed. Get article title, URL, description, author, date, category, and image. 20+ built-in presets: BBC, Reuters, TechCrunch, CNN, NYT, Wired & more. Optional full article text. No login. $6.99/month. 2-hour free trial.

Pricing

$6.99/month + usage

Rating

0.0

(0)

Developer

Scrape Pilot

Actor stats

Bookmarked

Total users

Monthly active users

3 months ago

Last modified

📰 RSS Feed Scraper — News Scraper & Article Extractor

The most complete RSS Feed Scraper on Apify. Extract articles from any RSS or Atom feed — BBC, Reuters, TechCrunch, CNN, NYT, Wired, The Verge, Reddit, and 20+ built-in presets — or paste any custom RSS feed URL. Get title, URL, description, author, publish date, category, and image per article. Optional full article text extraction included. No login. No API key. Instant structured output.

🔍 What Is This Actor?

RSS Feed Scraper is a production-ready Apify actor that extracts structured article data from any RSS or Atom news feed — using 20+ built-in presets for the world's top publications or any custom feed URL you provide.

Select a preset like bbc_news, techcrunch, or reuters_world — or paste your own RSS feed URL — and receive back a clean dataset of news articles: title, article URL, description, author, publish date, category, and thumbnail image. Enable the optional full article extraction mode to also retrieve the complete article body text from each linked page.

This news scraper works with any publication that provides an RSS or Atom feed — from major global outlets to niche industry blogs — making it the most versatile news article scraper available on Apify.

🚀 Why Use This RSS Feed Scraper?

Feature	This Actor	Manual Reading	Google Alerts	Other Scrapers
RSS feed scraper — any feed URL	✅	❌	❌	⚠️ Limited
20+ built-in news presets	✅	❌	❌	❌
Full article text extraction	✅ Optional	✅ Slow	❌	⚠️
Multiple feeds in one run	✅	❌	❌	❌
Author, category, image	✅ All fields	❌	❌	⚠️
RSS + Atom both supported	✅	N/A	N/A	⚠️
No login or API key	✅	✅	❌ Required	✅
Structured JSON output	✅	❌	❌	⚠️
Export to CSV / Excel	✅ Via Apify	❌	❌	❌
Scheduled runs	✅	❌	✅ Email only	❌

Bottom line: This RSS feed scraper is the only actor that combines 20+ built-in news presets, custom feed URL support, multi-feed batch runs, and optional full article body extraction — all in one tool with no credentials needed.

📡 Built-in News Feed Presets

Use any preset name directly in the preset_feed input — no URL needed:

🌍 News & World

Preset Key	Source
`bbc_news`	BBC News — Top Stories
`bbc_tech`	BBC News — Technology
`reuters_world`	Reuters — World News
`reuters_tech`	Reuters — Technology
`cnn_top`	CNN — Top Stories
`nyt_home`	New York Times — Homepage
`nyt_tech`	New York Times — Technology
`wsj_world`	Wall Street Journal — World
`google_news`	Google News — Top Stories

💻 Tech & Science

Preset Key	Source
`techcrunch`	TechCrunch — All Stories
`techcrunch_ai`	TechCrunch — AI Category
`wired`	Wired Magazine
`the_verge`	The Verge
`arstechnica`	Ars Technica
`engadget`	Engadget
`mit_tech_review`	MIT Technology Review
`hn_frontpage`	Hacker News — Front Page

🗣️ Community

Preset Key	Source
`reddit_world`	Reddit — r/worldnews
`reddit_tech`	Reddit — r/technology
`reddit_ai`	Reddit — r/artificial

Don't see your feed? Paste any RSS or Atom feed URL directly into feed_urls — the news article scraper handles any valid feed automatically.

🎯 Use Cases

📰 News Monitoring & Media Intelligence

Use this news scraper to monitor multiple publications simultaneously for breaking stories on any topic
Build automated news briefing pipelines by scheduling daily RSS feed scraper runs
Track how different outlets cover the same story by scraping multiple feeds in one run

🤖 AI & NLP Training Datasets

Build large news article datasets for text classification, summarization, or language model training
Collect article titles and descriptions from diverse news sources for headline generation research
Use the full article extraction mode to build rich training corpora from any news publication

📊 Content Research & Competitive Analysis

Monitor competitor publications by scraping their RSS feeds for topic and publishing frequency analysis
Track technology trend coverage across multiple tech publications simultaneously
Collect article metadata for content gap analysis and editorial planning

🛠️ Developer & Content Pipeline Integrations

Feed news article data into Slack bots, newsletters, dashboards, or CMS platforms automatically
Build a multi-source news aggregator using structured data from this RSS feed scraper
Integrate real-time news data into AI applications, chatbots, or research tools

🎓 Academic & Journalism Research

Collect news article datasets for media bias research, framing analysis, or agenda-setting studies
Archive news coverage of specific events across multiple outlets for longitudinal research
Build structured datasets of news articles for computational journalism or fact-checking tools

🏢 Brand & Topic Monitoring

Monitor brand mentions across major publications using keyword-relevant RSS feeds
Track industry news from trade publications by adding their feed URLs to a scheduled run
Build a real-time news alert system by combining this news scraper with Apify's scheduling

⚙️ Input Parameters

{
  "preset_feed":          "techcrunch",
  "feed_urls":            [],
  "max_results":          20,
  "fetch_full_articles":  false,
  "proxyConfiguration": {
    "useApifyProxy": false
  }
}

Parameter	Type	Default	Description
`preset_feed`	string	`""`	Built-in preset name — e.g. `"bbc_news"`, `"reuters_tech"`, `"techcrunch_ai"`. See full preset list above
`feed_urls`	array or string	`[]`	Custom RSS or Atom feed URLs — paste any valid feed. Newline-separated string also accepted. Multiple feeds processed in one run
`max_results`	integer	`20`	Maximum total articles to return across all feeds
`fetch_full_articles`	boolean	`false`	When `true`, visits each article URL and extracts the full article body text. Adds ~5–10 seconds per article
`proxyConfiguration`	object	Off	Optional proxy config — not required for most RSS feeds

Tip: You can combine preset_feed and feed_urls in the same run. The preset feed is processed first, then your custom URLs. Multiple feeds in feed_urls are all processed together with results merged into one dataset.

📋 Output Fields

Every record from this news article scraper includes:

Field	Type	Description	Example
`title`	string	Article headline (max 300 chars)	`"OpenAI releases GPT-5 with major reasoning improvements"`
`url`	string	Full article URL	`"https://techcrunch.com/2024/03/15/..."`
`description`	string	Article summary or excerpt (max 1000 chars)	`"OpenAI has announced the release of..."`
`published`	string	Publication date and time	`"Fri, 15 Mar 2024 09:30:00 GMT"`
`author`	string	Article author name	`"Sarah Perez"`
`category`	string	Article categories (up to 3, comma-separated)	`"AI, Technology, Startups"`
`image`	string	Article thumbnail or featured image URL	`"https://techcrunch.com/wp-content/..."`
`source`	string	Feed source domain	`"techcrunch.com"`
`type`	string	Feed format detected	`"rss"`, `"atom"`
`full_text`	string	Full article body text — only when `fetch_full_articles: true` (max 5000 chars)	`"OpenAI has today announced..."`

📦 Example Input & Output

Input — preset feed:

{
  "preset_feed":  "techcrunch_ai",
  "max_results":  5
}

Input — custom feed URLs:

{
  "feed_urls": [
    "https://www.wired.com/feed/rss",
    "https://www.theverge.com/rss/index.xml"
  ],
  "max_results":         10,
  "fetch_full_articles": false
}

Output (one record):

{
  "title":       "OpenAI releases GPT-5 with major reasoning improvements",
  "url":         "https://techcrunch.com/2024/03/15/openai-gpt5/",
  "description": "OpenAI has announced the release of GPT-5, featuring significant improvements in multi-step reasoning and code generation tasks.",
  "published":   "Fri, 15 Mar 2024 09:30:00 GMT",
  "author":      "Sarah Perez",
  "category":    "Artificial Intelligence, Technology",
  "image":       "https://techcrunch.com/wp-content/uploads/2024/03/gpt5.jpg",
  "source":      "techcrunch.com",
  "type":        "rss",
  "full_text":   null
}

💰 Pricing & Free Trial

Plan	Price	Includes
Free Trial	$0	2 hours full access — no credit card required
Monthly	$6.99 / month	Unlimited runs, all presets, custom feeds, full article extraction

Everything included in every plan:

✅ 20+ built-in news feed presets — BBC, Reuters, TechCrunch, CNN, NYT, and more
✅ Custom RSS and Atom feed URL support — any publication
✅ Multi-feed batch — process multiple feeds in one run
✅ Full article body text extraction (optional)
✅ Author, category, image, and publish date per article
✅ No login or API key required
✅ JSON + CSV + Excel export from Apify dataset
✅ Scheduled runs for automated news monitoring

Start your 2-hour free trial now — no credit card needed. Click Try for free at the top of this page.

⚡ Performance & Limits

Mode	Articles	Estimated Time
Single preset feed	20	~10–20 seconds
Multiple feeds	50	~30–60 seconds
With full article extraction	20	~3–5 minutes
With full article extraction	50	~8–15 minutes

Results pushed to the Apify dataset in real time as each feed is processed
Full article extraction adds approximately 5–10 seconds per article — disable for faster runs
No proxy required for most major RSS feeds
Multiple feeds are processed in sequence with automatic rate limiting

❓ FAQ

Q: Can I use this news scraper with any RSS feed — not just the presets? A: Yes. Paste any valid RSS or Atom feed URL into the feed_urls field. The actor handles both RSS 2.0 and Atom feed formats automatically. If a publication offers an RSS feed, this RSS feed scraper can extract it.

Q: Can I process multiple RSS feeds in one run? A: Yes. Add multiple URLs to the feed_urls array — or combine a preset_feed with custom URLs — and all feeds are processed in a single run. Results are merged into one output dataset.

Q: What does fetch_full_articles do? A: When enabled, the actor visits each article's URL after parsing the feed and extracts the full article body text from the page. This gives you the complete article content — not just the RSS excerpt. It adds processing time, so only enable it when you need the full text.

Q: Does this work with Atom feeds as well as RSS? A: Yes. Both RSS 2.0 and Atom feed formats are fully supported. The actor auto-detects the format and parses accordingly.

Q: Do I need a proxy for major news sites? A: No. Most major news RSS feeds are publicly accessible without any proxy. Proxy is optional and can be enabled for feeds that restrict access by geography or IP.

Q: Can I schedule this to run daily for automated news monitoring? A: Yes. Set up an Apify scheduled task with your chosen preset or feed URLs to automatically collect fresh articles every day — or at any interval you choose.

Q: What if a feed URL returns no articles? A: The actor logs a warning and skips that feed, then continues processing all remaining feeds. One failed feed never stops the rest of the run.

Q: Can I export results to Excel or CSV? A: Yes. All results are pushed to the Apify dataset, which can be exported to JSON, CSV, Excel, and more directly from the Apify Console after each run.

📜 Changelog

v1.0.0 (Current)

✅ RSS 2.0 and Atom feed parsing
✅ 20+ built-in news feed presets
✅ Custom RSS feed URL support — any publication
✅ Multi-feed batch processing in one run
✅ Full article body text extraction (optional)
✅ Article fields: title, URL, description, author, date, category, image, source
✅ Automatic RSS vs Atom format detection
✅ No proxy required for major news feeds
✅ Real-time dataset push as each feed is processed

🏷️ Tags

rss feed scraper news scraper news article scraper rss scraper atom feed scraper news feed extractor bbc news scraper techcrunch scraper reuters scraper news data extractor media monitoring news aggregator

⚖️ Legal & Terms of Use

This actor accesses publicly available RSS and Atom feed data published by news outlets and content creators for the purpose of content distribution.

Please note:

RSS feeds are intentionally published by content creators for public consumption and aggregation
Use extracted news article data only for lawful purposes — research, monitoring, NLP datasets, aggregation, and academic study are common legitimate uses
Article content is copyright of the original publisher — do not republish full article text without authorization
Respect individual publication terms of service when using full article extraction
The actor developer is not responsible for how extracted data is used

🤝 Support & Feedback

Bug report? Contact us via the Apify actor page
Feature request? Post in the Apify Community forum
Loving it? Please leave a ⭐ review — it helps other users find this actor!

Built with ❤️ on Apify
The most complete RSS Feed Scraper — 20+ news presets, custom feeds, full article extraction

💰 $6.99/month · 🆓 2-hour free trial · No credit card required

RSS & News Feed Aggregator — Multi-Source Article Scraper

joyouscam35875/rss-news-aggregator

Aggregate and parse RSS/Atom feeds from any source. Extract articles with titles, descriptions, authors, dates, images. Optionally fetch full article content. Perfect for news monitoring and AI pipelines. $0.0005/article.

Ken Digital

RSS Feed Article Monitor

seeb/rss-feed-article-monitor

Extract clean article rows from RSS, Atom, and JSON feeds for news monitoring, content research, content operations, and AI workflow inputs.

Techionik

RSS Feed Aggregator & Article Extractor

darknezz/rss-feed-aggregator

Aggregate RSS/Atom feeds and extract full article content. Multi-feed ingestion, deduplication, keyword filtering, rich metadata. Returns clean JSON with full-text extraction. For news monitoring, AI training, and curation.

Oaida Adrian

RSS & News Feed Extractor - Articles to JSON/CSV

pear_fight/rss-news-feed-extractor-articles-to-json-csv

Parse any RSS or Atom feed into clean, structured article data: title, link, author, publish date, categories, summary and full content. Handles both RSS and Atom formats. Perfect for news monitoring, content aggregation and feeding data pipelines. Export to JSON, CSV, Excel.

Harald

Google News Article Scraper

webscrap18/google-news-article-scraper

Scrape Google News, Extract full content with Title, Article Text, Images and Structured data.

WebScrap

News / RSS Monitor

civicdataworks/news-rss-monitor

Monitor RSS and Atom feeds, filter by keywords, and export feed metadata/snippet records without full-article scraping.

Rowan Mercer

News Article Scraper for Feeding LLM

proscraper/newsarticlescraper

Scrape news articles metadata to feed into LLM models. Returns article body, published date, article title, author etc.

Owais Nazir

183

Google News RSS Scraper

cloud9_ai/google-news-scraper

Scrape Google News search results via RSS feed. Returns article titles, URLs, sources, publish dates, and summaries for any keyword. No API key needed.

cloud9

News Aggregator - RSS Feed Parser & Article Extractor

klondikeking/news-aggregator

Extract structured news articles from any RSS feed. Get headlines, summaries, publication dates, authors, and source URLs in clean JSON. Perfect for media monitoring, content curation, and news aggregation pipelines.

Pierrick McD0nald

RSS / Atom Feed Scraper

rupom888/rss-atom-feed-scraper

Scrape any RSS or Atom feed. Works with news sites, blogs, podcasts, YouTube channels (/feeds/videos.xml?channel_id=...), Reddit (/r/subreddit/.rss), and any standard feed URL. Extracts title, description, author, publish date, categories, and full content.

Syed Rupom