News & Announcements to Markdown for RAG
Pricing
from $40.00 / 1,000 markdown chunks
News & Announcements to Markdown for RAG
Convert press releases, corporate announcements & news articles into clean, chunked Markdown for RAG and LLM pipelines. Article URLs or RSS feeds. No login.
Pricing
from $40.00 / 1,000 markdown chunks
Rating
0.0
(0)
Developer
NexGenData
Maintained by CommunityActor stats
0
Bookmarked
2
Total users
1
Monthly active users
2 days ago
Last modified
Share
📰 News & Announcements to Markdown for RAG
Turn press releases, corporate announcements, and news articles into clean, chunked Markdown for RAG and LLM pipelines. Feed it article URLs or RSS/Atom feeds and get LLM-ready text with citations.
⚡ What you get
| Field | Description |
|---|---|
url | Source article URL (citation) |
title | Article / release title |
chunkIndex / totalChunks | Position within the article |
markdown | Clean Markdown chunk |
🎯 Use cases
- AI engineers building news/PR RAG copilots
- Market & competitive intel feeding event data to an LLM
- PR/IR teams building searchable announcement archives
- Fintech/research products needing announcement text with citations
🚀 Sample inputs
{ "rssFeeds": ["https://www.prnewswire.com/rss/news-releases-list.rss"], "maxPerFeed": 10 }
{ "urls": ["https://www.businesswire.com/news/home/.../en/..."], "chunkWords": 600 }
📦 Sample output
{ "url": "https://www.prnewswire.com/news-releases/...", "title": "Acme Raises $50M Series B", "chunkIndex": 0, "totalChunks": 6, "markdown": "# Acme Corp Raises $50M......" }
📊 Sample Output

🛠 How it works
- Source — fetches article URLs directly, or pulls latest items from RSS/Atom feeds.
- Extract — isolates the main article (
<article>/<main>), strips nav/ads/scripts. - Convert — HTML → ATX Markdown.
- Chunk — ~
chunkWords-word chunks for embedding. - Schema — one row per chunk, with the source URL as citation.
🔗 Related Actors
💰 Pricing Example
Pay-per-event: $0.005 per run + $0.04 per Markdown chunk (document-record).
| Chunks | Cost |
|---|---|
| 100 | ~$4.00 |
| 500 | ~$20.00 |
| 2,000 | ~$80.00 |
| Apify's $5 free credit covers ~124 chunks. Start free → |
⚖️ Legal & data sources
Fetches publicly-accessible articles/feeds with an identified User-Agent. Respect each publisher's terms for your downstream use; output includes source URLs for attribution.
❓ FAQ
URLs or feeds? Either or both — feeds expand to their latest items.
Citations? Yes — every chunk keeps its source URL.
Chunk size? chunkWords (default 800).
Paywalled articles? Only public content is reachable.
Fresh? Pulled live at run time.
Dedup? Repeated URLs in one run are skipped.
🆘 Troubleshooting
- Empty markdown — the page may be JS-rendered or paywalled.
- Too much boilerplate — the article wrapper wasn't detected; try a direct article URL.
- Feed returns nothing — confirm it's a valid RSS/Atom URL.
- Huge output — lower
maxPerFeedorchunkWords.
🏷️ About NexGenData
Structured public-data tools for analysts, developers, and operators. thenextgennexus.com.