Reddit Text Cleaner — TTS-Ready Narration avatar

Reddit Text Cleaner — TTS-Ready Narration

Pricing

from $0.20 / 1,000 text cleaneds

Go to Apify Store
Reddit Text Cleaner — TTS-Ready Narration

Reddit Text Cleaner — TTS-Ready Narration

Turn messy Reddit/forum text into clean, TTS-ready narration: strips markdown, links, edit-stamps and emoji, expands abbreviations (AITA, MIL, TIFU…), splits into sentences, and optionally softens profanity. No AI, instant, deterministic.

Pricing

from $0.20 / 1,000 text cleaneds

Rating

0.0

(0)

Developer

Dami's Studio

Dami's Studio

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

20 hours ago

Last modified

Share

Reddit Text Cleaner

Reddit and forum text is full of stuff that wrecks text-to-speech: markdown asterisks, link syntax, "Edit:" stamps, emoji, and abbreviations like AITA that a voice reads letter by letter. This actor cleans all of that out and hands back narration that's ready to feed into a TTS engine. It's built for people generating Reddit story videos or audio at scale, where the cleanup step needs to be cheap and predictable.

How it works

Pure rules, no model. It runs a fixed pipeline of regex passes (strip markdown and links, drop edit-stamps and emoji, expand abbreviations, then optionally rewrite profanity), splits the result into sentences, and returns it. Same input always gives the same output, and it returns instantly.

Input

Nothing is strictly required, but you need to pass text one way or another. Use text for a single block, or texts for a batch. If both are present they're all processed.

FieldRequiredNotes
textnoOne block of text to clean, e.g. a post body.
textsnoArray of strings or post objects. For objects it reads scriptText, narration, selftext, body, or text, in that order. Lets you pipe the Reddit Scraper's output in directly.
expandAbbreviationsnoExpand internet shorthand for TTS: AITA to "Am I the asshole", MIL to "mother-in-law", IMO, TIFU, and so on. Default true.
profanityModenokeep leaves swears as-is, soft swaps in mild words (handy for ad-safe narration), censor masks them as f***, remove deletes them. Default keep.
wpmnoWords per minute used to estimate read time. Default 150.

Output

One dataset item per input text. The cleaned narration is in cleaned, and ttsSegments is that same text split into sentences if you want to render audio per line. You also get wordCount, sentenceCount, charCount, readTimeSeconds (based on your wpm), a hookScore for the opening line, and the truncated original.

Example

{
"text": "AITA for leaving? **So** here's the _story_. Check [this](https://x.com).\n\nEdit: thanks for the awards! TL;DR: I left.",
"expandAbbreviations": true,
"profanityMode": "soft"
}

Pricing

$0.0002 per text cleaned. Pay per result, no subscription.

Notes

Everything here is rule-based, so there's no OpenAI key needed and nothing to configure for the AI path. The trade-off is that abbreviation and profanity handling cover a curated list rather than every possible variant, so an obscure acronym may pass through untouched.