Pricing

from $40.00 / 1,000 voiceover generateds

Try for free

Go to Apify Store

AI Text-to-Speech Voiceover

Try for free

Turns text or a script into a downloadable AI voiceover audio file (MP3, WAV, Opus, or AAC) using OpenAI TTS voices. Built for faceless YouTube narration, IVR phone menus, audiobooks, and batch app prompts.

Pricing

from $40.00 / 1,000 voiceover generateds

Rating

5.0

(2)

Developer

Dami's Studio

Actor stats

Bookmarked

Total users

Monthly active users

17 days ago

Last modified

How it works

The actor sends your text to an OpenAI-compatible TTS endpoint. Long scripts get split at sentence boundaries into chunks under ~3,500 characters, each chunk is synthesized separately, and the parts are stitched back into one file with ffmpeg (using stream copy, so there's no re-encode and no quality loss). Each finished audio file is saved to the run's key-value store and a row is pushed to the dataset.

Input

Nothing is strictly required by the schema, but in practice you need an openaiApiKey and at least one of text or texts. If neither is provided the run errors out.

Field	Required	Notes
`text`	one of `text`/`texts`	The script to voice, as a single string.
`texts`	one of `text`/`texts`	Batch mode. Array of strings, or objects keyed by `script` / `scriptText` / `text` / `narration`. One audio file per item.
`voice`	no	`alloy`, `echo`, `fable`, `onyx`, `nova`, `shimmer`. Default `onyx` (deep male). `nova` and `shimmer` are female.
`model`	no	`tts-1` (fast, default) or `tts-1-hd` (higher quality, costs more on the OpenAI side).
`format`	no	`mp3` (default), `wav`, `opus`, or `aac`.
`speed`	no	Playback speed from 0.25 to 4.0. Default `1.0`. Values outside that range are clamped.
`openaiApiKey`	yes in practice	Your OpenAI key, used for the TTS call. Stored as a secret. Falls back to the `OPENAI_API_KEY` env var if set.
`baseUrl`	no	Advanced. Point at any OpenAI-compatible `/audio/speech` endpoint. Defaults to `https://api.openai.com/v1`.

Output

Each input item produces one audio file in the key-value store and one dataset record. The record includes audioKey and audioUrl (where to fetch the file), durationSeconds, characters, chunks (how many pieces the script was split into), plus the voice, model, and resolved format. Failed items get a record with ok: false and the error message instead of stopping the whole run.

Example

{
  "text": "Welcome back to the channel. Today we're looking at one of the strangest mysteries of the deep ocean.",
  "voice": "onyx",
  "model": "tts-1",
  "format": "mp3",
  "speed": 1.0,
  "openaiApiKey": "sk-..."
}

Pricing

$0.04 per voiceover, pay per result, no subscription. The OpenAI TTS usage is billed separately on your own key.

Notes

This actor calls OpenAI for synthesis, so it needs your own OpenAI API key. Individual chunks are capped at 4,000 characters before they're sent, which keeps each request within the model's per-call limit; there's no hard limit on total script length since long inputs are chunked and concatenated.

Reddit Text Cleaner — TTS-Ready Narration

dami_studio/reddit-text-cleaner

Cleans raw Reddit/forum text into TTS-ready narration: strips markdown, links and edit-stamps, expands abbreviations like AITA, and returns cleaned text plus per-sentence segments for voiceover.

Dami's Studio

5.0

Text to speech generator

akash9078/advanced-text-to-speech

Professional-grade Text-to-Speech (TTS) actor powered by advanced AI models. Convert any text into natural, human-like speech with 50+ premium voices across 9 languages. Perfect for content creation, accessibility, voiceovers, audiobooks, podcasts, and multilingual applications.

Akash Kumar Naik

AI Faceless Video Generator — Reddit/Story to Short

dami_studio/ai-faceless-short-generator

Turn a subreddit, a story, or a topic into a finished faceless short video — script, AI voiceover, cinematic scene images with motion, and word-synced captions, fully automated. For TikTok, Reels, and YouTube Shorts. The complete pipeline in one actor.

Dami's Studio

AI Text to Speech

saswave/ai-text-to-speech

TTS high-performance utility designed to convert written text into natural, human-like speech. Leveraging neural networks, ultra-low latency audio generation and high-fidelity voice synthesis across multiple global languages. Content creators looking to automate voiceover production.

SASWAVE

Text To Speech

vivid_astronaut/text-to-speech

Convert text to natural speech using AI voices. Multiple voices and languages available. Generate audio files for podcasts, videos, accessibility, and voice assistants.

Fabio Suizu

YouTube Music to MP3 Audio Downloader

lurkapi/youtube-music-to-mp3-audio-downloader

Download audio from YouTube Music as MP3, M4A, WAV, FLAC, AAC, OPUS, Vorbis, or ALAC. Paste one or more links, pick format and bitrate, get direct download links. Supports playlists, regular YouTube, and batch processing.

LurkAPI

AI Video To Voiceover Generator

peaceful_pushpins/AI-Video-to-Voiceover-Generator

This Actor uses AI orchestration to turn short product videos into high-quality, ready-to-use voiceover ads. It analyzes visual moments, generates multiple creative ad scripts, and delivers polished audio variants—so you can launch campaigns faster with zero manual effort.

Wasim Safdar

YouTube to MP3 Audio Downloader

lurkapi/youtube-to-mp3-audio-downloader

Download YouTube audio as MP3, AAC (M4A), or WAV. Paste one or more links, pick format and bitrate, get permanent download links. Supports playlists, Shorts, and batch processing.

LurkAPI

312

5.0

Text to Speech

theapicompany/text-to-speech

Transfers your Text input into a MP3 file.This is the Text to Speech API; The Input: { "text": "Your text that will be an audio" } The Output: To get the Output, which is a MP3 Data file, you have to go to Storage, in there you need to click on Key-Value-Storage and Download the file.

Jonah

5.0

YouTube Music Downloader

maximedupre/youtube-music-downloader

Download audio from YouTube Music and YouTube URLs. Save MP3, M4A, AAC, Opus, FLAC, WAV, Vorbis, or ALAC files to Apify storage with title, channel, duration, thumbnail, upload date, likes, comments, and file size.