Pricing

from $20.00 / 1,000 results

Text To Speech

AI Text-to-Speech API that converts written text into high-quality natural voice audio. Supports multiple voices, languages, adjustable speed and pitch, ideal for audiobooks, podcasts, accessibility, automation, and voice-enabled applications.

Pricing

from $20.00 / 1,000 results

Rating

0.0

(0)

Developer

Taher Ali Badnawarwala

Actor stats

Bookmarked

Total users

Monthly active users

a month ago

Last modified

Text to Speech Actor

An Apify Actor that converts text to speech using AI-powered voice synthesis technology. Simply provide the text you want to convert, along with language and voice preferences, and the Actor will generate a high-quality audio file for you.

What This Tool Does

This Actor connects to the MultipleWords text-to-speech API to convert written text into natural-sounding speech. It accepts text input along with language and voice configuration, processes the request through the API, and returns a downloadable audio file along with detailed metadata about the voice generation.

Key Features:

🎙️ Convert text to natural-sounding speech
🚀 Fast and automated audio generation
📦 Structured output with audio URLs and metadata
🔄 Reliable error handling and validation
📊 Complete generation details and timestamps
🌍 Support for multiple languages and voices

Purpose & Use Cases

This tool is designed to help individuals, businesses, and developers automate text-to-speech conversion for various purposes:

Content Creation

Generate voiceovers for videos and presentations
Create audio versions of blog posts and articles
Produce podcast introductions and outros
Generate narration for e-learning courses

Accessibility

Convert written content to audio for visually impaired users
Create audio descriptions for multimedia content
Generate spoken versions of documents and reports
Make content accessible through audio format

Business & Marketing

Create automated phone system messages
Generate voice prompts for IVR systems
Produce marketing audio content
Create audio advertisements and announcements

Development & Automation

Integrate text-to-speech into automated workflows
Batch convert multiple text documents to audio
Create audio content programmatically via API integration
Generate dynamic voice content for applications

Media Production

Quickly generate voiceovers for video projects
Create multiple voice variations for A/B testing
Produce audio content for social media
Generate voice samples and demos

Input Parameters

The Actor requires the following input:

`text` (Required)

Type: String
Description: The text content you want to convert to speech. Can be a single sentence, paragraph, or multiple paragraphs.
Example: "Technology that thinks with you, builds for you, and moves you forward. Welcome to the future—simple, smart, and powered by AI."
Best Practices:

Use clear, well-formatted text
Include proper punctuation for natural pauses
Break long content into paragraphs for better audio flow
Avoid special characters that may cause issues

`language_id` (Required)

Type: String
Description: Language identifier for the speech synthesis (e.g., 'en' for English, 'es' for Spanish, 'fr' for French).
Default: "en"
Example: "en", "es", "fr", "de"

`voice_id` (Required)

Type: String
Description: Voice identifier specifying the voice type to use (e.g., 'male' or 'female').
Default: "male"
Example: "male", "female"

`user_id` (Required)

Type: Integer
Description: User identifier for the API authentication.
Default: 663569
Example: 663569

`isPro` (Required)

Type: Integer
Description: Pro account flag indicating whether to use premium features (0 = No, 1 = Yes).
Default: 0
Valid Values: 0 or 1

Output Structure

The Actor returns structured data containing the generated audio file information:

{
  "status": "success",
  "file_name": "voice_20260127_072606_295b10fe.mp3",
  "access_url": "https://multiplewords.nyc3.cdn.digitaloceanspaces.com/text_to_voice/voice_20260127_072606_295b10fe.mp3",
  "voice_used": "en-AU-WilliamNeural",
  "details": {
    "id": 328186,
    "video_url": "https://multiplewords.nyc3.cdn.digitaloceanspaces.com/text_to_voice/voice_20260127_072606_295b10fe.mp3",
    "source_video_url": "",
    "video_description": "Technology that thinks with you, builds for you, and moves you forward.\nWelcome to the future—simple, smart, and powered by AI.",
    "video_name": "Technology that thinks with you, builds for you, and moves you forward.\nWelcome to the future—simple, smart, and powered by AI.",
    "video_description_to": "",
    "fps": 10,
    "video_length": 5,
    "creativity": 8.5,
    "isMusic": true,
    "isPro": false,
    "video_thumbnail": "https://multiplewords.nyc3.digitaloceanspaces.com/mw_assets/mw_logo_square_.jpg",
    "video_created_at": "2026-01-27T07:26:07.665739Z",
    "video_updated_at": "2026-01-27T07:26:07.665767Z",
    "engine": 35,
    "url_slug": "",
    "job_key": "0",
    "job_status": 1,
    "rating": 0,
    "feedback": "",
    "extra_1": null,
    "extra_2": null,
    "extra_3": null,
    "extra_4": null,
    "extra_5": null,
    "extra_6": null,
    "attempts": 3,
    "is_public": true,
    "isIndexed": false,
    "is_active": true,
    "number_of_times_played": 1,
    "user_id": 663569,
    "video_category_id": 1,
    "language_id": 49
  }
}

Output Fields Explained

status: Success indicator ("success" = successful conversion)
file_name: Name of the generated audio file
access_url: Direct URL to download the generated audio file (MP3 format)
voice_used: The specific voice model used for synthesis (e.g., "en-AU-WilliamNeural")
details: Complete metadata about the audio generation
- id: Unique identifier for the generated audio
- video_url: URL to the audio file (same as access_url)
- video_length: Duration of the audio in seconds
- video_created_at: Timestamp when the audio was created
- job_status: Status of the generation job
- user_id: User identifier used for the request
- language_id: Language identifier used

How to Use

Running Locally

Install dependencies:

$npm install

Run the Actor:

$apify run

Provide input when prompted, or create an input file in storage/key_value_stores/default/INPUT.json:

{
  "language_id": "en",
  "voice_id": "male",
  "user_id": 663569,
  "isPro": 0,
  "text": "Technology that thinks with you, builds for you, and moves you forward. Welcome to the future—simple, smart, and powered by AI."
}

Deploy to Apify Platform

Login to Apify:

$apify login

Deploy the Actor:

$apify push

Run on Apify Console:
- Go to Actors -> My Actors
- Select your Text to Speech Actor
- Enter your text and configuration in the input form
- Click "Start" to generate your audio file

Using via API

Once deployed, you can call the Actor via Apify API:

curl -X POST "https://api.apify.com/v2/acts/YOUR_USERNAME~text-to-speech/run-sync" \
  -H "Authorization: Bearer YOUR_API_TOKEN" \
  -H "Content-Type: application/json" \
  -d '{
    "language_id": "en",
    "voice_id": "male",
    "user_id": 663569,
    "isPro": 0,
    "text": "Create a professional voiceover for a tech product launch announcement"
  }'

Integration Examples

With Make (Integromat)

Connect the Actor to your Make workflows
Automatically generate voiceovers when new content is created
Send generated audio files to storage services or email
Create automated content pipelines with text-to-speech

With Zapier

Trigger audio generation from form submissions
Automatically add audio files to Google Drive or Dropbox
Send audio notifications via Slack or email
Convert blog posts to audio automatically

With Custom Applications

Integrate via Apify API into your web applications
Batch convert multiple text documents to audio
Create automated voiceover workflows
Build custom text-to-speech services

Technical Details

Runtime: Node.js 18+
Dependencies: Apify SDK v3.5.2+
API Endpoint: https://shorts.multiplewords.com/mwvideos/api/voice_from_text
Request Method: POST
Content Type: multipart/form-data
Response Format: JSON
Audio Format: MP3

Error Handling

The Actor includes comprehensive error handling:

Validates all required input parameters before processing
Handles API errors gracefully with detailed error messages
Provides informative error messages in logs
Returns appropriate exit codes for debugging
Validates API response status and structure

Resources

Support

For issues, questions, or feature requests, please refer to the Apify documentation or community forums.

Built with ❤️ using Apify SDK

Text To Speech

vivid_astronaut/text-to-speech

Convert text to natural speech using AI voices. Multiple voices and languages available. Generate audio files for podcasts, videos, accessibility, and voice assistants.

Fabio Suizu

Text to speech generator

akash9078/advanced-text-to-speech

Professional-grade Text-to-Speech (TTS) actor powered by advanced AI models. Convert any text into natural, human-like speech with 50+ premium voices across 9 languages. Perfect for content creation, accessibility, voiceovers, audiobooks, podcasts, and multilingual applications.

Akash Kumar Naik

Speech To Text

vivid_astronaut/speech-to-text

Convert speech to text with high accuracy using Azure AI. Supports 100+ languages, speaker detection, and timestamps. Perfect for transcription, subtitles, and voice-to-text applications.

Fabio Suizu

Google Free Text to Speech

jupri/google-speech

Use free Google Text to Speech to translate text into voice

cat

202

Text to Speech Generator

moving_beacon-owner1/my-actor-30

Convert text into natural-sounding speech in multiple languages with ease.

Jamshaid Arif

Hugging Face Audio AI

alizarin_refrigerator-owner/hugging-face-audio-ai

Audio w/Hugging Face models speech recognition, text-to-speech & audio analysis Speech-to-Text: Transcribe audio Text-to-Speech: Generate natural speech Audio Classification: Classify sounds Voice Activity Detection: Detect speech Speaker Diarization: Identify speakers Music Generation: Create music

The Howlers

Text-to-Speech Generator (OpenAI voice generator)

stanvanrooy6/text-to-speech-generator-openai-voice-generator

Convert text to speech effortlessly with our OpenAI voice generator. Choose from 6 English-optimized voices, customize settings, and get high-quality audio files fast. Simple to use, integrates with your OpenAI API key.

Stan Van Rooy

1.0

VoiceClonerTTS

lucymakeit/voiceclonertts

High-quality text-to-speech API with voice cloning.

Lucy Paureau

Speech AI MCP Server

vivid_astronaut/pronunciation-assessment-mcp

Speech AI MCP server with 9 tools: pronunciation scoring (0-100 at phoneme/word/sentence level), speech-to-text with timestamps, text-to-speech with 12 English voices, and multilingual Whisper transcription (99 languages + speaker diarization). Sub-300ms latency. Pay-per-use: $0.02/call.

Fabio Suizu

Text to Speech

theapicompany/text-to-speech

Transfers your Text input into a MP3 file.This is the Text to Speech API; The Input: { "text": "Your text that will be an audio" } The Output: To get the Output, which is a MP3 Data file, you have to go to Storage, in there you need to click on Key-Value-Storage and Download the file.

Jonah

5.0

Text To Speech

Text to Speech Actor

What This Tool Does

Purpose & Use Cases

Content Creation

Accessibility

Business & Marketing

Development & Automation

Media Production

Input Parameters

text (Required)

language_id (Required)

voice_id (Required)

user_id (Required)

isPro (Required)

Output Structure

Output Fields Explained

How to Use

Running Locally

Deploy to Apify Platform

Using via API

Integration Examples

With Make (Integromat)

With Zapier

With Custom Applications

Technical Details

Error Handling

Resources

Support

You might also like

Text To Speech

Text to speech generator

Speech To Text

Google Free Text to Speech

Text to Speech Generator

Hugging Face Audio AI

Text-to-Speech Generator (OpenAI voice generator)

VoiceClonerTTS

Speech AI MCP Server

Text to Speech

`text` (Required)

`language_id` (Required)

`voice_id` (Required)

`user_id` (Required)

`isPro` (Required)