Pricing

from $10.00 / 1,000 results

Speech To Text

Convert speech to text with high accuracy using Azure AI. Supports 100+ languages, speaker detection, and timestamps. Perfect for transcription, subtitles, and voice-to-text applications.

Pricing

from $10.00 / 1,000 results

Rating

0.0

(0)

Developer

Fabio Suizu

Actor stats

Bookmarked

Total users

Monthly active users

2 months ago

Last modified

Speech to Text - Audio Transcription

Convert audio files to text using AI-powered speech recognition. Supports multiple languages and engines.

Features

Fast Processing: Lightning-fast speech to text - audio transcription powered by Azure
Reliable: 99.9% uptime with automatic failover
Scalable: Handle single requests or bulk operations
Secure: Enterprise-grade security with API key authentication
Well Documented: Comprehensive API documentation and examples

Use Cases

Content Generation: Automate content creation workflows
Data Analysis: Extract insights from unstructured data
Automation: Integrate AI capabilities into your apps

Input Parameters

Parameter	Type	Required	Description
`audioUrl`	string	No	URL of the audio file to transcribe
`audioBase64`	string	No	Base64-encoded audio data (alternative to URL)
`language`	string	No	Language code (e.g., 'en', 'es', 'fr'). Leave empty for auto
`includeSegments`	boolean	No	Include time-stamped segments in the response
`engine`	string	No	Speech recognition engine to use
`detectLanguageOnly`	boolean	No	Only detect the language without full transcription

Output Format

{
  "success": true,
  "result": { ... },
  "timestamp": "2026-01-07T00:00:00Z"
}

Code Examples

JavaScript (Node.js)

import { ApifyClient } from 'apify-client';

const client = new ApifyClient({ token: 'YOUR_API_TOKEN' });

const input = {
  "audioUrl": "https://example.com/audio.mp3",
  "audioBase64": "example_audioBase64",
  "language": "en",
  "includeSegments": true,
  "engine": "azure",
  "detectLanguageOnly": false
};

const run = await client.actor("vivid_astronaut/speech-to-text").call(input);
const { items } = await client.dataset(run.defaultDatasetId).listItems();
console.log(items);

Python

from apify_client import ApifyClient

client = ApifyClient("YOUR_API_TOKEN")

run_input = {
  "audioUrl": "https://example.com/audio.mp3",
  "audioBase64": "example_audioBase64",
  "language": "en",
  "includeSegments": true,
  "engine": "azure",
  "detectLanguageOnly": false
}

run = client.actor("vivid_astronaut/speech-to-text").call(run_input=run_input)

for item in client.dataset(run["defaultDatasetId"]).iterate_items():
    print(item)

cURL

curl -X POST "https://api.apify.com/v2/acts/vivid_astronaut~speech-to-text/runs?token=YOUR_API_TOKEN" \
  -H "Content-Type: application/json" \
  -d '{
  "audioUrl": "https://example.com/audio.mp3",
  "audioBase64": "example_audioBase64",
  "language": "en",
  "includeSegments": true,
  "engine": "azure",
  "detectLanguageOnly": false
}'

Pricing

Model: Pay per result Price: $0.020 per result

You only pay for successful results. Platform usage costs are included.

API Documentation

Full API documentation is available at:

Support

Issues: Report bugs via Apify Console
Documentation: Apify Docs
Community: Apify Discord

Version History

See ./CHANGELOG.md for version history.

Powered by Azure Cloud Infrastructure

Video to Text Transcription

aizen0/video-to-text-transcription

Convert video speech to text in bulk. Supports Only Twitter/Instagram, auto-detects languages, handles large files automatically. Uses OpenAI Whisper for high accuracy.

Aizen

Google Free Text to Speech

jupri/google-speech

Use free Google Text to Speech to translate text into voice

cat

201

Video to Text

cheapget/video-to-text

💎$0.24💎/Video(Any Duration) AI-powered transcription from 1000+ platforms with automatic language detection, time-stamped segments, and instant translation to 100+ languages.

CheapGET

148

4.5

Instagram To Text

cheapget/instagram-to-text

AI-powered video transcription and translation. Convert video speech to text with timestamped subtitles in 100+ languages

CheapGET

5.0

Free Large Video Converter

lukaskrivka/audio-video-converter

Flexible and powerful conversion tool using the popular ffmpeg program ideal for very large video and audio files. Convert any audio or video file to a different format and adjust any settings. Automatically recognizes the source format.

Lukáš Křivka

159

Audio And Video Transcriber (OpenAI GPT-4o-transcribe)

stanvanrooy6/audio-video-transcriber

Downloads videos from public URLs, extracts audio, and transcribes them using OpenAI

Stan Van Rooy

5.0

Audio & Video to Text

donjuan_mime/audio-video-to-text

Transcribes video and audio files into plain text and subtitle formats (TXT, SRT, VTT, TSV, JSON) using OpenAI's Whisper model. Supports preloaded tiny, base, and small models.

Donjuan

Video Transcriber Ultimate

marielise.dev/video-transcriber-ultimate

Transcribe videos from 1000+ platforms including Vimeo, Dailymotion, Twitch, Rumble, TED, and Bitchute. Powered by Whisper AI with 50+ language support. Get full text with timestamps and segments. No API keys needed. Perfect for content creators, researchers, and accessibility compliance.

Marielise

Ultimate Youtube Downloader

hariprasadh10792/ultimate-youtube-downloader

🚀 Download YouTube videos in any quality with just a few clicks! This powerful actor extracts video metadata and provides direct download links from the top video download services on the web (🥇 9xbuddy.org, 💎 SaveFrom.net, 🚀 Y2Mate, ⚡ YTMate and so much more)