Deprecated

Pricing

Pay per event

See alternative Actors

Go to Apify Store

Sonartext Speech To Text

Deprecated

See alternative Actors

SonarText Speech to Text Transcription Service

Pricing

Pay per event

Rating

0.0

(0)

Developer

Kyle

Actor stats

Bookmarked

Total users

Monthly active users

4 months ago

Last modified

Categories

Automation

Developer tools

Input Method

inputMethod

Required

How to provide the audio/video file

Type:string

Default:file_upload

Options:

file_uploadurlyoutubetwittergdrives3

Audio/Video File

audioFile

Optional

Upload your audio or video file (up to 2GB). Required when Input Method is 'file_upload'.

Type:string

File URL

fileUrl

Optional

Direct URL to audio/video file. Required when Input Method is 'url'.

Type:string

YouTube URL

youtubeUrl

Optional

YouTube video URL (e.g. https://youtube.com/watch?v=...). Required when Input Method is 'youtube'.

Type:string

Twitter/X URL

twitterUrl

Optional

Twitter or X post URL with video. Required when Input Method is 'twitter'.

Type:string

Google Drive URL

gdriveUrl

Optional

Google Drive shareable link to audio/video file. Required when Input Method is 'gdrive'.

Type:string

AWS S3 URL

s3Url

Optional

AWS S3 URL or presigned URL to file. Required when Input Method is 's3'.

Type:string

Language

language

Optional

Language of the audio (leave blank for auto-detect)

Type:string

Default:

Options:

enesfrdeitptrujakozharhinlsvnodafiplcshurobghrsksletlvlt

Timestamps

timestamps

Optional

Include timestamps in the transcription

Type:string

Default:segment

Options:

nonesegmentwordboth

Speaker Diarization

speakerDiarization

Optional

Identify and separate different speakers in the audio

Type:boolean

Default:false

Minimum Speakers

minSpeakers

Optional

Minimum number of speakers expected (only used when Speaker Diarization is enabled)

Type:integer

Minimum:1

Maximum:10

Default:1

Maximum Speakers

maxSpeakers

Optional

Maximum number of speakers expected (only used when Speaker Diarization is enabled)

Type:integer

Minimum:1

Maximum:20

Default:5

Response Format

responseFormat

Optional

Output format for the transcription

Type:string

Default:json

Options:

jsontextsrtvtt

Maximum Cost (cents)

maxCostCents

Optional

Optional cost limit in cents to prevent unexpected charges

Type:integer

Minimum:1

Maximum:10000

Default:500

Webhook URL

webhookUrl

Optional

Optional URL to receive completion notification

Type:string

Universal Speech to Text Transcriber

tictechid/vanzi-universal-transcriber

Transcribe audio from videos stored on Google Drive, Dropbox, GitHub raw, OneDrive, Box, iCloud, AWS S3, GCS, Azure Blob, and Backblaze B2. Convert share links to direct downloads for fast, accurate transcripts with timestamps and easy API integration.

TicTech

5.0

(1)

Speech to Text Converter (Transcript / Captcha)

saswave/speech-to-text-converter

Transform audio records to text. Get transcription from sales or customer success teams audio files. Get Captcha text from captcha audio challenge. Speech to text converter helps you analyse, build KPI with audio records and bypass captcha.

SASWAVE

Speech To Text

vivid_astronaut/speech-to-text

Convert speech to text with high accuracy using Azure AI. Supports 100+ languages, speaker detection, and timestamps. Perfect for transcription, subtitles, and voice-to-text applications.

Fabio Suizu

Video Transcriber: Instagram, X, Facebook, TikTok

invideoiq/video-transcriber

Retrieves transcripts from online video content from multiple plateforms (Instagram, X, ..) using speech-to-text models. It delivers outputs in JSON and LLM-ready formats, making it ideal for analytics, and AI-based applications. Perfect for research and building intelligent conversational agents

InVideoIQ

286

4.9

(5)

Video to Text Transcription

aizen0/video-to-text-transcription

Convert video speech to text in bulk. Supports Only Twitter/Instagram, auto-detects languages, handles large files automatically. Uses OpenAI Whisper for high accuracy.

Aizen

Instagram Content Intelligence Pro

sian.agency/instagram-content-intelligence-pro

Revolutionary AI system that delivers comprehensive speech-to-text transcription combined with premium data analytics. Pay only for successful results - no processing fees, no setup costs.

SIÁN OÜ

5.0

(1)

🏁 TikTok Video Transcriber & Downloader +12 Languages

ingeniela/tiktok-video-transcriber

Download TikTok videos without watermark & get AI transcriptions with timestamps. Extract subtitles, captions & keywords. Multi-language speech-to-text converter. Direct download links included.

Ingeniela

Instagram To Text

cheapget/instagram-to-text

AI-powered video transcription and translation. Convert video speech to text with timestamped subtitles in 100+ languages

CheapGET

5.0

(1)

Twilio API Actor

alizarin_refrigerator-owner/twilio-api-actor

Access Twilio communication data including calls, SMS/MMS, recordings, transcriptions & usage analytics. Call Logs Detailed call history SMS/MMS Message history & sending Recordings Call recordings Transcriptions Speech-to-text Account phone numbers Billing usage data Lookup Phone number validation

The Howlers

Hugging Face Audio AI

alizarin_refrigerator-owner/hugging-face-audio-ai

Audio w/Hugging Face models speech recognition, text-to-speech & audio analysis Speech-to-Text: Transcribe audio Text-to-Speech: Generate natural speech Audio Classification: Classify sounds Voice Activity Detection: Detect speech Speaker Diarization: Identify speakers Music Generation: Create music

The Howlers

Speech AI MCP Server

vivid_astronaut/pronunciation-assessment-mcp

Speech AI MCP server with 9 tools: pronunciation scoring (0-100 at phoneme/word/sentence level), speech-to-text with timestamps, text-to-speech with 12 English voices, and multilingual Whisper transcription (99 languages + speaker diarization). Sub-300ms latency. Pay-per-use: $0.02/call.

Fabio Suizu