Sonartext Speech To Text
Pricing
Pay per event
Go to Apify Store
Sonartext Speech To Text
Under maintenanceSonarText Speech to Text Transcription Service
0.0 (0)
Pricing
Pay per event
1
2
2
Last modified
16 days ago
Pricing
Pay per event
SonarText Speech to Text Transcription Service
0.0 (0)
Pricing
Pay per event
1
2
2
Last modified
16 days ago
inputMethodEnumRequired
How to provide the audio/video file
"file_upload": string"url": string"youtube": string"twitter": string"gdrive": string"s3": stringDefault value of this property is "file_upload"
audioFilestringOptional
Upload your audio or video file (up to 2GB). Required when Input Method is 'file_upload'.
youtubeUrlstringOptional
YouTube video URL (e.g. https://youtube.com/watch?v=...). Required when Input Method is 'youtube'.
twitterUrlstringOptional
Twitter or X post URL with video. Required when Input Method is 'twitter'.
gdriveUrlstringOptional
Google Drive shareable link to audio/video file. Required when Input Method is 'gdrive'.
s3UrlstringOptional
AWS S3 URL or presigned URL to file. Required when Input Method is 's3'.
languageEnumOptional
Language of the audio (leave blank for auto-detect)
"en": string"es": string"fr": string"de": string"it": string"pt": string"ru": string"ja": string"ko": string"zh": string"ar": string"hi": string"nl": string"sv": string"no": string"da": string"fi": string"pl": string"cs": string"hu": string"ro": string"bg": string"hr": string"sk": string"sl": string"et": string"lv": string"lt": stringDefault value of this property is ""
timestampsEnumOptional
Include timestamps in the transcription
"none": string"segment": string"word": string"both": stringDefault value of this property is "segment"
speakerDiarizationbooleanOptional
Identify and separate different speakers in the audio
Default value of this property is false
minSpeakersintegerOptional
Minimum number of speakers expected (only used when Speaker Diarization is enabled)
Default value of this property is 1
maxSpeakersintegerOptional
Maximum number of speakers expected (only used when Speaker Diarization is enabled)
Default value of this property is 5
responseFormatEnumOptional
Output format for the transcription
"json": string"text": string"srt": string"vtt": stringDefault value of this property is "json"