Sonartext Speech To Text
Pricing
Pay per event
Go to Apify Store
Pricing
Pay per event
Rating
0.0
(0)
Developer

Kyle
Maintained by Community
Actor stats
1
Bookmarked
6
Total users
3
Monthly active users
14 days ago
Last modified
Categories
Share
Pricing
Pay per event
Pricing
Pay per event
Rating
0.0
(0)
Developer

Kyle
Actor stats
1
Bookmarked
6
Total users
3
Monthly active users
14 days ago
Last modified
Categories
Share
inputMethodEnumRequired
How to provide the audio/video file
"file_upload": string"url": string"youtube": string"twitter": string"gdrive": string"s3": stringDefault value of this property is "file_upload"
audioFilestringOptional
Upload your audio or video file (up to 2GB). Required when Input Method is 'file_upload'.
youtubeUrlstringOptional
YouTube video URL (e.g. https://youtube.com/watch?v=...). Required when Input Method is 'youtube'.
twitterUrlstringOptional
Twitter or X post URL with video. Required when Input Method is 'twitter'.
gdriveUrlstringOptional
Google Drive shareable link to audio/video file. Required when Input Method is 'gdrive'.
s3UrlstringOptional
AWS S3 URL or presigned URL to file. Required when Input Method is 's3'.
languageEnumOptional
Language of the audio (leave blank for auto-detect)
"en": string"es": string"fr": string"de": string"it": string"pt": string"ru": string"ja": string"ko": string"zh": string"ar": string"hi": string"nl": string"sv": string"no": string"da": string"fi": string"pl": string"cs": string"hu": string"ro": string"bg": string"hr": string"sk": string"sl": string"et": string"lv": string"lt": stringDefault value of this property is ""
timestampsEnumOptional
Include timestamps in the transcription
"none": string"segment": string"word": string"both": stringDefault value of this property is "segment"
speakerDiarizationbooleanOptional
Identify and separate different speakers in the audio
Default value of this property is false
minSpeakersintegerOptional
Minimum number of speakers expected (only used when Speaker Diarization is enabled)
Default value of this property is 1
maxSpeakersintegerOptional
Maximum number of speakers expected (only used when Speaker Diarization is enabled)
Default value of this property is 5
responseFormatEnumOptional
Output format for the transcription
"json": string"text": string"srt": string"vtt": stringDefault value of this property is "json"