YouTube Transcriber
Pricing
from $5.00 / 1,000 caption transcripts
Go to Apify Store
YouTube Transcriber
Transcribe YouTube videos. Captions when available, OpenAI Whisper fallback (BYOK) for the rest. No YouTube account needed.
YouTube Transcriber
Pricing
from $5.00 / 1,000 caption transcripts
Transcribe YouTube videos. Captions when available, OpenAI Whisper fallback (BYOK) for the rest. No YouTube account needed.
YouTube video URLs (any standard format) or bare 11-character video IDs. Each URL is canonicalized + validated against a strict hostname allowlist before any download.
BCP-47 language code (e.g. 'en', 'es', 'fr'). When the requested caption language isn't available, the actor falls through to Whisper.
auto = captions first, fall through to Whisper. captions = captions only. whisper = Whisper only.
Your OpenAI API key. Optional — only needed when a video has no captions in your preferred language and you want Whisper to transcribe it. With no key: captions still work; videos without captions are skipped with a clear reason. The key is never logged or stored. Required only when transcriptMethod='whisper'.
OpenAI Whisper model. Only whisper-1 supports verbose_json segment timestamps required by the JSON output format.
text = plain transcript. json = array of {start, end, text} segments.
When outputFormat=text, prefix each segment with [HH:MM:SS]. Has no effect on JSON output (timestamps always present there).
Skip videos longer than this. Default 18 to keep audio under OpenAI Whisper's 25 MB hard limit even at higher bitrates.
Cap on total minutes of audio sent to Whisper. Bounds your OpenAI bill per run. Set 0 for unlimited.