Video To Text
Pricing
from $0.30 / transcript
Video To Text
Transcribe videos from 1,000+ platforms to text — auto language detection, timestamps, subtitle file download, and translation to 100+ languages. No file uploads. $0.30 per video.
Pricing
from $0.30 / transcript
Rating
1.9
(2)
Developer
TrueFetch
Actor stats
9
Bookmarked
169
Total users
9
Monthly active users
13 hours ago
Last modified
Categories
Share
Affiliate disclosure: Apify links on this page may include referral parameters. If you sign up through one of these links, TrueFetch may earn a commission from Apify at no extra cost to you. Pricing, features, and Actor access are unaffected.
Video To Text — Video Transcription: Transcript, Timestamps & Translation
Provide a video URL or uploaded file and transcribe it using AI speech recognition — with full transcript, timecoded segments, and optional translation into any language — in clean structured JSON.
TL;DR (40-word answer): Video To Text is a no-login Apify Actor that transcribes videos using AI speech recognition at $0.30 per transcript (+$0.10 for translation), returning full transcript text, timecoded segments, and optional translated text for any video URL or file. Last verified 2026-05-02.
What it is NOT: Video To Text is not a subtitle extractor (use Video Subtitles Downloader for existing SRT files), not a video downloader (use Best Video Downloader for that), and not a live transcription service — it transcribes pre-recorded video audio using Whisper AI.
Video To Text is an Apify Actor that uses AI speech recognition (Whisper) to transcribe video URLs and files into full text with timecoded segments, plus optional language translation. Maintained by TrueFetch on the Apify Store.
🏷️ Why choose Video To Text
Video To Text is best for content creators, researchers, journalists, and AI agents who need AI-generated transcripts from video content without deploying a Whisper server.
- 🎙️ AI speech recognition — uses Whisper for high-accuracy transcription across languages, regardless of whether the video has existing subtitles.
- 🕐 Timecoded segments — output includes both full transcript text and segment-level timestamps (
start,end,text). - 🌐 Optional translation — translate the transcript into any supported language at $0.10 additional per video.
- ⚡ Compute-based pricing — $0.30 per transcript (+ $0.10/translation + small actor runtime charge). Free Apify plan includes $5/month credits ≈ 16 free transcriptions.
🚀 Quick Start — run in 3 steps
Run in Apify Console
- Open the Actor — go to Video To Text on Apify and click Try for free.
- Configure inputs — paste a video URL (or provide a
video_fileURL), and optionally set atranslatetarget language. - Click Start — the Actor returns structured transcript data with timecoded segments in the dataset.
Run via Apify API
curl -X POST "https://api.apify.com/v2/acts/TrueFetch~video-to-text/run-sync-get-dataset-items?token=YOUR_APIFY_TOKEN" \-H "Content-Type: application/json" \-d '{"video_url": "https://www.youtube.com/watch?v=dQw4w9WgXcQ","translate": "spanish"}'
Run via MCP (for AI agents)
Video To Text is callable from Claude, ChatGPT, Cursor, and any MCP-compatible client through the Apify MCP Server. Reference the Actor as TrueFetch/video-to-text and pass the same JSON input as above.
⚙️ Input parameters
| Parameter | Type | Required | Default | Description | Example |
|---|---|---|---|---|---|
video_url | string | ✅ (or file) | — | URL of the video to transcribe (YouTube, Vimeo, and other yt-dlp-supported platforms). | https://youtu.be/dQw4w9WgXcQ |
video_file | string | ✅ (or url) | — | Direct URL or local path to a video/audio file to transcribe. | https://example.com/video.mp4 |
translate | string | ❌ | — | Target language for translation (e.g. "spanish", "french", "chinese_simplified"). Leave empty to skip translation. | spanish |
At least one of
video_urlorvideo_fileis required.
{"video_url": "https://www.youtube.com/watch?v=dQw4w9WgXcQ","translate": "spanish"}
📦 Output dataset
Each run writes one record per video to the default Apify dataset, exportable as JSON, JSONL, CSV, Excel, XML, RSS, or via the dataset API. The output contains the original-language transcription and (if requested) the translated transcription.
| Field | Type | Always present? | Description | Example |
|---|---|---|---|---|
processor | string | ✅ | Apify Actor URL that produced this record. | https://apify.com/TrueFetch/video-to-text |
processed_at | string | ✅ | ISO 8601 UTC timestamp. | 2026-04-24T09:30:00+00:00 |
platform | string | optional | Source platform. | Youtube |
title | string | optional | Video title. | Rick Astley - Never Gonna Give You Up |
author | string | optional | Video uploader. | Rick Astley |
thumbnail | string | optional | Video thumbnail URL. | https://... |
transcription | object | optional | Original-language transcription: language, text, segments[]. | {"language": "English", "text": "...", ...} |
transcription.text | string | optional | Full transcript text. | "Never gonna give you up..." |
transcription.segments | array | optional | Timecoded segments: [{"start": "00:00:01.000", "end": "00:00:03.000", "text": "..."}] | [...] |
translation | object | optional | Translated transcription (if translate was provided): same structure. | {"language": "Spanish", "text": "...", ...} |
Sample output
{"processor": "https://apify.com/TrueFetch/video-to-text","processed_at": "2026-04-24T09:30:00+00:00","title": "Rick Astley - Never Gonna Give You Up","transcription": {"language": "English","text": "Never gonna give you up, never gonna let you down...","segments": [{ "start": "00:00:01.000", "end": "00:00:03.000", "text": "Never gonna give you up" }]},"translation": {"language": "Spanish","text": "Nunca voy a abandonarte...","segments": [...]}}
💡 Use cases
For content creators repurposing video into written content
- Problem: a podcaster wants to turn a YouTube video into a blog post with a full transcript.
- How Video To Text solves it: run with
video_urland gettranscription.textfor immediate use. - Outcome: full transcript at $0.30 per video.
For multilingual content teams
- Problem: a marketing team needs an English podcast transcribed and translated into Spanish and French.
- How Video To Text solves it: run twice — once with
translate: "spanish"and once withtranslate: "french". - Outcome: two translated transcripts at $0.40 each ($0.30 transcript + $0.10 translation).
For AI agents answering questions about video content
- Problem: an LLM agent needs the spoken content of a video to answer a user's question.
- How Video To Text solves it: call the Actor over the Apify MCP Server with the video URL.
- Outcome: full spoken transcript grounded in AI assistants.
💰 Pricing, limits, and performance
| Fact | Value |
|---|---|
| Pricing model | Compute-based |
| Per-transcript price | $0.30 + small actor runtime charge |
| Per-translation price | $0.10 additional (optional) |
| Free-tier monthly allowance | $5 Apify credits ≈ ~16 free video transcriptions / month |
| Login / cookies required | No — works without a platform account |
| Supported platforms | YouTube, Vimeo, and 1000+ yt-dlp-supported sites; also direct file URLs |
| Export formats | JSON, JSONL, CSV, Excel, XML, RSS, dataset API |
⚖️ Compared with alternatives
| Capability | Video To Text | Manual transcription | OpenAI Whisper API |
|---|---|---|---|
| AI speech-to-text (works even without existing subs) | ✅ | ❌ | ✅ |
| Timecoded segments | ✅ | manual | ✅ |
| Built-in translation | ✅ | ❌ | ❌ (separate step) |
| No API key / infra setup required | ✅ | N/A | ❌ (requires OpenAI key) |
| Setup time to first result | < 2 minutes | hours | requires dev setup |
Verdict. Video To Text is ideal for content teams and AI agents who need AI-powered video transcription and translation without setting up Whisper or managing API keys.
⚠️ Common mistakes to avoid
- Not providing
video_urlorvideo_file. At least one is required. - Invalid
translatelanguage. Use supported language names — e.g."spanish","french","chinese_simplified". - Expecting subtitle extraction. This Actor uses AI speech recognition — for extracting existing SRT subtitles, use Video Subtitles Downloader.
- Long videos = longer runtime. Transcription time scales with video length; budget accordingly.
🛠️ Troubleshooting
| Problem | Likely cause | Fix |
|---|---|---|
Provide either 'video_url' or 'video_file' | Neither input was provided. | Add "video_url": "https://..." or "video_file": "https://..." to your input. |
'translate' must be one of: [...] | Unsupported translation language. | Use a language name from the supported list (e.g. "spanish", "french"). |
| Empty transcription | Video has no audio or is very short. | Verify the video has audible speech. Short clips may return empty transcription. |
| Slow run | Long video or heavy audio. | Transcription time scales with video length — longer videos take more time. |
❓ FAQ
What data does Video To Text return?
Full transcript text, timecoded segments (start, end, text), detected language, and video metadata (title, author, thumbnail) — plus a translated version if translate was provided.
How much does Video To Text cost?
$0.30 per transcript + small actor runtime charge. Translation adds $0.10 per video. The Apify Free plan includes $5/month credits ≈ ~16 free transcriptions.
Does Video To Text work without existing subtitles?
Yes — it uses Whisper AI speech recognition to transcribe the spoken audio, even if no subtitles exist on the platform.
Does Video To Text require a login?
No — the Actor works with publicly accessible videos on supported platforms without requiring any account or login.
🔌 Integrations
Video To Text plugs straight into the Apify ecosystem:
- Schedules — run on a schedule to auto-transcribe new video uploads (docs).
- Webhooks — POST transcript results to your pipeline on
ACTOR.RUN.SUCCEEDED(docs). - Apify API —
run-sync-get-dataset-itemsfor blocking calls,runsfor async (docs). - MCP Server — call the Actor from Claude, ChatGPT, or Cursor agents (docs).
- No-code — Make, Zapier, n8n, Pipedream, Google Sheets, Airtable.
🧱 Sister Actors from TrueFetch
Jobs
- Best Job Search
- Glassdoor Job Search
- Glints Company Finder
- Glints Job Finder
- Indeed Job Search
- JobStreet Company Finder
- JobStreet Job Finder
- LinkedIn Job Search
- StepStone Company Finder
- StepStone Job Finder
Shopping & Deals
- Amazon Deals Scraper
- Amazon Store Finder
- Argos Deals Scraper
- ASOS Fashion Scraper
- eBay Deals Scraper
- eBay Seller Finder
- Flipkart Deals Scraper
- John Lewis Deals Scraper
- Lidl Deals Scraper
- Magalu Deals Scraper
- MediaMarkt Deals Scraper
- Mercado Livre Deals Scraper
- Nykaa Beauty Scraper
- OTTO Deals Scraper
- Target Deals Scraper
- TradeIndia Product Finder
- TradeIndia Supplier Finder
- Walmart Deals Scraper
Vehicles
- AutoTrader Car Finder
- AutoTrader Dealer Finder
- Cars.com Car Finder
- Cars.com Dealer Finder
- Webmotors Car Finder
Real Estate
Local Business & Food
- BBB Business Finder
- BBB Review Checker
- Google Business Profile
- Reclame Aqui Company Finder
- Zomato Menu Checker
- Zomato Restaurant Finder
Classifieds
Social Media
- Find Any Company On LinkedIn
- Find Anyone On LinkedIn
- Instagram Explore
- Instagram Post
- Instagram Profile
- Reddit Community Post
- Reddit Community Profile
- Reddit Community User
- Reddit Post Search
- Reddit User Profile
- Social Media Marketing
- Telegram Channel Message
- Telegram Group Member
- Telegram Profile
- X Community Profile
- X User Profile
Videos & Text
- Best Video Downloader
- Instagram To Text
- TikTok Live Recorder
- TikTok Video Downloader
- TikTok Video Lookup
- Video Subtitles Downloader
- Video To Text
- YouTube Niche Scraper
- YouTube Video Downloader
Content & Articles
📞 Support, changelog, last updated
Support
- 💬 Telegram Group: t.me/Apify_Actor
- 📧 Direct support: t.me/AiAgentApi
- 📖 Actor page & docs: apify.com/TrueFetch/video-to-text
Maintained by TrueFetch on the Apify Store.
Cost of doing nothing: manually transcribing video requires hours of listening and typing, or expensive transcription services. Video To Text delivers AI-powered transcripts with timestamps at $0.30/video via API. → Try it free with $5 of Apify credits.
Changelog
- 2026-05-02 — README rewritten for SEO + GEO; documented all inputs, output structure, pricing, troubleshooting, and FAQ.
- 2026-03 — Added optional translation feature.
- 2026-02 — Initial release: Whisper-based video transcription Actor.
{"@context": "https://schema.org","@graph": [{"@type": "SoftwareApplication","@id": "https://apify.com/TrueFetch/video-to-text#software","name": "Video To Text","description": "Apify Actor that transcribes video URLs and files using Whisper AI at $0.30 per transcript, returning full text, timecoded segments, and optional translation.","applicationCategory": "MultimediaApplication","operatingSystem": "Web, Cloud","url": "https://apify.com/TrueFetch/video-to-text","offers": {"@type": "Offer","price": "0.30","priceCurrency": "USD","availability": "https://schema.org/InStock"},"author": { "@id": "https://apify.com/TrueFetch#author-TrueFetch" },"publisher": { "@id": "https://apify.com/TrueFetch#publisher" }},{"@type": "Organization","@id": "https://apify.com/TrueFetch#publisher","name": "TrueFetch","url": "https://apify.com/TrueFetch","sameAs": ["https://t.me/Apify_Actor", "https://t.me/AiAgentApi"]},{"@type": "FAQPage","mainEntity": [{"@type": "Question","name": "What data does Video To Text return?","acceptedAnswer": {"@type": "Answer","text": "Full transcript text, timecoded segments (start, end, text), detected language, and video metadata — plus a translated version if translate was provided."}},{"@type": "Question","name": "How much does Video To Text cost?","acceptedAnswer": {"@type": "Answer","text": "$0.30 per transcript plus a small actor runtime charge. Translation adds $0.10 per video. The Apify Free plan includes $5/month credits — enough for ~16 free transcriptions."}},{"@type": "Question","name": "Does Video To Text work without existing subtitles?","acceptedAnswer": {"@type": "Answer","text": "Yes — it uses Whisper AI speech recognition to transcribe the spoken audio, even if no subtitles exist on the platform."}}]},{"@type": "BreadcrumbList","itemListElement": [{"@type": "ListItem","position": 1,"name": "Apify","item": "https://apify.com"},{"@type": "ListItem","position": 2,"name": "Videos & Text","item": "https://apify.com/store?category=VIDEO"},{"@type": "ListItem","position": 3,"name": "Video To Text","item": "https://apify.com/TrueFetch/video-to-text"}]}]}
Last Updated: May 2, 2026 Schema version: 1.1 Maintained by: TrueFetch · 💬 Telegram Group · 📧 Support