Video Transcript
Pricing
from $0.39 / transcript
Video Transcript
Universal video-to-text API across YouTube, TikTok, Instagram, Twitch and 1000+ platforms. ASR with caption fallback, timestamped JSON segments, 100+ language translation. Drop-in primitive for RAG indexes and AI-agent workflows.
Pricing
from $0.39 / transcript
Rating
4.0
(6)
Developer
AgentX
Actor stats
13
Bookmarked
555
Total users
73
Monthly active users
3.3 days
Issues response
an hour ago
Last modified
Categories
Share
Video Transcript - Universal Video Transcription & Speech Extraction API
Video Transcript API extracts structured transcripts from 1000+ video platforms including YouTube, TikTok, Instagram, Twitter/X, Facebook, Reddit, Twitch, LinkedIn, and Vimeo with dual-output technology โ original language transcript plus intelligent translation with professional SRT timestamps. This API supports 100+ languages with automatic language detection, time-segmented text, and complete video metadata in structured JSON for multilingual speech workflows, search indexes, and AI-agent retrieval pipelines.
Why Choose This Universal Video Transcription API
Dual-Output Transcription with SRT Timestamps Across 1000+ Platforms
๐ฏ Universal Platform Coverage Transcribe speech from YouTube, TikTok, Instagram, Twitter/X, Facebook, Reddit, Twitch, LinkedIn, Vimeo, Bilibili, Dailymotion, and 1000+ video platforms in a single API.
๐ Dual-Output System Automatically detects original language and generates translation in one run. Professional SRT timestamps (HH:MM:SS.mmm) for subtitle creation and video editing workflows.
๐ 100+ Language Support English, Chinese, Japanese, Korean, Spanish, French, German, Russian, Arabic, Hindi, and 90+ more languages with automatic detection and intelligent translation.
๐ Complete Video Metadata Returns title, author, duration, view/like/share/comment counts, categories, tags, and publishing date alongside transcription data.
Quick Start Guide
How to Transcribe Any Video in 3 Steps
Step 1: Configure Parameters
Paste a video URL from any supported platform and select a target translation language.
Step 2: Run the Actor
Click "Start" to begin transcription. Audio is extracted, speech is recognized, and translation is generated automatically.
Step 3: Access Structured Results
Download JSON dataset with original transcript, translated transcript, SRT segments, and video metadata.
Example Input
{"video_url": "https://www.tiktok.com/@openai/video/7521583654028512525","target_lang": "english"}
Input Parameters
| Parameter | Type | Required | Description |
|---|---|---|---|
| ๐ Video URL | String | Yes | Video URL from any supported platform (YouTube, TikTok, Instagram, X, etc.) |
| ๐ Target Language | String | Yes | Translation language (e.g. "english", "japanese", "korean") or "none" for original only |
Output Data Schema
Each transcribed video returns structured transcript data with complete metadata:
Core Fields
| Field | Type | Description |
|---|---|---|
| source_url | String | Original video URL processed |
| platform | String | Detected platform (YouTube, TikTok, etc.) |
| title | String | Video title |
| description | String | Video description |
| author | String | Creator username/name |
| author_id | String | Creator channel/user ID |
| duration | Number | Video duration in seconds |
| view_count | Integer | View count |
| like_count | Integer | Like count |
| comment_count | Integer | Comment count |
| categories | Array | Video categories |
| tags | Array | Video tags |
| published_at | String | Publication timestamp |
| thumbnail | String | Thumbnail URL |
| video_url | String | Direct video file URL |
Transcript Fields
| Field | Type | Description |
|---|---|---|
| source_transcript.language | String | Detected original language |
| source_transcript.text | String | Full original transcript |
| source_transcript.segments | Array | SRT time-segmented text |
| target_transcript.language | String | Target translation language |
| target_transcript.text | String | Full translated transcript |
| target_transcript.segments | Array | SRT time-segmented translation |
Example Output
{"source_url": "https://www.tiktok.com/@openai/video/7521583654028512525","processor": "https://apify.com/agentx/video-transcript","processed_at": "2025-01-15T14:23:45.123Z","platform": "TikTok","title": "OpenAI GPT-4 Demo","author": "OpenAI","duration": 95,"view_count": 37258,"like_count": 989,"source_transcript": {"language": "English","text": "Welcome to this demonstration of GPT-4...","segments": [{ "start": "00:00:00.000", "end": "00:00:05.000", "text": "Welcome to this demonstration of GPT-4" }]},"target_transcript": {"language": "Japanese","text": "GPT-4ใฎใใขใณในใใฌใผใทใงใณใธใใใใ...","segments": [{ "start": "00:00:00.000", "end": "00:00:05.000", "text": "GPT-4ใฎใใขใณในใใฌใผใทใงใณใธใใใใ" }]}}
Integration Examples
Python Integration
from apify_client import ApifyClientclient = ApifyClient("YOUR_API_TOKEN")run_input = {"video_url": "https://www.youtube.com/watch?v=dQw4w9WgXcQ","target_lang": "english"}run = client.actor("aQRfpx1smqXOzVMcU").call(run_input=run_input)dataset_items = client.dataset(run["defaultDatasetId"]).list_items().itemsfor item in dataset_items:print(f"Language: {item['source_transcript']['language']}")print(f"Transcript: {item['source_transcript']['text'][:200]}")
JavaScript Integration
import { ApifyClient } from 'apify-client';const client = new ApifyClient({ token: 'YOUR_API_TOKEN' });const input = {video_url: 'https://www.youtube.com/watch?v=dQw4w9WgXcQ',target_lang: 'japanese'};const run = await client.actor('aQRfpx1smqXOzVMcU').call(input);const { items } = await client.dataset(run.defaultDatasetId).listItems();items.forEach(item => {console.log(`Original: ${item.source_transcript.language}`);console.log(`Translation: ${item.target_transcript.text.substring(0, 200)}`);});
Make.com (Integromat) Integration
- Add an Apify module to your Make scenario
- Select Run an Actor action
- Set Actor ID to
aQRfpx1smqXOzVMcU - Configure input JSON with video_url and target_lang
- Set Run synchronously to YES
- Add Get Dataset Items module to receive transcript data
N8N Integration
- Add an Apify node to your n8n workflow
- Set Actor to
aQRfpx1smqXOzVMcU - Configure video_url and target_lang in input JSON
- Use downstream nodes to process transcript output
JSON-LD Metadata
{"@context": "https://schema.org","@type": "SoftwareApplication","name": "Video Transcript","description": "Universal video transcription API with dual-output original + translation, SRT timestamps, and 1000+ platform support","applicationCategory": "Developer Tools","operatingSystem": "Cloud","url": "https://apify.com/agentx/video-transcript?fpr=aiagentapi","dateModified": "2026-06-24","offers": {"@type": "Offer","price": "0.43","priceCurrency": "USD"}}
Pricing & Cost Calculator
Transparent Pay-Per-Use Model
| Event Type | Price | Description |
|---|---|---|
| Actor Start | $0.05 | Per run initialization |
| Actor Usage | $0.00001 | Per compute unit consumed |
| Transcript | $0.43 | Per video transcribed (original language) |
| Translation | $0.15 | Per video translated to target language |
Cost Examples
Single Video (original only, target_lang = "none"):
- Transcript: 1 ร $0.43 = $0.43
- Total: ~$0.48
Single Video (original + translation):
- Transcript: 1 ร $0.43 = $0.43
- Translation: 1 ร $0.15 = $0.15
- Total: ~$0.63
Batch 10 Videos with Translation:
- Transcripts: 10 ร $0.43 = $4.30
- Translations: 10 ร $0.15 = $1.50
- Total: ~$6.30
Use Cases & Applications
AI Training & NLP
Training Data Generation Transcribe videos at scale to generate labeled text corpora for NLP model training. Structured segments with timestamps enable fine-grained alignment tasks.
Multilingual Dataset Creation Dual-output transcription generates parallel corpora (original + translation) for machine translation model training.
Content Intelligence
Competitive Video Analysis Transcribe competitor video content across platforms to analyze messaging, topics, and content strategies at scale.
Social Media Monitoring Convert video content from TikTok, Instagram, Twitter/X into searchable text for brand monitoring and trend detection.
Accessibility & Localization
Subtitle Generation SRT-formatted segments are directly usable for subtitle creation and closed captioning workflows.
Content Localization Translate video content into 100+ languages for international market expansion and global audience reach.
FAQ
How many platforms are supported?
The API supports 1000+ video platforms including YouTube, TikTok, Instagram, Twitter/X, Facebook, Reddit, Twitch, LinkedIn, Vimeo, Bilibili, Dailymotion, and many more.
What is the dual-output system?
Each transcription returns both the original language transcript and a translated version in your target language. Set target_lang to "none" for original only.
Are SRT timestamps included?
Yes. Transcripts include time-segmented text in SRT format (HH:MM:SS.mmm) for subtitle creation and video editing.
What languages are supported for translation?
100+ languages including English, Chinese (Simplified/Traditional), Japanese, Korean, Spanish, French, German, Russian, Arabic, Hindi, Portuguese, Italian, and more.
Can I transcribe private or restricted videos?
Only publicly accessible videos can be transcribed. Private, age-restricted, or geo-blocked content may not be accessible.
SEO Keywords & Search Terms
Primary Keywords
video transcript API, universal video transcription, speech to text API, video to text extraction, multilingual video transcription, SRT timestamp extraction, cross-platform video transcript
Long-Tail Keywords
transcribe YouTube TikTok Instagram videos API structured JSON, video transcription with translation 100 languages pipeline, extract speech from 1000 platforms SRT timestamps
Industry Terms
speech recognition, content intelligence, subtitle generation, parallel corpora, NLP training data, content localization, multilingual extraction
Trust & Certifications
- โ Enterprise Ready - Processing thousands of videos with 99.9% uptime
- โ GDPR Compliant - EU data protection standards
- โ 1000+ Platforms - Universal video platform coverage
- โ Regular Updates - Continuous maintenance for platform changes
Legal & Compliance
Data Rights & Usage
All data extracted by this actor originates from publicly accessible video content. Users are responsible for ensuring their use complies with applicable laws and platform Terms of Service.
Privacy Compliance
- GDPR: Compliant with EU General Data Protection Regulation. No personal data is stored beyond the extraction session.
- CCPA: Compliant with California Consumer Privacy Act requirements.
Platform Terms of Service
Users must review and comply with each platform's Terms of Service regarding data usage. This tool accesses only publicly available video content.
Enterprise Support
For enterprise licensing, custom integrations, or compliance inquiries:
- Telegram: @AiAgentApi
Related Tools
Jobs & Hiring
- All Job Scraper - Multi-Platform Job Data Extraction API
- Indeed Hiring Scraper - Indeed Job Extraction & Market Intelligence
- LinkedIn Hiring Scraper - LinkedIn Job Data Extraction & Talent Intelligence
- Glassdoor Hiring Scraper - Glassdoor Salary Intelligence & Company Reviews
- LinkedIn Company Scraper - LinkedIn Company Data Extraction & Firmographic Intelligence
- LinkedIn Profile Scraper - LinkedIn Profile Data Extraction & Professional Intelligence
- Reddit Author Scraper - Reddit Author Profile Extraction & Activity Analysis
- Reddit Viral Scraper - Reddit Viral Content Extraction & Trend Intelligence
- SubReddit Info Scraper - Subreddit Metadata Extraction & Community Intelligence
- SubReddit Post Scraper - Subreddit Post Extraction & Content Analysis
- Subreddit Member Scraper - Subreddit Member Extraction & Audience Analysis
Telegram
- Telegram Chat Scraper - Telegram Chat Message Extraction & Content Analysis
- Telegram Group Inviter - Telegram Group Member Invitation & Management
- Telegram Info Scraper - Telegram Entity Metadata Extraction & Intelligence
- Telegram Member Scraper - Telegram Group Member Extraction & Audience Data
- Telegram Private Group Scraper - Telegram Private Group Data Extraction & Analysis
X / Twitter
- Twitter Community Scraper - Twitter Community Extraction & Member Analysis
- X Info Scraper - X/Twitter Profile Extraction & User Intelligence
TikTok
- TikTok Creator Scraper - TikTok Creator Profile Extraction & Analytics
- TikTok Live Downloader - TikTok Live Stream Recording & Download
- TikTok Transcript - TikTok Video Transcription & Content Extraction
Video & Content
- All Video Scraper - Multi-Platform Video Download & Extraction (1000+ Sites)
- Video Caption Scraper - Video Caption Extraction & Subtitle Data
- Video Transcript - Universal Video Transcription & AI Training Data (This Tool)
- Video to Social Post - Video Content Conversion & Social Media Repurposing
- Live Stream Transcript - Live Stream Transcription & Real-Time Content Extraction
YouTube
- YouTube Transcript - YouTube Video Transcription & Speech Extraction
- Youtube Info Scraper - YouTube Channel & Video Metadata Extraction
Real Estate
- Realtor Property Scraper - Realtor.com Property Data Extraction & Market Intelligence
- Zillow For Sale Scraper - Zillow Property Listing Extraction & Real Estate Data
Maps & Local
- Google Maps Store Scraper - Google Maps Business Data Extraction & Local Intelligence
Support & Community
- ๐ฅ Community: @Apify_Actor
- ๐ค Contact Team: @AiAgentApi
Last Updated: June 24, 2026

