Video Transcript avatar

Video Transcript

Pricing

from $0.39 / transcript

Go to Apify Store
Video Transcript

Video Transcript

Universal video-to-text API across YouTube, TikTok, Instagram, Twitch and 1000+ platforms. ASR with caption fallback, timestamped JSON segments, 100+ language translation. Drop-in primitive for RAG indexes and AI-agent workflows.

Pricing

from $0.39 / transcript

Rating

4.0

(6)

Developer

AgentX

AgentX

Maintained by Community

Actor stats

13

Bookmarked

555

Total users

73

Monthly active users

3.3 days

Issues response

an hour ago

Last modified

Share

Video Transcript - Universal Video Transcription & Speech Extraction API

Video Transcript API extracts structured transcripts from 1000+ video platforms including YouTube, TikTok, Instagram, Twitter/X, Facebook, Reddit, Twitch, LinkedIn, and Vimeo with dual-output technology โ€” original language transcript plus intelligent translation with professional SRT timestamps. This API supports 100+ languages with automatic language detection, time-segmented text, and complete video metadata in structured JSON for multilingual speech workflows, search indexes, and AI-agent retrieval pipelines.

1000+ Platforms 100+ Languages SRT Timestamps


Why Choose This Universal Video Transcription API

Dual-Output Transcription with SRT Timestamps Across 1000+ Platforms

๐ŸŽฏ Universal Platform Coverage Transcribe speech from YouTube, TikTok, Instagram, Twitter/X, Facebook, Reddit, Twitch, LinkedIn, Vimeo, Bilibili, Dailymotion, and 1000+ video platforms in a single API.

๐Ÿ”„ Dual-Output System Automatically detects original language and generates translation in one run. Professional SRT timestamps (HH:MM:SS.mmm) for subtitle creation and video editing workflows.

๐ŸŒ 100+ Language Support English, Chinese, Japanese, Korean, Spanish, French, German, Russian, Arabic, Hindi, and 90+ more languages with automatic detection and intelligent translation.

๐Ÿ“Š Complete Video Metadata Returns title, author, duration, view/like/share/comment counts, categories, tags, and publishing date alongside transcription data.


Quick Start Guide

How to Transcribe Any Video in 3 Steps

Step 1: Configure Parameters

Paste a video URL from any supported platform and select a target translation language.

Video Transcript Input

Step 2: Run the Actor

Click "Start" to begin transcription. Audio is extracted, speech is recognized, and translation is generated automatically.

Step 3: Access Structured Results

Download JSON dataset with original transcript, translated transcript, SRT segments, and video metadata.

Example Input

{
"video_url": "https://www.tiktok.com/@openai/video/7521583654028512525",
"target_lang": "english"
}

Input Parameters

ParameterTypeRequiredDescription
๐Ÿ”— Video URLStringYesVideo URL from any supported platform (YouTube, TikTok, Instagram, X, etc.)
๐ŸŒ Target LanguageStringYesTranslation language (e.g. "english", "japanese", "korean") or "none" for original only

Output Data Schema

Each transcribed video returns structured transcript data with complete metadata:

Video Transcript Output

Core Fields

FieldTypeDescription
source_urlStringOriginal video URL processed
platformStringDetected platform (YouTube, TikTok, etc.)
titleStringVideo title
descriptionStringVideo description
authorStringCreator username/name
author_idStringCreator channel/user ID
durationNumberVideo duration in seconds
view_countIntegerView count
like_countIntegerLike count
comment_countIntegerComment count
categoriesArrayVideo categories
tagsArrayVideo tags
published_atStringPublication timestamp
thumbnailStringThumbnail URL
video_urlStringDirect video file URL

Transcript Fields

FieldTypeDescription
source_transcript.languageStringDetected original language
source_transcript.textStringFull original transcript
source_transcript.segmentsArraySRT time-segmented text
target_transcript.languageStringTarget translation language
target_transcript.textStringFull translated transcript
target_transcript.segmentsArraySRT time-segmented translation

Example Output

{
"source_url": "https://www.tiktok.com/@openai/video/7521583654028512525",
"processor": "https://apify.com/agentx/video-transcript",
"processed_at": "2025-01-15T14:23:45.123Z",
"platform": "TikTok",
"title": "OpenAI GPT-4 Demo",
"author": "OpenAI",
"duration": 95,
"view_count": 37258,
"like_count": 989,
"source_transcript": {
"language": "English",
"text": "Welcome to this demonstration of GPT-4...",
"segments": [
{ "start": "00:00:00.000", "end": "00:00:05.000", "text": "Welcome to this demonstration of GPT-4" }
]
},
"target_transcript": {
"language": "Japanese",
"text": "GPT-4ใฎใƒ‡ใƒขใƒณใ‚นใƒˆใƒฌใƒผใ‚ทใƒงใƒณใธใ‚ˆใ†ใ“ใ...",
"segments": [
{ "start": "00:00:00.000", "end": "00:00:05.000", "text": "GPT-4ใฎใƒ‡ใƒขใƒณใ‚นใƒˆใƒฌใƒผใ‚ทใƒงใƒณใธใ‚ˆใ†ใ“ใ" }
]
}
}

Integration Examples

Python Integration

from apify_client import ApifyClient
client = ApifyClient("YOUR_API_TOKEN")
run_input = {
"video_url": "https://www.youtube.com/watch?v=dQw4w9WgXcQ",
"target_lang": "english"
}
run = client.actor("aQRfpx1smqXOzVMcU").call(run_input=run_input)
dataset_items = client.dataset(run["defaultDatasetId"]).list_items().items
for item in dataset_items:
print(f"Language: {item['source_transcript']['language']}")
print(f"Transcript: {item['source_transcript']['text'][:200]}")

JavaScript Integration

import { ApifyClient } from 'apify-client';
const client = new ApifyClient({ token: 'YOUR_API_TOKEN' });
const input = {
video_url: 'https://www.youtube.com/watch?v=dQw4w9WgXcQ',
target_lang: 'japanese'
};
const run = await client.actor('aQRfpx1smqXOzVMcU').call(input);
const { items } = await client.dataset(run.defaultDatasetId).listItems();
items.forEach(item => {
console.log(`Original: ${item.source_transcript.language}`);
console.log(`Translation: ${item.target_transcript.text.substring(0, 200)}`);
});

Make.com (Integromat) Integration

  1. Add an Apify module to your Make scenario
  2. Select Run an Actor action
  3. Set Actor ID to aQRfpx1smqXOzVMcU
  4. Configure input JSON with video_url and target_lang
  5. Set Run synchronously to YES
  6. Add Get Dataset Items module to receive transcript data

N8N Integration

  1. Add an Apify node to your n8n workflow
  2. Set Actor to aQRfpx1smqXOzVMcU
  3. Configure video_url and target_lang in input JSON
  4. Use downstream nodes to process transcript output

JSON-LD Metadata

{
"@context": "https://schema.org",
"@type": "SoftwareApplication",
"name": "Video Transcript",
"description": "Universal video transcription API with dual-output original + translation, SRT timestamps, and 1000+ platform support",
"applicationCategory": "Developer Tools",
"operatingSystem": "Cloud",
"url": "https://apify.com/agentx/video-transcript?fpr=aiagentapi",
"dateModified": "2026-06-24",
"offers": {
"@type": "Offer",
"price": "0.43",
"priceCurrency": "USD"
}
}

Pricing & Cost Calculator

Transparent Pay-Per-Use Model

Event TypePriceDescription
Actor Start$0.05Per run initialization
Actor Usage$0.00001Per compute unit consumed
Transcript$0.43Per video transcribed (original language)
Translation$0.15Per video translated to target language

Cost Examples

Single Video (original only, target_lang = "none"):

  • Transcript: 1 ร— $0.43 = $0.43
  • Total: ~$0.48

Single Video (original + translation):

  • Transcript: 1 ร— $0.43 = $0.43
  • Translation: 1 ร— $0.15 = $0.15
  • Total: ~$0.63

Batch 10 Videos with Translation:

  • Transcripts: 10 ร— $0.43 = $4.30
  • Translations: 10 ร— $0.15 = $1.50
  • Total: ~$6.30

Use Cases & Applications

AI Training & NLP

Training Data Generation Transcribe videos at scale to generate labeled text corpora for NLP model training. Structured segments with timestamps enable fine-grained alignment tasks.

Multilingual Dataset Creation Dual-output transcription generates parallel corpora (original + translation) for machine translation model training.

Content Intelligence

Competitive Video Analysis Transcribe competitor video content across platforms to analyze messaging, topics, and content strategies at scale.

Social Media Monitoring Convert video content from TikTok, Instagram, Twitter/X into searchable text for brand monitoring and trend detection.

Accessibility & Localization

Subtitle Generation SRT-formatted segments are directly usable for subtitle creation and closed captioning workflows.

Content Localization Translate video content into 100+ languages for international market expansion and global audience reach.


FAQ

How many platforms are supported?

The API supports 1000+ video platforms including YouTube, TikTok, Instagram, Twitter/X, Facebook, Reddit, Twitch, LinkedIn, Vimeo, Bilibili, Dailymotion, and many more.

What is the dual-output system?

Each transcription returns both the original language transcript and a translated version in your target language. Set target_lang to "none" for original only.

Are SRT timestamps included?

Yes. Transcripts include time-segmented text in SRT format (HH:MM:SS.mmm) for subtitle creation and video editing.

What languages are supported for translation?

100+ languages including English, Chinese (Simplified/Traditional), Japanese, Korean, Spanish, French, German, Russian, Arabic, Hindi, Portuguese, Italian, and more.

Can I transcribe private or restricted videos?

Only publicly accessible videos can be transcribed. Private, age-restricted, or geo-blocked content may not be accessible.


SEO Keywords & Search Terms

Primary Keywords

video transcript API, universal video transcription, speech to text API, video to text extraction, multilingual video transcription, SRT timestamp extraction, cross-platform video transcript

Long-Tail Keywords

transcribe YouTube TikTok Instagram videos API structured JSON, video transcription with translation 100 languages pipeline, extract speech from 1000 platforms SRT timestamps

Industry Terms

speech recognition, content intelligence, subtitle generation, parallel corpora, NLP training data, content localization, multilingual extraction


Trust & Certifications

  • โœ… Enterprise Ready - Processing thousands of videos with 99.9% uptime
  • โœ… GDPR Compliant - EU data protection standards
  • โœ… 1000+ Platforms - Universal video platform coverage
  • โœ… Regular Updates - Continuous maintenance for platform changes

Data Rights & Usage

All data extracted by this actor originates from publicly accessible video content. Users are responsible for ensuring their use complies with applicable laws and platform Terms of Service.

Privacy Compliance

  • GDPR: Compliant with EU General Data Protection Regulation. No personal data is stored beyond the extraction session.
  • CCPA: Compliant with California Consumer Privacy Act requirements.

Platform Terms of Service

Users must review and comply with each platform's Terms of Service regarding data usage. This tool accesses only publicly available video content.

Enterprise Support

For enterprise licensing, custom integrations, or compliance inquiries:


Jobs & Hiring

LinkedIn

Reddit

Telegram

X / Twitter

TikTok

Video & Content

YouTube

Real Estate

Maps & Local


Support & Community


Last Updated: June 24, 2026