Video Caption Scraper
Pricing
from $0.48 / subtitles
Video Caption Scraper
Subtitle / caption API for any public video URL across 1000+ platforms. Pulls native captions when available, falls back to ASR otherwise, and normalizes VTT, ASS, TTML, and SBV into clean SRT plus a timestamped subtitle map for localization, translation memory, or transcript indexing.
Video Caption Scraper - Multi-Platform Video Caption Extraction API
Video Caption Scraper is a multi-platform video caption extraction API that extracts native subtitle tracks, timestamped JSON segments, and SRT-converted output across 1,000+ video platforms in 55+ languages in a single video-URL-driven run. Video Caption Scraper returns structured records per video, including video URL, video ID, video title, video description, author/channel name, source platform, video duration in seconds, available subtitle language array, selected language code, native subtitle format (VTT/ASS/TTML/SBV), SRT file URL hosted on Apify key-value store, timestamped JSON segment array (start, end, text), thumbnail image URL, and ASR fallback flag when native captions are unavailable. Coverage spans YouTube, TikTok, Vimeo, Bilibili, Dailymotion, Twitch VOD, and 1,000+ additional platforms supported by the underlying extractor. Built for AI training corpora construction, video-localization pipelines, accessibility compliance toolchains, content-search indexing, video-summarization workflows, and multilingual-research datasets. Per-video pay-per-result pricing at $0.48 with no monthly minimum.
Why Choose This API
Structured Caption Data for AI & Localization Pipelines
๐ 1,000+ Platform Coverage Extract subtitles and captions from YouTube, TikTok, Vimeo, Dailymotion, and 1,000+ additional video platforms โ enabling cross-platform caption intelligence from a single consistent API endpoint.
๐ 55+ Language Support
The languages input parameter supports 55+ languages including English, Spanish, French, German, Chinese (Simplified and Traditional), Arabic, Hindi, Japanese, Korean, and more โ with an "all" option to extract all available subtitle tracks simultaneously.
โฑ๏ธ Timestamped Caption Output
The subtitle object provides timecode-keyed caption segments (e.g., "00:00:01,000 --> 00:00:03,500": "text") โ enabling precise speech-text alignment for AI training data, automatic translation workflows, and accessibility engineering.
๐ SRT File Export
The srt_url field provides a direct download link to the generated SRT subtitle file โ enabling drop-in integration with video players, subtitle editors, and localization management systems.
๐ Multi-Format Auto-Conversion Native caption formats (VTT, ASS, TTML, SBV) are automatically converted to standardized SRT and timestamped JSON โ eliminating format normalization overhead in downstream ingestion pipelines.
Quick Start Guide
How to Extract Video Captions in 3 Steps
Step 1: Enter the Video URL
Enter any video URL from YouTube, TikTok, or 1,000+ supported platforms (e.g., https://www.youtube.com/shorts/noWlKvEPOWk).
Step 2: Select Target Languages
Choose one or more languages from the 55+ supported options, or select "all" to extract every available subtitle track.
Step 3: Download Structured Caption Data
Each language track produces one structured record containing timestamped subtitle content and an SRT download URL.
Input Parameters
Configuration Fields
| Parameter | Type | Required | Description | Example Values |
|---|---|---|---|---|
video_url | string | โ | Video URL from any supported platform | "https://www.youtube.com/watch?v=...", "https://www.tiktok.com/@user/video/..." |
languages | array | โ | Language(s) for subtitle extraction | ["english"], ["english", "spanish"], ["all"] |
Supported Languages (55+)
all, afrikaans, arabic, armenian, belarusian, bengali, bulgarian, cantonese, catalan, chinese_simplified, chinese_traditional, croatian, czech, danish, dutch, english, esperanto, estonian, filipino, finnish, french, german, greek, gujarati, hebrew, hindi, hungarian, icelandic, indonesian, italian, japanese, korean, latvian, lithuanian, malay, marathi, norwegian, persian, polish, portuguese, romanian, russian, serbian, slovak, slovenian, spanish, swahili, swedish, tamil, telugu, thai, turkish, ukrainian, urdu, vietnamese
Example Input Configuration
{"video_url": "https://www.youtube.com/shorts/noWlKvEPOWk","languages": ["english", "spanish"]}
Output Data Schema
Complete Caption Record Structure
Each extracted subtitle track produces one record with the following fields:
Caption & Metadata Fields
| Field | Type | Description |
|---|---|---|
processor | string | Apify actor URL that processed this record |
processed_at | string | ISO 8601 timestamp (UTC) when scraped |
thumbnail | string | Video thumbnail URL |
platform | string | Source platform name |
title | string | Video title |
description | string | Video description text |
published_at | string | Video publication timestamp |
author | string | Video creator username or display name |
language | string | Subtitle track language |
subtitle | object | Timecode-keyed caption segments |
srt_url | string | Direct download link to SRT subtitle file |
Example JSON Output
{"processor": "https://apify.com/agentx/video-caption-scraper?fpr=aiagentapi","processed_at": "2026-05-01T10:30:00.000Z","platform": "YouTube","title": "Product Demo: Key Features Overview","author": "TechChannel","language": "english","subtitle": {"00:00:01,000 --> 00:00:03,500": "Welcome to our product demo.","00:00:04,000 --> 00:00:06,200": "Today we cover the key features."},"srt_url": "https://api.apify.com/v2/key-value-stores/abc123/records/captions.srt"}
Export Formats
- JSON - Structured caption data with timestamped segments
- SRT - Direct subtitle file via
srt_urldownload link - CSV - Caption metadata for analysis pipelines
- API Access - Programmatic access via Apify Client SDK
Integration Examples
Actor ID for Platform Integration
FKRVxiH2L1TUaCcPh
โ๏ธ Make.com Setup:
- Login to Make.com (Get 1000 Free Credits)
- Add module "Run an Actor"
- Turn 'Map' on - right side of the 'Actor*'
- Paste Actor ID - from above
- Click the 'โณ Refresh' - left side of Map
- Input JSON* - Modify the parameters as needed
- Set "Run synchronously" to YES
- Add module "Get Dataset Items" - receive the result
- In Dataset ID* select defaultDatasetId
๐ฑ N8N.io Setup:
- Add 'Run an Actor and get dataset' - from the apify node
- Actor โ By ID โ Paste Actor ID - from above
- Input JSON - Modify the parameters as needed
Python Integration Example
from apify_client import ApifyClientclient = ApifyClient('YOUR_API_TOKEN')run_input = {"video_url": "https://www.youtube.com/shorts/noWlKvEPOWk","languages": ["english", "spanish"]}run = client.actor("FKRVxiH2L1TUaCcPh").call(run_input=run_input)for item in client.dataset(run["defaultDatasetId"]).iterate_items():print(item)
JavaScript/Node.js Integration
import { ApifyClient } from "apify-client";const client = new ApifyClient({ token: "YOUR_API_TOKEN" });const input = {video_url: "https://www.youtube.com/shorts/noWlKvEPOWk",languages: ["english", "french", "german"],};const run = await client.actor("FKRVxiH2L1TUaCcPh").call(input);const { items } = await client.dataset(run.defaultDatasetId).listItems();items.forEach((item) => console.log(item));
JSON-LD Metadata
{"@context": "https://schema.org","@graph": [{"@type": "SoftwareApplication","@id": "https://apify.com/agentx/video-caption-scraper#software","name": "Video Caption Scraper","description": "Video Caption Scraper is a multi-platform video caption extraction API supporting 1,000+ video platforms and 55+ languages, with auto-conversion of VTT/ASS/TTML/SBV to SRT and timestamped JSON for AI training, localization, and accessibility pipelines.","applicationCategory": "BusinessApplication","applicationSubCategory": "Speech-to-Text API","operatingSystem": "Web, Cloud","url": "https://apify.com/agentx/video-caption-scraper?fpr=aiagentapi","softwareVersion": "1.0.0","datePublished": "2024-08-01","dateModified": "2026-05-01","featureList": ["1,000+ video platforms supported","55+ subtitle languages","Auto VTT/ASS/TTML/SBV โ SRT conversion","Timestamped JSON segment array","SRT file hosted on Apify key-value store","ASR fallback for videos without native captions","Per-video pay-per-result at $0.48","Native integrations with Make.com, n8n, LangChain, and CrewAI"],"offers": {"@type": "Offer","price": "0.48","priceCurrency": "USD","availability": "https://schema.org/InStock"},"author": { "@id": "https://apify.com/agentx#person" },"publisher": { "@id": "https://apify.com#organization" }},{"@type": "Person","@id": "https://apify.com/agentx#person","name": "AgentX","url": "https://apify.com/agentx","sameAs": ["https://apify.com/agentx","https://t.me/AiAgentApi","https://t.me/Apify_Actor"],"knowsAbout": ["video captions","subtitle extraction","SRT conversion","video localization","AI training data"]},{"@type": "BreadcrumbList","itemListElement": [{"@type": "ListItem","position": 1,"name": "Apify","item": "https://apify.com"},{"@type": "ListItem","position": 2,"name": "AgentX","item": "https://apify.com/agentx"},{"@type": "ListItem","position": 3,"name": "Video Caption Scraper","item": "https://apify.com/agentx/video-caption-scraper"}]}]}
Pricing & Cost Calculator
PAY_PER_EVENT Pricing
| Event | Price |
|---|---|
| Actor Start | $0.01 per start (per GB memory) |
| Actor Usage | $0.00001 per usage unit |
| Subtitles Extraction | $0.48 per video |
Cost Calculator Examples
| Videos | Actor Start | Subtitles Cost | Total |
|---|---|---|---|
| 1 video | $0.01 | $0.48 | ~$0.49 |
| 10 videos | $0.01 | $4.80 | ~$4.81 |
| 50 videos | $0.01 | $24.00 | ~$24.01 |
| 100 videos | $0.01 | $48.00 | ~$48.01 |
Each run processes one video; subtitle cost is per video regardless of language count selected.
Use Cases & Applications
AI Training Data & NLP Pipelines
Speech-Text Alignment Dataset Construction
The timestamped subtitle object provides precise speech-text alignment data across 55+ languages โ enabling multilingual ASR model training, subtitle quality evaluation, and cross-language alignment corpus construction for NLP research and AI training pipelines.
Multilingual Corpus Building Extract subtitles across multiple languages for the same video โ building parallel translation corpora, language model fine-tuning datasets, and multilingual NLP training sets from video content at scale.
Localization & Accessibility Engineering
Video Localization Pipeline Extract native subtitle tracks in source language, feed timestamped segments to translation APIs, and re-import translated SRT files โ building end-to-end video localization pipelines for content internationalization.
Accessibility Caption Generation
The srt_url download link provides ready-to-use SRT files for closed caption editors, accessibility compliance workflows, and video platform caption upload pipelines.
FAQ
What video platforms are supported?
This actor supports 1,000+ video platforms including YouTube (all formats: standard, Shorts, live), TikTok, Vimeo, Dailymotion, and many others. Any platform that hosts subtitle/caption tracks can be extracted.
What subtitle formats are supported?
Native formats (VTT, ASS, TTML, SBV) are automatically converted to SRT and timestamped JSON. The srt_url field provides an SRT download link; the subtitle field provides the timestamped JSON representation.
Can I extract all languages at once?
Yes โ set languages to ["all"] to extract every available subtitle track. Each language track produces one separate record in the output dataset.
Is there an ASR fallback for videos without subtitles?
Yes โ for videos without native captions, the actor falls back to ASR (automatic speech recognition) transcription to generate subtitle content.
SEO Keywords & Search Terms
Primary Keywords
video caption scraper API, video subtitle extraction API, multi-platform subtitle extractor, YouTube caption API, TikTok subtitle extraction, video captions structured data, SRT file extractor API, multilingual subtitle API, video caption pipeline, timestamped subtitle extractor
Long-Tail Keywords
how to extract video captions programmatically, YouTube subtitle extraction API multi-language, video caption to SRT conversion API, multilingual video subtitle pipeline, AI training caption dataset extractor, video accessibility caption API, TikTok caption scraper API, VTT to SRT conversion pipeline, timestamped subtitle JSON extractor, cross-platform caption intelligence API
Industry Terms
video caption intelligence API, multilingual subtitle extraction pipeline, ASR caption fallback API, speech-text alignment dataset, SRT file generation API, video localization pipeline, accessibility caption tool, NLP corpus caption extraction, subtitle format conversion API, cross-platform video caption data
Trust & Certifications
- โ Production-Grade Infrastructure โ runs on the Apify cloud platform with managed proxy rotation and automatic retries
- โ GDPR & CCPA-Region Aligned โ extracts only publicly available video captions; no personal contact data retained beyond the run session
- โ Pay-Per-Result Billing โ transparent $0.48 per video with no monthly minimum or seat fees
- โ Continuously Maintained โ platform extractors and format converters updated as video sources evolve
Legal & Compliance
Data Rights & Usage
All data extracted by this actor originates from publicly accessible video captions and subtitle tracks. Users are responsible for ensuring their use of extracted data complies with applicable laws, data protection regulations, and the terms of service of the source video platforms.
Privacy Compliance
- GDPR: Compliant with EU GDPR for data processing workflows.
- CCPA: Compliant with California Consumer Privacy Act requirements.
Platform Terms of Service
Users must review and comply with the terms of service of each source video platform when using extracted caption data.
Enterprise Support
For enterprise licensing, custom integrations, or compliance inquiries:
- Telegram: @AiAgentApi
Related Tools
Jobs & Hiring
- All Jobs Scraper - Multi-Platform Job Listings Intelligence API
- Glassdoor Hiring Scraper - Glassdoor Job Listings Intelligence API
- Glints Employer Data Extractor - Glints Employer Intelligence API
- Glints Jobs Data Extractor - Glints Job Listings Intelligence API
- Indeed Hiring Scraper - Indeed Job Listings Intelligence API
- JobStreet Employer Data Extractor - JobStreet Employer Intelligence API
- JobStreet Jobs Data Extractor - JobStreet Job Listings Intelligence API
- LinkedIn Hiring Scraper - LinkedIn Job Listings Intelligence API
- StepStone Employer Data Extractor - StepStone Employer Intelligence API
- StepStone Jobs Data Extractor - StepStone Job Listings Intelligence API
Social Media
- Instagram Creator Scraper - Instagram Creator Intelligence API
- Instagram Post Reel Scraper - Instagram Post & Reel Intelligence API
- Instagram Trending Scraper - Instagram Trending Content Intelligence API
- LinkedIn Company Scraper - LinkedIn Company Intelligence API
- LinkedIn Profile Scraper - LinkedIn Profile Intelligence API
- Medium Author Data Extractor - Medium Author Intelligence API
- Medium Content Data Extractor - Medium Content Intelligence API
- Reddit Author Scraper - Reddit Author Intelligence API
- Reddit Viral Scraper - Reddit Viral Content Intelligence API
- SubReddit Info Scraper - Subreddit Intelligence API
- Subreddit Member Scraper - Subreddit Member Intelligence API
- SubReddit Post Scraper - Subreddit Post Intelligence API
- Telegram Chat Scraper - Telegram Chat Intelligence API
- Telegram Group Inviter - Telegram Group Invitation API
- Telegram Info Scraper - Telegram Channel Intelligence API
- Telegram Member Scraper - Telegram Member Intelligence API
- Telegram Private Group Scraper - Telegram Private Group Intelligence API
- TikTok Creator Scraper - TikTok Creator Intelligence API
- Twitter Community Scraper - Twitter Community Intelligence API
- Twitter Info Scraper - Twitter Account Intelligence API
- YouTube Market Intelligence - YouTube Channel Intelligence API
Video & Transcript
- All Video Scraper - Multi-Platform Video Intelligence API
- Live Stream Transcript - Live Stream Transcript Intelligence API
- TikTok Live Downloader - TikTok Live Stream Download API
- TikTok Transcript - TikTok Video Transcript Intelligence API
- Video Caption Scraper - Multi-Platform Video Caption Extraction API
- Video to Social Post - Video to Social Post Generation API
- Video Transcript - Multi-Platform Video Transcript Intelligence API
- YouTube Transcript - YouTube Video Transcript Intelligence API
E-Commerce & Retail
- Amazon Product Data Extractor - Amazon Product Intelligence API
- Amazon Storefront Data Extractor - Amazon Storefront Intelligence API
- Argos Retail Data Extractor - Argos Product Intelligence API
- ASOS Product Catalog Extractor - ASOS Product Catalog Intelligence API
- eBay Marketplace Data Extractor - eBay Marketplace Intelligence API
- eBay Seller Leads API - eBay Seller Lead Intelligence API
- Flipkart Product Data Extractor - Flipkart Product Intelligence API
- John Lewis Retail Data Extractor - John Lewis Product Intelligence API
- Lidl Offers Data Extractor - Lidl Offers Intelligence API
- Magalu Product Data Extractor - Magalu Product Intelligence API
- MediaMarkt Product Data Extractor - MediaMarkt Product Intelligence API
- Mercado Livre Marketplace Data Extractor - Mercado Livre Marketplace Intelligence API
- Nykaa Product Data Extractor - Nykaa Product Intelligence API
- OTTO Product Data Extractor - OTTO Product Intelligence API
- Target Product Data Extractor - Target Product Intelligence API
- Walmart Product Data Extractor - Walmart Product Intelligence API
Classifieds & Automotive
- AutoTrader Dealer Leads API - AutoTrader Dealer Lead Intelligence API
- AutoTrader Vehicle Listings API - AutoTrader Vehicle Listings Intelligence API
- Cars.com Dealer Leads API - Cars.com Dealer Lead Intelligence API
- Cars.com Vehicle Listings API - Cars.com Vehicle Listings Intelligence API
- Kleinanzeigen Classifieds Data Extractor - Kleinanzeigen Classifieds Intelligence API
- Kleinanzeigen Seller Leads API - Kleinanzeigen Seller Lead Intelligence API
- OLX Classifieds Data Extractor - OLX Classifieds Intelligence API
- OLX Seller Leads API - OLX Seller Lead Intelligence API
- Webmotors Vehicle Listings API - Webmotors Vehicle Listings Intelligence API
Real Estate
- MagicBricks Builder Leads API - MagicBricks Builder Lead Intelligence API
- MagicBricks Property Data Extractor - MagicBricks Real Estate Data API
- Realtor Property Scraper - Realtor.com Property Listings Intelligence API
- Zillow For Sale Scraper - Zillow Property Listings Intelligence API
Business Intelligence & Reviews
- BBB Business Data Extractor - BBB Business Intelligence API
- BBB Reputation Data Extractor - BBB Reputation Intelligence API
- Google Maps Store Scraper - Google Maps Business Intelligence API
- Reclame Aqui Company Data Extractor - Reclame Aqui Company Intelligence API
- TradeIndia Product Data Extractor - TradeIndia Product Intelligence API
- TradeIndia Supplier Leads API - TradeIndia Supplier Lead Intelligence API
- Zomato Menu Data Extractor - Zomato Restaurant Menu Intelligence API
- Zomato Restaurant Data Extractor - Zomato Restaurant Search Intelligence API
Other
- Global Weather Scraper - Global Weather Intelligence API
Support & Community
- ๐ฅ Community: @Apify_Actor
- ๐ค Contact Team: @AiAgentApi
Last Updated: May 01, 2026