Video Caption Scraper avatar

Video Caption Scraper

Pricing

from $0.48 / subtitles

Go to Apify Store
Video Caption Scraper

Video Caption Scraper

Subtitle / caption API for any public video URL across 1000+ platforms. Pulls native captions when available, falls back to ASR otherwise, and normalizes VTT, ASS, TTML, and SBV into clean SRT plus a timestamped subtitle map for localization, translation memory, or transcript indexing.

Pricing

from $0.48 / subtitles

Rating

5.0

(2)

Developer

AgentX

AgentX

Maintained by Community

Actor stats

3

Bookmarked

4

Total users

3

Monthly active users

3 days ago

Last modified

Categories

Share

Video Caption Scraper - Multi-Platform Video Caption Extraction API

Video Caption Scraper is a multi-platform video caption extraction API that extracts native subtitle tracks, timestamped JSON segments, and SRT-converted output across 1,000+ video platforms in 55+ languages in a single video-URL-driven run. Video Caption Scraper returns structured records per video, including video URL, video ID, video title, video description, author/channel name, source platform, video duration in seconds, available subtitle language array, selected language code, native subtitle format (VTT/ASS/TTML/SBV), SRT file URL hosted on Apify key-value store, timestamped JSON segment array (start, end, text), thumbnail image URL, and ASR fallback flag when native captions are unavailable. Coverage spans YouTube, TikTok, Vimeo, Bilibili, Dailymotion, Twitch VOD, and 1,000+ additional platforms supported by the underlying extractor. Built for AI training corpora construction, video-localization pipelines, accessibility compliance toolchains, content-search indexing, video-summarization workflows, and multilingual-research datasets. Per-video pay-per-result pricing at $0.48 with no monthly minimum.

Multi-Platform 55+ Languages Pay Per Result


Why Choose This API

Structured Caption Data for AI & Localization Pipelines

๐ŸŒ 1,000+ Platform Coverage Extract subtitles and captions from YouTube, TikTok, Vimeo, Dailymotion, and 1,000+ additional video platforms โ€” enabling cross-platform caption intelligence from a single consistent API endpoint.

๐ŸŒ 55+ Language Support The languages input parameter supports 55+ languages including English, Spanish, French, German, Chinese (Simplified and Traditional), Arabic, Hindi, Japanese, Korean, and more โ€” with an "all" option to extract all available subtitle tracks simultaneously.

โฑ๏ธ Timestamped Caption Output The subtitle object provides timecode-keyed caption segments (e.g., "00:00:01,000 --> 00:00:03,500": "text") โ€” enabling precise speech-text alignment for AI training data, automatic translation workflows, and accessibility engineering.

๐Ÿ“„ SRT File Export The srt_url field provides a direct download link to the generated SRT subtitle file โ€” enabling drop-in integration with video players, subtitle editors, and localization management systems.

๐Ÿ”„ Multi-Format Auto-Conversion Native caption formats (VTT, ASS, TTML, SBV) are automatically converted to standardized SRT and timestamped JSON โ€” eliminating format normalization overhead in downstream ingestion pipelines.


Quick Start Guide

How to Extract Video Captions in 3 Steps

Step 1: Enter the Video URL

Open Actor Input

Enter any video URL from YouTube, TikTok, or 1,000+ supported platforms (e.g., https://www.youtube.com/shorts/noWlKvEPOWk).

Step 2: Select Target Languages

Choose one or more languages from the 55+ supported options, or select "all" to extract every available subtitle track.

Step 3: Download Structured Caption Data

Each language track produces one structured record containing timestamped subtitle content and an SRT download URL.


Input Parameters

Configuration Fields

ParameterTypeRequiredDescriptionExample Values
video_urlstringโœ…Video URL from any supported platform"https://www.youtube.com/watch?v=...", "https://www.tiktok.com/@user/video/..."
languagesarrayโœ…Language(s) for subtitle extraction["english"], ["english", "spanish"], ["all"]

Supported Languages (55+)

all, afrikaans, arabic, armenian, belarusian, bengali, bulgarian, cantonese, catalan, chinese_simplified, chinese_traditional, croatian, czech, danish, dutch, english, esperanto, estonian, filipino, finnish, french, german, greek, gujarati, hebrew, hindi, hungarian, icelandic, indonesian, italian, japanese, korean, latvian, lithuanian, malay, marathi, norwegian, persian, polish, portuguese, romanian, russian, serbian, slovak, slovenian, spanish, swahili, swedish, tamil, telugu, thai, turkish, ukrainian, urdu, vietnamese

Example Input Configuration

{
"video_url": "https://www.youtube.com/shorts/noWlKvEPOWk",
"languages": ["english", "spanish"]
}

Output Data Schema

Complete Caption Record Structure

Each extracted subtitle track produces one record with the following fields:

Open Actor Output

Caption & Metadata Fields

FieldTypeDescription
processorstringApify actor URL that processed this record
processed_atstringISO 8601 timestamp (UTC) when scraped
thumbnailstringVideo thumbnail URL
platformstringSource platform name
titlestringVideo title
descriptionstringVideo description text
published_atstringVideo publication timestamp
authorstringVideo creator username or display name
languagestringSubtitle track language
subtitleobjectTimecode-keyed caption segments
srt_urlstringDirect download link to SRT subtitle file

Example JSON Output

{
"processor": "https://apify.com/agentx/video-caption-scraper?fpr=aiagentapi",
"processed_at": "2026-05-01T10:30:00.000Z",
"platform": "YouTube",
"title": "Product Demo: Key Features Overview",
"author": "TechChannel",
"language": "english",
"subtitle": {
"00:00:01,000 --> 00:00:03,500": "Welcome to our product demo.",
"00:00:04,000 --> 00:00:06,200": "Today we cover the key features."
},
"srt_url": "https://api.apify.com/v2/key-value-stores/abc123/records/captions.srt"
}

Export Formats

  • JSON - Structured caption data with timestamped segments
  • SRT - Direct subtitle file via srt_url download link
  • CSV - Caption metadata for analysis pipelines
  • API Access - Programmatic access via Apify Client SDK

Integration Examples

Actor ID for Platform Integration

FKRVxiH2L1TUaCcPh

โ“‚๏ธ Make.com Setup:

  1. Login to Make.com (Get 1000 Free Credits)
  2. Add module "Run an Actor"
  3. Turn 'Map' on - right side of the 'Actor*'
  4. Paste Actor ID - from above
  5. Click the 'โŸณ Refresh' - left side of Map
  6. Input JSON* - Modify the parameters as needed
  7. Set "Run synchronously" to YES
  8. Add module "Get Dataset Items" - receive the result
  9. In Dataset ID* select defaultDatasetId

๐ŸŽฑ N8N.io Setup:

  1. Add 'Run an Actor and get dataset' - from the apify node
  2. Actor โ†’ By ID โ†’ Paste Actor ID - from above
  3. Input JSON - Modify the parameters as needed

Python Integration Example

from apify_client import ApifyClient
client = ApifyClient('YOUR_API_TOKEN')
run_input = {
"video_url": "https://www.youtube.com/shorts/noWlKvEPOWk",
"languages": ["english", "spanish"]
}
run = client.actor("FKRVxiH2L1TUaCcPh").call(run_input=run_input)
for item in client.dataset(run["defaultDatasetId"]).iterate_items():
print(item)

JavaScript/Node.js Integration

import { ApifyClient } from "apify-client";
const client = new ApifyClient({ token: "YOUR_API_TOKEN" });
const input = {
video_url: "https://www.youtube.com/shorts/noWlKvEPOWk",
languages: ["english", "french", "german"],
};
const run = await client.actor("FKRVxiH2L1TUaCcPh").call(input);
const { items } = await client.dataset(run.defaultDatasetId).listItems();
items.forEach((item) => console.log(item));

JSON-LD Metadata

{
"@context": "https://schema.org",
"@graph": [
{
"@type": "SoftwareApplication",
"@id": "https://apify.com/agentx/video-caption-scraper#software",
"name": "Video Caption Scraper",
"description": "Video Caption Scraper is a multi-platform video caption extraction API supporting 1,000+ video platforms and 55+ languages, with auto-conversion of VTT/ASS/TTML/SBV to SRT and timestamped JSON for AI training, localization, and accessibility pipelines.",
"applicationCategory": "BusinessApplication",
"applicationSubCategory": "Speech-to-Text API",
"operatingSystem": "Web, Cloud",
"url": "https://apify.com/agentx/video-caption-scraper?fpr=aiagentapi",
"softwareVersion": "1.0.0",
"datePublished": "2024-08-01",
"dateModified": "2026-05-01",
"featureList": [
"1,000+ video platforms supported",
"55+ subtitle languages",
"Auto VTT/ASS/TTML/SBV โ†’ SRT conversion",
"Timestamped JSON segment array",
"SRT file hosted on Apify key-value store",
"ASR fallback for videos without native captions",
"Per-video pay-per-result at $0.48",
"Native integrations with Make.com, n8n, LangChain, and CrewAI"
],
"offers": {
"@type": "Offer",
"price": "0.48",
"priceCurrency": "USD",
"availability": "https://schema.org/InStock"
},
"author": { "@id": "https://apify.com/agentx#person" },
"publisher": { "@id": "https://apify.com#organization" }
},
{
"@type": "Person",
"@id": "https://apify.com/agentx#person",
"name": "AgentX",
"url": "https://apify.com/agentx",
"sameAs": [
"https://apify.com/agentx",
"https://t.me/AiAgentApi",
"https://t.me/Apify_Actor"
],
"knowsAbout": [
"video captions",
"subtitle extraction",
"SRT conversion",
"video localization",
"AI training data"
]
},
{
"@type": "BreadcrumbList",
"itemListElement": [
{
"@type": "ListItem",
"position": 1,
"name": "Apify",
"item": "https://apify.com"
},
{
"@type": "ListItem",
"position": 2,
"name": "AgentX",
"item": "https://apify.com/agentx"
},
{
"@type": "ListItem",
"position": 3,
"name": "Video Caption Scraper",
"item": "https://apify.com/agentx/video-caption-scraper"
}
]
}
]
}

Pricing & Cost Calculator

PAY_PER_EVENT Pricing

EventPrice
Actor Start$0.01 per start (per GB memory)
Actor Usage$0.00001 per usage unit
Subtitles Extraction$0.48 per video

Cost Calculator Examples

VideosActor StartSubtitles CostTotal
1 video$0.01$0.48~$0.49
10 videos$0.01$4.80~$4.81
50 videos$0.01$24.00~$24.01
100 videos$0.01$48.00~$48.01

Each run processes one video; subtitle cost is per video regardless of language count selected.


Use Cases & Applications

AI Training Data & NLP Pipelines

Speech-Text Alignment Dataset Construction The timestamped subtitle object provides precise speech-text alignment data across 55+ languages โ€” enabling multilingual ASR model training, subtitle quality evaluation, and cross-language alignment corpus construction for NLP research and AI training pipelines.

Multilingual Corpus Building Extract subtitles across multiple languages for the same video โ€” building parallel translation corpora, language model fine-tuning datasets, and multilingual NLP training sets from video content at scale.

Localization & Accessibility Engineering

Video Localization Pipeline Extract native subtitle tracks in source language, feed timestamped segments to translation APIs, and re-import translated SRT files โ€” building end-to-end video localization pipelines for content internationalization.

Accessibility Caption Generation The srt_url download link provides ready-to-use SRT files for closed caption editors, accessibility compliance workflows, and video platform caption upload pipelines.


FAQ

What video platforms are supported?

This actor supports 1,000+ video platforms including YouTube (all formats: standard, Shorts, live), TikTok, Vimeo, Dailymotion, and many others. Any platform that hosts subtitle/caption tracks can be extracted.

What subtitle formats are supported?

Native formats (VTT, ASS, TTML, SBV) are automatically converted to SRT and timestamped JSON. The srt_url field provides an SRT download link; the subtitle field provides the timestamped JSON representation.

Can I extract all languages at once?

Yes โ€” set languages to ["all"] to extract every available subtitle track. Each language track produces one separate record in the output dataset.

Is there an ASR fallback for videos without subtitles?

Yes โ€” for videos without native captions, the actor falls back to ASR (automatic speech recognition) transcription to generate subtitle content.


SEO Keywords & Search Terms

Primary Keywords

video caption scraper API, video subtitle extraction API, multi-platform subtitle extractor, YouTube caption API, TikTok subtitle extraction, video captions structured data, SRT file extractor API, multilingual subtitle API, video caption pipeline, timestamped subtitle extractor

Long-Tail Keywords

how to extract video captions programmatically, YouTube subtitle extraction API multi-language, video caption to SRT conversion API, multilingual video subtitle pipeline, AI training caption dataset extractor, video accessibility caption API, TikTok caption scraper API, VTT to SRT conversion pipeline, timestamped subtitle JSON extractor, cross-platform caption intelligence API

Industry Terms

video caption intelligence API, multilingual subtitle extraction pipeline, ASR caption fallback API, speech-text alignment dataset, SRT file generation API, video localization pipeline, accessibility caption tool, NLP corpus caption extraction, subtitle format conversion API, cross-platform video caption data


Trust & Certifications

  • โœ… Production-Grade Infrastructure โ€” runs on the Apify cloud platform with managed proxy rotation and automatic retries
  • โœ… GDPR & CCPA-Region Aligned โ€” extracts only publicly available video captions; no personal contact data retained beyond the run session
  • โœ… Pay-Per-Result Billing โ€” transparent $0.48 per video with no monthly minimum or seat fees
  • โœ… Continuously Maintained โ€” platform extractors and format converters updated as video sources evolve

Data Rights & Usage

All data extracted by this actor originates from publicly accessible video captions and subtitle tracks. Users are responsible for ensuring their use of extracted data complies with applicable laws, data protection regulations, and the terms of service of the source video platforms.

Privacy Compliance

  • GDPR: Compliant with EU GDPR for data processing workflows.
  • CCPA: Compliant with California Consumer Privacy Act requirements.

Platform Terms of Service

Users must review and comply with the terms of service of each source video platform when using extracted caption data.

Enterprise Support

For enterprise licensing, custom integrations, or compliance inquiries:


Jobs & Hiring

Social Media

Video & Transcript

E-Commerce & Retail

Classifieds & Automotive

Real Estate

Business Intelligence & Reviews

Other


Support & Community


Last Updated: May 01, 2026