Video Caption Scraper avatar

Video Caption Scraper

Pricing

from $0.48 / subtitles

Go to Apify Store
Video Caption Scraper

Video Caption Scraper

Video caption extraction API across 1000+ source platforms. Native captions plus ASR fallback, auto-conversion of VTT, ASS, TTML and SBV into SRT and timestamped JSON. Drop-in for AI training corpora and localization pipelines.

Pricing

from $0.48 / subtitles

Rating

5.0

(2)

Developer

AgentX

AgentX

Maintained by Community

Actor stats

3

Bookmarked

2

Total users

1

Monthly active users

4 hours ago

Last modified

Share

Video Caption Scraper - Multi-Platform Video Caption Extraction & Subtitle API

Extract structured captions from 1000+ video platforms with automatic SRT conversion and timestamped JSON output. This video caption extraction API delivers native captions plus ASR fallback, automatic format conversion (VTT, ASS, TTML, SBV β†’ SRT), and structured subtitle dictionaries. Engineered for native-caption recovery, localization pipelines, and subtitle-normalization workflows.

1000+ Platforms SRT Conversion AI-Ready JSON Multi-Language


Why Choose This Video Caption API

The Only Unified Multi-Platform Caption Extraction Solution

🎯 Universal Platform Coverage Extract captions from YouTube, TikTok, Instagram, Twitter/X, Facebook, Vimeo, and 1000+ additional platforms. No login required β€” access public video captions without platform-specific accounts or OAuth configurations.

πŸ”„ Automatic Format Conversion Automatically converts non-standard caption formats (VTT, ASS, SSA, TTML, LRC, SRV1/2/3, JSON3) into industry-standard SRT format. Eliminates manual format handling in your pipeline.

πŸ“Š Structured AI-Ready Output Transforms raw caption files into structured JSON dictionaries {"timestamp": "text"} for direct integration with OpenAI, Claude, LangChain, and other LLM frameworks. Each record includes video metadata alongside parsed subtitles.

🌍 Multi-Language Extraction Select specific languages or extract all available captions at once for global content analysis, localization workflows, and multi-language training datasets.

⚑ Enterprise-Grade Reliability Built-in error handling with intelligent retries for network fluctuations and platform rate limits. Scalable from short social media clips to full-length documentaries.

Competitive Comparison

FeatureVideo Caption ScraperBright DataOctoparsePlatform APIs
Platform Scope1000+ Sites ⭐5-10 Sites1-2 SitesSite-specific
Auto SRT ConversionIncluded ⭐ManualNoMulti-format
AI-Ready JSONYes ⭐LimitedNoLimited
Multi-LanguageYes ⭐LimitedNoPartial
Setup TimeInstant ⭐ComplexCoding req.OAuth setup

Why Choose This API

This actor is designed for structured output, repeatable automation, and downstream integration in monitoring, enrichment, or ingestion pipelines.

Quick Start Guide

How to Extract Video Captions in 3 Steps

Step 1: Configure Your Extraction

Provide the video URL and select the languages you want to extract.

Step 2: Run the Actor

Click "Start" and the actor will detect available captions, download and convert them, and produce structured JSON output.

Step 3: Download Results

Access your caption data as structured JSON with timestamped subtitle dictionaries and downloadable SRT files.

Example Input

{
"video_url": "https://www.youtube.com/shorts/noWlKvEPOWk",
"languages": ["english", "chinese_simplified"]
}

Input Parameters

ParameterTypeRequiredDescription
πŸ”— Video URLStringYesURL of the video to extract captions from. Supports 1000+ platforms including YouTube, TikTok, Instagram, and Vimeo.
🌍 LanguagesArrayYesList of languages to extract. Use all for all available languages or specify individual languages like english, chinese_simplified.

Output Data Schema

Each caption extraction produces a structured JSON record:

FieldTypeDescription
processorStringURL of the Apify actor that processed this data
processed_atStringISO 8601 timestamp (UTC) when the data was processed
platformStringSource platform (e.g., YouTube, TikTok, Instagram)
titleStringVideo title
authorStringVideo author/creator
srt_urlStringPublic URL to the converted SRT file in key-value store
languageStringLanguage of the extracted caption
subtitleObjectDictionary of timestamps and matched caption text

Example Output

{
"processor": "https://apify.com/agentx/video-caption-scraper?fpr=aiagentapi",
"processed_at": "2026-01-27T17:42:00+00:00",
"platform": "Youtube",
"title": "Amazing Nature Clip",
"author": "NatureWorld",
"srt_url": "https://api.apify.com/v2/key-value-stores/.../ENGLISH.srt",
"language": "ENGLISH",
"subtitle": {
"00:00:01,000 --> 00:00:03,000": "Look at this amazing scenery.",
"00:00:03,500 --> 00:00:06,000": "The birds are singing in the forest."
}
}

Integration Examples

Python Integration Example

from apify_client import ApifyClient
client = ApifyClient("YOUR_API_TOKEN")
run_input = {
"video_url": "https://www.youtube.com/shorts/noWlKvEPOWk",
"languages": ["english"]
}
run = client.actor("FKRVxiH2L1TUaCcPh").call(run_input=run_input)
dataset_items = client.dataset(run["defaultDatasetId"]).list_items().items
for item in dataset_items:
print(f"{item['language']}: {len(item['subtitle'])} segments")
for timestamp, text in item["subtitle"].items():
print(f" {timestamp}: {text}")

JavaScript/Node.js Integration

import { ApifyClient } from "apify-client";
const client = new ApifyClient({ token: "YOUR_API_TOKEN" });
const input = {
video_url: "https://www.youtube.com/shorts/noWlKvEPOWk",
languages: ["english"],
};
const run = await client.actor("FKRVxiH2L1TUaCcPh").call(input);
const { items } = await client.dataset(run.defaultDatasetId).listItems();
items.forEach((item) => {
console.log(
`${item.language}: ${Object.keys(item.subtitle).length} segments`,
);
});

Actor ID for Platform Integration

FKRVxiH2L1TUaCcPh

Ⓜ️ Make.com Setup:

  1. Login to Make.com (Get 1000 Free Credits)
  2. Add module "Run an Actor"
  3. Turn 'Map' on - right side of the 'Actor*'
  4. Paste Actor ID - from above
  5. Click the '⟳ Refresh' - left side of Map
  6. Input JSON* - Modify the parameters as needed
  7. Set "Run synchronously" to YES
  8. Add module "Get Dataset Items" - receive the result
  9. In Dataset ID* select defaultDatasetId

🎱 N8N.io Setup:

  1. Add 'Run an Actor and get dataset' - from the apify node
  2. Actor β†’ By ID β†’ Paste Actor ID - from above
  3. Input JSON - Modify the parameters as needed

JSON-LD Metadata

{
"@context": "https://schema.org",
"@type": "SoftwareApplication",
"name": "Video Caption Scraper",
"description": "Multi-platform video caption extraction API with SRT conversion and timestamped JSON output",
"applicationCategory": "Developer Tools",
"operatingSystem": "Cloud",
"url": "https://apify.com/agentx/video-caption-scraper?fpr=aiagentapi",
"dateModified": "2026-04-21",
"offers": {
"@type": "Offer",
"price": "0.48",
"priceCurrency": "USD"
}
}

Pricing & Cost Calculator

Transparent Pay-Per-Use Model

Event TypePriceDescription
Actor Start$0.01One-time charge per Actor run
Actor Usage$0.00001Charged for runtime, proxy, and storage based on resource consumption
Subtitles$0.48Charged per video for caption extraction and metadata

Cost Examples

Small Scale (10 videos):

  • Subtitles: 10 Γ— $0.48 = $4.80
  • Actor Start: $0.01
  • Total: ~$4.81

Medium Scale (100 videos):

  • Subtitles: 100 Γ— $0.48 = $48.00
  • Actor Start: $0.01
  • Total: ~$48.01

Large Scale (1,000 videos):

  • Subtitles: 1,000 Γ— $0.48 = $480.00
  • Actor Start: $0.01
  • Total: ~$480.01

Use Cases & Applications

AI & Machine Learning

Training Data for Language Models Extract timestamped captions from video datasets to build training corpora for speech recognition, NLU, and video understanding models. Structured JSON output feeds directly into LangChain, CrewAI, and AutoGPT workflows.

Content Intelligence Pipelines Analyze video captions at scale to detect brand mentions, sentiment patterns, and topic trends across platforms. Feed structured subtitle data into NLP engines for competitive intelligence.

Localization & Accessibility

Multi-Language Subtitle Workflows Extract source-language captions and convert to SRT format for translation pipelines. Supports 100+ languages for global content distribution.

Compliance Archiving Archive video captions with metadata for regulatory compliance, accessibility audits, and content governance workflows.

Content Intelligence

Trend Mining Analyze caption text across thousands of videos to identify emerging topics, keywords, and content patterns for editorial planning and SEO strategy.

Brand Monitoring Monitor video captions across 1000+ platforms for brand mentions, product references, and competitive intelligence signals.


FAQ

Which platforms are supported?

Over 1000 platforms including YouTube, TikTok, Instagram, Twitter/X, Facebook, Vimeo, Twitch, and hundreds of additional video hosting sites.

What caption formats are supported?

The actor handles VTT, ASS, SSA, TTML, LRC, SRV1/2/3, SBV, and JSON3 formats. All non-standard formats are automatically converted to industry-standard SRT.

Can I extract captions in multiple languages?

Yes. Set the languages parameter to all to extract every available caption track, or specify individual languages like english, chinese_simplified, etc.

What happens if a video has no captions?

If no captions are available for the specified languages, the actor returns a record indicating no subtitles were found. The ASR fallback attempts to generate captions from the audio track when native captions are unavailable.

How is the subtitle dictionary structured?

Each subtitle entry uses the SRT timestamp format as key ("HH:MM:SS,mmm --> HH:MM:SS,mmm") mapped to the caption text as value. This format enables direct timestamp-based lookups in AI and search workflows.

Can I process multiple videos in one run?

Each run processes one video URL. For batch processing, use the Apify API to trigger multiple parallel runs or schedule sequential runs with a list of URLs.


SEO Keywords & Search Terms

Primary Keywords

video caption scraper, video subtitle extraction API, caption extraction tool, SRT converter API, video transcription API, multi-platform caption scraper

Long-Tail Keywords

how to extract captions from YouTube videos, TikTok caption extraction API, Instagram video subtitle scraper, automated SRT file generation, video caption to JSON API

Industry Terms

content intelligence, localization pipeline, AI training data, speech-to-text, subtitle extraction, caption parsing, compliance archiving, video understanding

Use Case Keywords

AI training data captions, brand monitoring video, multilingual subtitle extraction, video content analysis, accessibility compliance captions


Trust & Certifications

  • βœ… Enterprise Ready - Production-grade reliability with 99.9% uptime
  • βœ… GDPR Compliant - EU data protection standards and privacy regulations
  • βœ… 99.9% Uptime - Reliable cloud infrastructure with automatic failover
  • βœ… Regular Updates - Continuous maintenance for platform changes

Data Rights & Usage

All data extracted by this actor originates from publicly available video captions and metadata. Users are responsible for ensuring their use of extracted data complies with applicable laws and regulations in their jurisdiction.

Privacy Compliance

  • GDPR: Compliant with EU General Data Protection Regulation. No personal data is stored beyond the extraction session.
  • CCPA: Compliant with California Consumer Privacy Act requirements. Data processing follows privacy-by-design principles.

Platform Terms of Service

Users must review and comply with the terms of service of each video platform when using extracted caption data. This tool accesses only publicly available information.

Enterprise Support

For enterprise licensing, custom integrations, or compliance inquiries, contact our team:


Jobs & Hiring

LinkedIn

Reddit

Telegram

Instagram

X / Twitter

TikTok

Video & Content

YouTube

Real Estate

Maps & Local


Support & Community


Last Updated: April 21, 2026