Pricing

from $5.00 / 1,000 transcript extracteds

YouTube Subtitle & Transcript Scraper

Extract YouTube subtitles & transcripts from videos, Shorts, playlists, and channels. Output as JSON, SRT, VTT, or clean LLM-ready text. 100+ languages. Rich metadata: views, description, thumbnail. Multi-fallback engine for maximum reliability. Fair billing — failures are free.

Pricing

from $5.00 / 1,000 transcript extracteds

Rating

0.0

(0)

Developer

Richard Feng

Actor stats

Bookmarked

Total users

Monthly active users

13 days ago

Last modified

What you get

For each video, the scraper returns:

Full transcript text with timestamps
Rich video metadata — title, channel, description, view count, thumbnail, publish date
Language info — detected language, auto-generated flag, all available languages listed
Multiple output formats — pick what fits your workflow

Output formats

Format	Best for
JSON	Apps, databases, APIs — structured data with timestamps per segment
SRT	Video editors, media players — standard subtitle file format
VTT	Web players, HTML5 video — WebVTT subtitle format
Text	Search indexing, content analysis — plain text joined together
LLM	AI/ML pipelines, RAG, fine-tuning — clean text with annotations stripped

The LLM format automatically removes [Music], [Applause], speaker labels, and other non-speech annotations so you get pure spoken content ready for language models.

Supported URL types

You can pass any of these as input:

https://www.youtube.com/watch?v=dQw4w9WgXcQ — standard video
https://youtu.be/dQw4w9WgXcQ — short link
https://www.youtube.com/shorts/dQw4w9WgXcQ — YouTube Shorts
https://www.youtube.com/playlist?list=PLxxxxx — full playlist
https://www.youtube.com/@channelname — all videos from a channel
dQw4w9WgXcQ — just the video ID

Mix and match in a single run — the scraper handles them all.

Input options

Option	Default	Description
urls	—	List of YouTube URLs or video IDs to process
outputFormat	`json`	Output format: `json`, `srt`, `vtt`, `text`, or `llm`
languages	`["en"]`	Preferred languages in priority order (e.g. `["en", "ja", "de"]`)
includeAutoGenerated	`true`	Use YouTube's auto-generated captions when manual ones aren't available
maxVideos	`0` (unlimited)	Limit how many videos to process from playlists/channels
maxConcurrency	`3`	How many videos to process in parallel (1–10)
proxy	Apify Proxy	Proxy settings — residential proxies recommended

You can also use startUrls (the [{url: "..."}] format) instead of urls — both work.

Example input

{
    "urls": [
        "https://www.youtube.com/watch?v=dQw4w9WgXcQ",
        "https://youtu.be/JGwWNGJdvx8"
    ],
    "outputFormat": "llm",
    "languages": ["en"],
    "maxConcurrency": 2
}

Example output

Each video produces one result in the dataset:

{
    "videoId": "dQw4w9WgXcQ",
    "url": "https://www.youtube.com/watch?v=dQw4w9WgXcQ",
    "title": "Rick Astley - Never Gonna Give You Up (Official Video)",
    "channelName": "Rick Astley",
    "channelId": "UCuAXFkgsw1L7xaCfnd5JJOw",
    "description": "The official video for \"Never Gonna Give You Up\" by Rick Astley...",
    "publishDate": "2009-10-25",
    "viewCount": 1761003712,
    "thumbnail": "https://i.ytimg.com/vi/dQw4w9WgXcQ/sddefault.jpg",
    "availableLanguages": ["en", "de-DE", "ja", "pt-BR", "es-419"],
    "language": "en",
    "languageName": "English",
    "isAutoGenerated": false,
    "duration": 213,
    "wordCount": 487,
    "segmentCount": 61,
    "text": "We're no strangers to love, you know the rules and so do I...",
    "segments": [
        { "text": "We're no strangers to love", "start": 18.64, "end": 21.88 },
        { "text": "You know the rules and so do I", "start": 22.64, "end": 26.96 }
    ],
    "extractedAt": "2026-04-10T07:00:00.000Z",
    "error": null
}

When using SRT or VTT format, the result includes an srt or vtt field with the formatted subtitle file content.

Recommendations

For best results:

Use residential proxies (the default) — they work much better with YouTube than datacenter proxies
Start with maxConcurrency: 1 if you're processing many videos, then increase gradually
Set languages to your target language — the scraper picks the best available match
Use the LLM format if you're feeding transcripts into AI models — it strips all the noise

For large jobs:

Use playlists or channel URLs to batch-process videos in one run
Set maxVideos to limit playlist/channel scrapes during testing
The scraper handles failures gracefully — if one video fails, the rest still process. Failed videos show up in the results with an error field so you can retry them later

For AI/ML workflows:

The LLM output format gives you clean, annotation-free text optimized for context windows
JSON format preserves timestamps, which is useful for building time-aligned datasets
The segments array gives you natural sentence boundaries from the original captions

Fair billing

You're never charged for videos that fail to extract. You only pay for successful results.

Language support

The scraper supports all languages that YouTube captions are available in — over 100 languages. Set your preferred languages in priority order and the scraper will pick the best available match.

If manual captions aren't available in your language, YouTube's auto-generated captions are used as a fallback (unless you disable this with includeAutoGenerated: false).

Error handling

The scraper is designed to be resilient:

If a video has no captions, it reports the error and moves on
If YouTube rate-limits a request, the scraper retries with backoff
If one extraction method fails, it automatically tries alternatives
Failed videos appear in the dataset with a descriptive error field — successful videos have error: null

Need help?

If you run into issues or have questions, open an issue on the Apify Store page.

YouTube Transcript API

glassventures/youtube-transcript-api

Extract transcripts, captions, and subtitles from YouTube videos. Supports 100+ languages, auto-generated captions, SRT/VTT export, playlists, and channels.

Glass Ventures

Youtube Transcript Scraper

yasmany.casanova/youtube-transcript-scraper

Extract transcripts and subtitles from YouTube videos, channels, and playlists. Supports multiple languages, auto-generated captions, translation, batch processing, and 5 export formats (JSON, SRT, VTT). No API key required.

Yasmany Grijalba Casanova

YouTube Transcript & Subtitle Scraper

dizzying_teeth/youtube-transcript-scraper

Extract transcripts and subtitles from YouTube videos, channels or search results. Plain text, timestamped or SRT format. Optional translation.

Alex v

Youtube Transcript Scraper

scraperx/youtube-transcript-scraper

🎬 YouTube Transcript Scraper (youtube-transcript-scraper) extracts transcripts, subtitles & captions (auto/manual) with timestamps from videos, channels & playlists. 📦 Bulk scrape. 📄 Export SRT, VTT, CSV, JSON. 🌐 Multilingual. 🚀 Perfect for SEO, content repurposing, research & accessibility.

ScraperX

YouTube Subtitles Extractor

red.cars/youtube-subtitles

Extract subtitles from YouTube videos in multiple formats (JSON, SRT, VTT, TXT) with support for playlists, channels, and advanced features like multi-language extraction and text cleaning.

AutomateLab

YouTube Transcript Scraper Pro - Bulk + AI-Ready

wetyr_corporation/youtube-transcript-scraper-pro

Bulk extract YouTube transcripts in 100+ languages. 4 output formats (text, JSON, SRT, VTT), built-in translation, video metadata, residential proxies. AI/RAG-ready output for LLM training, content repurposing, and SEO research.

WETYR

YouTube Scraper Intelligence

crowdpull/youtube-scraper-intelligence

Scrape public YouTube videos, Shorts, streams, channels, playlists, search results, subtitles, comments, and AI-ready transcript records.

Crowd Pull

Youtube Video Subtitles Scraper

simpleapi/youtube-video-subtitles-scraper

YouTube Video Subtitles Scraper extracts captions and subtitle tracks from YouTube videos in multiple languages. Returns timed transcripts, language codes, and download formats (SRT, VTT, TXT). Ideal for accessibility, translation, research, SEO, and automating transcript content analysis workflows

SimpleAPI

YouTube Transcript & Subtitle Scraper

abotapi/youtube-transcript-scraper

Extract transcripts and subtitles from YouTube videos in bulk using video, playlist, channel URLs, or keyword search. Returns timed transcript segments, plain text, SRT, and WebVTT subtitle files, with optional auto-translation to other languages.

AbotAPI

YouTube Transcript Scraper - Subtitles and Captions

openclawmara/youtube-transcript-scraper

Extract transcripts and subtitles from YouTube videos. Get auto-generated or manual captions in any language. Bulk extraction from video URLs, channels, or playlists. Output as plain text, timestamped segments, or SRT. Perfect for content repurposing, SEO, and video analysis.