Pricing

from $3.50 / 1,000 results

YouTube Transcripts & Captions Scraper (Subtitles at Scale)

Extract transcripts and captions from YouTube videos at scale. Returns full text, per-segment timing, and all available languages (manual + auto-generated). For RAG, sentiment analysis, video summarization, and agent workflows. No API key.

Pricing

from $3.50 / 1,000 results

Rating

0.0

(0)

Developer

Thirdwatch

Actor stats

Bookmarked

Total users

Monthly active users

5 days ago

Last modified

What you get

A clean, structured transcript record per video. Choose a preferred language or let the actor fall back to whatever's available. Returns both the full joined transcript_text (ideal for vector stores) and a segments array with per-line timestamps (ideal for subtitle overlays and chapter generation).

Output fields

Field	Description
`video_id`	11-character YouTube video ID
`video_url`	Canonical watch URL
`language_code`	Actual caption language returned (ISO 639-1)
`language_name`	Human-readable language name
`is_auto_generated`	`true` if auto-captions, `false` if human-written
`auto_translated`	`true` if auto-translated into the requested language
`available_languages`	Array of `{code, name, is_auto_generated}` for every track on the video
`transcript_text`	Full transcript joined into one string
`segments`	Array of `{text, start, duration}` per caption line
`segment_count`	Number of caption lines
`total_duration_seconds`	Total covered duration
`data_source`	Origin tag
`used_residential_proxy`	`true` only when the direct request was blocked and the paid fallback succeeded

Failures are not written to the paid result dataset. They are available in the run's ERRORS key-value-store record with codes such as no_captions_available, private_video, region_locked, age_restricted_or_login_required, video_unavailable, no_player_response, transcript_fetch_failed, and empty_transcript.

Example output

{
    "video_id": "dQw4w9WgXcQ",
    "video_url": "https://www.youtube.com/watch?v=dQw4w9WgXcQ",
    "language_code": "en",
    "language_name": "English",
    "is_auto_generated": false,
    "auto_translated": false,
    "available_languages": [
        {"code": "en", "name": "English", "is_auto_generated": false},
        {"code": "es", "name": "Spanish", "is_auto_generated": false},
        {"code": "en", "name": "English (auto-generated)", "is_auto_generated": true}
    ],
    "transcript_text": "We're no strangers to love You know the rules and so do I ...",
    "segment_count": 58,
    "total_duration_seconds": 212.48,
    "segments": [
        {"text": "We're no strangers to love", "start": 18.8, "duration": 7.0},
        {"text": "You know the rules and so do I", "start": 25.8, "duration": 3.5},
        {"text": "A full commitment's what I'm thinking of", "start": 29.3, "duration": 3.7}
    ],
    "data_source": "youtube_timedtext"
}

Example unbilled error diagnostic (no captions published)

{
    "failed_count": 1,
    "failed_videos": [
        {
            "video_id": "abc123XYZ_0",
            "video_url": "https://www.youtube.com/watch?v=abc123XYZ_0",
            "error": "no_captions_available",
            "available_languages": [],
            "data_source": "youtube_innertube"
        }
    ]
}

Input parameters

Parameter	Required	Description
`videoUrls`	One of	YouTube URLs. Supports `watch?v=`, `youtu.be/`, `shorts/`, `embed/`.
`videoIds`	One of	Raw 11-character video IDs. Either `videoUrls` or `videoIds` must be provided.
`languageCode`	No	Preferred caption language. Default `en`.
`preferManual`	No	Prefer human-written captions over auto-generated. Default `true`.
`includeTimestamps`	No	Include the `segments` array. Default `true`. Turn off for smaller RAG payloads.
`includeAutoTranslate`	No	Fall back to YouTube auto-translate if the requested language isn't available. Default `false`.
`useResidentialProxy`	No	Allow residential retry after a direct IP-dependent failure. Direct HTTP is always tried first. Default `true`.
`maxResults`	No	Cap on transcripts returned. Default 5, max 10000.

Pricing

Only successfully delivered transcripts are billed. Ordinary direct-HTTP transcripts start at $0.003 per video; subscriber tiers reduce that to $0.0025, $0.002, and $0.0015. If YouTube blocks direct cloud egress and the residential fallback succeeds, that transcript has one additional $0.004 proxy fallback event. Videos without captions and other failed inputs are recorded under ERRORS and are not billed as results.

Use cases

AI engineers (RAG): index thousands of talks, lectures, tutorials for semantic search. transcript_text drops straight into your vector store.
Content marketers: generate written blog posts, newsletters, and social clips from podcast and YouTube content at scale.
Product & research teams: run sentiment analysis across competitor channels and track topic drift over time.
Accessibility & compliance: build closed-caption corpora in bulk for ADA / WCAG compliance.
Agent builders: plug into Claude / GPT / MCP workflows so an agent can "read" a YouTube URL.
Language learners & translators: grab multilingual caption tracks side-by-side for study material.
Video summarization tools: feed full transcripts into an LLM to generate chapter markers, key takeaways, and tl;drs.

Limitations

Not all videos have transcripts — some uploaders disable captions entirely. Those videos are listed in the unbilled ERRORS record with no_captions_available.
Auto-generated transcripts are lower quality — especially for music, accents, and technical content. Set preferManual: true (default) to pick human-written tracks whenever available.
Age-restricted and private videos are blocked — they appear in the unbilled ERRORS record rather than the paid result dataset. Region-locked videos behave the same way.
YouTube occasionally throttles heavy uninterrupted runs; the actor backs off automatically on rate limits.

Compared to alternatives

Apify's pintostudio/youtube-transcript-scraper — similar scope at $0.01 per transcript; our direct path starts at $0.003 and keeps failed videos unbilled.
youtube-transcript-api (Python library) — free to run yourself, but you handle the proxy, consent cookie, and retry logic. This actor is a hosted drop-in with built-in reliability.
Official YouTube Data API — captions endpoint requires OAuth and channel ownership; not usable for third-party videos.

Pairs well with

YouTube Scraper — pull video metadata (title, description, views, likes, channel) first, then feed the IDs here for transcripts.
Google News Scraper — enrich news-video transcripts with source articles.
Reddit Scraper — cross-reference discussion threads with the video's transcript.

FAQ

Does this work on YouTube Shorts? Yes — youtube.com/shorts/{id} URLs work identically to regular videos.

Do I need an API key? No. The actor uses YouTube's public caption endpoints — no OAuth, no Google Cloud project.

Can I get transcripts for private videos? No. Private videos return a private_video error. Only publicly published videos are supported.

Which languages are supported? Every language a video has published captions for. Use languageCode to pick your preferred track, or enable includeAutoTranslate to cross-translate.

What happens if captions are disabled? The video is omitted from paid results and recorded under ERRORS with error: "no_captions_available", so your pipeline can filter, retry, or skip it without paying for a transcript that was not delivered.

Can I feed this straight into a vector DB? Yes — the transcript_text field is a single joined string designed for RAG ingestion. Turn off includeTimestamps to drop the segments array and shrink payloads further.

Built by Thirdwatch. Questions? Open an issue or reach out on the Apify Store listing.

Last verified: 2026-05

YouTube Transcript Scraper - Captions & Auto Subtitles

nominated_tupelo/yt-transcript-scraper

Extract transcripts and subtitles from any YouTube video. Supports auto-generated captions, manual subtitles, multiple languages, and batch video processing. No API key required.

kade

YouTube Transcript API

glassventures/youtube-transcript-api

Extract transcripts, captions, and subtitles from YouTube videos. Supports 100+ languages, auto-generated captions, SRT/VTT export, playlists, and channels.

Glass Ventures

YouTube Transcript Scraper — Get Subtitles & Captions

thriftykiwi/youtube-transcript-scraper

Extract subtitles and captions from any YouTube video. List available languages and download full transcripts. No API key required — uses the public youtube-transcript service.

Thrifty Kiwi

YouTube Transcript Scraper

cloud9_ai/youtube-transcript-scraper

Extract transcripts and captions from YouTube videos via InnerTube API. Support for auto-generated and manual captions in multiple languages. Get timestamped text segments.

cloud9

YouTube Transcript Scraper

elaborate_statue/youtube-transcript-scraper

Extract transcripts (captions) from YouTube videos with timestamps. Supports manual and auto-generated captions in 50+ languages. Outputs JSON, plain text, or SRT format.

Alex Kim

YouTube Transcript Scraper

akash9078/youtube-transcript-scraper

YouTube Transcript Scraper & Extractor API — Extract transcripts, captions & subtitles from YouTube videos, Shorts & VODs without an API key. Supports auto-generated and manual captions in 100+ languages with translation, batch extraction & clean JSON for AI agents, RAG, SEO & automation.

Akash Kumar Naik

1.1K

4.9

YouTube Transcript Extractor — Captions & Timestamps

junipr/youtube-transcript-extractor

Extract available YouTube transcripts and captions with timestamps, languages, metadata, and text exports for research or RAG workflows.

junipr

YouTube Full Channel Transcripts Extractor

scrapier/youtube-full-channel-transcripts-extractor

Extract transcripts from all videos on a YouTube channel with a single run. Collect captions, subtitles, video metadata, and spoken text at scale for content research, SEO, AI training, sentiment analysis, competitive intelligence, and workflow automation.

Scrapier

YouTube Transcripts Subtitles Captions Extractor. ⚡

lume/yt-transcripts

YouTube transcript extractor, subtitle downloader, captions scraper, and video transcript crawler. Extract, download, and save YouTube video transcripts, subtitles, and captions for one or many Youtube Videos.

Lume

338

5.0

YouTube Transcript & Subtitle Scraper API — Captions + Metadata

herus13/youtube-transcript-scraper

Extract YouTube transcripts, subtitles, and captions with timestamps plus video metadata (title, channel, views, duration). For RAG, analysis, and content workflows. No official API key needed.