Extract YouTube Transcripts in Seconds (No API Key Needed)
Pricing
from $1.00 / 1,000 results
Extract YouTube Transcripts in Seconds (No API Key Needed)
Scrape YouTube video transcripts reliably using a smart fallback chain: custom transcript URL templates → Invidious captions → optional yt-dlp → optional youtubetranscript.com (last resort). Outputs one transcript record per video with language, source, and error details. No YouTube API required.
Pricing
from $1.00 / 1,000 results
Rating
0.0
(0)
Developer
Inus Grobler
Actor stats
0
Bookmarked
11
Total users
2
Monthly active users
11 days ago
Last modified
Categories
Share
YouTube Transcript Scraper API | Extract Captions & Subtitle Segments (Apify Actor)
Extract YouTube transcripts, captions, and subtitle text segments from video URLs or video IDs. This Apify Actor is built for transcript scraping at scale with resilient fallback sources and Apify Proxy support.
Important Output Behavior
- Output is always segment/piece based.
- One input video can produce many dataset items.
- This increases result volume and can increase pay-per-result usage/costs on Apify.
How It Works
Fallback chain (reliability-first):
- optional
customTranscriptUrlTemplate(legacy/API use) - Invidious captions
- YouTube internal player API
- YouTube watch-page caption tracks
youtube-transcript-apiyoutubetranscript.com- optional
yt-dlpfallback (last resort) - automatic retry pass for transient failures
Input (Simplified)
Use .actor/input_schema.json in the Apify UI.
Primary fields:
videoUrls(required)preferredLanguagestimeoutSecmaxConcurrencymaxCharsproxyCountry(optional)proxyPoolSizefetchVideoMetayoutubeCookies(optional)enableYtDlpFallback(optional)
Proxy Behavior
- On Apify, the Actor defaults to Apify Proxy automatically.
- It tries
RESIDENTIALfirst (when available), then falls back to account-default groups. - Proxy sessions are pre-created and rotated per video using
proxyPoolSize.
Output
Dataset rows are transcript pieces, not one row per video.
Common output fields:
video_id,url,titlestatus(foundormissing)language,source,transcript_urlpiece_index,piece_count,piece_start,piece_durtext,word_count,transcript_word_count,error
Key-value store:
OUTPUT: run metadata + totals + warningsOUTPUT.meta.source_timings: per-source attempts, success/failure, latency stats (avg_ms,p95_ms,max_ms)
Notes
- Transcript availability depends on caption availability and endpoint accessibility.
- This Actor does not use the official YouTube Data API.
- Legacy/advanced fields are still accepted for backward compatibility via API calls.