Pricing

from $2.50 / 1,000 results

Go to Apify Store

YouTube Transcript Scraper

Try for free

Extract YouTube captions, timestamps, SRT, VTT, and plain text from public videos in bulk without browser automation.

Pricing

from $2.50 / 1,000 results

Rating

0.0

(0)

Developer

太郎山田

Actor stats

Bookmarked

Total users

Monthly active users

2 days ago

Last modified

YouTube Transcript Bulk API

After this run

Turn this Actor's output into a capped paid report with YouTube Channel Transcript RAG Intelligence. Use it when AI builders, content teams, and knowledge-base owners need to decide whether a YouTube corpus is clean enough for RAG before building embeddings or a chatbot.

First report: $9 / corpus_snapshot_report; set maxChargeUsd to $9.
Deeper report: $29 / rag_readiness_report; use only when the first result needs competitor or action-depth.
This is an internal Apify flow aid. It is not revenue proof until accounted paid usage appears.

Next report-style Actors

If you already have data from this Actor, these follow-on Actors turn public or user-provided inputs into decision-ready reports. They are optional, capped by maxChargeUsd, and do not make business outcome claims.

YouTube Channel Transcript RAG Intelligence - audit transcript coverage, missing-caption risk, and RAG readiness.
Website RAG Readiness Audit Report - audit public web pages that will be mixed with transcript corpora.

Extract transcripts from public YouTube videos in bulk. The actor is built for AI pipelines, RAG ingestion, content repurposing, subtitle export, research, and searchable video archives.

What It Does

You provide YouTube video URLs, video IDs, playlist URLs, or channel URLs. The actor resolves visible video IDs, fetches public caption tracks, selects the best matching language, downloads the timed transcript XML, and returns one dataset row per video.

The implementation is HTTP-first and does not use browser automation. That keeps Apify hosting cost low and makes the pricing predictable.

Input

videoUrls: YouTube watch, Shorts, embed, live, or youtu.be URLs.
videoIds: Direct 11-character YouTube video IDs.
playlistUrls: Public playlist pages to expand into video IDs.
channelUrls: Public channel pages to expand into visible video IDs.
dedupeVideos: Remove duplicate video IDs after combining direct, playlist, and channel sources.
language: Preferred caption language such as en or ja.
includeAutoGenerated: Allows auto-generated captions when manual captions are not available.
translationLanguage: Optional YouTube transcript translation target.
outputFormat: json, text, srt, or vtt.
delivery / webhookUrl: Write to dataset only, or also send the final payload to a webhook.
maxVideos: Maximum videos to process.
dryRun: Validate input and emit preview rows without fetching YouTube.

Output

Each video produces one row with transcript data and billing audit fields:

videoId, videoUrl, videoTitle, channelTitle, sourceType
status, chargedEvent, sourceUrls, errors
language, sourceLanguage, isAutoGenerated, captionTrackName
segmentCount, fullText, segments, formattedTranscript
errorCode, errorMessage, scrapedAt

Unavailable captions, deleted videos, private videos, and request failures are returned as no-charge error rows instead of failing the full run.

Pricing Events

Recommended PPE launch target:

apify-actor-start: keep Apify default $0.00005.
transcript_extracted: $0.0025 for successful transcript rows.
transcript_partial: $0.001 for useful partial transcript rows if enabled later.
dry_run, error, and unavailable-caption rows are no-charge.

The current cost model assumes HTTP requests, no browser, and no residential proxy. Publication should remain blocked if live cost probes show that residential proxy is required.

Limits

Only public videos with public caption tracks are supported.
Age-restricted, private, deleted, or captionless videos return an error row.
Playlist and channel expansion uses public page HTML and only discovers visible video IDs.
YouTube may change its watch page payload shape. The canary should run daily against a known captioned video.

Local Run

npm test
npm start

The default input.json uses dryRun: true so local startup does not depend on live YouTube access.

Premium Report Pack

Use these premium report actors when a raw dataset is ready to become a buyer-facing audit, watch summary, or agency deliverable. All three keep sourceDatasetId as advanced-only; first runs should use pasted input, URLs, demo mode, and reportTier.

CSV Local Business List Scoring & SEO Gap Report - Score pasted local business CSV lists and produce agency-ready lead/SEO gap reports.
SaaS Pricing Page Monitor & Competitor Price Change Alerts - Turn public pricing pages into snapshots, competitor reports, and weekly pricing watch summaries.
Ad Landing Page Offer Intelligence & CRO Gap Report - Analyze user-provided landing pages and pasted ad copy for offer, CTA, proof, and CRO gaps.

Recommended flow from this actor: run the current extraction/check first, export the useful dataset or copy the relevant URLs, then choose entry, premium, or bundle in the report actor with maxChargeUsd as the safety cap.

Turn Transcripts Into A Corpus Audit

After bulk transcript extraction, use YouTube Transcript Corpus Audit & RAG Readiness when the next question is not ?can I get text?? but ?is this corpus ready for RAG?? Start with reportTier="corpus_snapshot"; use rag_readiness only when the snapshot has enough caption coverage.

Use these follow-on Actors when you want a capped, decision-ready report instead of more raw rows. They use public or user-provided inputs, respect maxChargeUsd, and do not promise rankings, revenue, conversion lifts, or sales outcomes.

YouTube Channel Transcript RAG Intelligence - audit transcript coverage, missing-caption risk, and RAG readiness.

If this Actor gave you raw rows or source context, these follow-on report Actors are designed for a small capped paid run. They help make a decision, not just collect more data.

YouTube Channel Transcript RAG Intelligence - decide whether a YouTube corpus is clean enough for RAG before building embeddings or a chatbot. Entry $9 / corpus_snapshot_report; premium $29 / rag_readiness_report.

Keep maxChargeUsd equal to the selected tier. Internal links are traffic aids only; real proof requires accounted paid usage.

YouTube Transcript Scraper

elaborate_statue/youtube-transcript-scraper

Extract transcripts (captions) from YouTube videos with timestamps. Supports manual and auto-generated captions in 50+ languages. Outputs JSON, plain text, or SRT format.

Alex Kim

YouTube Transcript Scraper And Formatter

matthewjames/youtube-transcript-scraper-and-formatter

Extracts auto-generated YouTube transcripts from videos and formats them in plain text, SRT, and VTT format.

Matthew James

5.0

Youtube Transcript Scraper

scraperx/youtube-transcript-scraper

🎬 YouTube Transcript Scraper (youtube-transcript-scraper) extracts transcripts, subtitles & captions (auto/manual) with timestamps from videos, channels & playlists. 📦 Bulk scrape. 📄 Export SRT, VTT, CSV, JSON. 🌐 Multilingual. 🚀 Perfect for SEO, content repurposing, research & accessibility.

ScraperX

YouTube Transcript & Captions Scraper

benthepythondev/youtube-transcript-scraper

Extract transcripts from any YouTube video with captions. Supports 100+ languages, auto-generated captions, and translation. Output as plain text, SRT, VTT, or JSON with timestamps. Includes video metadata (title, channel, views). Perfect for content repurposing and AI training.

ben

118

YouTube Transcript API

glassventures/youtube-transcript-api

Extract transcripts, captions, and subtitles from YouTube videos. Supports 100+ languages, auto-generated captions, SRT/VTT export, playlists, and channels.

Glass Ventures

Youtube Transcript

canadesk/youtube-transcript

Extract transcripts (with timestamps) from YouTube videos.

Canadesk Support

Youtube Transcript Scraper

scrapeflow/youtube-transcript-scraper

🎥 YouTube Transcript Scraper extracts captions/transcripts (auto & human) with timestamps and languages. 📝 Export JSON/CSV/SRT/VTT, bulk or API. 🔎 Ideal for SEO, research, repurposing & NLP. ⚡ Fast, reliable, playlist/channel ready.

ScrapeFlow

YouTube Transcript Extractor

scraperhive/youtube-transcript-extractor

Extract YouTube video transcripts, subtitles, and captions in multiple formats with precise timestamps. Plain Text · JSON · SRT · WebVTT · 20+ Languages · Batch Processing · Auto + Manual Captions

Mubeen Ali

YouTube Transcript API & Bulk Subtitle Downloader

tugelbay/youtube-transcript

Bulk YouTube transcript API for SRT/VTT, Markdown, JSON, and text exports with metadata for AI/RAG, research, subtitles, and content workflows. Guide: https://konabayev.com/tools/youtube-transcript-scraper/?utm_source=apify_info&utm_medium=referral&utm_campaign=youtube-transcript

Tugelbay Konabayev

YouTube Transcript Extractor

junipr/youtube-transcript-extractor

Extract YouTube transcripts in text, SRT, VTT, or JSON. Auto-generated and manual captions in any language. Video metadata: title, channel, views, duration. Innertube API with residential proxy. Batch videos.