YouTube Transcript API - AI Training Data (Batch) avatar

YouTube Transcript API - AI Training Data (Batch)

Pricing

from $10.00 / 1,000 transcript extracteds

Go to Apify Store
YouTube Transcript API - AI Training Data (Batch)

YouTube Transcript API - AI Training Data (Batch)

Batch extract YouTube transcripts at scale. Process thousands of videos in parallel with AI-optimized output. Smart chunking, token estimation, SRT/VTT export. $10 per 1K.

Pricing

from $10.00 / 1,000 transcript extracteds

Rating

0.0

(0)

Developer

Tan Analytics

Tan Analytics

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

2 days ago

Last modified

Share

YouTube Transcript Extractor - Batch Edition

Extract YouTube video transcripts in bulk. Process up to 1000 videos simultaneously.


Why Batch Processing?

FeatureSingle VideoBatch Edition
Process 1 video
Process 100 videos
Process 1000 videos
Parallel processing-Up to 10 concurrent
ConvenienceOne run per videoAll in one run

Use Cases

  • AI Training Data - Build large datasets from thousands of videos
  • Content Research - Analyze playlists, channels, or topic collections
  • Subtitle Generation - Export SRT/VTT for video libraries
  • Academic Research - Extract lectures, interviews, documentaries at scale

Features

⚡ Batch Processing

  • Process multiple videos in one run
  • Parallel extraction (up to 10 concurrent)
  • Progress tracking for large batches
  • Automatic proxy fallback for reliability

🎯 AI-Optimized Output

  • Smart chunking - Fit any LLM context window
  • Token estimation - Know your token count
  • Clean mode - Remove ♪ [music] [applause]

📄 Multiple Formats

  • Plain text
  • SRT subtitles
  • VTT subtitles
  • Timestamps

📊 Metadata Enrichment

  • Video title & author
  • Thumbnail URL
  • Duration
  • Word count & tokens

Pricing

$10 per 1,000 transcript extractions

VideosCost
100$1.00
1,000$10.00
10,000$100.00

No monthly commitment. Pay per video extracted.


Quick Start

Input

{
"videoUrls": [
"https://www.youtube.com/watch?v=...",
"https://www.youtube.com/watch?v=...",
"https://youtube.com/shorts/..."
],
"maxConcurrency": 5,
"chunkSize": 2000,
"cleanTranscript": true,
"outputFormat": "text"
}

Output

Each video returns:

{
"videoUrl": "https://www.youtube.com/watch?v=dQw4w9WgXcQ",
"videoId": "dQw4w9WgXcQ",
"transcript": "♪ We're no strangers to love ♪",
"transcriptClean": "We're no strangers to love",
"chunks": [
{"id": 0, "text": "...", "start": 1, "end": 110, "wordCount": 230}
],
"metadata": {
"title": "Rick Astley - Never Gonna Give You Up",
"author": "Rick Astley",
"duration": 211,
"wordCount": 367,
"estimatedTokens": 488
},
"transcriptSRT": "1\n00:00:01,360 --> 00:00:03,040\n♪ We're no strangers to love ♪"
}

Input Parameters

ParameterTypeDefaultDescription
videoUrlsarrayrequiredYouTube video URLs
languagestring"en"Transcript language
maxConcurrencyinteger5Parallel videos (max 10)
chunkSizeinteger2000Max chars per chunk (0 = off)
cleanTranscriptbooleanfalseRemove ♪ [music]
includeMetadatabooleantrueInclude video metadata
outputFormatstring"text"text, srt, or vtt
includeTimestampsbooleantrueAdd [MM:SS] markers

Supported URLs

  • https://www.youtube.com/watch?v=VIDEO_ID
  • https://youtu.be/VIDEO_ID
  • https://www.youtube.com/shorts/VIDEO_ID

FAQ

Q: How many videos can I process? A: No hard limit. Process hundreds or thousands in one run.

Q: What if a video has no transcript? A: That video returns an error. Other videos still process.

Q: How fast is batch processing? A: Up to 10 videos in parallel. Depends on video length and proxy availability.

Q: Which videos work? A: Any YouTube video with available captions/subtitles.


Support

Open an issue on GitHub or contact for enterprise pricing on 100K+ videos.


$10 per 1,000 transcripts | Try it now on Apify