Unlimited YouTube Subtitle & Transcript Scraper Pro avatar

Unlimited YouTube Subtitle & Transcript Scraper Pro

Pricing

$20.00/month + usage

Go to Apify Store
Unlimited YouTube Subtitle & Transcript Scraper Pro

Unlimited YouTube Subtitle & Transcript Scraper Pro

๐—ฌ๐—ผ๐˜‚๐—ง๐˜‚๐—ฏ๐—ฒ ๐—ฆ๐˜‚๐—ฏ๐˜๐—ถ๐˜๐—น๐—ฒ & ๐—ง๐—ฟ๐—ฎ๐—ป๐˜€๐—ฐ๐—ฟ๐—ถ๐—ฝ๐˜ ๐——๐—ผ๐˜„๐—ป๐—น๐—ผ๐—ฎ๐—ฑ๐—ฒ๐—ฟ - Extract subtitles and transcripts in ๐—บ๐˜‚๐—น๐˜๐—ถ๐—ฝ๐—น๐—ฒ ๐—น๐—ฎ๐—ป๐—ด๐˜‚๐—ฎ๐—ด๐—ฒ๐˜€ from any YouTube video. Supports ๐—ฏ๐˜‚๐—น๐—ธ ๐—ฑ๐—ผ๐˜„๐—ป๐—น๐—ผ๐—ฎ๐—ฑ๐˜€, auto-generated captions & custom language selection.

Pricing

$20.00/month + usage

Rating

5.0

(1)

Developer

Xtech

Xtech

Maintained by Community

Actor stats

2

Bookmarked

69

Total users

6

Monthly active users

16 hours ago

Last modified

Share

YouTube Transcript Scraper Pro

Extract clean transcripts from one or many YouTube videos in a single run.

This Actor accepts YouTube video URLs or video IDs and returns transcripts in the format you need: plain text, timestamped text, SRT, VTT, JSON, CSV, or all formats at once.

What you get

  • Bulk processing for multiple videos
  • Support for standard YouTube URLs, short URLs, and raw video IDs
  • Automatic preference for manually created subtitles when available
  • Structured dataset output for downstream automations
  • Output format control for different workflows (LLM prep, subtitle editing, QA, analytics)

Input

videos (required)

List of YouTube videos.

Supported values:

  • Video ID: dQw4w9WgXcQ
  • Watch URL: https://www.youtube.com/watch?v=dQw4w9WgXcQ
  • Short URL: https://youtu.be/dQw4w9WgXcQ

type (optional)

Output format for transcript.

Allowed values:

  • plain (default)
  • timestamped
  • srt
  • vtt
  • json
  • csv
  • all

keepMusicMarkers (optional)

  • Type: boolean
  • Default: false
  • When false, markers like [Music] are removed.

dropEmpty (optional)

  • Type: boolean
  • Default: true
  • Removes empty transcript segments after cleanup.

maxConcurrency (optional)

  • Type: integer
  • Default: 3
  • Range: 1 to 10
  • Controls how many videos are processed in parallel.

Output

Each dataset item contains:

  • video_id (string)
  • language (string or null)
  • is_generated (boolean or null)
  • transcript (string, array, object, or null depending on type)

If a video fails, the item is still returned with:

  • language: null
  • is_generated: null
  • transcript: ""

Example input

{
"videos": [
"https://www.youtube.com/watch?v=dQw4w9WgXcQ",
"https://youtu.be/jNQXAC9IVRw",
"M7FIvfx5J10"
],
"type": "all",
"keepMusicMarkers": false,
"dropEmpty": true,
"maxConcurrency": 4
}

Example output item

{
"video_id": "dQw4w9WgXcQ",
"language": "en",
"is_generated": false,
"transcript": {
"plain": "Never gonna give you up...",
"timestamped": "00:00:01.200 Never gonna give you up...",
"srt": "1\\n00:00:01,200 --> 00:00:03,800\\nNever gonna give you up...",
"vtt": "WEBVTT\\n\\n00:00:01.200 --> 00:00:03.800\\nNever gonna give you up...",
"json": [
{
"text": "Never gonna give you up...",
"start": 1.2,
"duration": 2.6
}
],
"csv": "start,duration,text\\n1.2,2.6,\"Never gonna give you up...\""
}
}

Typical use cases

  • Build subtitle files (srt or vtt) for editing or publishing
  • Prepare clean text for AI/LLM pipelines (plain or json)
  • Create timestamped transcripts for compliance or QA reviews
  • Batch-export transcripts for research and analytics

Notes

  • The Actor deduplicates repeated video IDs in one run.
  • Transcript availability depends on whether subtitles exist for a given video.
  • For best throughput on larger batches, increase maxConcurrency gradually.