Pricing

$19.99/month + usage

Youtube Transcript Scraper

🎬 YouTube Transcript Scraper (youtube-transcript-scraper) pulls clean video transcripts/captions with timestamps, multi-language, and batch export (JSON/CSV). 🔎 Ideal for SEO, keyword research, summaries, accessibility, and content repurposing. ⚡ Fast, reliable, API-ready.

Pricing

$19.99/month + usage

Rating

0.0

(0)

Developer

ScrapeEngine

Actor stats

Bookmarked

Total users

Monthly active users

3 months ago

Last modified

Youtube Transcript Scraper

Youtube Transcript Scraper is a fast, reliable YouTube transcript extractor that turns public captions into clean, structured data — no copy-paste, no extensions. It solves manual transcription by letting you download YouTube transcripts as plain text or detailed timestamped captions you can export to JSON/CSV. Built for marketers, developers, data analysts, and researchers, this YouTube transcript scraper tool scales from single videos to bulk lists with proxy-backed reliability so you can get YouTube video transcripts at speed and feed them into your workflows.

What data / output can you get?

The actor streams results to your Apify dataset as each URL finishes. Here are the exact fields you’ll see in the output, with examples.

Data type	Description	Example value
id	YouTube video ID extracted from the input URL	4KbrxIpQgkM
url	Canonical video URL composed from the ID	https://www.youtube.com/watch?v=4KbrxIpQgkM
input	The original input URL you provided	https://youtu.be/4KbrxIpQgkM
transcripts	Array of transcript variants by language	[…]
transcripts[].language	Human‑readable language label from YouTube	English
transcripts[].content (text)	Full transcript merged into a single string when outputFormat="text"	Hello everyone and welcome…
transcripts[].content[] (timestamp)	Array of caption segments when outputFormat="timestamp"	[…]
transcripts[].content[].startMs	Segment start time in milliseconds	1250
transcripts[].content[].endMs	Segment end time in milliseconds	4280
transcripts[].content[].startTime	Segment start time in mm:ss	0:01
transcripts[].content[].text	Caption text for the segment	Welcome to the channel.

Notes:

Set outputFormat to "text" for one merged string per language, or "timestamp" for structured caption timing.
Export results from the Apify dataset in JSON or CSV for downstream analysis, SEO, accessibility, or automation.

Key features

⚡ Fast, flexible transcript output: Choose outputFormat="text" for a single merged transcript or "timestamp" for detailed time‑coded captions.
🗣️ Language filters you control: Toggle includeEnglishAG to include English auto‑generated captions and includeNonEnglish to include non‑English transcripts.
📦 Batch processing at scale: Provide multiple YouTube URLs and save each completed item immediately — perfect for a bulk YouTube transcript downloader workflow.
🔒 Proxy‑backed reliability: Uses Apify RESIDENTIAL proxy by default (when enabled) to reduce IP blocks and stabilize large runs.
💻 Developer‑friendly & API‑ready: Access via the Apify API; outputs are clean JSON for pipelines, integrations, and automation.
🔄 Dataset‑first workflow: Every video’s result is pushed as soon as it finishes so you can stream, monitor, and export during long jobs.

How to use Youtube Transcript Scraper - step by step

Sign in to Apify and open the Youtube Transcript Scraper actor.
Paste one or more video links into urls (accepts both youtube.com/watch?v=… and youtu.be/… formats).
Choose your outputFormat:
- text for a single merged transcript per language.
- timestamp for per‑segment timing with startMs, endMs, startTime, text.
Set language filters as needed:
- includeEnglishAG to include English auto‑generated captions.
- includeNonEnglish to include non‑English transcripts.
(Optional) Configure proxyConfiguration. By default, the actor attempts to use the Apify RESIDENTIAL proxy to mitigate blocking when enabled.
Start the run. Each processed URL is appended to the dataset immediately as a separate item.
Export the dataset as JSON or CSV and feed it into your analytics, SEO, or automation pipeline.

Pro tip: Orchestrate runs via the Apify API and pipe results to Make, n8n, or Python for an automated YouTube transcript API workflow.

Use cases

Use case name	Description
SEO + Content repurposing	Convert YouTube subtitles to text to create blogs, social captions, and keyword‑rich summaries.
Research & academia	Scrape YouTube transcripts for topic modeling, text mining, and qualitative analysis at scale.
Accessibility workflows	Download auto‑generated YouTube captions and official subtitles to improve accessibility or QA.
Marketing & social teams	Extract quotes and highlights to accelerate campaign assets and video summaries.
Developer pipelines (API)	Use the structured JSON output to power chatbots, search indexes, or RAG systems.
Competitive & trend analysis	Bulk YouTube transcript downloader for channels or lists to analyze messaging and themes.

Why choose Youtube Transcript Scraper?

This production‑ready YouTube caption downloader focuses on precision, scale, and automation — not brittle, manual alternatives.

🎯 Accurate, structured output: Choose simple text or rich timestamps with startMs/endMs/startTime/text.
🌍 Multilingual control: Include English auto‑generated and/or non‑English transcripts when available.
📈 Built for scale: Process many URLs per run and stream results to your dataset as they complete.
💻 Developer access: Clean JSON fits directly into APIs, Python scripts, and automation tools.
🛡️ Reliable vs. extensions: Apify runtime with optional RESIDENTIAL proxy reduces friction compared to browser add‑ons.
💸 Export‑ready: Download your results in JSON/CSV without extra formatting.

Is it legal / ethical to use Youtube Transcript Scraper?

Yes — when done responsibly. This actor automates access to transcripts and captions available on YouTube. Use it for analysis, accessibility, or internal research in line with platform terms and applicable laws.

Guidelines:

Scrape only public videos and captions you’re allowed to use.
Respect YouTube’s Terms of Service and local regulations (e.g., GDPR/CCPA where applicable).
Don’t redistribute transcripts commercially without rights from content owners.
Do not attempt to access private or restricted content.
Consult your legal team for edge cases or commercial redistribution.

Input parameters & output format

Example JSON input

{
  "urls": [
    "https://www.youtube.com/watch?v=4KbrxIpQgkM",
    "https://youtu.be/dQw4w9WgXcQ"
  ],
  "includeEnglishAG": true,
  "includeNonEnglish": false,
  "outputFormat": "text",
  "proxyConfiguration": {
    "useApifyProxy": true,
    "apifyProxyGroups": ["RESIDENTIAL"]
  }
}

Parameter reference

Field	Type	Required	Default	Description
urls	array	Yes	[]	One or more YouTube video URLs to process. Each completed URL is appended immediately to the dataset.
includeEnglishAG	boolean	No	true	Whether to include English auto-generated transcripts.
includeNonEnglish	boolean	No	false	Whether to include non-English transcripts.
outputFormat	string (enum: "timestamp","text")	No	"text"	Format of transcript output: "timestamp" returns detailed timestamps, "text" returns plain text.
proxyConfiguration	object	No	{}	Proxy configuration. Uses Apify RESIDENTIAL proxy by default to bypass YouTube IP blocking. If not configured, will try to use Apify proxy automatically.

Notes on defaults and filtering:

UI defaults come from the input schema above.
If you omit fields in a raw API call, runtime fallbacks apply in code: includeEnglishAG defaults to false, includeNonEnglish defaults to true, and outputFormat defaults to "text". To avoid surprises, explicitly set these flags.

Example JSON output (outputFormat="text")

{
  "id": "4KbrxIpQgkM",
  "url": "https://www.youtube.com/watch?v=4KbrxIpQgkM",
  "input": "https://youtu.be/4KbrxIpQgkM",
  "transcripts": [
    {
      "language": "English",
      "content": "Hello everyone and welcome to our video. Today we will cover..."
    }
  ]
}

Example JSON output (outputFormat="timestamp")

{
  "id": "dQw4w9WgXcQ",
  "url": "https://www.youtube.com/watch?v=dQw4w9WgXcQ",
  "input": "https://www.youtube.com/watch?v=dQw4w9WgXcQ",
  "transcripts": [
    {
      "language": "English",
      "content": [
        {
          "startMs": 0,
          "endMs": 2140,
          "startTime": "0:00",
          "text": "We're no strangers to love"
        },
        {
          "startMs": 2140,
          "endMs": 4280,
          "startTime": "0:02",
          "text": "You know the rules and so do I"
        }
      ]
    }
  ]
}

Output field notes:

transcripts[].content is either a string (when "text") or an array of segments with startMs, endMs, startTime, text (when "timestamp").
The actor pushes one dataset item per input URL as soon as it completes.

FAQ

Do I need a YouTube transcript Chrome extension to use this?

No. This runs on Apify’s infrastructure and exposes a dataset/API, so you can get YouTube video transcripts without installing any browser extension.

Can I scrape YouTube transcripts in bulk?

Yes. Add multiple video links to urls and the actor will process each, saving results to the dataset as they finish — ideal for a bulk YouTube transcript downloader flow.

Does it support auto-generated captions?

Yes. Set includeEnglishAG to true to include English auto‑generated captions. You can also enable includeNonEnglish to include non‑English transcripts when available.

Can I extract subtitles as plain text or with timestamps?

Yes. Set outputFormat to "text" for one merged transcript per language, or "timestamp" for detailed, per‑segment timings.

Can I export SRT files?

Not directly. The actor outputs either plain text or timestamped segments in JSON. You can convert the timestamped JSON to SRT in a post‑processing step if needed.

Is there a YouTube transcript API for developers?

You can run this actor via the Apify API and receive structured JSON/CSV outputs, making it a practical YouTube transcript API alternative for programmatic pipelines.

Will it work without proxies?

It can run without a proxy, but the actor is designed to use the Apify RESIDENTIAL proxy by default (when enabled) to reduce YouTube IP blocking, especially for large jobs.

How are results exported?

All results are pushed to an Apify dataset. From there, you can download in JSON or CSV and integrate into analytics or automation workflows.

Does it handle non-English transcripts?

Yes. Enable includeNonEnglish to include non‑English transcripts when YouTube provides them.

How does this differ from a YouTube transcript Chrome extension?

It’s infrastructure‑backed and API‑ready. You can automate, run in bulk, and export clean JSON/CSV without a browser, making it more robust than extension‑based tools.

Final thoughts

Youtube Transcript Scraper is built to turn YouTube captions into clean, structured data for analysis and reuse. With configurable language filters, plain text or timestamped outputs, batch processing, and proxy‑backed reliability, it’s ideal for marketers, developers, analysts, and researchers. Use the Apify API to automate at scale, export to JSON/CSV, and plug results into your content, accessibility, or AI workflows. Start extracting smarter transcripts today.

Youtube Transcript Scraper

scrapers-hub/youtube-transcript-scraper

🎥 YouTube Transcript Scraper pulls video transcripts & captions fast—timestamps, multi-language, and batch/channel support. Export JSON/CSV for SEO, summaries, subtitles, or NLP. ⚡ Ideal for content repurposing, research, analytics, and automation. 🧠🚀

Scrapers Hub

Youtube Transcript Scraper

scraply/youtube-transcript-scraper

🎬 YouTube Transcript Scraper (youtube-transcript-scraper) quickly pulls video captions/transcripts — with timestamps, multi-language support & exports (TXT, SRT, JSON). 🔎 Ideal for SEO, content repurposing, research, subtitles & accessibility. ⚡ Fast, developer-friendly.

Scraply

YouTube Transcript Scraper

devscrapper/youtube-transcript-scraper

YouTube Transcript Scraper – Fast, Clean & Reliable

Oussama Production

Youtube Transcript Scraper

scrapapi/youtube-transcript-scraper

🎥 YouTube Transcript Scraper (youtube-transcript-scraper) extracts clean video transcripts & captions—timestamps, languages, and more. ⚡ Bulk scrape playlists/channels, export JSON/CSV for SEO, research, summarization & AI. 🔎 Perfect for repurposing and indexing.

ScrapAPI

Youtube Transcript Scraper

scraperx/youtube-transcript-scraper

🎬 YouTube Transcript Scraper (youtube-transcript-scraper) extracts transcripts, subtitles & captions (auto/manual) with timestamps from videos, channels & playlists. 📦 Bulk scrape. 📄 Export SRT, VTT, CSV, JSON. 🌐 Multilingual. 🚀 Perfect for SEO, content repurposing, research & accessibility.