Pricing

from $1.50 / 1,000 results

Lightning YouTube Scraper (Transcript & Metadata)

Extract full transcripts (subtitles) and metadata from YouTube videos instantly without opening a browser. Perfect for AI, LLMs, and content summarization.

Pricing

from $1.50 / 1,000 results

Rating

0.0

(0)

Developer

Tan Yegen

Actor stats

Bookmarked

Total users

Monthly active users

5 days ago

Last modified

⚡ Lightning YouTube Scraper (Transcript & Metadata)

🤖 Copy to your AI assistant

Copy this block into ChatGPT, Claude, Cursor, or any LLM to start using this actor.

tyegen/universal-youtube-transcript-extractor on Apify. Call: ApifyClient("TOKEN").actor("tyegen/universal-youtube-transcript-extractor").call(run_input={"startUrls": ["URL_HERE"]}), then client.dataset(run["defaultDatasetId"]).list_items().items for results.

Unlock the hidden textual data of YouTube at unprecedented speeds. Extract full transcripts (subtitles) and rich metadata from any YouTube video instantly—without the overhead of browser automation, without battling official API quota limits, and at nearly $0.00 in Compute Unit costs.

🚀 The Game-Changing Technology (How it Works)

Most YouTube scrapers on the market rely on heavy, resource-intensive browser automation tools like Playwright or Puppeteer. They physically open a browser, load the heavy YouTube interface, scroll down to force elements to render, and scrape the DOM. This is slow, prone to breaking, and expensive. Alternatively, they use the official YouTube Data API, which requires API keys and imposes strict, costly quota limits.

This actor uses a hidden backdoor approach: It targets the internal ytInitialPlayerResponse JSON embedded directly within the raw HTML payload of a YouTube video page. It then talks directly to Google's backend caption servers to download the subtitle XML files.

✨ Unbeatable Features

Ultra-Lightning Speed: Extracts a 2-hour long podcast transcript in exactly 1 second.
No API Keys Needed: Completely bypasses the YouTube Data API limitations. Zero setup required.
Incredibly Cost-Effective: Uses pure, lightweight HTTP requests. It costs a fraction of a cent per video in Apify Compute Units.
Clean, Formatted Output: Automatically decodes messy XML entities (like & or ') and merges timestamped captions into a beautiful, readable, and continuous text block.
Rich Metadata Included: Alongside the transcript, it fetches the video title, author, view count, length in seconds, and high-res thumbnail URL.

🎯 Ideal Use Cases & Target Audience

AI & LLM Training (RAG Pipelines): Feed thousands of hours of rich, conversational podcast transcripts into your Retrieval-Augmented Generation pipelines or fine-tuning datasets.
Content Summarization Agents: Build automated workflows that grab video texts instantly and pass them to ChatGPT, Claude, or Gemini for rapid summarization and key-takeaway extraction.
Competitor & SEO Analysis: Extract text from competitor videos to analyze their spoken keywords, hooks, content structure, and pacing.
Content Repurposing: Instantly convert your own YouTube videos into blog posts, newsletters, or Twitter threads.

💰 Pricing & ROI

Pay-Per-Result: Only $1.50 per 1,000 videos. You get the full metadata PLUS the entire video transcript for a price no competitor can match. Your compute costs will remain near zero.

📥 Input Configuration

Field	Type	Description
`startUrls`	Array	A list of YouTube video URLs (e.g., `https://www.youtube.com/watch?v=dQw4w9WgXcQ`).
`proxyConfiguration`	Object	Standard Apify Datacenter proxies work flawlessly for this hidden API approach.

📤 Output Schema

For each video URL, the actor will produce a clean JSON object containing the metadata and transcript.

Field	Type	Description
`url`	String	The original YouTube video URL.
`videoId`	String	The unique 11-character YouTube video ID.
`title`	String	The title of the video.
`author`	String	The name of the channel/creator.
`views`	Number	Total view count.
`lengthSeconds`	Number	Duration of the video in seconds.
`thumbnail`	String	URL to the highest resolution thumbnail available.
`transcriptLanguage`	String	The detected language of the transcript (e.g., "English").
`transcript`	String	The full, cleaned text of the video's subtitles.
`scrapedAt`	String	ISO timestamp of when the extraction occurred.

💡 Output Example

{
  "url": "https://www.youtube.com/watch?v=M98G...",
  "videoId": "M98G...",
  "title": "Sam Altman: OpenAI CEO on GPT-4, ChatGPT, and the Future of AI",
  "author": "Lex Fridman",
  "views": 5400231,
  "lengthSeconds": 8700,
  "thumbnail": "https://i.ytimg.com/vi/M98G.../maxresdefault.jpg",
  "transcriptLanguage": "English",
  "transcript": "Hello and welcome to the Lex Fridman podcast. Today my guest is Sam Altman. We discuss the future of artificial intelligence... (thousands of words of clean text)",
  "scrapedAt": "2026-04-30T18:00:00.000Z"
}

⚠️ Limitations & Good to Know

No Captions Available: If a video does not have auto-generated or manual captions enabled by the creator, the transcript field will return null.
Private/Age-Restricted Videos: Videos that require user login or age verification cannot be scraped by this actor.

Youtube Transcript Generator

quirky_neuron/youtube-transcript-generator

Instantly extract transcripts and subtitles from any YouTube video. Supports full URLs and Video IDs. Returns structured JSON data via a fast API integration. Perfect for AI analysis, content summarization, and SEO.

Shadow Dev

YouTube Video Transcript & Metadata Scraper

trisecode/yt-transcript

Fast & free YouTube scraper. Extract transcripts, subtitles, and detailed video metadata without an API key. Supports export to JSON, CSV.

Trisecode

Youtube Transcript Scraper

scrapier/youtube-transcript-scraper

Extract full transcripts from YouTube videos with the YouTube Transcript Scraper. Get precise timestamps, speaker names, and text for any video. Perfect for content analysis, SEO, research, and summarization. Fast, accurate, and easy to integrate into your workflow.

Scrapier

5.0

Youtube Transcript

canadesk/youtube-transcript

Extract transcripts (with timestamps) from YouTube videos.

Canadesk Support

Youtube Transcript Scraper

api-empire/youtube-transcript-scraper

Extract full YouTube video transcripts instantly with this Apify YouTube Transcript Scraper. Get accurate subtitles, timestamps, and speaker data for analysis, SEO, or research. Perfect for content creators, marketers, and data scientists. Fast, reliable, and easy to automate.

API Empire

Youtube Transcript Scraper

scraper-engine/youtube-transcript-scraper

YouTube Transcript Scraper extracts full transcripts from public YouTube videos with ease. Quickly retrieve spoken content for research, summarization, SEO, or accessibility—just enter a video URL and get clean, structured text. No login or API key required.

Scraper Engine

263

5.0

YouTube Transcripts Subtitles Captions Extractor. ⚡

lume/yt-transcripts

YouTube transcript extractor, subtitle downloader, captions scraper, and video transcript crawler. Extract, download, and save YouTube video transcripts, subtitles, and captions for one or many Youtube Videos.

Lume

311

5.0

Youtube Transcript Scraper

scrapapi/youtube-transcript-scraper

🎥 YouTube Transcript Scraper (youtube-transcript-scraper) extracts clean video transcripts & captions—timestamps, languages, and more. ⚡ Bulk scrape playlists/channels, export JSON/CSV for SEO, research, summarization & AI. 🔎 Perfect for repurposing and indexing.

ScrapAPI

Youtube Transcript Scraper

happitap/youtube-transcript-scraper

High-performance YouTube transcript scraper to fetch video transcripts, metadata, and channel content without an API key. Extract full transcripts, video metadata, and channel listings from YouTube with ease. Supports multiple languages, automatic fallback logic, and highconcurrency scraping.