🎬 YouTube Transcript & Metadata Extractor avatar
🎬 YouTube Transcript & Metadata Extractor

Pricing

Pay per event

Go to Apify Store
🎬 YouTube Transcript & Metadata Extractor

🎬 YouTube Transcript & Metadata Extractor

Developed by

dz_omar

dz_omar

Maintained by Community

πŸŽ₯Extract complete transcripts with precise timestamps ⏱️ and comprehensive video metadata from any YouTube video β€”> ⚑Fast, reliable, and ready to use.

4.9 (16)

Pricing

Pay per event

54

419

104

Issues response

5.2 hours

Last modified

3 days ago

Extract complete transcripts with precise timestamps and comprehensive video metadata from any YouTube video - fast, reliable, and built for scale.

This powerful Apify actor provides professional-grade extraction of YouTube transcripts and video information. Perfect for developers, researchers, content creators, and businesses who need accurate, structured video data for analysis, automation, or content repurposing.

YouTube Transcript Extractor


✨ Key Features

🎯 Complete Data Extraction

  • Full Transcripts with second-by-second precision timestamps
  • Rich Video Metadata including title, views, likes, and publication date
  • Detailed Channel Information with name, ID, subscribers, and verification status
  • Video Analytics including word count and estimated duration

⚑ Performance & Reliability

  • Lightning Fast - Optimized crawler for maximum speed
  • Smart Caching - Avoid re-processing with built-in cache system
  • Proxy Support - Optional proxy configuration for enhanced reliability
  • Batch Processing - Handle multiple videos in a single run

πŸ› οΈ Developer-Friendly

  • Clean JSON Output - Ready for any data pipeline or API integration
  • Flexible Configuration - Control timestamps, caching, and more
  • Multiple Views - Pre-configured data views for different use cases
  • Well-Documented - Clear examples and comprehensive documentation

How It Works

  1. Input YouTube URLs - Provide one or more video links
  2. Configure Options - Choose timestamp inclusion and caching preferences
  3. Run the Actor - Fast, automated extraction begins
  4. Get Structured Results - Download complete data in JSON, CSV, or Excel format

πŸ“₯ Input Configuration

Simple Input Example

{
"youtubeUrl": [
{ "url": "https://www.youtube.com/watch?v=nC8ilIMH8sk" },
{ "url": "https://youtu.be/tpHZYImuhZg" }
]
}

Input Parameters

ParameterTypeDefaultDescription
youtubeUrlarrayβ€”List of YouTube video URLs (supports all formats: youtube.com, youtu.be, shorts, live)
proxyConfigobjectβ€”Optional proxy configuration for enhanced reliability

Supported URL Formats

  • βœ… https://www.youtube.com/watch?v=VIDEO_ID
  • βœ… https://youtu.be/VIDEO_ID
  • βœ… https://www.youtube.com/shorts/VIDEO_ID
  • βœ… https://www.youtube.com/live/VIDEO_ID

πŸ“€ Complete Output Structure

Each video produces a comprehensive JSON object with all available data:

{
"videoId": "nC8ilIMH8sk",
"VideoURL": "https://youtu.be/nC8ilIMH8sk",
"embedUrl": "https://www.youtube.com/embed/nC8ilIMH8sk",
"Video_title": "New ways to monetize: Pay per event and tiered pricing",
"published_Date": "Streamed live on Mar 20, 2025",
"Views": "492 views",
"likes": "20",
"channel": {
"name": "Apify",
"id": "UCTgwcoeGGKmZ3zzCXN2qo_A",
"url": "https://www.youtube.com/channel/UCTgwcoeGGKmZ3zzCXN2qo_A",
"subscribers": "10.3K subscribers",
"verified": false
},
"thumbnail": "https://i.ytimg.com/vi/nC8ilIMH8sk/default.jpg",
"Description": "Join our webinar where we'll explore Apify's latest pricing updates...",
"hasTranscript": true,
"transcriptText": "Hello everyone. I think we have already live 10 viewers...",
"timestamps": [
{
"time": "2:28",
"text": "Hello everyone. I think we have already live 10 viewers. I might"
},
{
"time": "2:36",
"text": "wait for one or two more minutes before starting..."
}
],
"transcriptWordCount": 10080,
"transcriptDuration": "Approx 3045 minutes"
}

Output Fields Explained

Video Information

  • videoId - Unique YouTube video identifier
  • VideoURL - Direct link to the video
  • embedUrl - Embeddable video URL
  • Video_title - Full video title
  • published_Date - Publication or stream date
  • Views - View count
  • likes - Like count
  • thumbnail - Video thumbnail image URL
  • Description - Full video description

Channel Information

  • channel.name - Channel name
  • channel.id - Unique channel identifier
  • channel.url - Direct channel link
  • channel.subscribers - Subscriber count
  • channel.verified - Verification status badge

Transcript Data

  • hasTranscript - Boolean indicating transcript availability
  • transcriptText - Complete cleaned transcript as continuous text
  • timestamps - Array of time-stamped transcript segments
  • transcriptWordCount - Total word count in transcript
  • transcriptDuration - Estimated transcript duration

πŸ“Š Pre-Configured Data Views

1. πŸ“Š Full Video Metadata & Transcripts

Complete dataset with all fields - perfect for comprehensive analysis.

Fields: All video metadata, channel info, transcript, timestamps, analytics

2. 🎀 Pure Transcript Results

Focused view for transcript-only needs.

Fields: Thumbnail, URL, title, transcript text, timestamps, word count, duration

3. πŸ“Ί Channels Overview

Channel-focused view for creator analysis.

Fields: Video info, channel name, ID, URL, subscribers, verification status


πŸ’‘ Professional Use Cases

🎨 Content Creation & Marketing

  • Content Repurposing - Transform videos into blogs, social media posts, or newsletters
  • SEO Optimization - Extract text content for better search engine indexing
  • Subtitle Generation - Create or verify subtitle files for accessibility
  • Quote Extraction - Find and highlight key moments for marketing materials

πŸ“š Research & Education

  • Academic Research - Analyze lectures, interviews, and educational content at scale
  • Sentiment Analysis - Process video content for NLP and sentiment research
  • Language Learning - Create study materials from foreign language videos
  • Content Analysis - Study speaking patterns, topics, and engagement metrics

πŸ€– Automation & Integration

  • AI/ML Training Data - Feed structured transcripts into machine learning models
  • Chatbot Training - Use video content to train conversational AI
  • API Integration - Seamlessly integrate with your existing data pipelines
  • Automated Workflows - Trigger actions based on new video content

πŸ“ˆ Business Intelligence

  • Competitor Analysis - Monitor and analyze competitor video content
  • Brand Monitoring - Track mentions and sentiment in video content
  • Market Research - Extract insights from industry thought leaders
  • Customer Feedback - Analyze product reviews and testimonials from videos

βš™οΈ Technical Advantages

Built for Performance

  • Optimized Crawler - Uses CheerioCrawler for maximum speed and efficiency
  • Parallel Processing - Handle multiple videos simultaneously
  • Smart Deduplication - Automatically skip duplicate URLs
  • Efficient Memory Usage - Configurable memory limits (128MB - 512MB)

πŸ”’ Reliable & Robust

  • Error Handling - Graceful handling of missing transcripts or blocked videos
  • Retry Logic - Automatic retry on temporary failures
  • Cache System - Persistent storage of processed videos
  • Proxy Support - Optional proxy configuration for enhanced reliability

🌍 Universal Compatibility

  • Multi-language Support - Works with all languages supported by YouTube
  • All Video Types - Standard videos, shorts, live streams, premieres
  • Export Formats - JSON, CSV, Excel, or direct API access
  • Apify Ecosystem - Seamlessly integrates with other Apify actors

πŸ”§ Advanced Configuration

Using Proxy (Optional)

For enhanced reliability and to avoid rate limiting:

{
"youtubeUrl": [
{ "url": "https://www.youtube.com/watch?v=VIDEO_ID" }
],
"proxyConfig": {
"useApifyProxy": true
}
}

πŸ“ˆ Performance Metrics

  • Speed: Processes typical video in 5-10 seconds
  • Accuracy: Extracts official YouTube transcripts with 100% fidelity
  • Reliability: Built-in retry logic and error handling
  • Scalability: Handle from single videos to large batch operations

This actor extracts publicly available transcript and metadata from YouTube. All extracted data is information that YouTube makes accessible through normal viewing.

Important: Please ensure your use complies with:

  • YouTube's Terms of Service
  • Applicable copyright laws
  • Data protection regulations (GDPR, etc.)
  • Your specific jurisdiction's laws

🎯 Why Choose This Extractor?

βœ… Comprehensive Data - Get both transcripts AND complete metadata in one run
βœ… Production-Ready - Battle-tested reliability for professional applications
βœ… Developer-Friendly - Clean JSON output, multiple export formats, easy integration
βœ… Cost-Effective - Efficient processing minimizes compute costs
βœ… Well-Maintained - Regular updates and responsive support
βœ… Apify Platform - Leverage the full power of Apify's infrastructure


🚦 Getting Started

  1. Sign up for a free Apify account
  2. Configure input with your YouTube URLs
  3. Run the actor - It's that simple!
  4. Download results in your preferred format

No complex setup, no hidden fees, no surprises. Just reliable data extraction.


πŸ’¬ Support & Contact

Need help or have questions? We're here for you: