Instagram Youtube Transcripts With Speaker Labels

Pricing

Pay per usage

Instagram Youtube Transcripts With Speaker Labels

Generate transcripts with speaker diarization from Instagram Reels & YouTube videos. Automatically identifies speakers, outputs SRT/VTT subtitles, timestamps & full text. Perfect for podcasts, interviews & meetings. Bulk processing supported.

Pricing

Pay per usage

Rating

0.0

(0)

Developer

Transcript Downloader

Maintained by Community

Actor stats

Bookmarked

Total users

Monthly active users

4 days ago

Last modified

🎙️ Transcript Downloader - Transcripts with Speaker Labels

Generate transcripts with automatic speaker diarization (speaker labels) from previously downloaded Instagram or YouTube audio using the Transcript Downloader API. Perfect for interviews, podcasts, meetings, and multi-speaker content.

📚 API Documentation

For complete API reference, endpoint details, and advanced usage examples, visit our official documentation:

Transcript Downloader API Documentation

Get Your API Key • API Pricing

⚠️ Prerequisites

This actor requires a transcript_speaker_id from a previously downloaded audio file.

You must first use one of these actors to download audio and obtain the ID:

Instagram Audio Scraper - For Instagram reels and posts
YouTube Audio Scraper - For YouTube videos

The transcript_speaker_id is included in the audio download response.

✨ Features

🎯 Speaker diarization - Automatically identifies and labels different speakers
📝 Multiple output formats - Full JSON, plain text, SRT, or VTT subtitles
⏱️ Timestamps included - Each segment includes start time and duration
🌍 Language detection - Automatically detects the spoken language
📊 Speaker count - Reports the number of unique speakers detected
🔄 Bulk processing - Process multiple transcripts in a single run
💾 Optional file storage - Save SRT/VTT files to Apify key-value store
🕒 Polling logic with automatic retries
🧠 Progress tracking and run logs
🔐 Secure API token-based authentication

📧 Input Parameters

Parameter	Type	Required	Default	Description
`transcriptSpeakerIds`	array	✅ Yes	—	List of `transcript_speaker_id` values from audio download responses
`apiToken`	string	✅ Yes	—	Bearer token for Transcript Downloader API
`outputFormat`	string	No	`full`	Output format: `full`, `text_only`, `srt`, or `vtt`
`maxWaitTime`	number	No	`10`	Max time to wait for transcription (in minutes, range: 1–15)
`pollingInterval`	number	No	`30`	Interval between polling status (in seconds, range: 30–300)

📥 Example Input

{
  "transcriptSpeakerIds": [
    "01KB21QX05P6B4JA7FJHTM7AWE",
    "01KB22YZ06Q7C5KB8GLIUN8BWF"
  ],
  "apiToken": "your-api-token",
  "outputFormat": "full",
  "maxWaitTime": 10,
  "pollingInterval": 30
}

📤 Output Format

Each transcript_speaker_id generates an output record with metadata and processing info:

Full JSON (default)

Complete transcript with all metadata:

{
  "transcriptSpeakerId": "01KB21QX05P6B4JA7FJHTM7AWE",
  "status": "success",
  "mediaId": "ABC123xyz",
  "language": "en",
  "duration": 30.0,
  "speakerCount": 2,
  "cost": "0.030",
  "format": "full",
  "segments": [
    {
      "text": "Hello everyone, welcome to the show.",
      "start": 0.0,
      "duration": 2.5,
      "speaker": "Speaker 1"
    },
    {
      "text": "Thanks for having me.",
      "start": 2.5,
      "duration": 1.8,
      "speaker": "Speaker 2"
    }
  ],
  "fullText": "Speaker 1: Hello everyone, welcome to the show.\nSpeaker 2: Thanks for having me."
}

Plain Text (`text_only`)

Readable transcript grouped by speaker:

{
  "transcriptSpeakerId": "01KB21QX05P6B4JA7FJHTM7AWE",
  "status": "success",
  "format": "text_only",
  "content": "Speaker 1: Hello everyone, welcome to the show.\n\nSpeaker 2: Thanks for having me. It's great to be here."
}

SRT Format (`srt`)

Standard subtitle format with speaker labels:

1
00:00:00,000 --> 00:00:02,500
[Speaker 1] Hello everyone, welcome to the show.

2
00:00:02,500 --> 00:00:04,300
[Speaker 2] Thanks for having me.

VTT Format (`vtt`)

WebVTT subtitle format with voice tags:

WEBVTT

1
00:00:00.000 --> 00:00:02.500
<v Speaker 1>Hello everyone, welcome to the show.

2
00:00:02.500 --> 00:00:04.300
<v Speaker 2>Thanks for having me.

📊 Special Response Types

No Speech Detected

When audio contains no recognizable speech:

{
  "transcriptSpeakerId": "01KB21QX05P6B4JA7FJHTM7AWE",
  "status": "no_speech",
  "message": "No speech detected in audio",
  "mediaId": "ABC123xyz",
  "duration": 0,
  "cost": "0.030"
}

Failed Response

{
  "transcriptSpeakerId": "01KB21QX05P6B4JA7FJHTM7AWE",
  "status": "failed",
  "error": "Invalid transcript_speaker_id or audio file not found"
}

🚀 How to Use

Get your API token from Transcript Downloader
Run the Instagram Audio Scraper or YouTube Audio Scraper actor first
Copy the transcript_speaker_id from the audio download response
Add the ID(s) to this actor's input
Run the actor and access results in the dataset or key-value store

Example Workflow

Step 1: Run Instagram Audio Scraper
        ↓
        Response includes: "transcript_speaker_id": "01KB21QX05P6B4JA7FJHTM7AWE"
        ↓
Step 2: Run this actor with that ID
        ↓
        Get transcript with speaker labels

❌ Error Handling

This actor includes robust handling for common issues:

Status Code	Description
`400`	Audio processing failed — verify audio was downloaded successfully
`401`	Insufficient credits or invalid token — check credits and API token
`403`	Invalid API key — check or regenerate key
`404`	Invalid ID or audio file not found — verify transcript_speaker_id
`429`	Too many requests — reduce polling frequency
`503`	Transcript Downloader API under maintenance

Failed items are captured in the dataset with detailed error information.

⚠️ Rate Limiting

🔄 Max 75 requests per minute
⏱️ Keep polling interval above 30 seconds to avoid throttling
📊 Default polling interval of 30 seconds is recommended

⏱️ Processing Time & Performance

📊 Estimated processing time per transcript:
- Short audio (< 1 minute): ~30-60 seconds
- Medium audio (1-5 minutes): ~1-3 minutes
- Long audio (5-15 minutes): ~3-8 minutes
- Very long audio (15+ minutes): ~8-15 minutes
🔄 Batch processing: Sequential processing with 30s polling interval
⚡ First-time vs cached: First transcription takes longer; subsequent requests may be faster if cached

💡 Best Practices

✅ Ensure audio download is complete before requesting transcript
⏳ Use appropriate polling intervals (30s recommended)
🔐 Keep your apiToken secret (never log it)
📊 Monitor for no_speech status on music-only content
🎯 Use srt or vtt format for video subtitles
📝 Use text_only for readable documents
🧠 Monitor output for incomplete or failed transcriptions
🗂️ SRT/VTT files are automatically saved to key-value store

💰 Pricing & Billing

The Transcript Downloader API used by this actor requires a valid API token. API usage is billed separately:

Transcription with speaker labels: ~$0.03 per transcript
Cost displayed: Exact cost shown in each response

📊 Very cost-effective for speaker-labeled transcription. View full details and subscription plans on our pricing page

🎯 Use Cases

🎙️ Podcast transcription - Multi-host shows with speaker identification
📹 Interview processing - Separate interviewer and interviewee
📋 Meeting notes - Identify who said what
📺 Video subtitles - Generate SRT/VTT files with speaker labels
📊 Content analysis - Analyze speaking patterns and participation
♿ Accessibility - Create accessible transcripts for hearing impaired
📝 Content repurposing - Convert audio content to written format
🔍 Research - Analyze conversations and dialogues

🔄 Integration with Other Actors

This actor works with the Transcript Downloader suite:

Instagram Audio Scraper → Download audio, get transcript_speaker_id
YouTube Audio Scraper → Download audio, get transcript_speaker_id
Transcripts with Speaker Labels (this actor) → Generate diarized transcript

Complete Workflow:

Instagram/YouTube URL → Audio Scraper → transcript_speaker_id → This Actor → Transcript with Speakers

📈 Monitoring & Analytics

Track performance and usage with Apify tools:

Run history
Success/failure rates
Storage and resource usage
Output file availability

Example completion log:

Transcript with Speaker Labels Actor completed {
  totalProcessed: 10,
  successful: 8,
  noSpeech: 1,
  failed: 1,
  successRate: '80.0%'
}

🙋 Support

Need help? Visit Transcript Downloader Support. We respond within 24 business hours.

For technical issues with this actor, check the run logs for detailed error messages.

📄 License

This actor is provided under the ISC License.

Made with ❤️ by Transcript Downloader | Website | API Dashboard

Youtube Transcript Scraper

api-empire/youtube-transcript-scraper

Extract full YouTube video transcripts instantly with this Apify YouTube Transcript Scraper. Get accurate subtitles, timestamps, and speaker data for analysis, SEO, or research. Perfect for content creators, marketers, and data scientists. Fast, reliable, and easy to automate.

API Empire

Youtube Transcript Scraper

scrapier/youtube-transcript-scraper

Extract full transcripts from YouTube videos with the YouTube Transcript Scraper. Get precise timestamps, speaker names, and text for any video. Perfect for content analysis, SEO, research, and summarization. Fast, accurate, and easy to integrate into your workflow.

Scrapier

5.0

Youtube Transcript Scraper

simpleapi/youtube-transcript-scraper

A powerful automation actor that extracts full transcripts from YouTube videos with timestamps and speaker data. It enables content analysis, keyword extraction, translation, and SEO research, helping creators and analysts access precise, structured video text data instantly.

SimpleAPI

5.0

Best Youtube Transcripts Scraper

scrape-creators/best-youtube-transcripts-scraper

Extract transcripts from YouTube videos. Simply enter video URLs. Get full text, timestamps, and metadata where available. Perfect for research, SEO, and content analysis.

Scrape Creators

833

5.0

🔥🔥 - Youtube Transcript Scraper - ✅ caption, subtitles

peakydev/youtube-transcript

Scrape transcripts, captions, subtitles from Youtube videos

Peaky Dev

5.0

YouTube Transcript Scraper And Formatter

matthewjames/youtube-transcript-scraper-and-formatter

Extracts auto-generated YouTube transcripts from videos and formats them in plain text, SRT, and VTT format.

Matthew James

5.0

Youtube Subtitles Pro

red.cars/youtube-subtitles-pro

Extract YouTube subtitles & transcripts without API keys - get SRT, VTT, JSON formats instantly. Perfect for accessibility compliance, content creation & AI training data - no quotas or authentication required.

AutomateLab

Youtube Subtitles

red.cars/youtube-subtitles

Extract subtitles from YouTube videos in multiple formats (JSON, SRT, VTT, TXT) with support for playlists, channels, and advanced features like multi-language extraction and text cleaning.

AutomateLab

$0.15/min REAL YouTube Transcriber & Subtitles (JSON/SRT/VTT)

practicaltools/apify-youtube-transcribe

Download and transcribe YouTube videos into text and subtitle files – quickly, locally, and without external APIs. This Apify actor Faster-Whisper to generate transcripts and captions. It saves results in TXT, JSON, SRT, and VTT formats, plus provides a summary in the Dataset.

Practical Tools

5.0

Youtube Video Subtitles Scraper

simpleapi/youtube-video-subtitles-scraper

YouTube Video Subtitles Scraper extracts captions and subtitle tracks from YouTube videos in multiple languages. Returns timed transcripts, language codes, and download formats (SRT, VTT, TXT). Ideal for accessibility, translation, research, SEO, and automating transcript content analysis workflows

SimpleAPI

Instagram Youtube Transcripts With Speaker Labels

Instagram Youtube Transcripts With Speaker Labels

🎙️ Transcript Downloader - Transcripts with Speaker Labels

📚 API Documentation

⚠️ Prerequisites

✨ Features

📧 Input Parameters

📤 Output Format

Full JSON (default)

Plain Text (text_only)

SRT Format (srt)

VTT Format (vtt)

📊 Special Response Types

No Speech Detected

Failed Response

🚀 How to Use

Example Workflow

❌ Error Handling

⚠️ Rate Limiting

⏱️ Processing Time & Performance

💡 Best Practices

💰 Pricing & Billing

🎯 Use Cases

🔄 Integration with Other Actors

📈 Monitoring & Analytics

🙋 Support

📄 License

You might also like

Youtube Transcript Scraper

Youtube Transcript Scraper

Youtube Transcript Scraper

Best Youtube Transcripts Scraper

🔥🔥 - Youtube Transcript Scraper - ✅ caption, subtitles

YouTube Transcript Scraper And Formatter

Youtube Subtitles Pro

Youtube Subtitles

$0.15/min REAL YouTube Transcriber & Subtitles (JSON/SRT/VTT)

Youtube Video Subtitles Scraper

Plain Text (`text_only`)

SRT Format (`srt`)

VTT Format (`vtt`)