Pricing

from $5.00 / 1,000 results

Instagram Transcript Scraper

Extract transcripts from Instagram videos and reels using auto-generated captions or AI-powered speech-to-text. Returns clean, timestamped transcript segments with full video metadata.

Pricing

from $5.00 / 1,000 results

Rating

4.2

(4)

Developer

Crawler Bros

Actor stats

Bookmarked

720

Total users

166

Monthly active users

0.35 hours

Issues response

6 hours ago

Last modified

What This Actor Does

The Instagram Transcript Scraper navigates to an Instagram video URL, downloads the audio, and converts it to text using one of two methods:

Native Captions — Reads Instagram's auto-generated captions directly from the API. Instant and zero extra cost when available.
Whisper AI — Uses OpenAI's Whisper speech-to-text model locally on the container. Works on any video with speech, regardless of whether Instagram generated captions.

The auto mode (default) tries native captions first and only falls back to Whisper when needed — giving you the best coverage at the lowest processing time.

Input Parameters

Field	Type	Required	Default	Description
`videoUrls`	array	Yes	—	One or more Instagram video/reel URLs to transcribe
`transcriptionMethod`	string	No	`auto`	`auto`, `native`, or `whisper` — see Transcription Methods
`whisperModel`	string	No	`base`	Whisper model size: `tiny`, `base`, or `small`
`language`	string	No	auto-detect	Language code (e.g. `en`, `es`, `fr`). Leave empty for automatic detection
`includeSegments`	boolean	No	`false`	When `true`, adds a `segments` array to each result with per-phrase timestamps and text. Incurs an additional charge per run

Supported URL Formats

https://www.instagram.com/reel/ABC123xyz/
https://www.instagram.com/p/ABC123xyz/
https://www.instagram.com/tv/ABC123xyz/

Input Example

{
  "videoUrls": [
    "https://www.instagram.com/reel/DV29mBcMQwp/",
    "https://www.instagram.com/p/DULBkEngpxg/"
  ],
  "transcriptionMethod": "auto",
  "whisperModel": "base",
  "language": "",
  "includeSegments": false
}

Output Format

One dataset item is returned per video URL. Video metadata sits at the root level; all transcript segments are nested inside a segments array within that single item.

On error (private post, deleted video, unsupported media type), the actor emits a single item with postUrl, errMsg, and createdAt only — no empty fields.

Output Fields

Root level — always present

Field	Type	Description
`postUrl`	string	Canonical Instagram URL of the video
`shortCode`	string	Instagram shortcode (unique identifier)
`pk`	string	Instagram internal numeric media ID
`id`	string	Combined media ID (pk_userId format)
`postDescription`	string	Video caption text
`thumbnailUrl`	string	Thumbnail image URL
`videoUrl`	string	Direct URL to the highest-quality MP4
`pubDate`	string	Post creation time in UTC ISO 8601 format (e.g. `2025-05-10T14:32:03Z`)
`likeCount`	integer	Number of likes
`commentCount`	integer	Number of comments
`userId`	string	Creator's numeric Instagram user ID
`userName`	string	Creator's Instagram handle
`userFullName`	string	Creator's display name
`avatarUri`	string	Creator's profile picture URL
`fullText`	string	Complete transcript of the entire video as a single string
`transcriptionMethod`	string	`native` or `whisper`
`createdAt`	string	UTC ISO timestamp of when this record was scraped

Root level — conditionally present

Field	Type	Description
`audioUrl`	string	Direct URL to the audio-only track (omitted when unavailable)
`segments`	array	Timestamped transcript segments — only present when `includeSegments: true`
`errMsg`	string	Error description (only present on failed records)

Each object in segments (only when includeSegments: true)

Field	Type	Description
`index`	integer	Zero-based position of this segment
`start`	number	Segment start time in seconds
`end`	number	Segment end time in seconds
`text`	string	Transcript text for this segment

Output Example — Success (`includeSegments: false`)

{
  "postUrl": "https://www.instagram.com/p/DV29mBcMQwp/",
  "shortCode": "DV29mBcMQwp",
  "pk": "3852537424986049577",
  "id": "3852537424986049577_16278726",
  "postDescription": "On Friday, US President Donald Trump claimed Iran's air force is \"no longer\"...",
  "thumbnailUrl": "https://scontent-iad6-1.cdninstagram.com/...",
  "videoUrl": "https://scontent-iad6-1.cdninstagram.com/...mp4",
  "pubDate": "2025-05-11T05:12:03Z",
  "likeCount": 16799,
  "commentCount": 1159,
  "userId": "16278726",
  "userName": "bbcnews",
  "userFullName": "BBC News",
  "avatarUri": "https://scontent-iad3-1.cdninstagram.com/...",
  "fullText": "On Friday, U.S. President Donald Trump claimed Iran's Air Force is no longer...",
  "transcriptionMethod": "whisper",
  "createdAt": "2026-06-08T06:09:05.841Z"
}

Output Example — Success (`includeSegments: true`)

{
  "postUrl": "https://www.instagram.com/p/DV29mBcMQwp/",
  "shortCode": "DV29mBcMQwp",
  "fullText": "On Friday, U.S. President Donald Trump claimed Iran's Air Force is no longer...",
  "transcriptionMethod": "whisper",
  "createdAt": "2026-06-08T06:09:05.841Z",
  "segments": [
    {
      "index": 0,
      "start": 0,
      "end": 4.36,
      "text": "On Friday, U.S. President Donald Trump claimed Iran's Air Force is no longer, as a result"
    },
    {
      "index": 1,
      "start": 4.36,
      "end": 8.12,
      "text": "of military action. This follows video released by the U.S. on Thursday..."
    }
  ]
}

Output Example — Error

{
  "postUrl": "https://www.instagram.com/reel/DELETED123/",
  "errMsg": "Could not extract video data from page. The post may not exist or may not be a video.",
  "createdAt": "2026-06-08T06:09:15.168Z"
}

Transcription Methods

Auto (Recommended)

Tries Instagram's native captions first. Falls back to Whisper AI automatically when native captions are unavailable. Best balance of speed and coverage for most workloads.

Native

Only reads Instagram's built-in auto-generated captions. Fastest option — no audio download needed. May not be available on older posts, non-Reel content, or videos without speech.

Whisper AI

Always downloads the video and runs local AI speech-to-text. Consistent coverage for any video with speech, independent of Instagram's captioning availability.

Model comparison:

Model	Size	Speed	Accuracy	Best For
`tiny`	39 MB	Fastest	Basic	Quick previews, speed-critical pipelines
`base`	74 MB	Fast	Good	Most use cases
`small`	244 MB	Moderate	Very good	Accented speech, technical or specialized content

Use Cases

AI agents & LLM pipelines — Feed Instagram video speech into RAG systems, summarizers, or classifiers
Content research — Extract and analyze what creators are saying across a topic or niche
Social media monitoring — Capture spoken claims in video content for brand or news tracking
Subtitle generation — Generate timestamped captions for repurposed video content
Competitive intelligence — Batch-transcribe competitor or industry video content
Accessibility — Build searchable archives of spoken video content

Supported Languages

The Whisper model supports 99+ languages with automatic detection. For best accuracy on non-English content, set the language field explicitly. Supported dropdown options include:

English, Spanish, French, German, Italian, Portuguese, Russian, Chinese, Japanese, Korean, Arabic, Hindi, Dutch, Polish, Turkish, Swedish, Danish, Finnish, Norwegian, Czech, Romanian, Hungarian, Indonesian, Vietnamese, Thai, Ukrainian, Hebrew, Persian, Malay, and more.

Limitations

Public videos only — Private accounts, restricted posts, and login-gated content cannot be scraped.
Audio quality matters — Whisper accuracy degrades on videos with heavy background music, multiple overlapping speakers, or very low recording quality.
Native captions not always available — Instagram doesn't generate captions for all videos. Short clips, older posts, or posts with music-only audio may have no native captions; the actor falls back to Whisper automatically in auto mode.
Instagram CDN URLs expire — The videoUrl, audioUrl, thumbnailUrl, and avatarUri URLs in the output are time-limited CDN links. Download media promptly; do not rely on these URLs as permanent storage.
Rate limiting — Processing many videos in rapid succession may trigger temporary rate limits. The actor includes automatic delays between requests.

FAQ

Q: Does this work with Instagram Reels, regular video posts, and IGTV?
A: Yes. All three URL formats (/reel/, /p/, /tv/) are supported.

Q: What if the video has no speech (music only or silent)?
A: Whisper will return an empty or near-empty transcript. Native captions won't exist. The actor returns a single row with empty fullText and segmentText.

Q: Can I process multiple videos in one run?
A: Yes. Provide as many URLs as you need in the videoUrls array. The actor processes them sequentially with automatic pauses between requests.

Q: Why does my run return "errMsg": "Could not extract video data..."?
A: The post is likely private, deleted, or restricted for your region. Verify the post is publicly accessible in a browser without being logged in.

Q: How accurate is Whisper transcription?
A: The base model is accurate for clear, single-speaker English audio. For accented speech, fast speech, or specialized vocabulary, use small. For multilingual content, setting the language field explicitly improves results.

Q: Are the video/audio URLs in the output permanent?
A: No. Instagram CDN URLs are signed and expire within hours or days. Download the media during the run if you need to store it.

Q: What languages does auto-detection support?
A: Whisper's automatic language detection covers 99+ languages. Detection is most reliable for videos with at least 30 seconds of speech.

Q: Does this require Instagram login or cookies?
A: The actor uses a shared cookie pool for Instagram session access. No credentials are required from the user.

Other Instagram Scrapers

Want to get other data from Instagram? Check out our complete suite of Instagram scrapers:

Actor	Description
Instagram Profile Scraper	Extract profile data, bio, follower counts, and more
Instagram Followers & Following Scraper	Scrape followers and following lists from any profile
Instagram Tagged Posts Scraper	Collect posts where a user has been tagged
Instagram Hashtag Scraper	Scrape posts and profiles by hashtag
Instagram Story Downloader	Download stories from Instagram profiles
Instagram Downloader API	Download photos, videos, and reels from Instagram
Instagram Keyword Scraper	Search and scrape posts by keyword
Instagram Keyword Search Scraper	Search Instagram accounts and posts by keyword
Instagram Comment Scraper	Scrape comments and replies from Instagram posts

Instagram Reel AI Transcript Extractor

linen_snack/instagram-reel-transcript-ai-extractor

Extract word-perfect transcripts from Instagram Reels with AI-powered sentiment analysis, entity detection, SRT/VTT subtitle export, and full channel scraping. 10 free reels included.

ius iyb

256

Instagram Transcript API – AI Video to Text for Developers

apple_yang/instagram-transcripts-scraper

Instagram Reels Transcript API for converting video audio into accurate text using AI. Extract transcripts, spoken content, and metadata from public Reels and videos. Fast, reliable, and built for developers, AI agents, and automation workflows.

APISmith

1.1K

4.8

Instagram Transcript Extractor

bulletproof/instagram-transcript-extractor

📸 Convert any Instagram Reel, IGTV, or video post to text. Extract transcripts and subtitles with timestamps. Outputs JSON, SRT, or plain text. Auto-captions + speech-to-text fallback. 14+ languages. No login needed.

Zero Downtime

488

5.0

Video Transcript

agentx/video-transcript

Universal video-to-text API across YouTube, TikTok, Instagram, X, Facebook, Vimeo and 1000+ platforms. Returns the full transcript as timestamped segments with the source video metadata, optionally translated into 100+ target languages — one endpoint replacing per-platform transcription stacks.

AgentX

730

4.1

Instagram Reels Scraper

scrapio/instagram-reels-scraper

Scrapes Reels from Instagram by profile, hashtag, or explore feed, capturing video URLs, captions, hashtags, audio details, thumbnails, views, likes, comments, and timestamps. Ideal for trend research, influencer analysis, and large-scale Reels data extraction

Scrapio

Instagram Video Transcript

truefetch/instagram-video-transcript

AI-transcribe any Instagram reel, story, or video — timestamped captions, speaker diarization, and translation into 100+ languages from a single pasted link. $0.30 per video.

TrueFetch

167

5.0

Instagram AI Transcript Extractor

sian.agency/instagram-ai-transcript-extractor

Instagram Transcript Generator — 🎬 AI Reel Transcription | 🗣️ Speaker Diarization | 🌍 Language Detection | 📊 30+ Metrics | 💰 Best Price. Extract entire channels with word-perfect transcripts and speaker identification. Try 5 reels free!

SIÁN OÜ

2.4K

4.0

Instagram Reels Transcript Scraper (No Login)

makework36/instagram-reels-transcript-scraper

Scrape Instagram reels from any public profile and get AI transcripts — no login, no cookies. HTTP-pure via mirror + Groq Whisper.

deusex machine

Video Subtitle & Caption Extractor

khadinakbar/video-subtitle-extractor

Extract subtitles, captions, and AI transcripts from any video URL across 1000+ platforms (YouTube, Vimeo, TikTok, Instagram, X/Twitter, Facebook, Twitch, TED, Bilibili). Native captions first, Whisper AI fallback when none. JSON, SRT, VTT, text, or LLM-ready markdown.