Instagram Reels Transcript Scraper (No Login)
Pricing
Pay per event
Instagram Reels Transcript Scraper (No Login)
Scrape Instagram reels from any public profile and get AI transcripts — no login, no cookies. HTTP-pure via mirror + Groq Whisper.
Pricing
Pay per event
Rating
0.0
(0)
Developer
deusex machine
Maintained by CommunityActor stats
0
Bookmarked
4
Total users
2
Monthly active users
6 days ago
Last modified
Categories
Share
Instagram Reels Transcript Scraper — Captions, AI Transcripts, Video URLs & Metadata (No Login)
⭐ Useful? Leave a review — it takes 10 seconds and is the single biggest thing that helps other content researchers, social media marketers, agencies and short-form video tool builders find this Instagram reels transcript scraper.
A fast, production-grade Instagram reels scraper with built-in AI transcription. Give it any public Instagram username and it pulls the profile's newest reels and videos, then transcribes the spoken audio of each one into clean text. For every reel you get the post URL, direct MP4 video URL, thumbnail, caption, like and comment counts, post date — and a full AI-generated transcript with automatic language detection.
No login, no cookies, no Instagram account, no password. The scraper never touches your account, so there is zero risk of a ban or a lockout. Pass an array of usernames like ["natgeo", "nasa"] and get back clean JSON, CSV or Excel ready to drop into a content dashboard, a repurposing pipeline, a brand-monitoring tool or a research dataset.
Built for content researchers, social media marketers, agencies repurposing short-form video into blogs and newsletters, accessibility teams generating captions, competitive-intelligence analysts, creator-economy tools and anyone doing Instagram reels research at scale.
✨ Why use this scraper
Most Instagram scrapers require your session cookies or a logged-in account — which puts your real Instagram profile at risk of rate-limits, checkpoints and bans. This actor takes a completely different route: it reads public profile data through a no-login mirror and pulls the reel's MP4 straight from Instagram's open CDN, then runs the audio through Whisper (via Groq's whisper-large-v3-turbo) for a transcript.
That means:
- Zero account risk — nothing is ever logged in. Your Instagram profile is never used, so it can't be flagged.
- Spoken-word transcripts, not just captions — the post caption is rarely the same as what's actually said in the reel. This actor gives you both.
- Automatic language detection — Whisper detects the spoken language per reel, or you can force a language code.
- HTTP-pure enumeration — profile reels are pulled from a JSON endpoint, so listing is fast and cheap.
- Direct CDN downloads — video bytes come straight from Instagram's CDN (no proxy bottleneck), so transcription stays fast.
- Metadata-only mode — turn transcription off to get reels + video URLs + captions + engagement at a fraction of the cost.
Non-speech reels (pure music, ambient nature footage) are detected and returned with an empty transcript — and you are not charged for transcribing them.
📤 Output fields
One item per reel, flat JSON. Maps cleanly to a BigQuery schema, a Postgres table or a Google Sheet.
Reel metadata
| Field | Type | Description |
|---|---|---|
username | string | The profile the reel belongs to |
shortcode | string | Instagram post identifier |
url | string | Direct link to the post on instagram.com |
type | string | Media type reported by Instagram |
isVideo | boolean | true for reels / videos |
videoUrl | string | Direct MP4 URL on Instagram's CDN |
thumbnailUrl | string | Cover image URL |
caption | string | Post caption (HTML-decoded, emojis preserved) |
takenAt | string | Human-readable post date (e.g. "2 days ago") |
timestamp | integer | Unix timestamp of the post |
Engagement metrics
| Field | Type | Description |
|---|---|---|
likes | string/int | Like count (e.g. 15300 or "15.3k") |
comments | string/int | Comment count |
Transcript
| Field | Type | Description |
|---|---|---|
transcript | string | AI transcript of the spoken audio (null if the reel has no speech) |
transcriptLanguage | string | Detected language, e.g. "English" (null when no speech) |
transcriptError | string | Populated only if a reel failed to download or transcribe |
🎯 Use cases
- Content repurposing — turn a creator's reels into blog posts, newsletters, X threads or LinkedIn carousels. The transcript is your raw script.
- Competitive research — pull a competitor's reels weekly, read what they actually say, and benchmark caption vs spoken messaging.
- Accessibility & subtitles — generate transcripts and captions for reels that ship without them.
- SEO & content mining — extract spoken keywords and topics from a niche's top reels to inform your own content calendar.
- Sentiment & trend analysis — feed transcripts into OpenAI, Claude or a local Llama to cluster themes, hooks and messaging across a category.
- Training data — short-form spoken-language transcripts in many languages are valuable fine-tuning and search data.
- Brand monitoring — track what sponsored creators say about your brand in video, not just in the caption.
- Lead research for agencies — audit a prospect's Instagram presence (posting cadence, engagement, messaging) before a pitch.
🚀 How to use
Mode 1 — reels + transcripts from one profile
{"profiles": ["natgeo"],"maxPostsPerProfile": 30,"transcribe": true}
Mode 2 — metadata only (cheapest, no transcription)
{"profiles": ["nasa"],"maxPostsPerProfile": 50,"transcribe": false}
Mode 3 — multiple profiles in one run
{"profiles": ["natgeo", "nasa", "bbcearth"],"maxPostsPerProfile": 20,"transcribe": true}
Mode 4 — force a transcript language
{"profiles": ["selena"],"maxPostsPerProfile": 15,"transcribe": true,"language": "es"}
Mode 5 — a blocked profile via residential proxy
{"profiles": ["some_private_looking_profile"],"maxPostsPerProfile": 25,"transcribe": true,"proxyGroup": "residential"}
📥 Input
| Parameter | Type | Default | Description |
|---|---|---|---|
profiles | array | [] | Public usernames or profile URLs (natgeo, @nasa, https://www.instagram.com/instagram/) — required |
maxPostsPerProfile | integer | 30 | Max reels per profile, newest first (1–1000) |
onlyVideos | boolean | true | Skip photo posts; only return videos (required for transcripts) |
transcribe | boolean | true | Generate an AI transcript per reel. Turn off for metadata-only (cheaper, faster) |
language | string | — | ISO-639-1 code (en, es, pt…) to force the transcript language. Empty = auto-detect |
proxyGroup | string | datacenter | datacenter (default, cheapest) or residential for profiles that get blocked |
📋 Output example
{"username": "natgeo","shortcode": "C8xKq2pN3aZ","url": "https://www.instagram.com/p/C8xKq2pN3aZ/","type": "video","isVideo": true,"videoUrl": "https://scontent.cdninstagram.com/o1/v/t2/.../reel.mp4","thumbnailUrl": "https://scontent.cdninstagram.com/.../cover.jpg","caption": "Keep exploring! ABC News and National Geographic are taking you...","likes": "15.3k","comments": 210,"takenAt": "2 days ago","timestamp": 1749250000,"transcript": "Can you guess where we are? We're about to unveil the seven natural wonders of America. From a geologist's standpoint, globally, this is geology in action...","transcriptLanguage": "English","transcriptError": null}
Export the dataset as JSON, CSV, Excel, XML, JSONL or HTML table directly from the Apify console, or via the Apify API.
💻 Code examples
curl
curl -X POST 'https://api.apify.com/v2/acts/makework36~instagram-reels-transcript-scraper/run-sync-get-dataset-items?token=YOUR_TOKEN' \-H 'Content-Type: application/json' \-d '{"profiles": ["natgeo"],"maxPostsPerProfile": 30,"transcribe": true}'
Python (apify-client)
from apify_client import ApifyClientclient = ApifyClient("YOUR_APIFY_TOKEN")run = client.actor("makework36/instagram-reels-transcript-scraper").call(run_input={"profiles": ["natgeo", "nasa"],"maxPostsPerProfile": 30,"transcribe": True,})for reel in client.dataset(run["defaultDatasetId"]).iterate_items():if reel.get("transcript"):print(f"{reel['shortcode']} [{reel['transcriptLanguage']}] {reel['transcript'][:80]}")
Node.js (apify-client)
import { ApifyClient } from 'apify-client';const client = new ApifyClient({ token: 'YOUR_APIFY_TOKEN' });const run = await client.actor('makework36/instagram-reels-transcript-scraper').call({profiles: ['nasa'],maxPostsPerProfile: 20,transcribe: true,});const { items } = await client.dataset(run.defaultDatasetId).listItems();console.table(items.map((r) => ({code: r.shortcode, likes: r.likes, lang: r.transcriptLanguage,})));
Export to CSV
From the run page → Export → CSV. Or via API:
https://api.apify.com/v2/datasets/DATASET_ID/items?format=csv&token=YOUR_TOKEN
Repurpose a reel into a blog post
from apify_client import ApifyClientimport openaiclient = ApifyClient("YOUR_APIFY_TOKEN")oa = openai.OpenAI()run = client.actor("makework36/instagram-reels-transcript-scraper").call(run_input={"profiles": ["my_competitor"],"maxPostsPerProfile": 10,"transcribe": True,})for reel in client.dataset(run["defaultDatasetId"]).iterate_items():if not reel.get("transcript"):continueresp = oa.chat.completions.create(model="gpt-4o-mini",messages=[{"role": "user", "content": f"Turn this reel transcript into a 200-word blog intro:\n{reel['transcript']}"}],)print(reel["url"], "\n", resp.choices[0].message.content, "\n")
Zapier / Make.com no-code
Both Zapier and Make have an official Apify integration. Trigger on a schedule, action "Run Apify Actor", pass this actor's name and your input JSON, then send the output into a Google Sheet, Airtable, Slack alert or Notion row.
⚡ Performance
- ~1 second to download each reel's MP4 (direct from Instagram's CDN, no proxy)
- ~5–8 seconds per reel end-to-end with transcription (download + audio extraction + Whisper)
- Metadata-only mode lists ~12 reels per profile page in a single request — near-instant
- Up to 1,000 reels per profile — raise
maxPostsPerProfileas needed - Non-speech reels are detected and skipped for transcription automatically
📊 Comparison
Several Instagram scrapers live on the Apify Store. Here's how this one stacks up on what actually matters for short-form video research.
| Feature | This scraper | Alternative A | Alternative B | Alternative C |
|---|---|---|---|---|
| Needs login / cookies | No | Yes | Yes | Sometimes |
| Risk to your account | None | High | High | Medium |
| AI transcript of spoken audio | Built-in (Whisper) | No | Add-on ($) | No |
| Auto language detection | Yes | No | Partial | No |
| Direct MP4 video URL | Yes | Yes | No | Yes |
| Caption + likes + comments | Yes | Yes | Yes | Partial |
| Metadata-only mode (cheaper) | Yes | No | No | Partial |
| Multiple profiles per run | Yes (array) | Yes | Single | Yes |
| Max reels per profile | 1,000 | 200 | 100 | 500 |
| Code examples in README | Python / Node / curl / OpenAI | None | None | None |
The honest take: if you only need post URLs and never care about what's actually said in the video, a basic metadata scraper is fine. If you want the transcript — the real script of each reel — without ever logging in, this scraper bundles it all in one pass.
💵 Pricing
Pay-per-event — you only pay for what you pull, with no subscription and no monthly minimum.
| Event | Price | What you get |
|---|---|---|
| Actor start | $0.005 per run | Pipeline initialization (once per run) |
| Reel scraped | $0.003 per reel | Full metadata: caption, likes, comments, video URL, thumbnail |
| AI transcript | $0.012 per reel | Whisper transcript — only charged when the reel actually has speech |
A reel with a transcript = ~$0.015 all-in (scrape + transcript). Metadata-only (transcribe: false) = $0.003 per reel.
| Volume | Metadata only | With transcripts |
|---|---|---|
| 1,000 reels | ~$3 | ~$15 |
| 10,000 reels | ~$30 | ~$150 |
Apify gives every new user a $5 free trial credit — enough to transcribe ~330 reels while you evaluate. You're never charged for transcribing reels that have no speech (music-only / ambient).
❓ FAQ
Do I need an Instagram account or login? No. The scraper never logs in and never uses cookies, so your account is never at risk. It only works with public profiles.
Will this get my Instagram account banned? No — because your account is never used. Nothing about your real profile is touched.
How accurate are the transcripts?
Transcription uses OpenAI's Whisper (large-v3-turbo), one of the best speech-to-text models available. Accuracy depends on audio clarity. Reels with background music over speech transcribe less cleanly than clear narration.
Why is transcript null on some reels?
That reel has no spoken words — it's music-only or ambient footage. You are not charged for transcribing those.
Can I transcribe in a specific language?
Yes. Set language to an ISO code like es, pt or fr to force it. Leave it empty for automatic detection.
Can I get reels without transcribing them?
Yes. Set transcribe to false to get metadata, captions and video URLs only, at a much lower price.
Can I download the actual video files?
Each item includes a direct videoUrl to the MP4 on Instagram's CDN, which you can download yourself.
A profile returns nothing — what do I do?
Switch proxyGroup to residential. Some profiles or networks block datacenter traffic; residential IPs usually get through.
Is scraping Instagram legal? Scraping publicly available data is generally permissible in most jurisdictions. Respect Instagram's Terms of Use and privacy regulations (GDPR, CCPA, LGPD). Consult legal counsel for your specific use case.
Can I run this on a schedule? Yes. Apify's scheduler supports daily, weekly and custom cron expressions. Every run writes to a fresh dataset you can pull over the Apify API.
📝 Changelog
- v0.1 (2026-06-07) — Initial release: no-login profile reel enumeration, direct-CDN MP4 download, Whisper transcription with auto language detection, metadata-only mode, datacenter / residential proxy support.
🔗 Related actors
- YouTube Shorts Scraper — Shorts metadata, views, likes, dislikes, comments and music
- TikTok Scraper — TikTok videos and creator analytics
- Google Play Reviews Scraper — Android app reviews, ratings, locales
- Telegram Channel Scraper — public Telegram messages and metadata
- Reddit MCP Server — Reddit data as MCP tools for Claude and Cursor