Instagram To Text avatar

Instagram To Text

Pricing

Pay per event

Go to Apify Store
Instagram To Text

Instagram To Text

AI-powered video transcription and translation. Convert video speech to text with timestamped subtitles in 100+ languages

Pricing

Pay per event

Rating

0.0

(0)

Developer

CheapGET

CheapGET

Maintained by Community

Actor stats

1

Bookmarked

52

Total users

13

Monthly active users

5 hours ago

Last modified

Share

Support Rating Reliability Instagram To Text

Transform Instagram videos into searchable, accessible text with AI-powered transcription in 100+ languages.

Extract speech from Instagram Reels and Stories with professional-grade accuracy. Get timestamped subtitles, automatic language detection, and instant translation—perfect for accessibility compliance, SEO optimization, and global content localization.

🤝 Support & Community

📧 Support: Contact Us 💬 Community: Telegram Group

🏆 Key Features

🎯 AI-Powered Transcription

  • 🎙️ Speech Recognition: Advanced AI extracts speech from videos with 90-95% accuracy, handling multiple speakers and accents.
  • ⏱️ Timestamped Subtitles: Every word is time-coded with millisecond precision (HH:MM:SS.mmm format) for subtitle generation.
  • 🌍 100+ Language Detection: Automatic language identification and transcription in over 100 languages including English, Spanish, Chinese, Japanese, Arabic, and more.
  • 📝 Segment-Level Output: Organized transcript with individual segments for easy editing and subtitle file creation.

🔄 Translation & Localization

  • 🗣️ Multi-Language Translation: Translate transcripts to any of 100+ target languages while preserving timestamp accuracy.
  • 🎯 Context-Aware Translation: Each subtitle segment is translated independently, maintaining context and timing synchronization.
  • 🌐 Global Content Reach: Localize Instagram content for international audiences with professional-quality AI translation.
  • ⚡ Parallel Processing: Fast translation with concurrent processing for quick turnaround times.

📊 Video Metadata Extraction

  • 👤 Creator Information: Extract author name, ID, and profile URL for attribution and analysis.
  • 📈 Engagement Metrics: Capture view counts, likes, comments, and shares (when available from platform).
  • 🖼️ Thumbnail Images: Automatically download and store video thumbnails for previews and social sharing.
  • 📅 Publishing Data: Track content publication dates with ISO-formatted timestamps for temporal analysis.

💰 Pricing

ResourceCostDescription
Actor Usage$0.00001Charged for Actor runtime, proxy and storage. Cost depends on resource consumption during execution
Transcript$0.39Charged once per video. Includes AI speech recognition and subtitle generation with timestamps
Translation$0.13Charged once per video when translation is requested. Includes AI-powered text translation to target language

Example Cost Calculation:

  • Processing 10 Instagram Reels with transcription only

  • Cost: (10 × $0.39) = $3.90 + runtime fees

  • Processing 10 Instagram Reels with transcription and translation

  • Cost: (10 × $0.39) + (10 × $0.13) = $5.20 + runtime fees

🎯 Use Cases

Accessibility

Generate readable text outputs for captions, internal accessibility workflows, and searchable knowledge bases.

Content Repurposing

Turn spoken content into posts, newsletters, summaries, clips, and reusable editorial inputs.

Research Archives

Index transcripts so teams can search interviews, webinars, tutorials, and public media after collection.

Compliance Review

Scan spoken content for claims, mentions, policy-sensitive language, or audit trails.

Multilingual Workflows

Route transcripts into translation, localization, and documentation processes for broader distribution.

🌟 Why choose this Actor?

Built for content creators, marketers, and accessibility professionals, this Actor transforms Instagram videos into structured, searchable, and translatable text.

FeatureInstagram To TextManual TranscriptionProfessional ServicesRev.com
Pricing ModelPay per use❌ Time-consuming❌ $1-3 per minute⚠️ $1.50/min
SpeedReal-time❌ Hours/days❌ 24-48 hours⚠️ 12-24 hours
Accuracy90-95%✅ Human-level✅ Professional✅ 99%
Languages100+ langs❌ Limited⚠️ Major languages⚠️ 30+ langs
TimestampsAutomatic❌ Manual work✅ Included✅ Included
TranslationBuilt-in❌ Separate service❌ Extra cost❌ Not supported
Setup TimeInstant❌ Complex⚠️ Account required⚠️ Upload needed
Min. Cost$0.39❌ Free but slow❌ $10+ minimum⚠️ $1.50 minimum

💻 Input Parameters

Instagram To Text input configuration showing video_url field for Instagram Reels/Stories URL and translate dropdown for optional language selection

ParameterTypeRequiredDescription
video_urlstring✅ YesInstagram video URL to transcribe. Supports Instagram Reels and Stories. Format: https://www.instagram.com/reel/[VIDEO_ID] or https://www.instagram.com/stories/[USERNAME]/[STORY_ID]
translatestring❌ NoTarget language for translation (optional). Select from 100+ languages including English, Spanish, Chinese (Simplified), Chinese (Traditional), Japanese, Korean, Arabic, French, German, Portuguese, Russian, Hindi, and more

📝 Example Input

{
"video_url": "https://www.instagram.com/reel/DRfTMtyAZ_M",
"translate": "japanese"
}

📤 Output Structure

FieldTypeDescription
processorstringURL of the Apify actor that processed this data
processed_atstringISO 8601 timestamp when the data was processed (format: YYYY-MM-DDTHH:MM:SS+00:00)
platformstringSource platform name (Instagram)
thumbnailstringVideo thumbnail image URL stored in Apify key-value store
titlestringOriginal title of the video
descriptionstringVideo description/caption text
authorstringVideo creator or uploader username/name
author_idstringAuthor's Instagram user ID
author_urlstringURL to the author's Instagram profile page
durationnumberVideo duration in seconds
audio_titlestringTrack name if the video contains music metadata
audio_artiststringArtist name if the video contains music metadata
view_countintegerNumber of views on the video (when available)
like_countintegerNumber of likes on the video
shares_countintegerNumber of shares/reposts (when available)
dislike_countintegerNumber of dislikes on the video (when available)
comment_countintegerNumber of comments on the video
categoriesarrayVideo categories (when available)
tagsarrayVideo tags/hashtags (when available)
published_atstringISO 8601 timestamp when the video was published
transcriptobjectAI-generated transcript with speech recognition and timestamped subtitles in original language
translationobjectAI-translated transcript with timestamped subtitles in target language (when requested)

Transcript Object Fields

FieldTypeDescription
languagestringDetected language name (e.g., "English", "Spanish", "Japanese")
textstringFull transcript text concatenated from all segments
segmentsarrayArray of timestamped transcript segments with start, end, and text properties

Segment Fields

FieldTypeDescription
startstringSegment start time in HH:MM:SS.mmm format (e.g., "00:00:00.000")
endstringSegment end time in HH:MM:SS.mmm format (e.g., "00:00:06.560")
textstringTranscript text for this time segment

📤 Example Output

{
"processor": "https://apify.com/cheapget/instagram-to-text?fpr=aiagentapi",
"processed_at": "2025-12-25T13:30:38+00:00",
"platform": "Instagram",
"title": "Video by openai",
"description": "You can now use ChatGPT Voice right inside chat — no separate mode needed...",
"author": "OpenAI",
"author_id": "openai",
"author_url": null,
"duration": 62,
"audio_title": null,
"audio_artist": null,
"view_count": null,
"like_count": 7089,
"shares_count": null,
"dislike_count": null,
"comment_count": 246,
"categories": [],
"tags": [],
"published_at": "2025-11-25T18:13:03+00:00",
"thumbnail": "https://api.apify.com/v2/key-value-stores/UKonHLvjRbXHyC0Lu/records/DRfTMtyAZ_M.png",
"transcript": {
"language": "English",
"text": " Hey Rocky, great to have you here. Hey, so can you tell me what's new with Voice? Absolutely...",
"segments": [
{
"start": "00:00:00.000",
"end": "00:00:06.560",
"text": "Hey Rocky, great to have you here."
},
{
"start": "00:00:06.560",
"end": "00:00:09.759",
"text": "Hey, so can you tell me what's new with Voice?"
}
]
},
"translation": {
"language": "Japanese",
"text": " やあ、ロッキー、来てくれて嬉しいよ。 ねえ、Voice の新機能について教えてもらえますか?...",
"segments": [
{
"start": "00:00:00.000",
"end": "00:00:06.560",
"text": "やあ、ロッキー、来てくれて嬉しいよ。"
},
{
"start": "00:00:06.560",
"end": "00:00:09.759",
"text": "ねえ、Voice の新機能について教えてもらえますか?"
}
]
}
}

🔌 Integrations

Seamlessly connect this actor to your existing pipelines via the Apify API.

🐍 Python SDK

from apify_client import ApifyClient
client = ApifyClient("YOUR_APIFY_API_TOKEN")
run_input = {'video_url': 'https://www.instagram.com/reel/DRfTMtyAZ_M',
'translate': 'japanese'}
run = client.actor("cheapget/instagram-to-text").call(run_input=run_input)
for item in client.dataset(run["defaultDatasetId"]).iterate_items():
print(item)

🟨 JavaScript / Node.js SDK

import { ApifyClient } from 'apify-client';
const client = new ApifyClient({ token: 'YOUR_APIFY_API_TOKEN' });
const run = await client.actor('cheapget/instagram-to-text').call({
runInput: {
"video_url": "https://www.instagram.com/reel/DRfTMtyAZ_M",
"translate": "japanese"
}
});
for await (const item of client.dataset(run.defaultDatasetId).iterateItems()) {
console.log(item);
}

🌐 REST API (curl)

curl -X POST "https://api.apify.com/v2/acts/cheapget~instagram-to-text/runs?token=YOUR_TOKEN" \
-H "Content-Type: application/json" \
-d '{"video_url": "https://www.instagram.com/reel/DRfTMtyAZ_M", "translate": "japanese"}'

Ⓜ️ Make.com Integration

Get Started with Make.com (1000 Free Credits) 🎁

Step 1: Add the Make.com module "Run an Actor"
Step 2: Enable mapping and paste Actor ID `DWlxiR8rGSilY8GHd`
Step 3: Paste the input JSON and run synchronously
Step 4: Add "Get Dataset Items" with `defaultDatasetId`

🎱 N8N.io Integration

Open Source Workflow Automation

Step 1: Add the Apify node "Run an Actor and get dataset"
Step 2: Set Selection Mode to By ID
Step 3: Paste Actor ID `DWlxiR8rGSilY8GHd`
Step 4: Add your input JSON and read items from `defaultDatasetId`

📚 API Documentation

  • MCP API - Model Context Protocol integration
  • Python API - Complete Python client documentation with examples
  • JavaScript API - Node.js and browser integration guide

🏗️ Metadata for Developers (JSON-LD)

{
"@context": "https://schema.org",
"@type": "SoftwareApplication",
"name": "Instagram To Text - Apify Actor",
"applicationCategory": "DeveloperApplication",
"applicationSubCategory": "Transcription and Subtitle Extraction",
"operatingSystem": "Cloud",
"offers": {
"@type": "Offer",
"price": "0.00",
"priceCurrency": "USD",
"priceValidUntil": "2099-12-31",
"availability": "https://schema.org/InStock"
},
"description": "Transform Instagram videos into searchable, accessible text with AI-powered transcription in 100+ languages.",
"featureList": [
"AI-Powered Transcription",
"Translation & Localization",
"Video Metadata Extraction"
],
"keywords": "instagram-to-text, structured data extraction, apify actor",
"author": {
"@type": "Organization",
"name": "cheapget",
"url": "https://apify.com/cheapget?fpr=aiagentapi"
},
"softwareVersion": "1.0.0",
"datePublished": "2024-01-01",
"dateModified": "2026-04-12"
}

🚀 Performance Tips

Optimize your transcription runs for speed, cost, and accuracy with these best practices:

💰 Cost Optimization

  • Test First: Start with a single short video to verify output quality before processing large batches
  • Skip Translation: If you only need transcription, leave the translate parameter empty to save $0.13 per video
  • Batch Processing: Process multiple videos in parallel to maximize efficiency and reduce overall runtime costs

⚡ Speed Optimization

  • Video Quality: Higher quality audio improves transcription accuracy and reduces processing time for corrections
  • Shorter Videos: Videos under 5 minutes process faster; consider splitting longer content if needed
  • Direct URLs: Use direct Instagram video URLs when possible to avoid additional redirect processing

🛡️ Data Quality Tips

  • Clear Audio: Videos with clear speech and minimal background noise produce the most accurate transcripts (90-95%)
  • Language Detection: Automatic detection works best for single-language content; mixed-language videos may have reduced accuracy
  • Speaker Consistency: Single-speaker videos transcribe more accurately than multi-speaker conversations
  • Background Noise: Excessive music or ambient noise can reduce transcription accuracy

📊 Best Practices

  • Transcription Accuracy: Expect 90-95% accuracy for clear audio with standard accents
  • Translation Quality: AI translation maintains context and meaning while preserving timestamp synchronization
  • Metadata Availability: Some Instagram videos may have limited metadata; private content cannot be accessed
  • Retry Logic: The Actor automatically retries failed downloads and transcriptions for reliability

❓ FAQ

What Instagram content types are supported?

This Actor supports Instagram Reels and Stories. Simply paste the Instagram video URL and the Actor will automatically download, transcribe, and optionally translate the content.

How accurate is the transcription?

Transcription accuracy typically ranges from 90-95% for clear audio with standard accents. Accuracy depends on audio quality, background noise, speaker clarity, accent complexity, and language. Videos with clear speech and minimal background noise produce the best results.

Which languages are supported?

The Actor supports automatic transcription and translation in 100+ languages including English, Spanish, Chinese (Simplified & Traditional), Japanese, Korean, Arabic, French, German, Portuguese, Russian, Hindi, Italian, Dutch, Turkish, Polish, Vietnamese, Thai, Indonesian, and many more.

Can I transcribe private Instagram videos?

No, the Actor can only process publicly accessible Instagram videos. Private or restricted content requires authentication, which is not supported to maintain security and privacy compliance.

How long does processing take?

Processing time depends on video length and complexity. Typically:

  • 1-minute video: 30-60 seconds for transcription
  • 1-minute video with translation: 45-90 seconds total
  • 5-minute video: 2-4 minutes for transcription
  • 5-minute video with translation: 3-6 minutes total

What output formats are available?

The Actor outputs data in JSON format by default. You can export results to CSV or Excel formats using Apify's dataset export features. The timestamped segments can be easily converted to SRT, VTT, or other subtitle formats.

Do I get timestamped subtitles?

Yes! Every transcript includes precise timestamps for each segment, showing start and end times in HH:MM:SS.mmm format (e.g., "00:00:06.560"). This makes it easy to generate SRT, VTT, WebVTT, or other subtitle file formats.

Can I use this for commercial purposes?

Yes, you can use the transcripts for commercial purposes including content creation, marketing, accessibility compliance, and SEO optimization. However, ensure you have rights to the original Instagram video content and comply with Instagram's terms of service.

What happens if transcription fails?

The Actor includes automatic retry logic with up to 8 attempts for transcription. If all attempts fail, you'll receive a clear error message indicating the issue (e.g., no speech detected, excessive background noise, or download failure).

🏷️ Instagram To Text

🔥 Search Terms: instagram transcription, instagram to text, video to text, AI subtitle generator, instagram reels transcription, instagram stories transcription, automatic subtitles, video translation, speech recognition, closed captions, social media transcription, AI transcription service, video to text converter, instagram caption generator, instagram subtitle generator, multilingual subtitles, video accessibility, content localization, video SEO optimization, automatic transcription, AI speech to text, video content extraction, social media accessibility, subtitle generator, transcript generator, video analysis tool, content repurposing, video marketing tool, influencer content analysis, social listening tool, instagram reels to text, instagram stories to text

💼 Use Case: content-creation social-media-marketing accessibility video-seo content-localization subtitle-generation video-transcription ai-transcription multilingual-content closed-captions video-translation content-repurposing influencer-marketing social-listening market-research competitive-analysis video-analytics content-analysis educational-content e-learning video-accessibility ada-compliance wcag-compliance seo-optimization keyword-research content-marketing video-editing automated-subtitles speech-to-text

This actor processes publicly available video content from Instagram. It does not bypass authentication, access private accounts, or violate platform terms of service. You are responsible for:

  • Content Rights: Ensuring you have permission to transcribe and use the video content
  • Privacy Compliance: Adhering to GDPR, CCPA, and other applicable privacy laws when processing personal data
  • Platform Terms: Respecting Instagram's terms of service and usage policies
  • Ethical Use: Using transcripts responsibly for legitimate purposes such as accessibility, research, or content creation
  • Best Job Search - Aggregates job listings from LinkedIn, Indeed, Glassdoor, and regional platforms with normalized output for sourcing, analytics, and job monitoring.
  • Glassdoor Job Search - Extracts Glassdoor job postings with salary ranges, company ratings, reviews, and listing details for employer and compensation research.
  • Indeed Job Search - Extracts Indeed job listings with titles, locations, salary disclosures, company details, and application links for hiring and market research.
  • LinkedIn Job Search - Extracts LinkedIn job postings with applicant counts, company information, seniority signals, and structured hiring data.
  • Best Video Downloader - Downloads videos or metadata from 1000+ platforms with quality control, thumbnail URLs, and optional comment extraction.
  • TikTok Video Downloader - Downloads TikTok videos without watermarks and returns metadata, audio details, comments, hashtags, and creator information.
  • YouTube Video Downloader - Downloads YouTube videos in multiple qualities and returns metadata, comments, thumbnails, and media file links.
  • Video Subtitles Downloader - Extracts subtitles from video URLs and returns structured subtitle data with downloadable text outputs.
  • TikTok Live Recorder - Records TikTok live streams and returns capture files with stream, creator, and engagement metadata.
  • TikTok Video Profile - Extracts rich TikTok video metadata including captions, hashtags, engagement stats, comments, and creator information.
  • YouTuber Profile - Extracts YouTube channel profile data including subscriber counts, descriptions, links, emails, and optional video catalogs.
  • Video To Text - Transcribes video and audio content into structured text with timestamps and language support for searchable downstream workflows.
  • Instagram To Text - Transcribes Instagram video content into text for captioning, repurposing, research, and searchable archives.
  • Social Media Marketing - Generates platform-ready marketing outputs from source content with multiple copy variations, tones, and distribution formats.
  • Reddit User Profile - Extracts Reddit user profile data, activity signals, karma metrics, trophies, and account-level metadata.
  • Reddit Community Profile - Extracts subreddit metadata including rules, wiki content, pinned posts, descriptions, and community-level details.
  • Reddit Community Post - Extracts posts from a subreddit with metadata, content, authors, engagement stats, and comment threads.
  • Reddit Post Search - Searches Reddit posts by keyword and returns structured post data with engagement signals and nested comments.
  • Reddit Community User - Extracts active users from a subreddit with karma, account status, trophies, moderated communities, and related profile data.
  • Telegram Group Member - Extracts Telegram group member profiles and community participant data from public groups.
  • Telegram Channel Message - Extracts Telegram channel messages with text, media, views, replies, forwards, and reaction metrics.
  • Telegram Profile - Extracts Telegram profiles for users, groups, bots, and channels with public metadata and identity details.
  • Google Business Profile - Extracts Google Business Profile listings with contact information, reviews, ratings, photos, categories, and location data.
  • X Community Profile - Extracts X community metadata including descriptions, membership signals, creator information, and activity details.
  • X User Profile - Extracts comprehensive X user profile data including verification, follower metrics, pinned content, and account metadata.
  • Zillow Real Estate - Extracts Zillow property listings with pricing, photos, valuations, unit details, address data, and listing metadata.
  • US Real Estate - Extracts U.S. property listings with pricing, specs, valuation fields, agent details, and market-ready real estate data.

Last Updated: April 12, 2026