All notable changes and improvements to TikTok AI Transcript Extractor.
- SRT & VTT Subtitles: Auto-generated SRT and WebVTT subtitle formats from transcript segments — ready for video editors and media players
- Speaker Diarization & Word-Level Timestamps: Segments now include
speaker, language, and a words array with per-word timing, confidence scores, and speaker labels
- Creator Profile Data: Added
followerCount, authorHeartCount, authorVideoCount, authorBio, and authorVerified fields
- Engagement Expansion: Added
collectCount (bookmarks/saves) to engagement metrics
- Content Classification: Added
diversificationLabels (content categories), textLanguage (detected language), and isAd (advertisement detection)
- Location Data: Added
locationCreated (country code), poiName, and poiAddress for geo-tagged videos
- Video Metadata: Added
fileSize and scrapingDate fields
- Schema Additions:
coverImageUrl now properly defined in dataset schema fields
- Video Duration Fix:
videoDuration now correctly shows seconds (e.g. 50) instead of being divided by 1000 (was showing 0.05)
- Richer Analysis: 45+ data fields (up from 30+) for deeper content insights
- Speaker Identification: Know who said what in multi-speaker TikTok videos
- Creator Intelligence: Full creator profile data for influencer analysis and competitor monitoring
- Geo Insights: Location-based analysis for regional content trends
- Content Categorization: TikTok's own content labels for automated topic classification
- Timestamped Segments Field: Added
segments array containing transcript broken into timestamped segments with id, text, start, and end times for precise content analysis
- Improved Input Schema: Enhanced formatting with better line breaks between Features and Use Cases sections
- Better UX: Most important data visible first in results (transcript → segments → cover image → metrics)
- Easier Onboarding: Clear, concise documentation without fluff
- Professional Output: Clean dataset structure optimized for analysis
- Precise Timestamps: Segments field enables exact time-based content navigation and analysis