Tiktok Transcript Extractor Scraper
2 hours trial then $10.00/month - No credit card required now
Tiktok Transcript Extractor Scraper
2 hours trial then $10.00/month - No credit card required now
Effortlessly extract TikTok video transcripts in WebVTT format with this reliable scraper. Save time, automate data collection, and customize settings like concurrency and proxies for seamless performance. Perfect for accessibility, analysis, or content repurposing!
The TikTok Transcript Extractor Scraper is a tool designed to automate the extraction of captions and transcripts from TikTok videos in WebVTT format. This scraper provides an easy way to retrieve video subtitles for accessibility, analysis, or repurposing.
Features
- Extract Transcripts: Automatically retrieve captions in WebVTT format from TikTok videos.
- Start with Specific URLs: Input TikTok video URLs to fetch transcripts from desired videos.
- Customizable Concurrency: Optimize the scraping speed with configurable concurrency settings.
- Retry Mechanism: Ensure data accuracy with automatic retries for failed requests.
- Proxy Support: Use proxies for anonymity and uninterrupted operations.
Input Configuration
Below is the structure of the input configuration for the scraper:
1. Start URLs
- Title: Start URLs
- Type: Array
- Description: Specify the TikTok video URLs to scrape the transcript.
- Editor:
requestListSources
- Example:
1{ 2 "url": "https://www.tiktok.com/@stoolpresidente/video/7451747413649263915" 3}
2. Max Concurrency
- Title: Max Concurrency
- Type: Integer
- Description: Maximum number of pages that can be processed simultaneously.
- Default:
10
3. Min Concurrency
- Title: Min Concurrency
- Type: Integer
- Description: Minimum number of pages that will be processed simultaneously.
- Default:
1
4. Max Request Retries
- Title: Max Request Retries
- Type: Integer
- Description: Number of retries for a failed request before the scraper gives up.
- Default:
100
5. Proxy Configuration
- Section Caption: Proxy Configuration
- Title: Proxy Configuration
- Type: Object
- Description: Configure proxy servers for secure and anonymous scraping.
- Default:
1{ 2 "useApifyProxy": true, 3 "apifyProxyGroups": ["RESIDENTIAL"] 4}
- Details:
For more information on proxy configuration, refer to Apify Proxy Configuration.
Example Input
1{ 2 "startUrls": [ 3 { 4 "url": "https://www.tiktok.com/@stoolpresidente/video/7451747413649263915" 5 } 6 ], 7 "maxConcurrency": 10, 8 "minConcurrency": 1, 9 "maxRequestRetries": 100, 10 "proxy": { 11 "useApifyProxy": true, 12 "apifyProxyGroups": ["RESIDENTIAL"] 13 } 14}
Output
The scraper produces an output containing the transcript in WebVTT format. Here’s an example:
Output Example
1{ 2 "transcript": "WEBVTT\n\n\n00:00:00.260 --> 00:00:01.500\nWatch out for the snow storm,\n\n00:00:01.501 --> 00:00:02.621\npresident. Oh,\n\n00:00:02.622 --> 00:00:04.061\nhe said watch out for. No,\n\n00:00:04.062 --> 00:00:05.541\nI didn't know what the hell you were talking about.\n\n..." 3}
transcript
: Contains the WebVTT format captions from the TikTok video.
Transcript Format (WebVTT)
The captions are structured in WebVTT format with time codes and text:
1WEBVTT 2 300:00:00.260 --> 00:00:01.500 4Watch out for the snow storm, 5 600:00:01.501 --> 00:00:02.621 7president. Oh,
Why Use This Scraper?
- Ease of Use: Just provide the TikTok video URLs and extract captions effortlessly.
- Customizable Settings: Adjust concurrency, retries, and proxy settings to fit your needs.
- Accurate and Reliable: Automatically retries failed requests to minimize data loss.
- Time-Saving: Automates manual effort of retrieving TikTok captions for multiple videos.
Notes
- Ensure valid TikTok video URLs are provided in the
startUrls
field. - Proxies are recommended for large-scale scraping to prevent rate-limiting or IP bans.
Explore More Scrapers
If you found this Apify Smartbuyglasses Scraper useful, be sure to check out our other powerful scrapers and actors at memo23's Apify profile. We offer a wide range of tools to enhance your web scraping and automation needs across various platforms and use cases.
Support
- For issues or feature requests, please use the Issues section of this actor.
- If you need customization or have questions, feel free to contact the author:
- Author's website: https://muhamed-didovic.github.io/
- Email: muhamed.didovic@gmail.com
Additional Services
- Request customization or whole dataset: muhamed.didovic@gmail.com
- If you need anything else scraped, or this actor customized, email: muhamed.didovic@gmail.com
- For API services of this scraper (no Apify fee, just usage fee for the API), contact: muhamed.didovic@gmail.com
- Email: muhamed.didovic@gmail.com
Actor Metrics
4 monthly users
-
0 No stars yet
>99% runs succeeded
Created in Jan 2025
Modified a day ago