TikTok Transcript Scraper avatar

TikTok Transcript Scraper

Pricing

from $3.99 / 1,000 results

Go to Apify Store
TikTok Transcript Scraper

TikTok Transcript Scraper

TikTok Transcript Scraper extracts captions, subtitles, and auto-generated transcripts from public TikTok videos. Ideal for content analysis, keyword research, trend tracking, accessibility, and marketing insights to better understand audience engagement and video performance.

Pricing

from $3.99 / 1,000 results

Rating

0.0

(0)

Developer

Scrape Bridge

Scrape Bridge

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

2 days ago

Last modified

Share

Tiktok Transcript Scraper 🎯

Scraping TikTok captions one video at a time is slow, error-prone, and hard to scale when you need real text for analysis or repurposing. Tiktok Transcript Scraper pulls transcripts and key video metadata from TikTok URLs in bulk so you can move faster with a TikTok captions scraper or TikTok subtitle scraper workflow. It’s perfect for marketers, data analysts, and researchers who want TikTok transcript extraction tool output ready for downstream cleanup, indexing, or research. In one run, you can process multiple TikTok video URLs and save structured results to your dataset within seconds of starting.


See the Data: Sample Output

Here's a real record from a single run:

{
"url": "https://www.tiktok.com/@david.webdeveloper/video/7506519006854253847",
"id": "7506519006854253847",
"description": "Building useful web tools—one tip at a time.",
"created_time": 1717212345,
"transcript": [
{
"start": "00:00:00",
"end_time": "00:00:02",
"text": "Here’s a quick tip to improve your workflow."
},
{
"start": "00:00:02",
"end_time": "00:00:05",
"text": "Automate the boring parts and focus on results."
}
],
"video_height": "1920",
"video_width": "1080",
"video_duration": "00:00:45",
"cover_url": "https://example.com/cover.jpg",
"video_download_url": "https://example.com/download.mp4",
"stats_play": [1234, 0],
"authorStats": [
{ "key": "followers", "value": 48200 }
],
"status": "success"
}
FieldTypeWhat It Tells You
urlstringThe TikTok video URL you provided—easy traceability back to the input.
idstringA stable video identifier for matching, deduping, and joins in your data pipeline.
descriptionstringVideo description text that often complements captions for topic modeling and tagging.
created_timenumberWhen the video was created, useful for time-based analysis and trend comparisons.
transcriptarray | stringParsed caption timing + text (or "No Transcript Found" when nothing is available).
video_heighttextVideo resolution height metadata for media analytics and quality checks.
video_widthtextVideo resolution width metadata for media analytics and quality checks.
video_durationtextDuration string to help normalize pacing and compare content length.
cover_urllinkCover image URL for previewing content in dashboards or reports.
video_download_urllinkDownload link for the video asset when you need it alongside the transcript.
stats_playarray“Play” statistics payload for performance context (e.g., for correlating text themes to engagement).
authorStatsarrayCreator-level stats payload to enrich your dataset with audience context.
statusstringRun outcome at the record level; errors are handled so you can detect failed items cleanly.

Export your full dataset as JSON, CSV, or Excel from the Apify dashboard.


Setting It Up

Drop this into your input.json and you're ready to go:

{
"startUrls": [
{
"url": "https://www.tiktok.com/@david.webdeveloper/video/7506519006854253847"
},
{
"url": "https://www.tiktok.com/@example_user/video/1234567890123456789"
}
],
"proxyConfiguration": {
"useApifyProxy": false
}
}
ParameterRequiredWhat It Does
startUrlsA list of TikTok video URLs to scrape (you can submit one or many).
proxyConfigurationProxy settings for the run to help with reliable access to TikTok content.
proxy supportSet whether to route requests through Apify Proxy (defaults to false in the prefill).

What It Does

Tiktok Transcript Scraper takes a list of TikTok video URLs and outputs structured transcript data plus video and author metadata.

Extract TikTok transcripts with timing

For each provided URL, it returns a transcript field containing parsed subtitle segments with start, end_time, and text. This format is designed to be easy to analyze for themes, segments, and pacing—useful when you’re building workflows like a TikTok subtitle scraper or TikTok captions scraper.

Clean, integration-ready JSON output

Each result includes url, id, description, created_time, and video metadata such as video_height, video_width, and video_duration. The output also includes media links like cover_url and video_download_url, plus performance-style fields such as stats_play and authorStats.

Built-in proxy support for reliable scraping

The actor lets you configure proxy behavior via proxyConfiguration. If the selected proxy is rejected by TikTok, a residential proxy is used as a fallback to keep your runs moving.

Includes retries and fallbacks for resilience

When fetching transcript data, it uses a retry approach (up to 3 retries). If a transcript can’t be parsed or no subtitles are available, the actor returns "No Transcript Found" in the transcript field instead of breaking your pipeline.

Works well as a TikTok transcript extraction tool

Because results are pushed per-video into the dataset, you can run large batches of TikTok video transcript downloader jobs and then export for analysis, reporting, or enrichment—without manually copy-pasting captions.

Overall, Tiktok Transcript Scraper turns TikTok transcript API–style output into a ready-to-use dataset for your next step.


Why Tiktok Transcript Scraper?

There are plenty of ways to pull text from TikTok videos—here’s why Tiktok Transcript Scraper stands out.

Structured transcripts, not raw blobs

You get parsed caption segments with clear timing boundaries (start and end_time) and clean caption text (text). That makes TikTok caption extraction easier to plug into analysis, transcription review, or content research.

Reliable bulk processing with proxy fallback

If your initial proxy approach is rejected, the actor falls back to a residential proxy method to keep the run robust. This is especially helpful when you’re scraping captions from TikTok videos at scale.

Results written directly to your dataset

As each TikTok video is processed, the result is pushed to the dataset immediately. That means less waiting, better visibility, and easier partial-run recovery.


Real-World Use Cases

Here's how different teams put Tiktok Transcript Scraper to work:

Growth & Content Analysts
A marketing analyst collects a weekly set of TikTok captions scraper inputs to understand what messaging patterns correlate with performance. After the run, they export the dataset to analyze themes over time using created_time, and segment quotes using the transcript timing.

SEO & Research Teams
A researcher building a dataset of TikTok text from captions for qualitative coding runs TikTok subtitle scraper batches by video URL. They use transcript plus description to enrich each record, then attach the results to internal taxonomy.

Outreach & Community Builders
A community manager wants to identify talking points in creator content before reaching out. Using Tiktok closed captions scraper output, they quickly scan segments by timing and capture the exact caption wording alongside authorStats and stats_play context.

Automation & Data Engineers
A developer integrates a TikTok transcription scraper into an ETL pipeline: trigger the actor, then ingest the dataset into their warehouse. Because the output is consistent JSON with fields like id, created_time, and video_download_url, it’s straightforward to join with other content metadata.

Podcast & Repurposing Producers
A content team uses TikTok transcript extraction tool output to draft scripts and quotes. They pull transcript segments to preserve pacing, and fall back gracefully when "No Transcript Found" appears so their pipeline stays stable.


How to Run It

No code required. Here's how to get your first results in under 5 minutes:

  1. Open the actor on Apify
    Go to the actor page on console.apify.com and open Tiktok Transcript Scraper.

  2. Enter your inputs
    Paste one or more TikTok video URLs into startUrls (each item should include a url).

  3. Configure proxy settings (optional)
    If you have proxy preferences, set them under proxyConfiguration (for example proxy support).

  4. Start the run and watch the live log
    Launch the run and monitor progress in the Apify log output.

  5. Open the Dataset tab
    Your transcript results appear in the dataset as each video is processed.

  6. Export in your preferred format
    Download your dataset as JSON, CSV, or Excel from the Apify dashboard.

The whole setup takes under 5 minutes — results start appearing within seconds of launch.


Export & Integration Options

Once your data is collected, Tiktok Transcript Scraper fits directly into your existing workflow. You can export results from the Apify dataset tab as JSON, CSV, or Excel for analysis and reporting.

For automation, you can connect via Apify’s API access to pull results programmatically, or trigger downstream steps when a run completes using webhooks. No-code options like Zapier or Make can also help push extracted TikTok captions scraper output into your tools.

If you already have dashboards, you can schedule repeated runs and keep your transcript dataset fresh for ongoing research and trend tracking.


Pricing

Tiktok Transcript Scraper runs on Apify, which includes a free tier — no credit card needed to start. You’ll begin with a limited amount of platform credits to run several test jobs and validate results quickly.

After that, runs are billed based on Actor compute units (CU) with pay-as-you-go pricing, and Apify plans apply for larger workloads. Start free at apify.com — scale up when you need to.


Reliability & Limitations

What We HandleHow
Rate-limit style failuresUses multiple attempts (up to 3 retries).
Proxy blocksSupports proxy configuration and uses a residential proxy fallback when needed.
Partial data availabilityReturns "No Transcript Found" in transcript when subtitles aren’t available.
Per-video dataset outputPushes each successful result into the dataset as it’s processed.
Parsing errorsHandles parsing issues by logging errors and returning null results for problematic items.

Limitations: this actor is designed for publicly accessible video transcript data. If a video has no subtitles/captions available, the transcript field will be "No Transcript Found". Also, any login-gated content or restricted access scenarios are outside the scope of this actor.

For enterprise-scale needs or custom configurations, reach out and we'll help.


Frequently Asked Questions

Is there a free plan?

Yes. Apify offers a free tier with credits you can use to run Tiktok Transcript Scraper and verify outputs before scaling up.

Do I need to log in or create an account on TikTok?

No. This actor works from publicly available TikTok video content you provide via startUrls, without requiring a TikTok account login.

How accurate is the extracted data?

The actor returns parsed transcript segments when subtitles/captions are available, and otherwise sets transcript to "No Transcript Found". Accuracy depends on what captions exist on the target videos.

How many results can I get per run?

You can pass multiple items in startUrls. The actor processes the provided list and writes a result record per successfully scraped video into your dataset.

How fresh is the data?

The data freshness depends on when the captions exist on the target videos at the time you run the actor. Running again later will refresh the dataset for new or updated content.

This actor is intended for publicly available data. You’re responsible for ensuring your use complies with GDPR, CCPA, and TikTok’s Terms of Service for your specific use case.

Can I export to Google Sheets or Excel?

Yes. You can export your dataset from the Apify dashboard as JSON, CSV, or Excel, and then import into Google Sheets or other tools that accept those formats.

Can I schedule this to run automatically?

Yes. You can schedule Apify actors for recurring runs so your TikTok captions scraper dataset stays up to date over time.

Can I access results via the API?

Yes. You can retrieve results programmatically using Apify’s API capabilities for this actor.

What happens when the actor encounters an error?

The actor logs errors and continues processing other URLs in your startUrls. If a transcript can’t be processed for a specific video, you may see null results for that item, while successful items are still pushed into the dataset.


Get Help & Use Responsibly

Got a question about Tiktok Transcript Scraper or a feature you'd like added? Reach out at dataforleads@gmail.com. We welcome ideas like additional transcript formats, stronger dataset normalization fields, or export-friendly segment structures.

Disclaimer: This actor collects publicly available data from TikTok. It does not access private accounts, login-gated pages, or password-protected content. You are responsible for ensuring your use complies with GDPR, CCPA, and TikTok’s Terms of Service. For data-removal requests, contact dataforleads@gmail.com. Use responsibly, ethically, and only for lawful purposes.