Wistia Transcript Scraper — Captions to TXT, SRT & VTT
Pricing
from $1.00 / 1,000 per record returneds
Wistia Transcript Scraper — Captions to TXT, SRT & VTT
Extract any public Wistia video's transcript and captions — no login, no ASR. By hashedId or any page that embeds Wistia: full text, timestamped segments & SRT/VTT, plus title and duration, straight from Wistia's CDN. $2 per 1,000 videos.
Pricing
from $1.00 / 1,000 per record returneds
Rating
0.0
(0)
Developer
Scrapers Delight
Maintained by CommunityActor stats
0
Bookmarked
2
Total users
1
Monthly active users
2 days ago
Last modified
Categories
Share
🎥 Wistia Video Transcript & Caption Scraper
Get any public Wistia video's transcript — no login, no AI transcription. Wistia hosts every video on one CDN keyed by its hashedId, and this actor reads the video's own captions: full transcript text, timestamped segments, and SRT/VTT, plus title and duration. Pass hashedIds directly or point it at a page that embeds Wistia videos.
Because it reads Wistia's existing captions, there's no speech-to-text compute — it's fast and cheap.
What does it do?
For each Wistia video (by hashedId or harvested from an embedding page) it returns:
- 📝 Full transcript (plain text) — always included
- ⏲️ Timestamped segments —
{start, end, text} - 🎬 SRT / VTT subtitles
- 🏷️ Title, duration, media type, caption language
No ASR, no API key — it reads the captions the uploader added.
What data does it extract?
For every video: hashed_id, title, duration_sec, media_type, caption_language, transcript, segments[], srt, vtt, segment_count, is_new (monitor), scraped_at.
Who is it for?
- 📣 Marketing & demand-gen teams repurposing Wistia product/webinar videos into text.
- 🤖 AI / RAG builders turning hosted video into searchable content.
- ♿ Accessibility / SEO workflows needing captions and transcripts.
How to use it (step by step)
- Click Try for free.
- Paste Wistia hashedIds (the token after
/medias/) — or an embedding page URL. - (Optional) add
srt/vtt/segmentsformats. - Click Start, open the Dataset tab to view/export.
- (Optional) set monitorMode + a pageUrl + a Schedule for new videos.
Quick start
{ "hashedIds": ["z1fxq584qr"], "transcriptFormats": ["txt", "srt"] }
Input
| Field | What it does |
|---|---|
hashedIds | Wistia hashedIds or embed URLs |
pageUrl | a page embedding Wistia videos to harvest IDs from |
transcriptFormats | txt · segments · srt · vtt |
maxVideos | hard cap per run (0 = unlimited) |
monitorMode, alertOnNewVideo | recurring watcher + alerts |
webhookUrl, slackWebhookUrl, emailRecipients | alert channels |
proxyConfiguration, requestConcurrency | proxy + parallelism |
Output
Each video is one dataset record (fields above). Export to JSON, CSV, Excel, HTML, or RSS, or fetch via the Apify API. Videos without captions are returned with a note (not faked).
How much does it cost?
Pay-per-event — and with no transcription compute, it's cheap:
| Event | What it covers | Suggested price |
|---|---|---|
lot-scraped | each video returned | ~$0.004 / video |
lot-detail-enriched | each transcript fetched | ~$0.004 / video |
monitor-run-completed | each scheduled watch run | ~$0.05 / run |
new-lot-detected | each new video | ~$0.02 / video |
alert-delivered | each Slack/email/webhook push | ~$0.005 / alert |
(Final per-event prices are set on the actor's pricing page.)
How does it work without AI transcription?
Wistia stores each video's captions on its public CDN (fast.wistia.net), reachable by the video's hashedId. This actor reads those captions (inline transcript + a WEBVTT track) — it does not run speech-to-text, so there's no GPU/compute cost. Captions exist only if the uploader added or auto-generated them; videos without captions are reported as such.
Is it legal to scrape Wistia transcripts?
This actor reads publicly embeddable Wistia videos' own caption files. The content belongs to the publisher (copyrighted). Scraping public data is generally legal, but you are responsible for your use — review Wistia's and the publisher's terms; don't redistribute transcripts you're not licensed to.
FAQ
Where do I find the hashedId?
It's the token after /medias/ in a Wistia embed/URL (e.g. fast.wistia.net/embed/medias/z1fxq584qr → z1fxq584qr), or media-id="…" in the embed code. You can also pass a page URL and let the actor harvest them.
Is there a Whisper/ASR step? No — it reads Wistia's captions, so it's fast and cheap.
Can I get subtitles?
Yes — add srt and/or vtt to transcriptFormats.
How do I export? JSON, CSV, Excel, HTML, or RSS from the Dataset tab, or via the Apify API.
Feedback
Want multi-language caption pulls or project-level crawling? Open an issue on the actor.