Wistia Transcript Scraper — Captions to TXT, SRT & VTT avatar

Wistia Transcript Scraper — Captions to TXT, SRT & VTT

Pricing

from $1.00 / 1,000 per record returneds

Go to Apify Store
Wistia Transcript Scraper — Captions to TXT, SRT & VTT

Wistia Transcript Scraper — Captions to TXT, SRT & VTT

Extract any public Wistia video's transcript and captions — no login, no ASR. By hashedId or any page that embeds Wistia: full text, timestamped segments & SRT/VTT, plus title and duration, straight from Wistia's CDN. $2 per 1,000 videos.

Pricing

from $1.00 / 1,000 per record returneds

Rating

0.0

(0)

Developer

Scrapers Delight

Scrapers Delight

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

2 days ago

Last modified

Share

🎥 Wistia Video Transcript & Caption Scraper

Get any public Wistia video's transcript — no login, no AI transcription. Wistia hosts every video on one CDN keyed by its hashedId, and this actor reads the video's own captions: full transcript text, timestamped segments, and SRT/VTT, plus title and duration. Pass hashedIds directly or point it at a page that embeds Wistia videos.

Because it reads Wistia's existing captions, there's no speech-to-text compute — it's fast and cheap.


What does it do?

For each Wistia video (by hashedId or harvested from an embedding page) it returns:

  • 📝 Full transcript (plain text) — always included
  • ⏲️ Timestamped segments{start, end, text}
  • 🎬 SRT / VTT subtitles
  • 🏷️ Title, duration, media type, caption language

No ASR, no API key — it reads the captions the uploader added.


What data does it extract?

For every video: hashed_id, title, duration_sec, media_type, caption_language, transcript, segments[], srt, vtt, segment_count, is_new (monitor), scraped_at.


Who is it for?

  • 📣 Marketing & demand-gen teams repurposing Wistia product/webinar videos into text.
  • 🤖 AI / RAG builders turning hosted video into searchable content.
  • Accessibility / SEO workflows needing captions and transcripts.

How to use it (step by step)

  1. Click Try for free.
  2. Paste Wistia hashedIds (the token after /medias/) — or an embedding page URL.
  3. (Optional) add srt/vtt/segments formats.
  4. Click Start, open the Dataset tab to view/export.
  5. (Optional) set monitorMode + a pageUrl + a Schedule for new videos.

Quick start

{ "hashedIds": ["z1fxq584qr"], "transcriptFormats": ["txt", "srt"] }

Input

FieldWhat it does
hashedIdsWistia hashedIds or embed URLs
pageUrla page embedding Wistia videos to harvest IDs from
transcriptFormatstxt · segments · srt · vtt
maxVideoshard cap per run (0 = unlimited)
monitorMode, alertOnNewVideorecurring watcher + alerts
webhookUrl, slackWebhookUrl, emailRecipientsalert channels
proxyConfiguration, requestConcurrencyproxy + parallelism

Output

Each video is one dataset record (fields above). Export to JSON, CSV, Excel, HTML, or RSS, or fetch via the Apify API. Videos without captions are returned with a note (not faked).


How much does it cost?

Pay-per-event — and with no transcription compute, it's cheap:

EventWhat it coversSuggested price
lot-scrapedeach video returned~$0.004 / video
lot-detail-enrichedeach transcript fetched~$0.004 / video
monitor-run-completedeach scheduled watch run~$0.05 / run
new-lot-detectedeach new video~$0.02 / video
alert-deliveredeach Slack/email/webhook push~$0.005 / alert

(Final per-event prices are set on the actor's pricing page.)


How does it work without AI transcription?

Wistia stores each video's captions on its public CDN (fast.wistia.net), reachable by the video's hashedId. This actor reads those captions (inline transcript + a WEBVTT track) — it does not run speech-to-text, so there's no GPU/compute cost. Captions exist only if the uploader added or auto-generated them; videos without captions are reported as such.


This actor reads publicly embeddable Wistia videos' own caption files. The content belongs to the publisher (copyrighted). Scraping public data is generally legal, but you are responsible for your use — review Wistia's and the publisher's terms; don't redistribute transcripts you're not licensed to.


FAQ

Where do I find the hashedId? It's the token after /medias/ in a Wistia embed/URL (e.g. fast.wistia.net/embed/medias/z1fxq584qrz1fxq584qr), or media-id="…" in the embed code. You can also pass a page URL and let the actor harvest them.

Is there a Whisper/ASR step? No — it reads Wistia's captions, so it's fast and cheap.

Can I get subtitles? Yes — add srt and/or vtt to transcriptFormats.

How do I export? JSON, CSV, Excel, HTML, or RSS from the Dataset tab, or via the Apify API.


Feedback

Want multi-language caption pulls or project-level crawling? Open an issue on the actor.