Wisprs — AI Transcription & Subtitle Generator avatar

Wisprs — AI Transcription & Subtitle Generator

Pricing

Pay per usage

Go to Apify Store
Wisprs — AI Transcription & Subtitle Generator

Wisprs — AI Transcription & Subtitle Generator

Transcribe any YouTube video, podcast, TikTok, or audio/video URL to text. Export as SRT, VTT, TXT, JSON, Markdown, or DOCX. Generate AI summaries, chapters, show notes, and Twitter threads. 100+ languages. No Wisprs account needed.

Pricing

Pay per usage

Rating

0.0

(0)

Developer

Gitonga Mwaura

Gitonga Mwaura

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

12 hours ago

Last modified

Categories

Share

Wisprs — AI Transcription & Subtitle Generator (YouTube, Podcast, Audio)

Submit any YouTube video, podcast episode, or audio URL and get back a full text transcript, SRT/VTT subtitles, speaker-labeled segments, and structured content like show notes, chapters, or Twitter/X threads — saved to your Apify Dataset automatically.

The Wisprs Transcription Actor is powered by the Wisprs API, a production-grade async voice processing engine that transcribes audio from any publicly accessible URL. Unlike caption-scraping actors that fail on 30–40% of videos (music, Shorts, unlisted), Wisprs uses Whisper-based transcription and yt-dlp to cover 100% of video and audio URLs. Accuracy is excellent on clear audio; results vary by language, accent, and recording quality.


What does this Actor do?

  1. Takes a list of YouTube, podcast, or audio/video URLs from your startUrls input
  2. Submits each URL to the Wisprs transcription API (async job queue — no timeouts)
  3. Polls until each job completes (typically 1–3 minutes per 15-minute video)
  4. Exports the transcript in your chosen formats: TXT, SRT, VTT, JSON, or Markdown
  5. Optionally generates structured content from the transcript via the Wisprs repurpose engine — podcast show notes, timestamped chapters, guest quotes, Twitter/X threads, or blog posts
  6. Saves one dataset row per URL — ready for downstream automation

How do I use this Actor to transcribe a YouTube video?

Step 1 — Run the Actor

Set the following input:

{
"startUrls": [{ "url": "https://www.youtube.com/watch?v=EXAMPLE" }],
"language": "auto",
"exportFormats": ["txt", "srt"],
"repurposeMode": "none"
}

Step 2 — Check your Dataset

When the run completes, your Dataset contains one row per URL with the full transcript, SRT subtitles, and any repurposed content you requested.


What data does the Actor extract?

Each dataset row includes:

FieldDescription
urlThe submitted URL
jobIdWisprs job identifier for tracking
transcriptionIdTranscription identifier for the Wisprs API
statuscompleted or failed
durationSecondsAudio/video duration in seconds
detectedLanguageISO 639-1 code of the detected language
transcript_txtFull plain-text transcript
transcript_srtSRT subtitle file content
transcript_vttWebVTT subtitle file content
transcript_jsonWord-level timestamps in JSON
transcript_mdTranscript in Markdown format
repurposed_show_notesStructured show notes (summary, chapters, quotes)
repurposed_threadTwitter/X thread array
repurposed_summary2–4 sentence summary
repurposed_chaptersTimestamped chapter markers

How much will it cost to transcribe a 30-minute podcast?

Pricing is per-event:

  • $0.005 per transcription submitted (one-time per URL)
  • $0.015 per audio minute processed (e.g. a 30-minute episode = $0.45)
  • $0.075 per repurpose result generated (show notes, thread, etc.)

Example: 10 podcast episodes averaging 45 minutes each

  • Submission: 10 × $0.005 = $0.05
  • Audio minutes: 10 × 45 × $0.015 = $6.75
  • Show notes (optional): 10 × $0.075 = $0.75
  • Total: ~$7.55 for 10 complete episodes with show notes

The Apify free plan includes $5 of monthly credits — enough to transcribe 5–6 short videos at no cost.


Input

FieldTypeRequiredDefaultDescription
startUrlsArrayYesYouTube, podcast, or audio/video URLs
languageStringNoautoISO 639-1 code or auto
diarizeBooleanNofalseLabel speakers (host vs guest)
exportFormatsArrayNo["txt","srt"]txt, srt, vtt, json, md
repurposeModeStringNononenone, summary, show_notes, thread, chapters, quotes
webhookUrlStringNoCallback URL per completed job
maxPollSecondsIntegerNo900Max wait per job (60–3600s)
pollIntervalSecondsIntegerNo10Status check interval (5–60s)

Supported URLs

  • YouTube videos, Shorts, and playlist-linked videos
  • Direct audio: mp3, wav, m4a, ogg, flac
  • Direct video: mp4, webm, mov
  • Podcast RSS episode links
  • TikTok, Loom, Vimeo, and most public video/audio hosts

Language support

100+ languages with automatic detection. The detected language appears in each dataset row as detectedLanguage. Pass a specific ISO 639-1 code ("en", "es", "fr") to skip auto-detection and speed up processing slightly for known-language content.




FAQ

Do I need an OpenAI API key or a Wisprs account? No. No external API key or account required. The Actor handles authentication internally — you pay only via Apify credits.

Does it work for videos without captions? Yes. Unlike caption-scraping actors, Wisprs transcribes the audio directly using Whisper. It covers captionless videos, music videos, private-link Loom recordings, and anything with audio.

Is there a video length limit? No hard limit. The async job queue handles videos of any length. A 90-minute webinar typically completes in 5–10 minutes.

What if a job fails? The dataset row will have status: "failed" and an errorMessage explaining why (unsupported format, private video, no audio, etc.).


Support


Submit a URL. Get subtitles, transcripts, and show notes. Done.