Podcast Show Notes Generator — AI Transcription & Chapters
Pricing
Pay per usage
Podcast Show Notes Generator — AI Transcription & Chapters
Transcribe any podcast episode and auto-generate show notes, timestamped chapters, and guest quotes. Accepts MP3, RSS feeds, M4A, Spotify embed URLs. Speaker diarization. 100+ languages. No Wisprs account needed.
Pricing
Pay per usage
Rating
0.0
(0)
Developer
Gitonga Mwaura
Maintained by CommunityActor stats
0
Bookmarked
2
Total users
1
Monthly active users
12 hours ago
Last modified
Categories
Share
Submit any podcast episode URL — mp3, m4a, RSS feed link — and get back a full transcript, structured show notes, timestamped chapters, speaker-attributed guest quotes, and SRT/VTT subtitle files saved to your Apify Dataset.
The Podcast Show Notes Generator uses the Wisprs API to transcribe audio directly from episode URLs using Whisper-based speech-to-text. Unlike tools that rely on pre-existing transcripts, Wisprs transcribes the actual audio — which means it works for every episode, whether or not the host has published a transcript. Accuracy is excellent on clear audio; results vary by language, accent, and recording quality.
What does this Actor do?
The dirty secret of podcasting is that the episode itself is the easy part. A typical post-production workflow takes 3–5 hours of mechanical work after recording. This Actor collapses that entire workflow into two API calls.
- Submit episode URLs from your input
- Queue each for async transcription (handles episodes of any length — no timeout)
- Generate structured show notes including:
- Summary — 2–4 sentences capturing the episode's core argument
- Chapters — timestamped chapter markers with titles and descriptions
- Quotes — top verbatim guest quotes with speaker attribution and timestamps
- Show notes markdown — ready to paste directly into your CMS or podcast host
- Export the transcript in your chosen formats: TXT, SRT, VTT, JSON, Markdown
- Save everything to your Apify Dataset — one row per episode
How do I use this Actor to generate podcast show notes?
Step 1 — Run the Actor
{"startUrls": [{ "url": "https://feeds.example.com/episodes/episode-42.mp3" }],"language": "auto","diarize": true,"exportFormats": ["txt", "srt"]}
Step 2 — Check your Dataset
Each episode produces one dataset row containing structured show notes, the full transcript, SRT subtitles, and speaker-labeled quotes.
What data does the Actor extract?
| Field | Description |
|---|---|
url | The submitted episode URL |
jobId | Wisprs job identifier |
transcriptionId | Transcription identifier |
status | completed or failed |
durationSeconds | Episode audio duration |
detectedLanguage | Detected language ISO code |
transcript_txt | Full plain-text transcript |
transcript_srt | SRT subtitle file |
transcript_vtt | WebVTT subtitle file |
repurposed_show_notes.summary | Episode summary paragraph |
repurposed_show_notes.chapters | Array of { title, startSeconds, description } |
repurposed_show_notes.quotes | Array of { speaker, text, startSeconds } |
repurposed_show_notes.showNotes | Formatted Markdown show notes ready to publish |
Show notes output example
{"summary": "Sarah Chen joins us to explain why her team consolidated 47 microservices back into a modular monolith — and what the deployment numbers looked like on the other side.","chapters": [{ "title": "Introduction", "startSeconds": 0, "description": "Host introduces Sarah Chen, VP Engineering at Acme Corp." },{ "title": "The Microservices Trap", "startSeconds": 312, "description": "How 47 services became impossible to debug." },{ "title": "The Great Consolidation", "startSeconds": 894, "description": "Moving back to a monolith in 6 weeks." }],"quotes": [{"speaker": "Sarah Chen","text": "We had 47 services and nobody could tell you what half of them did. That's not a microservices problem — that's an organizational problem we tried to solve with infrastructure.","startSeconds": 318}],"showNotes": "**Episode 42: Why We Killed Our Microservices**\n\nSarah Chen, VP of Engineering at Acme Corp, shares the real story...\n\n**Chapters:**\n- [0:00] Introduction\n- [5:12] The Microservices Trap\n- [14:54] The Great Consolidation"}
How much will it cost to process a 45-minute podcast episode?
Pricing is pay-per-event:
- $0.005 per episode submitted
- $0.015 per audio minute (45-min episode = $0.675)
- $0.075 per show notes result generated
Example: 10 × 45-minute episodes with show notes
- Submit: 10 × $0.005 = $0.05
- Audio: 10 × 45 × $0.015 = $6.75
- Show notes: 10 × $0.075 = $0.75
- Total: ~$7.55
The Apify free plan includes $5/month in credits — enough to test 5–6 episodes.
What can I automate with this?
RSS-to-CMS pipeline — monitor your podcast RSS feed, submit new episode URLs as they publish, and write the generated show notes directly back to your CMS via webhook. Every episode gets show notes on publish with zero manual work.
Guest quote library — the quotes array includes speaker attribution and timestamps for every notable thing every guest has ever said. Pipe this into a database and you have a searchable quote library across your entire back-catalog — perfect for social media clip selection.
Transcript SEO pages — publish the transcript_txt or transcript_md export as a dedicated page for each episode. Long-tail podcast transcripts drive search traffic without any additional writing effort.
Chapter markers for Podcast Namespace — the chapters array maps directly to the Podcasting 2.0 chapter format used by Fountain, Overcast, and Pocket Casts. Inject chapters automatically on every episode.
Multi-show network dashboard — submit episodes from all shows in your network concurrently. The async job model handles parallel processing — one run covers an entire network.
Supported URL formats
- Direct audio: mp3, wav, m4a, ogg, flac
- RSS feed episode URLs (standard podcast hosting platforms)
- Buzzsprout, Transistor, Simplecast, Anchor, Captivate, RSS.com, Podbean
- YouTube podcast episodes
- Spotify episode audio links (public)
Language support
100+ languages with automatic detection. Speaker diarization (host vs guest labeling) works best on clear two-speaker recordings. For episodes with more participants, speakers are labeled "Speaker 1", "Speaker 2", etc.
Related Actors
- Wisprs — Audio & Video Transcription — universal transcription for any URL
- Wisprs — YouTube Content Repurposer — YouTube → thread, blog, chapters
- Wisprs — Social Media Transcriber — TikTok, Reels, Shorts
FAQ
Does this work for podcasts without published transcripts? Yes. Wisprs transcribes the audio directly using Whisper — it does not rely on pre-existing transcript files or RSS transcript tags.
How long does a 60-minute episode take to process? Typically 3–8 minutes, depending on server load. The Actor polls automatically and saves results when complete.
Can I process an entire podcast back-catalog?
Yes. Add all episode URLs to startUrls. The Actor processes them sequentially, saving each result to the Dataset as it completes.
Does speaker diarization work in languages other than English? Speaker detection works across all supported languages. Accuracy on non-English content is excellent on clear recordings.
Support
- Documentation: wisprs.co/docs
- Email: tosh@belvadigital.com
Two API calls. One polished episode. Ship faster.