πŸŽ™οΈ Podcast Scraper β€” Episodes & Show Data avatar

πŸŽ™οΈ Podcast Scraper β€” Episodes & Show Data

Pricing

from $5.00 / 1,000 results

Go to Apify Store
πŸŽ™οΈ Podcast Scraper β€” Episodes & Show Data

πŸŽ™οΈ Podcast Scraper β€” Episodes & Show Data

Extract podcast episodes β€” titles, descriptions, durations, publish dates & audio URLs. Monitor trends, build content databases & analyze the podcast ecosystem. Pay per episode.

Pricing

from $5.00 / 1,000 results

Rating

0.0

(0)

Developer

Stephan Corbeil

Stephan Corbeil

Maintained by Community

Actor stats

0

Bookmarked

8

Total users

2

Monthly active users

5 hours ago

Last modified

Categories

Share

πŸŽ™οΈ Podcast Episodes Scraper β€” Show Metadata, Episode Feeds & Audio URLs

Bulk-extract podcast show + episode metadata from public RSS feeds and the major podcast indices: show title, description, host, network, episode list with audio_url, duration, published_at, episode descriptions, season + episode numbers, transcripts where available, and chart rank where applicable. A pay-per-result alternative to ListenNotes API ($16-200/mo), Podchaser Pro, Chartable, and Podtrac Enterprise β€” built for podcast networks doing competitive intel, ad-buyers sizing show audiences, ML/NLP teams building speech corpora, and journalists mining episode catalogs.

Why Podcast Episodes Scraper Beats ListenNotes, Podchaser, Chartable & Podtrac

FeatureNexGenData Podcast ScraperListenNotes APIPodchaser ProChartablePodtrac Enterprise
Cost$1 per 1K episodes, pay-per-event$16-200 / month$$$ / seat$$$$ enterprise$$$$ enterprise
Per-episode audio URLsYes β€” every enclosure URLYes (paid tier)YesNoNo
Show + episode metadataYes β€” full RSS + index mergeYesYesLimitedLimited
Chart rank where availableYes β€” Apple Podcasts chartsYes (paid tier)YesYesNo
Episode transcriptsYes β€” where publicly availablePlan-gatedPlan-gatedNoNo
Bulk exportJSON / CSV / ExcelPlan-gatedCSV (plan-gated)CSVCustom
AuthApify tokenAPI keySeat + licenseAccount + planEnterprise contract
Monthly minimumNone$16+$$$$$$$Annual contract

Most podcast intelligence teams pick this actor instead of ListenNotes API because the drop-in JSON shape is identical for downstream pipelines, it's cheaper than Podchaser Pro for ad-prospecting workflows, and the audio URLs are usable directly by Whisper or Deepgram for transcription without intermediate seat purchases.

What You Get Per Episode

Each dataset item is a flat record:

  • show_title, show_description, show_author, network
  • show_categories[] β€” Apple Podcasts category taxonomy
  • show_rss_url, show_apple_id, show_spotify_id
  • episode_title, episode_description, episode_summary
  • audio_url β€” direct enclosure URL (mp3/m4a)
  • audio_size_bytes, audio_mime_type
  • duration_seconds
  • published_at β€” ISO 8601 timestamp
  • season, episode_number, episode_type β€” full, trailer, bonus
  • image_url β€” episode-level artwork if present
  • transcript_url, transcript_text β€” when publicly available
  • chart_rank, chart_country β€” if currently charting
  • language, explicit

Use Cases

  • Podcast networks β€” competitive intel across a peer set; track release cadence, episode length, and category positioning
  • Ad agencies β€” build target-list of shows by IAB category, language, network, and audience-size signal
  • ML / speech teams β€” assemble a domain-specific training corpus by pulling audio_url for every episode in a niche
  • Journalists / researchers β€” search episode descriptions across a topic catalog for sourcing and primary research
  • Producers β€” benchmark your own show against competitors on release frequency and avg-episode length
  • PR teams β€” find every podcast a brand has appeared on by searching show + episode description text

Quick Start

from apify_client import ApifyClient
client = ApifyClient("YOUR_APIFY_TOKEN")
run = client.actor("nexgendata/podcast-episodes-scraper").call(run_input={
"shows": [
{"apple_id": "1438054347"},
{"rss_url": "https://feeds.megaphone.fm/the-tim-ferriss-show"}
],
"maxEpisodesPerShow": 50
})
for item in client.dataset(run["defaultDatasetId"]).iterate_items():
print(item["show_title"], item["episode_title"], item["duration_seconds"])

Pricing

Pay-per-event:

  • Actor Start: small fixed charge per run (memory-scaled)
  • Per episode: $1 per 1,000 episodes returned

No subscription, no minimum, no per-seat fee.

Use caseActor
YouTube channel + video metadata MCPyoutube-media-mcp-server
YouTube channel monetization detectoryoutube-channel-monetization-detector
AI sentiment + theme analyzerai-sentiment-analyzer
Cross-source review intelligence MCPreview-intelligence-mcp-server
NPR news & podcast scrapernpr-scraper
Hacker News scraperhacker-news-scraper
IMDb film & TV metadataimdb-scraper
Reddit subreddit trend trackerreddit-subreddit-trends

FAQ

Do you need a ListenNotes API key? No. This actor parses public RSS feeds + the Apple Podcasts public index. No third-party key required.

Can I pass an RSS URL directly? Yes β€” accept either rss_url, apple_id, or spotify_id per show.

Are transcripts always returned? Only when the publisher includes a <podcast:transcript> tag in their RSS feed (per the Podcasting 2.0 spec) or links to a public transcript URL.

Output formats? JSON, CSV, Excel, and the Apify dataset API.

Is this legal? Yes. RSS feeds are the public, intentional distribution channel for podcasts.

About NexGenData

NexGenData publishes 260+ buyer-intent actors covering SEC filings, YC alumni, lead generation, competitive intelligence, stock fundamentals across 30+ exchanges, and more. All pay-per-result. Browse the full catalog at https://apify.com/nexgendata?fpr=2ayu9b


How NexGenData Pricing Works

Every NexGenData actor uses pay-per-event pricing β€” you only pay for results that actually land in your dataset. No monthly minimum, no seat fees, no surprise overage bills.

  • Actor Start: a single-event charge each time you spin the actor up (scaled to memory size)
  • Result / item: charged per item written to the default dataset
  • No charge for retries, internal proxy rotation, or failed sub-requests β€” those are absorbed by the platform

Apify Platform Bonus

New to Apify? Sign up with the NexGenData referral link β€” you get free platform credits on signup (enough for several thousand free results) and you help fund the maintenance of this actor fleet.

Integration Surface

Every actor in the NexGenData catalog can be triggered from:

  • Apify console β€” point-and-click run
  • Apify API β€” REST + webhooks
  • Apify Python / JS SDKs β€” programmatic batch
  • Zapier, Make.com, n8n β€” official integrations
  • MCP β€” many actors are exposed as MCP tools for Claude / ChatGPT / Cursor agents
  • Schedules β€” built-in cron for daily / weekly / monthly runs
  • Webhooks β€” POST results to any HTTPS endpoint on dataset write

Support

NexGenData maintains 260+ Apify actors and ships updates regularly. Bug reports via the Apify console issues tab get a response within 24 hours. Roadmap requests are welcome β€” high-demand features ship in the next version.

Home: thenextgennexus.com Full catalog: apify.com/nexgendata