Podcast Transcript Search avatar

Podcast Transcript Search

Pricing

Pay per usage

Go to Apify Store
Podcast Transcript Search

Podcast Transcript Search

Podcast Transcript Search. Search and discover data across multiple sources with structured output. Fast, reliable, and cost-effective.

Pricing

Pay per usage

Rating

0.0

(0)

Developer

Donny

Donny

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

a day ago

Last modified

Categories

Share

What does Podcast Transcript Search do?

Podcast Transcript Search is an Apify actor that crawls podcast transcript databases and directories to find episodes containing specific keywords and topics. It returns structured results with the podcast name, episode title, matched text excerpts, timestamps, speaker names, and direct links. This actor is perfect for researchers, journalists, content creators, and marketers who need to discover relevant podcast discussions without manually listening to hours of audio content. The actor leverages CheerioCrawler for efficient HTML parsing across multiple transcript sources.

The podcast ecosystem has exploded with millions of episodes across every conceivable topic. Finding the specific episodes that discuss your area of interest is nearly impossible through manual browsing. This actor automates the search process by crawling publicly available transcript databases and extracting relevant mentions. Instead of spending hours searching through individual podcast platforms, you get a consolidated dataset of every mention of your keywords along with contextual information to evaluate relevance. The structured output makes it easy to filter, sort, and analyze results in spreadsheets or data pipelines.

  1. Open the actor on the Apify platform.
  2. Enter your search keywords in the searchQuery field. Use specific phrases for targeted results.
  3. Set maxResults to control the volume of data returned.
  4. Click Start to begin the search.
  5. Review results in the Dataset tab once the run completes.
  6. Export to JSON, CSV, or Excel for further analysis.

Input Parameters

ParameterTypeDescriptionDefault
searchQuerystringKeywords or topics to search for"artificial intelligence"
maxResultsintegerMaximum transcript matches to return30

Output

Each result in the dataset contains:

FieldDescription
podcastNameName of the podcast show
episodeTitleTitle of the specific episode
matchedTextText excerpt containing the keyword match
timestampTimestamp within the episode (when available)
speakerNameName of the speaker (when available)
urlDirect link to the transcript or episode

Cost Estimate

Running this actor on the Apify platform typically costs between $0.005 and $0.02 per run depending on the number of results requested. The actor uses 1024 MB of memory by default and processes pages efficiently using Cheerio rather than a full browser. Most searches complete within 1-3 minutes. Increasing maxResults may extend the run time and cost proportionally.

Tips and Best Practices

  • Use specific multi-word phrases for more relevant results. For example, "machine learning ethics" will yield better matches than just "machine learning."
  • Combine results from multiple runs with different keywords to build comprehensive topic research.
  • The matchedText field provides context around each keyword hit so you can evaluate relevance without visiting every link.
  • Schedule regular runs to discover new episodes as they are published and transcribed.
  • For related content aggregation, check out the RSS Feed Aggregator actor to monitor podcast RSS feeds directly.
  • Some transcript databases may have rate limits. The actor handles retries automatically, but very large searches may take longer.
  • Filter your exported dataset by podcastName to focus on specific shows that consistently cover your topic.