Get started
Product
Back
Start here!
Get data with ready-made web scrapers for popular websites
Browse 15,337 Actors
Apify platform
Apify Store
Pre-built web scraping tools
Actors
Build and run serverless programs
Integrations
Connect with apps and services
MCP
Give your AI access to Actors
Anti-blocking
Scrape without getting blocked
Proxy
Rotate scraper IP addresses
Open source
Crawlee
Web scraping and crawling library
Solutions
MCP server configuration
Configure your Apify MCP server with Actors and tools for seamless integration with MCP clients.
Start building
Web data for
Enterprise
Startups
Universities
Nonprofits
Use cases
Data for generative AI
Data for AI agents
Lead generation
Market research
View more →
Consulting
Apify Professional Services
Apify Partners
Developers
Documentation
Full reference for the Apify platform
Code templates
Python, JavaScript, and TypeScript
Web scraping academy
Courses for beginners and experts
Monetize your code
Publish your scrapers and get paid
Learn
API reference
CLI
SDK
Earn from your code
$596k paid out in December. Many developers earn $3k+ every month.
Start earning now
Resources
Help and support
Advice and answers about Apify
Actor ideas
Get inspired to build Actors
Changelog
See what’s new on Apify
Customer stories
Find out how others use Apify
Company
About Apify
Contact us
Blog
Live events
Partners
Jobs
We're hiring!
Join our Discord
Talk to scraping experts
Pricing
Contact sales
Pay per usage
jupri/google-speech
Use free Google Text to Speech to translate text into voice
Rating
0.0
(0)
Developer
cat
Actor stats
9
Bookmarked
199
Total users
1
Monthly active users
9 months ago
Last modified
Categories
AI
Automation
Videos
Share
vivid_astronaut/speech-to-text
Convert speech to text with high accuracy using Azure AI. Supports 100+ languages, speaker detection, and timestamps. Perfect for transcription, subtitles, and voice-to-text applications.
Fabio Suizu
6
cheapget/instagram-to-text
AI-powered video transcription and translation. Convert video speech to text with timestamped subtitles in 100+ languages
CheapGET
21
5.0
aizen0/video-to-text-transcription
Convert video speech to text in bulk. Supports Only Twitter/Instagram, auto-detects languages, handles large files automatically. Uses OpenAI Whisper for high accuracy.
Aizen
39
futurizerush/youtube-subtitle-generator
Generate subtitles from YouTube videos using OpenAI's AI models for speech-to-text transcription and translation to 15+ languages. Outputs SRT, TXT, and JSON formats. Note: Requires fresh cookies for each run (expire within minutes) - not suitable for automation.
Futurize Rush
3
donjuan_mime/audio-video-to-text
Transcribes video and audio files into plain text and subtitle formats (TXT, SRT, VTT, TSV, JSON) using OpenAI's Whisper model. Supports preloaded tiny, base, and small models.
Donjuan
74
tictechid/vanzi-universal-transcriber
Transcribe audio from videos stored on Google Drive, Dropbox, GitHub raw, OneDrive, Box, iCloud, AWS S3, GCS, Azure Blob, and Backblaze B2. Convert share links to direct downloads for fast, accurate transcripts with timestamps and easy API integration.
TicTech
67
vittuhy/audio-and-video-transcript
This Actor transcribes audio or video files from publicly accessible URLs using OpenAI's Whisper API. To use this Actor, you'll need to provide your own OpenAI API key. It supports multiple languages and highly customizable parameters, enabling precise control over the transcription process.
Vít Tuhý
79
1.8
stanvanrooy6/audio-video-transcriber
Downloads videos from public URLs, extracts audio, and transcribes them using OpenAI
Stan Van Rooy
46
parseforge/audio-transcriber
Automates audio transcription from multiple sources (files or links). Normalizes input format to ensure optimal processing. Generates word-for-word transcriptions maintaining references to source audio, perfect for datasets requiring traceability and regulatory compliance.
ParseForge
16
invideoiq/video-transcriber
Retrieves transcripts from online video content from multiple plateforms (Instagram, X, ..) using speech-to-text models. It delivers outputs in JSON and LLM-ready formats, making it ideal for analytics, and AI-based applications. Perfect for research and building intelligent conversational agents
InVideoIQ
222
4.8