Pricing

from $1.00 / 1,000 per record returneds

MIT OpenCourseWare Transcript Scraper — Lectures to Text

Extract MIT OpenCourseWare video-lecture transcripts — no login, no ASR. Give it a course (crawls every lecture) or specific lecture URLs: full transcript text, timestamped segments & SRT/VTT, plus course and lecture titles. Creative-Commons content. $2 per 1,000 lectures.

Pricing

from $1.00 / 1,000 per record returneds

Rating

0.0

(0)

Developer

Scrapers Delight

Actor stats

Bookmarked

Total users

Monthly active users

a month ago

Last modified

🎓 MIT OpenCourseWare Lecture Transcript Scraper

Pull MIT OpenCourseWare video-lecture transcripts — no login, no AI transcription. MIT OCW publishes a transcript for every lecture, and this actor reads it: full text, timestamped segments, and SRT/VTT, plus course and lecture titles. Give it a course (it crawls every lecture) or specific lecture URLs.

It reads OCW's own captions, so there's no speech-to-text compute — fast and cheap. (MIT OCW is free, Creative-Commons educational content.)

What does it do?

For each lecture (from a course crawl or direct URLs) it returns:

📝 Full transcript (plain text) — always included
⏲️ Timestamped segments — {start, end, text}
🎬 SRT / VTT subtitles
🏷️ Course title + lecture title

No ASR, no API key — it reads the published .vtt caption track.

What data does it extract?

For every lecture: url, course_title, lecture_title, transcript, segments[], srt, vtt, segment_count, is_new (monitor), scraped_at.

Who is it for?

🎓 Learners & educators turning lectures into searchable notes and study guides.
🤖 AI / RAG builders — rigorous, structured lecture content is excellent training/retrieval data.
🌍 Localization / accessibility workflows.

How to use it (step by step)

Click Try for free.
Paste a course URL (https://ocw.mit.edu/courses/{slug}/) — or specific lecture URLs.
(Optional) add srt/vtt/segments formats.
Click Start, open the Dataset tab to view/export.
(Optional) set monitorMode + a Schedule to capture lectures as courses update.

Quick start

{ "courseUrls": ["https://ocw.mit.edu/courses/6-0001-introduction-to-computer-science-and-programming-in-python-fall-2016/"], "transcriptFormats": ["txt", "srt"] }

Input

Field	What it does
`courseUrls`	OCW course URLs (crawls each course's lectures)
`lectureUrls`	specific lecture resource URLs
`transcriptFormats`	`txt` · `segments` · `srt` · `vtt`
`maxLectures`	hard cap per run (0 = all)
`monitorMode`, `alertOnNewLecture`	recurring watcher + alerts
`webhookUrl`, `slackWebhookUrl`, `emailRecipients`	alert channels
`proxyConfiguration`, `requestConcurrency`	proxy + parallelism

Output

Each lecture is one dataset record (fields above). Export to JSON, CSV, Excel, HTML, or RSS, or fetch via the Apify API.

How much does it cost?

Pay-per-event — and with no transcription compute, it's cheap:

Event	What it covers	Suggested price
`lot-scraped`	each lecture returned	~$0.003 / lecture
`lot-detail-enriched`	each transcript fetched	~$0.003 / lecture
`monitor-run-completed`	each scheduled watch run	~$0.05 / run
`new-lot-detected`	each new lecture	~$0.02 / lecture
`alert-delivered`	each Slack/email/webhook push	~$0.005 / alert

(Final per-event prices are set on the actor's pricing page.)

Is it legal to scrape OCW transcripts?

MIT OpenCourseWare is published free to the public under a Creative Commons BY-NC-SA license. This actor reads those public transcripts. You must comply with the CC BY-NC-SA terms — attribute MIT OCW, non-commercial use, share-alike — and review OCW's site terms. You are responsible for your use.

FAQ

Does it crawl a whole course? Yes — give a course URL and it finds + transcribes every video lecture.

Is there a Whisper/ASR step? No — it reads OCW's .vtt captions, so it's fast and cheap.

Can I get subtitles? Yes — add srt and/or vtt to transcriptFormats.

How do I export? JSON, CSV, Excel, HTML, or RSS from the Dataset tab, or via the Apify API.

Feedback

Want PDF-notes extraction or per-department crawling? Open an issue on the actor.

Coursera Transcript Scraper — Lecture Subtitles (No Login)

scrapersdelight/coursera-transcript-scraper

Extract Coursera lecture transcripts from the course's own subtitle tracks — no login, no ASR. By course slug: each open lecture's transcript as text, timestamped segments & SRT/VTT, in 30+ languages. Gated lectures are flagged, not faked. $2 per 1,000 lectures.

Scrapers Delight

MIT OpenCourseWare Scraper | Free MIT Course Data

parseforge/mit-ocw-scraper

Pull MIT OpenCourseWare courses with title, instructor, department, level, semester, syllabus, lecture notes, problem sets, exams, and video URLs. Build free education datasets, study tools, and AI training corpora using world-class material from MIT, all openly licensed.

ParseForge

MIT OpenCourseWare Scraper

crawlerbros/mit-open-course-ware-scraper

Scrape MIT OpenCourseWare (ocw.mit.edu) - 2,500+ free MIT courses with full metadata: title, department, level, instructors, topics, resource types, descriptions, and image URLs. Search by keyword, browse by department or level, or fetch a single course by URL.

Crawler Bros

Udemy Courses Scraper - Low-cost💲🔥🎓🔍

delectable_incubator/udemy-courses-scraper-low-cost

🎓 Scrape Udemy course search results and extract course titles, instructors, ratings, review counts, durations, lecture counts, course levels, languages, badges, course URLs, and more. Perfect for market research, competitor analysis, education insights, trend monitoring, and AI datasets. 🚀📊

Prime Scrape

Udemy Scraper | $2 / 1k | All In One

fatihtahta/udemy-scraper

Scrape Udemy into clean, structured course, review and instructor data. $4 per 1,000 results. Capture titles, pricing and discounts, ratings, popularity, lecture counts, levels, languages, images, and profiles. Ideal for course market research, competitor analysis, and building targeted lead lists.

Fatih Tahta

Coursera Scraper | All In One | $0.8 / 1k

fatihtahta/coursera-scraper

Scrape Coursera into clean, structured course and review data. Get titles, pricing and discounts, ratings, popularity, lecture counts, levels, languages, images and more. Ideal for course market research, competitor analysis, and building targeted lead lists.

Fatih Tahta

Dailymotion Transcript Scraper — Subtitles to TXT, SRT, VTT

scrapersdelight/dailymotion-transcript-scraper

Extract any public Dailymotion video's subtitle transcript — no login, no ASR. By video URL/ID or a search query: full text, timestamped segments & SRT/VTT, plus title, owner and duration, from Dailymotion's own subtitle tracks. $2 per 1,000 videos.

Scrapers Delight

Vimeo Transcript Scraper — Captions to TXT, SRT & VTT

scrapersdelight/vimeo-transcript-scraper

Extract any public Vimeo video's captions and transcript — no login, no ASR. By video URL/ID or a page that links Vimeo videos: transcript text, timestamped segments & SRT/VTT, plus title, owner and duration, from Vimeo's own caption tracks. $2 per 1,000 videos.

Scrapers Delight

Udemy Course Scraper - Cheap & Advanced 🔎🎓

scrapestorm/udemy-course-scraper-cheap-advanced

Gather Udemy course search results 🎓📚, including free & paid courses, ratings ⭐, instructors 👨‍🏫, course duration ⏱️, lecture counts, badges 🏅, pricing, and more. Search millions of courses using one or multiple keywords, export structured datasets 📊 or integrate with your automation workflows

Storm_Scraper

4.3

TikTok Transcript Scraper - JSON, SRT, VTT

jamhimself/tiktok-transcript-scraper

Extracts TikTok video transcripts from native captions (no AI transcription). Input: video URLs or IDs. Output: timestamped JSON segments, plain text, SRT, VTT, or RAG chunks + metadata. $0.003 per video with a transcript; no-caption videos free.