TED Talks Scraper avatar

TED Talks Scraper

Pricing

from $3.00 / 1,000 results

Go to Apify Store
TED Talks Scraper

TED Talks Scraper

Scrape TED.com talks with title, speaker, duration, view count, publish/record dates, topics, language, description, thumbnail. Two modes: fetch specific talks by URL/slug, or browse all talks in a topic. Pure HTTP, no auth needed.

Pricing

from $3.00 / 1,000 results

Rating

5.0

(21)

Developer

Crawler Bros

Crawler Bros

Maintained by Community

Actor stats

22

Bookmarked

2

Total users

1

Monthly active users

7 days ago

Last modified

Share

Scrape TED.com talks — title, speaker, duration, view count, publish/record dates, topics, language, description, and thumbnail. Two modes: fetch specific talks by URL/slug, or browse all talks in a topic.

Pure HTTP, no auth, no proxy required. TED.com works from datacenter IPs.

What you get

Talk records (recordType=talk)

FieldDescription
idTED talk ID
slugURL slug (e.g. sir_ken_robinson_do_schools_kill_creativity)
urlCanonical talk URL
titleTalk title
speakerPresenter name (presenterDisplayName)
partnerNameTEDx / TED Conference / Independent / etc.
descriptionPlain-text summary
socialDescriptionOptional alt summary used on social embeds (only when different)
durationSecondsTalk length in seconds
durationFormattedHuman-readable MM:SS or H:MM:SS
viewedCountAll-time view count
publishedAtISO 8601 timestamp the talk was published on TED.com
recordedOnISO date the talk was recorded
languageISO 639-1 language code
featuredtrue for featured talks
curatorApprovedtrue for curator-approved talks
hasTranslationstrue if subtitle translations exist
topicsArray of topic names (e.g. creativity, business, psychology)
thumbnailUrlWidescreen thumbnail URL
relatedTalkSlugsArray of related talk slugs
scrapedAtISO 8601 UTC timestamp

Empty fields are dropped from every record at every depth.

Input

ParameterTypeDefaultDescription
modeEnumbyUrlsbyUrls / byTopic / bySearch / bySpeaker / byPlaylist / browse
talkUrlsArray["sir_ken_robinson_do_schools_kill_creativity"]Talk URLs or slugs (mode=byUrls)
topicEnumcreativityCurated TED topic slug from a 50-item dropdown (mode=byTopic)
searchQueryStringFree-text search query (mode=bySearch)
speakerStringTED speaker slug or URL (mode=bySpeaker)
playlistStringTED playlist numeric ID or full URL (mode=byPlaylist)
sortEnumpopularpopular / newest / oldest for mode=browse
minViewsIntegerDrop talks with fewer views
minDurationSeconds / maxDurationSecondsIntegerFilter by talk length
languageEnum(no filter)ISO 639-1 dropdown of TED's top-30 languages
maxItemsInteger25Hard cap (1-1000)

Example input — single talk

{
"mode": "byUrls",
"talkUrls": ["sir_ken_robinson_do_schools_kill_creativity"]
}

Example input — multiple talks

{
"mode": "byUrls",
"talkUrls": [
"https://www.ted.com/talks/brene_brown_the_power_of_vulnerability",
"https://www.ted.com/talks/simon_sinek_how_great_leaders_inspire_action",
"do_schools_kill_creativity"
]
}

Example input — browse a topic

{
"mode": "byTopic",
"topic": "creativity",
"minViews": 1000000,
"maxItems": 50
}

Example input — short English talks

{
"mode": "byTopic",
"topic": "psychology",
"language": "en",
"maxDurationSeconds": 600,
"maxItems": 25
}
{
"mode": "bySearch",
"searchQuery": "quantum computing",
"maxItems": 30
}

Example input — by speaker

{
"mode": "bySpeaker",
"speaker": "sir_ken_robinson"
}
{
"mode": "byPlaylist",
"playlist": "171",
"maxItems": 25
}
{
"mode": "browse",
"sort": "popular",
"maxItems": 25
}

Example output

{
"recordType": "talk",
"id": "66",
"slug": "sir_ken_robinson_do_schools_kill_creativity",
"url": "https://www.ted.com/talks/sir_ken_robinson_do_schools_kill_creativity",
"title": "Do schools kill creativity?",
"speaker": "Sir Ken Robinson",
"description": "Sir Ken Robinson makes an entertaining and profoundly moving case for creating an education system that nurtures (rather than undermines) creativity.",
"durationSeconds": 1151,
"durationFormatted": "19:11",
"viewedCount": 80129749,
"publishedAt": "2006-06-27T00:11:00Z",
"recordedOn": "2006-02-25",
"language": "en",
"curatorApproved": true,
"hasTranslations": true,
"topics": ["education", "creativity", "psychology"],
"thumbnailUrl": "https://pi.tedcdn.com/r/talkstar-photos.s3.amazonaws.com/uploads/...",
"scrapedAt": "2026-05-06T10:42:18Z"
}

Use cases

  • Educational platform content discovery — Build curated talk libraries by topic.
  • Corporate training catalogs — Index TED talks by length, speaker, or topic for L&D programs.
  • Content recommendation engines — Match TED talks to user interests via topics + view counts.
  • Speaker / influencer research — Track TED appearances of public figures.
  • Academic research — Snapshot communication / public-speaking dataset.

FAQ

Do I need a TED account or API key? No. TED.com pages embed full talk metadata in their __NEXT_DATA__ JSON blob; the actor reads that directly.

How do I find a topic slug? Browse https://www.ted.com/topics — every topic page URL ends in the slug (e.g. /topics/creativitycreativity). Common slugs: creativity, business, education, psychology, technology, science, health, culture, art.

Are transcripts included? Not in this version. Talk transcripts live on a separate /transcript URL and are translated into many languages; capturing them requires an additional fetch per talk and per language.

Why does mode=byTopic make N+1 requests? TED's topic listing returns lighter data per talk. To get full metadata (views, duration, language, etc.), the actor visits each talk's individual URL. Use mode=byUrls if you already have the slugs and want fewer round-trips.

How current is the data? Live — every run hits TED.com at request time. Schedule the actor for daily / weekly refreshes to track view-count growth.

Do I need a proxy? No. TED.com accepts datacenter IPs without restriction.

Limitations

  • TED's topic pages return ~16-20 talks per topic; large catalogs need multiple topic queries.
  • Transcripts are not yet captured.
  • Per-talk comments / reactions are not exposed in the public data.
  • Some old talks have sparse metadata (no language, no recordedOn, etc.).