Youtube Metadata Scraper (Transcripts Included 😋) avatar
Youtube Metadata Scraper (Transcripts Included 😋)

Pricing

$5.00/month + usage

Go to Store
Youtube Metadata Scraper (Transcripts Included 😋)

Youtube Metadata Scraper (Transcripts Included 😋)

Developed by

tolu.

tolu.

Maintained by Community

Introducing the most comprehensive and robust YouTube Metadata Scraper on Apify, built for videos and shorts. Get detailed metadata, including title, description, video length, tags, like count, view count, comment count, and even full transcripts.

0.0 (0)

Pricing

$5.00/month + usage

0

Total users

9

Monthly users

6

Runs succeeded

>99%

Last modified

12 days ago

📝 Youtube Metadata Scraper (Transcripts Included 😋) 📝

Introducing the most comprehensive and robust Youtube Metadata web scraper on Apify. Get video details, channel details, engagement statistics, transcripts and more from Youtube videos and shorts via a single interface.

😋 Features 😋

  • Extract complete video metadata including video title, description, channel details, engagement stats and full transcripts.
  • Seamlessly handle both videos and shorts URL through a single interface.
  • Easily customize transcript options in the input settings including transcript formats and languages.
  • Access detailed timestamps, including start times and time ranges for each transcript segment.

⚡ Use Cases ⚡

Market Research & Competitor Analysis

  • Track trends through titles, descriptions, and tags.
  • Monitor competitor video performance through likes, views, and comment counts.
  • Identify high-performing keywords and optimize SEO strategies.

Content Creation & Strategy

  • Analyze successful videos to refine content ideas.
  • Extract transcripts for repurposing into blog posts, captions, or summaries.
  • Gather engagement insights to understand audience preferences.

Academic & Sentiment Analysis

  • Collect video transcripts for natural language processing (NLP) or sentiment analysis.
  • Study trends in discussions across different topics.
  • Monitor misinformation and content moderation studies.

Social Media & Brand Monitoring

  • Track mentions of brands or topics in video descriptions and transcripts.
  • Measure audience sentiment and engagement around specific products or campaigns.
  • Identify influencers and assess their impact.

👩‍🍳 Input Parameters 👩‍🍳

ParameterTypeDescriptionDefault Value
startURLsarrayAt least one Youtube URL (Video and Shorts URL are supported)-
extractTranscriptbooleanIf selected, includes transcript in the resulttrue
transcriptFormatraw|timestampIf raw, return transcript as a string without timestamp information. If timestamp, include timestamp information for each transcript segmenttimestamp
includeEnglishAGbooleanIf selected, includes English (auto-generated) option in transcriptfalse
includeNonEnglishbooleanIf selected, includes non-English languages in transcriptsfalse
proxyobjectApify's proxy configuration. Choose RESIDENTIAL proxies for reliable runsRESIDENTIAL

🍖 Output Example 🍖

includeEnglishAG and includeNonEnglish was set to False while transcriptFormat was set to raw:

{
"id": "yy16KFzM4XU",
"url": "https://www.youtube.com/watch?v=yy16KFzM4XU",
"title": "Why sci-fi alien planets all look the same",
"description": "There's a reason that a lot of planets in American science fiction look the same: they're all filmed in the same places. But why those particular locations? It's about money, about union rules, and about the thirty-mile zone -- or as it's otherwise known, the TMZ.\n\nWikipedia on Vasquez Rocks: https://en.wikipedia.org/wiki/List_of_productions_using_the_Vasquez_Rocks_as_a_filming_location\n\nCamera: Matt Gray http://www.mattg.co.uk/\n\n🟥 MORE FROM TOM: https://www.tomscott.com/\n(you can find contact details and social links there too)\n\n📰 WEEKLY NEWSLETTER with good stuff from the rest of the internet: https://www.tomscott.com/newsletter/\n❓ LATERAL, free weekly podcast: https://lateralcast.com/ https://youtube.com/lateralcast/\n➕ TOM SCOTT PLUS: https://youtube.com/tomscottplus\n👥 THE TECHNICAL DIFFICULTIES: https://youtube.com/techdif",
"lengthInSeconds": 128,
"uploadDatetime": "2017-05-01T15:00:05+00:00",
"category": "Education",
"tags": [],
"channelID": "UCBa659QWEk1AI4Tg--mrJ2A",
"channelURL": "http://www.youtube.com/@TomScottGo",
"channelUsername": "TomScottGo",
"channelDisplayName": "Tom Scott",
"channelSubscribers": "6.54M",
"viewCount": 4057772,
"likeCount": 106903,
"commentCount": 1819,
"transcripts": [
{
"language": "English",
"content": "I am exactly thirty miles away from the intersection of Beverly Boulevard\nand North La Cienega Boulevard in downtown Los Angeles. This is part of the boundary\nof the studio zone, a thirty-mile circle that\nsweeps through southern California, and which explains why a lot of planets in\nscience fiction look kind of the same. See, if I were filming a movie and I asked\nmy cast and crew to meet me here, outside the zone, I'd have to pay them for their time and mileage\ngetting to the location. But if I just walk a few paces this way,\ninside the studio zone, then cast and crew get paid from when they\narrive here, from their call time. Filming inside the zone is significantly cheaper. The headquarters of pretty much all the movie\nstudios are inside the zone, and so are a lot of places that you'll have\nseen in film and television. Griffith Park and Bronson Canyon\nare well within the zone, close to the centre of Hollywood,\nso that's where Adam West's Batcave was, along with where the walls fell for\nCaptain Picard, and along with, well, pretty much any American TV show that's ever\nneeded that sort of landscape. And Vasquez Rocks, which have been used in\nso many TV shows, films and music videos that there is a separate\nWikipedia page just listing them, is conveniently twenty-nine miles from the\ncentre of the zone. If those rocks were slightly further away\nfrom Hollywood, then who knows? Perhaps Bill and Ted would have been killed\nby their evil robot doubles somewhere else. Why thirty miles? Union rules. Hollywood is a union town, and long ago,\nit was negotiated that thirty miles was \"local\"\nand anything outside wasn't, presumably after many, many disagreements\ngoing back and forth between union reps and the studios. Now, exceptions have been added over time:\nthe rules are complicated, and I couldn't explain them all if I tried. Production coordinators generally have that job. But in summary:\nthis thirty-mile zone, this TMZ, and yes, that's where the gossip site got\nits name from, this border is why, sometimes, it looks like\nthe crew of the Enterprise are boldly going where quite a lot of people\nhave gone before."
}
]
}

Also, here is an excerpt of the transcript when transcriptFormat is set to timestamp:

{
...,
"transcripts": [
{
"language": "English",
"content": [
{
"startMs": 60,
"endMs": 1920,
"startTime": "0:00",
"text": "I am exactly thirty miles away"
},
{
"startMs": 1920,
"endMs": 5720,
"startTime": "0:01",
"text": "from the intersection of Beverly Boulevard\nand North La Cienega Boulevard"
},
{
"startMs": 5720,
"endMs": 7588,
"startTime": "0:05",
"text": "in downtown Los Angeles."
},
...,
]
}
]
}

⚠ Limitations ⚠

  • Very likely to fail if the scraper is not ran with RESIDENTIAL proxies.

This scraper is under active development and suggestions or feature requests will be greatly appreciated. If you have suggestions, feature requests, or encounter any issues, feel free to: