YouTube Metadata Scraper – Full Video Info & Formats avatar

YouTube Metadata Scraper – Full Video Info & Formats

Pricing

from $7.00 / 1,000 successful video scrapes

Go to Apify Store
YouTube Metadata Scraper – Full Video Info & Formats

YouTube Metadata Scraper – Full Video Info & Formats

Extract complete YouTube video metadata: title, description, views, likes, comments, all formats (URLs), subtitles, chapters, heatmap, thumbnails, channel stats, Open Graph data. No API key. Pay per result. Perfect for analytics and archiving.

Pricing

from $7.00 / 1,000 successful video scrapes

Rating

0.0

(0)

Developer

Scrape Pilot

Scrape Pilot

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

4 days ago

Last modified

Share


📺 YouTube Metadata Scraper – Full Video Info, Formats, Subtitles & More

Extract complete YouTube video metadata – including all available formats, thumbnails, subtitles, chapters, heatmap, likes/comments, channel info, and Open Graph data. Powered by yt‑dlp with browser impersonation via curl_cffi. Perfect for video archiving, analytics, content research, and AI training datasets.


💡 What is the YouTube Metadata Scraper?

The YouTube Metadata Scraper is a professional Apify actor that retrieves every piece of public information from any YouTube video. It combines:

  • yt‑dlp (the most advanced video extraction tool) to fetch all metadata fields, including:
    • Video title, description, uploader, channel, upload date
    • View count, like count, comment count
    • All available video/audio formats (resolution, codec, bitrate, file size)
    • Thumbnails (all sizes, including maxresdefault)
    • Chapters, heatmap (engagement over time), subtitles (manual and auto‑generated)
    • Live status, age restriction, language, availability
  • curl_cffi + BeautifulSoup to extract additional HTML meta tags:
    • Open Graph (og:title, og:image, etc.)
    • Twitter Card data
    • JSON‑LD structured data (including first comment)
    • Canonical URL and page title

The output is a single, rich JSON object per video, pushed directly to the Apify dataset. No API key required – everything is public.


🚀 Key Features

FeatureDescription
Complete metadataTitle, description, uploader, channel, upload date, duration, language, age limit, live status.
Engagement statsView count, like count, comment count (as reported by YouTube).
All formatsEvery available resolution and codec (including audio‑only) with file size, bitrate, URL.
ThumbnailsAll thumbnail sizes (from default to maxresdefault) with resolution and preference.
Subtitles & captionsManual and auto‑generated subtitles for all languages (URLs to .vtt, .srt, .json3).
Chapters & heatmapVideo chapters (if any) and engagement heatmap (segmented viewer activity).
Rich HTML metadataOpen Graph, Twitter Card, JSON‑LD structured data, canonical URL.
No login requiredUses only public YouTube pages and APIs.
Residential proxy readyBypasses rate limits (optional; most runs work without proxy).
Clean JSON outputEvery field is properly typed; missing fields are omitted.
Apify native integrationSeamless dataset storage, scheduling, and API access.

📥 Input Parameters

The actor accepts a JSON object with the following fields:

ParameterTypeRequiredDefaultDescription
startUrlsarray of objectsYesList of YouTube video URLs (e.g., [{"url": "https://www.youtube.com/watch?v=..."}]).
proxyConfigurationobjectNoApify proxy configuration (residential recommended for large volumes).

Example Input

{
"startUrls": [
{"url": "https://www.youtube.com/watch?v=dQw4w9WgXcQ"}
],
"proxyConfiguration": {
"useApifyProxy": true,
"apifyProxyGroups": ["RESIDENTIAL"]
}
}

📤 Output Fields

The actor pushes one dataset item per video URL. The output is a superset of yt‑dlp’s info dict plus an extra __html_meta__ field containing HTML‑parsed data. Below are the most important fields (the actual output is extremely comprehensive – see example output for a taste).

FieldTypeDescription
idstringYouTube video ID (e.g., dQw4w9WgXcQ).
titlestringFull video title.
fulltitlestringSame as title (alias).
descriptionstringVideo description (plain text).
durationintegerDuration in seconds.
duration_stringstringHuman‑readable duration (e.g., 3:33).
view_countintegerNumber of views.
like_countintegerNumber of likes.
comment_countintegerNumber of comments.
upload_datestringYYYYMMDD.
timestampintegerUnix timestamp of upload.
channelstringChannel name.
channel_idstringYouTube channel ID.
channel_urlstringURL of the channel page.
channel_follower_countintegerNumber of subscribers.
channel_is_verifiedbooleanVerified badge.
uploaderstringUploader name (same as channel).
uploader_idstringChannel handle (e.g., @RickAstleyYT).
uploader_urlstringChannel URL.
availabilitystringpublic, unlisted, private, etc.
age_limitintegerAge restriction (0 if none).
categoriesarrayVideo categories (e.g., ["Music"]).
tagsarrayVideo tags.
is_livebooleanCurrently live.
was_livebooleanPreviously live.
live_statusstringnot_live, is_live, was_live.
thumbnailsarrayList of thumbnail objects (each with url, width, height, preference, id).
thumbnailstringBest‑quality thumbnail URL (usually maxresdefault).
formatsarrayAll available video/audio formats. Each has format_id, ext, width, height, fps, vcodec, acodec, filesize, tbr, url, etc.
chaptersarrayIf present: start_time, end_time, title.
heatmaparrayEngagement over time (each segment: start_time, end_time, value 0–1).
subtitlesobjectManual subtitles: language → list of formats (url, ext).
automatic_captionsobjectAuto‑generated captions (same structure).
webpage_urlstringCanonical YouTube URL.
extractorstringAlways "youtube".
extractor_keystringAlways "Youtube".
__html_meta__objectExtra HTML metadata: Open Graph, Twitter Card, JSON‑LD, canonical_url, page_title.

Example Output (truncated)

[
{
"id": "dQw4w9WgXcQ",
"title": "Rick Astley - Never Gonna Give You Up (Official Video) (4K Remaster)",
"view_count": 1781039352,
"like_count": 19147046,
"comment_count": 2400000,
"upload_date": "20091025",
"duration": 213,
"channel": "Rick Astley",
"channel_follower_count": 4510000,
"channel_is_verified": true,
"thumbnails": [
{"url": "https://i.ytimg.com/vi/dQw4w9WgXcQ/maxresdefault.jpg", "width": 1920, "height": 1080}
],
"formats": [
{
"format_id": "18",
"ext": "mp4",
"width": 640,
"height": 360,
"fps": 25,
"vcodec": "avc1.42001E",
"acodec": "mp4a.40.2",
"filesize_approx": 11832459,
"url": "https://rr1---sn-p5qs7nd7.googlevideo.com/..."
}
],
"subtitles": {
"en": [
{"ext": "vtt", "url": "https://www.youtube.com/api/timedtext?..."}
]
},
"__html_meta__": {
"og:title": "Rick Astley - Never Gonna Give You Up (Official Video) (4K Remaster)",
"og:image": "https://i.ytimg.com/vi/dQw4w9WgXcQ/maxresdefault.jpg",
"twitter:card": "player",
"page_title": "Rick Astley - Never Gonna Give You Up (Official Video) (4K Remaster) - YouTube",
"__ld_json__": [
{
"@type": "VideoObject",
"name": "Rick Astley - Never Gonna Give You Up (Official Video) (4K Remaster)",
"comment": [
{"text": "scanned a damn qr code", "author": {"name": "@vinsbdfs"}, "upvoteCount": 1578}
]
}
]
}
}
]

🛠 How to Use on Apify

  1. Create a task with this actor.
  2. Provide one or more YouTube video URLs in the startUrls field.
  3. (Optional) Enable residential proxies – recommended for bulk scraping (100+ videos) to avoid rate limiting.
  4. Run – the actor will fetch metadata for each URL and push one item per video to the Dataset.
  5. Export – download results as JSON, CSV, or Excel.

Running via API

curl -X POST "https://api.apify.com/v2/acts/your-username~youtube-metadata-scraper/runs" \
-H "Content-Type: application/json" \
-H "Authorization: Bearer YOUR_API_TOKEN" \
-d '{
"startUrls": [{"url": "https://www.youtube.com/watch?v=dQw4w9WgXcQ"}]
}'

💰 Pricing

ModelPrice
Pay‑per‑result$0.007 per video (example – adjust as needed)

Actual pricing may vary based on Apify’s consumption model. This actor respects the pay‑per‑event system; you are charged only for successfully scraped videos.


🎯 Use Cases

  • Video analytics – Track engagement (views, likes, comments) over time.
  • Content archiving – Store complete metadata and format URLs for backup.
  • Subtitle extraction – Get subtitle files for language learning or transcription.
  • Research & AI training – Build datasets of video titles, descriptions, tags, and engagement.
  • Competitive monitoring – Watch competitor channels’ uploads and performance.
  • Media forensics – Retrieve heatmaps and chapter data for detailed analysis.

❓ Frequently Asked Questions

1. Do I need a YouTube API key?
No. The actor uses yt-dlp and public web pages – no API key required.

2. Can I download the actual video file?
This actor only returns metadata and format URLs. To download the video, you can use the url from any format entry with your own HTTP client (e.g., wget). The URLs are temporary (expire after a few hours).

3. How fresh is the data?
The actor fetches the latest available information at the time of the run. View and like counts are real‑time.

4. Why do I need residential proxies?
YouTube may throttle datacenter IPs after many requests. For bulk scraping (>100 videos), residential proxies reduce the chance of 429 errors.

5. What does the heatmap field represent?
It indicates relative viewer engagement across the video timeline. Higher values mean more rewatching or activity in that segment. Values are normalized between 0 and 1.

6. Are subtitles always available?
Only if the video has manual subtitles or auto‑captions enabled. The actor returns whatever YouTube provides.

7. Does it work for age‑restricted videos?
Yes, for most public age‑restricted videos. However, some may require a logged‑in session (not supported).

8. Can I scrape playlists or channels?
This version is designed for single videos only. For playlists or channels, see our link.



🔍 SEO Keywords

YouTube metadata scraper, YouTube video info extractor, yt-dlp Apify, YouTube subtitles downloader, YouTube format URLs, YouTube engagement analytics, video metadata API, YouTube chapters heatmap, YouTube comment count, Apify YouTube actor




Start extracting complete YouTube metadata today – pay only for successful results.