All notable, user-facing capabilities of the YouTube Data Scraper. This is a
feature log rather than a strict per-release history; it summarizes what the
actor can do.
- Search — find videos by keyword, with each query run as its own task.
- Channel — collect channel profiles (subscriber / video / view counts,
country, join date, links, banner) from channel URLs, @handles, or channel
search results.
- Video — rich per-video metadata and statistics.
- Comments — fetch comments with author flags, like counts, and optional
nested replies.
- Transcripts — download full captions in multiple formats.
- Trending — surface the most-viewed recent videos for a topic.
- Flexible targeting: paste video / channel / playlist URLs, bare video IDs, or
bare @handles; or run keyword searches. Typed URL lists (
videoUrls,
channelUrls, playlistUrls) merge with startUrls.
- Friendly input aliases so common alternate field names are accepted
(e.g.
query / q / keywords, urls, limit / maxItems,
lang, gl / region, proxy).
- Search filters: sort by relevance / date / view count / rating; upload-date
window; video duration; result type (videos / channels / playlists / any).
- Feature filters: HD, 4K, HDR, 360°, subtitles/CC, Creative Commons, live.
- Channel options: choose the Videos / Shorts / Live tab and the sort order;
optionally fetch the About panel for join date, country, total views, links.
- Shorts handling: include, exclude, cap, or return only Shorts.
- Post-fetch filters across all modes: include/exclude keywords, published-date
range (
fromDate / toDate, absolute or relative), max age in days, and a
custom filter expression.
- Output as timed segments (JSON), plain text, SRT, WebVTT, or raw XML.
- Choose a transcript language, prefer auto-generated tracks, and optionally
translate the transcript into another language.
- Emit only a chosen subset of transcript fields.
- Engagement metrics: like-to-view and comment-to-view ratios, and a combined
engagement rate.
- Extracted hashtags, description URLs, and social-profile links.
- Parsed chapters, advertised caption languages, and high-resolution thumbnails.
- Channel-level derived metrics: average views per video and estimated uploads
per month.
- Description rendering in plain text, HTML, Markdown, or all formats.
- Optional translated titles and descriptions via YouTube's own localization.
- Canonical, embed, and short URLs on every video.
- Per-field metadata surfaced in the Apify console, plus tabular dataset views
for Videos, Channels, Comments, Transcripts, and Playlist Videos.
- Global result cap, per-query/per-target cap, adjustable concurrency, and an
optional soft run-time limit that saves partial results gracefully.
- Detail levels from Basic (listing fields only) to Complete (all fields).
- Locale controls for country and interface language.
- Optional run metadata (
runId, actorId, startedAt) on every item, and an
option to strip empty fields for compact output.
- Track seen items across runs and emit only new and changed items on repeat
runs, keyed by a stable state key.
- Send a run summary to Telegram, Discord, Slack, WhatsApp, or a custom webhook,
with a configurable item limit and an option to notify only on new/updated
items.
- Invalid locale codes and out-of-range numeric inputs are normalized to safe
defaults instead of failing the run.
- Optional proxy routing (enabled by default).
- Results are written to the dataset in bounded batches as the run progresses,
so a timeout or interruption keeps the data already collected and memory
stays bounded on large runs.
- Saving each batch is retried automatically on a transient storage hiccup; a
batch that still fails is skipped with a warning rather than aborting the run.