# Labeled Reddit Video Dataset for ML Training

**Use case:** 

ML engineers can assemble thousands of short Reddit clips labeled by subreddit and filtered to 3-30s, a dataset ready for action-recognition models.

## Input

```json
{
  "sources": [
    "r/nextfuckinglevel",
    "r/sports",
    "r/gifs",
    "r/combatfootage"
  ],
  "sort": "top",
  "time": "all",
  "maxPostsPerSource": 50,
  "downloadVideos": true,
  "minDurationSeconds": 3,
  "maxDurationSeconds": 30,
  "dedupeAcrossRuns": true,
  "proxyConfiguration": {
    "useApifyProxy": true,
    "apifyProxyGroups": [
      "RESIDENTIAL"
    ]
  }
}
```

## Output

```json
{
  "subreddit": {
    "label": "Sub",
    "format": "text"
  },
  "title": {
    "label": "Title",
    "format": "text"
  },
  "resolution": {
    "label": "Res",
    "format": "text"
  },
  "durationSeconds": {
    "label": "Dur (s)",
    "format": "number"
  },
  "hasAudio": {
    "label": "Audio",
    "format": "boolean"
  },
  "videoUrl": {
    "label": "Video",
    "format": "link"
  },
  "url": {
    "label": "Post",
    "format": "link"
  }
}
```

## About this Actor

This example demonstrates how to use [Reddit Video Scraper & Downloader](https://apify.com/dami_studio/reddit-video-scraper) with a specific input configuration. Visit the [Actor detail page](https://apify.com/dami_studio/reddit-video-scraper) to learn more, explore other use cases, and run it yourself.