# Build a Reddit thread dataset for AI

**Use case:** 

Create structured Reddit comment data with stable thread paths for AI agents, RAG, and LLM analysis.

## Input

```json
{
  "postUrls": [
    "https://www.reddit.com/r/ProductManagement/comments/14g8f2h/do_you_consider_reddit_posts_in_your_user/"
  ],
  "maxComments": 5,
  "includeReplies": true,
  "sortBy": "top",
  "maxConcurrency": 1,
  "proxy": {
    "useApifyProxy": true,
    "apifyProxyGroups": [
      "RESIDENTIAL"
    ],
    "apifyProxyCountry": "US"
  },
  "mcpOutputConnectors": [],
  "mcpToolArguments": {},
  "failOnMcpDeliveryError": false
}
```

## Output

```json
{
  "postUrl": {
    "label": "Post URL",
    "format": "string"
  },
  "postTitle": {
    "label": "Post title",
    "format": "string"
  },
  "postAuthor": {
    "label": "Post author",
    "format": "string"
  },
  "subreddit": {
    "label": "Subreddit",
    "format": "string"
  },
  "commentId": {
    "label": "Comment ID",
    "format": "string"
  },
  "commentAuthor": {
    "label": "Comment author",
    "format": "string"
  },
  "commentText": {
    "label": "Comment text",
    "format": "string"
  },
  "commentTimestamp": {
    "label": "Comment timestamp",
    "format": "string"
  },
  "commentDepth": {
    "label": "Comment depth",
    "format": "integer"
  },
  "commentPath": {
    "label": "Comment path",
    "format": "string"
  },
  "parentPath": {
    "label": "Parent path",
    "format": "string"
  },
  "replyCount": {
    "label": "Reply count",
    "format": "integer"
  },
  "scrapedAt": {
    "label": "Scraped at",
    "format": "string"
  }
}
```

## About this Actor

This example demonstrates how to use [Reddit Comment Scraper Pro — Updated & Reliable](https://apify.com/newbs/reddit-comment-scraper) with a specific input configuration. Visit the [Actor detail page](https://apify.com/newbs/reddit-comment-scraper) to learn more, explore other use cases, and run it yourself.