Reddit Scraper avatar

Reddit Scraper

Pricing

Pay per usage

Go to Apify Store
Reddit Scraper

Reddit Scraper

Scrape Reddit posts, comments, and subreddit data. Full nested comment threads, search queries, user profiles.

Pricing

Pay per usage

Rating

0.0

(0)

Developer

Fulcria Labs

Fulcria Labs

Maintained by Community

Actor stats

0

Bookmarked

1

Total users

0

Monthly active users

3 days ago

Last modified

Categories

Share

Scrape Reddit posts, comments, and subreddit data. Extract posts by subreddit, search query, or direct URL with full comment threads and nested replies.

Features

  • Subreddit scraping - Browse any subreddit sorted by hot, new, top, or rising
  • Reddit search - Search across all of Reddit or within specific subreddits
  • Direct post scraping - Scrape any Reddit post URL with full comments
  • Nested comments - Extract full comment threads with configurable depth
  • Flexible output - Posts with comments, posts only, comments only, or flat format
  • Score filtering - Filter posts by minimum upvote score
  • Polite scraping - Built-in rate limiting to respect Reddit's servers

Input

ParameterTypeDefaultDescription
subredditsstring[][]Subreddit names to scrape (without r/ prefix)
searchQueriesstring[][]Search terms to find posts
postUrlsstring[][]Direct Reddit post URLs
sortstring"hot"Sort order: hot, new, top, rising
timeFilterstring"week"Time range for Top sort: hour, day, week, month, year, all
maxPostsinteger25Max posts per subreddit/query (1-500)
includeCommentsbooleantrueWhether to scrape comments
maxCommentsinteger50Max top-level comments per post (0 = all)
includeRepliesbooleantrueInclude nested comment replies
maxReplyDepthinteger3Max reply nesting depth (0-10)
minScoreinteger0Minimum post score filter
outputFormatstring"posts_with_comments"Output format (see below)

Output Formats

posts_with_comments (default)

Each dataset item is a post with embedded comments array:

{
"type": "post",
"id": "1abc123",
"subreddit": "python",
"title": "What's the best Python web framework in 2026?",
"author": "dev_user",
"score": 542,
"upvoteRatio": 0.95,
"numComments": 187,
"url": "https://www.reddit.com/r/python/comments/1abc123/...",
"selfText": "I've been using Flask but wondering...",
"createdAt": "2026-02-20T15:30:00+00:00",
"flair": "Discussion",
"comments": [
{
"type": "comment",
"id": "comment1",
"author": "expert_dev",
"body": "FastAPI is the way to go...",
"score": 89,
"depth": 0,
"replies": [
{
"type": "comment",
"body": "Agreed, especially for APIs...",
"depth": 1,
"replies": []
}
]
}
]
}

posts_only

Posts without comments (faster scraping).

comments_only

Each comment as a separate dataset item, enriched with post title and subreddit.

flat

One item per post and one item per comment (no nesting). Useful for data analysis and CSV export.

Example Usage

Scrape top posts from r/python

{
"subreddits": ["python"],
"sort": "top",
"timeFilter": "month",
"maxPosts": 50,
"includeComments": true
}

Search Reddit for a topic

{
"searchQueries": ["web scraping best practices"],
"maxPosts": 25,
"outputFormat": "flat"
}

Scrape specific posts

{
"postUrls": [
"https://www.reddit.com/r/python/comments/abc123/example_post/"
],
"maxComments": 100,
"maxReplyDepth": 5
}

Multiple subreddits, posts only (fast)

{
"subreddits": ["machinelearning", "datascience", "artificial"],
"sort": "hot",
"maxPosts": 100,
"includeComments": false,
"outputFormat": "posts_only"
}

How It Works

This actor uses Reddit's public JSON API (appending .json to Reddit URLs) to extract data. No Reddit API credentials are required.

Rate limiting: The actor includes built-in delays between requests (1.5 seconds) to be respectful of Reddit's servers. Using Apify proxy is recommended for larger scraping jobs.

Comment threading: Comments are extracted with their full reply tree structure. The maxReplyDepth parameter controls how deep into reply chains the scraper goes. Replies are nested in a replies array within each comment.

Use Cases

  • Market research - Monitor product discussions and sentiment across subreddits
  • Content analysis - Analyze trending topics, keywords, and engagement patterns
  • Competitive intelligence - Track mentions of brands, products, or competitors
  • Academic research - Collect Reddit data for NLP, sentiment analysis, or social studies
  • SEO and content - Find popular questions and topics for content creation
  • Community monitoring - Track discussions in specific subreddits over time

Limitations

  • Reddit may rate-limit requests; use proxy configuration for large jobs
  • Some subreddits may be private or restricted
  • Deleted posts/comments show as [deleted]
  • Maximum ~1000 posts per subreddit listing (Reddit API limitation)