Reddit Thread Scraper avatar

Reddit Thread Scraper

Pricing

from $2.00 / 1,000 results

Go to Apify Store
Reddit Thread Scraper

Reddit Thread Scraper

Extract posts and top comments from any Reddit thread or subreddit. Returns post title, author, score, URL, body text, and top-voted comments with full metadata. Ideal for sentiment analysis, research, AI training datasets, and community monitoring. No API key required.

Pricing

from $2.00 / 1,000 results

Rating

0.0

(0)

Developer

Sheshinmcfly

Sheshinmcfly

Maintained by Community

Actor stats

1

Bookmarked

2

Total users

0

Monthly active users

2 hours ago

Last modified

Share

Extract posts and comments from any subreddit via Reddit's official public JSON API. No authentication required. Filter by sort order, time range, and number of comments.

Perfect for AI training datasets, sentiment analysis, market research, and trend monitoring.


What data does it extract?

Posts

FieldDescriptionExample
typeRecord type"post"
idReddit post ID"1sa4rlx"
subredditSubreddit name"MachineLearning"
titlePost title"New paper on LLM reasoning"
authorUsername"researcher123"
scoreUpvotes - downvotes1420
upvoteRatioUpvote ratio0.97
numCommentsTotal comment count83
selftextPost body text"We propose a new..."
urlLink URL"https://arxiv.org/..."
permalinkReddit post URL"https://reddit.com/r/..."
flairPost flair label"Research"
createdAtPost creation time"2026-04-21T10:00:00Z"
extractedAtExtraction timestamp"2026-04-21T12:00:00Z"

Comments

FieldDescriptionExample
typeRecord type"comment"
idComment ID"abc123"
postIdParent post ID"1sa4rlx"
authorUsername"user456"
bodyComment text"Great work, but..."
scoreUpvotes - downvotes342
depthNesting level (0 = top-level)0
permalinkDirect link to comment"https://reddit.com/..."
createdAtComment creation time"2026-04-21T10:05:00Z"

Use cases

  • AI training data: Clean text from expert communities for LLM fine-tuning
  • Sentiment analysis: Monitor brand mentions and user opinions
  • Market research: Track trends and discussions in niche communities
  • Competitive intelligence: See what problems users are discussing
  • RAG pipelines: Feed domain-specific knowledge into retrieval systems
  • Content research: Find top-performing posts for content strategy

How to use

  1. Open the actor and configure:
    • Subreddits: List subreddit names (e.g. MachineLearning, investing, python)
    • Sort: hot, new, top, or rising
    • Time filter: For "top" sort — day, week, month, year, all
    • Max posts: Cap per subreddit
    • Include comments: Also extract top comments
  2. Click Start
  3. Download results as JSON, CSV, or Excel

Example output (JSON)

[
{
"type": "post",
"id": "1sa4rlx",
"subreddit": "MachineLearning",
"title": "[D] New method achieves SOTA on reasoning benchmarks",
"author": "ml_researcher",
"score": 1420,
"upvoteRatio": 0.97,
"numComments": 83,
"selftext": "We introduce a novel approach...",
"url": "https://arxiv.org/abs/2504.12345",
"permalink": "https://www.reddit.com/r/MachineLearning/comments/1sa4rlx/",
"flair": "Research",
"createdAt": "2026-04-21T10:00:00.000Z",
"extractedAt": "2026-04-21T12:00:00.000Z"
},
{
"type": "comment",
"id": "kxyz789",
"postId": "1sa4rlx",
"subreddit": "MachineLearning",
"author": "deep_learner",
"body": "Impressive results. Did you test on out-of-distribution benchmarks?",
"score": 342,
"depth": 0,
"permalink": "https://www.reddit.com/r/MachineLearning/comments/1sa4rlx/comment/kxyz789/",
"createdAt": "2026-04-21T10:05:00.000Z",
"extractedAt": "2026-04-21T12:00:00.000Z"
}
]

Pricing

This actor charges $0.002 USD per item extracted (posts and comments each count as one item). Extracting 100 posts with 10 comments each = 1,100 items ≈ $2.20 USD.


Keywords

reddit scraper, subreddit posts extractor, reddit comments scraper, reddit data for AI, reddit sentiment analysis, reddit thread extractor, social media scraper, reddit API scraper, NLP training data, reddit market research


This actor extracts publicly available data only from Reddit using Reddit's official public JSON API (reddit.com/r/{subreddit}.json), in compliance with Chilean Law 19.628 on the Protection of Private Life (Ley 19.628 sobre Protección de la Vida Privada).

What this actor does NOT collect:

  • Private messages or non-public posts
  • Email addresses or personal contact information
  • Data from private or restricted subreddits
  • Any data not freely visible to anonymous visitors

What this actor collects:

  • Post titles, body text, and metadata (public content)
  • Publicly visible usernames and comment text
  • Engagement metrics (score, upvotes, comment counts)

All data is publicly accessible without authentication via Reddit's JSON API. Users are solely responsible for ensuring their use of this data complies with applicable laws and Reddit's terms of service.