Reddit Thread Scraper
Pricing
from $2.00 / 1,000 results
Reddit Thread Scraper
Extract posts and top comments from any Reddit thread or subreddit. Returns post title, author, score, URL, body text, and top-voted comments with full metadata. Ideal for sentiment analysis, research, AI training datasets, and community monitoring. No API key required.
Pricing
from $2.00 / 1,000 results
Rating
0.0
(0)
Developer
Sheshinmcfly
Actor stats
1
Bookmarked
2
Total users
0
Monthly active users
2 hours ago
Last modified
Categories
Share
Extract posts and comments from any subreddit via Reddit's official public JSON API. No authentication required. Filter by sort order, time range, and number of comments.
Perfect for AI training datasets, sentiment analysis, market research, and trend monitoring.
What data does it extract?
Posts
| Field | Description | Example |
|---|---|---|
type | Record type | "post" |
id | Reddit post ID | "1sa4rlx" |
subreddit | Subreddit name | "MachineLearning" |
title | Post title | "New paper on LLM reasoning" |
author | Username | "researcher123" |
score | Upvotes - downvotes | 1420 |
upvoteRatio | Upvote ratio | 0.97 |
numComments | Total comment count | 83 |
selftext | Post body text | "We propose a new..." |
url | Link URL | "https://arxiv.org/..." |
permalink | Reddit post URL | "https://reddit.com/r/..." |
flair | Post flair label | "Research" |
createdAt | Post creation time | "2026-04-21T10:00:00Z" |
extractedAt | Extraction timestamp | "2026-04-21T12:00:00Z" |
Comments
| Field | Description | Example |
|---|---|---|
type | Record type | "comment" |
id | Comment ID | "abc123" |
postId | Parent post ID | "1sa4rlx" |
author | Username | "user456" |
body | Comment text | "Great work, but..." |
score | Upvotes - downvotes | 342 |
depth | Nesting level (0 = top-level) | 0 |
permalink | Direct link to comment | "https://reddit.com/..." |
createdAt | Comment creation time | "2026-04-21T10:05:00Z" |
Use cases
- AI training data: Clean text from expert communities for LLM fine-tuning
- Sentiment analysis: Monitor brand mentions and user opinions
- Market research: Track trends and discussions in niche communities
- Competitive intelligence: See what problems users are discussing
- RAG pipelines: Feed domain-specific knowledge into retrieval systems
- Content research: Find top-performing posts for content strategy
How to use
- Open the actor and configure:
- Subreddits: List subreddit names (e.g.
MachineLearning,investing,python) - Sort: hot, new, top, or rising
- Time filter: For "top" sort — day, week, month, year, all
- Max posts: Cap per subreddit
- Include comments: Also extract top comments
- Subreddits: List subreddit names (e.g.
- Click Start
- Download results as JSON, CSV, or Excel
Example output (JSON)
[{"type": "post","id": "1sa4rlx","subreddit": "MachineLearning","title": "[D] New method achieves SOTA on reasoning benchmarks","author": "ml_researcher","score": 1420,"upvoteRatio": 0.97,"numComments": 83,"selftext": "We introduce a novel approach...","url": "https://arxiv.org/abs/2504.12345","permalink": "https://www.reddit.com/r/MachineLearning/comments/1sa4rlx/","flair": "Research","createdAt": "2026-04-21T10:00:00.000Z","extractedAt": "2026-04-21T12:00:00.000Z"},{"type": "comment","id": "kxyz789","postId": "1sa4rlx","subreddit": "MachineLearning","author": "deep_learner","body": "Impressive results. Did you test on out-of-distribution benchmarks?","score": 342,"depth": 0,"permalink": "https://www.reddit.com/r/MachineLearning/comments/1sa4rlx/comment/kxyz789/","createdAt": "2026-04-21T10:05:00.000Z","extractedAt": "2026-04-21T12:00:00.000Z"}]
Pricing
This actor charges $0.002 USD per item extracted (posts and comments each count as one item). Extracting 100 posts with 10 comments each = 1,100 items ≈ $2.20 USD.
Keywords
reddit scraper, subreddit posts extractor, reddit comments scraper, reddit data for AI, reddit sentiment analysis, reddit thread extractor, social media scraper, reddit API scraper, NLP training data, reddit market research
Legal Disclaimer
This actor extracts publicly available data only from Reddit using Reddit's official public JSON API (reddit.com/r/{subreddit}.json), in compliance with Chilean Law 19.628 on the Protection of Private Life (Ley 19.628 sobre Protección de la Vida Privada).
What this actor does NOT collect:
- Private messages or non-public posts
- Email addresses or personal contact information
- Data from private or restricted subreddits
- Any data not freely visible to anonymous visitors
What this actor collects:
- Post titles, body text, and metadata (public content)
- Publicly visible usernames and comment text
- Engagement metrics (score, upvotes, comment counts)
All data is publicly accessible without authentication via Reddit's JSON API. Users are solely responsible for ensuring their use of this data complies with applicable laws and Reddit's terms of service.