Deprecated

Pricing

Pay per event

See alternative Actors

Go to Apify Store

Reddit Posts & Subreddit Comment Scraper

Deprecated

See alternative Actors

Scrape Reddit posts and nested comment trees from specific subreddits. Proxy-aware fallback for the legacy public surface. Sort by hot, top, new, rising with optional comment depth control.

Pricing

Pay per event

Rating

0.0

(0)

Developer

naoki anzai

Actor stats

Bookmarked

Total users

Monthly active users

5 days ago

Last modified

💬 Reddit Scraper (Legacy Fallback)

Dive deep into niche communities with the Subreddit & Comment Scraper, a powerful extraction utility built for proxy-sensitive environments where users control their own access setup. It serves as a legacy fallback for workflows that require explicit proxy configuration and list-based subreddit collection. It can scrape high-volume subreddits, pulling down both parent posts and the complex, nested comment trees that contain valuable user opinions and sentiment.

Research teams, community managers, and OSINT analysts utilize this scraper to conduct historical audits of specific subreddits, track viral topics, and analyze authentic user feedback. By specifying target subreddits and applying granular sort filters (such as Top of All Time or Newest), you can precisely control what data enters your pipeline. The actor is intended for compliant research on public subreddit pages, with clear warnings when Reddit blocks datacenter traffic.

Your resulting datasets will include rich, structured details: accurate timestamps, total upvotes, author handles, full comment bodies, and post URLs. This allows for seamless downstream analysis of social trends. Note that this is a legacy-focused actor prioritizing proxy-aware subreddit flows. If your primary goal involves setting up recurring keyword alerts across the entire site, or broad user-profile scraping, our newer Reddit All-in-One Scraper is recommended. However, for specialized subreddit extraction where you control the residential proxies and demand a straightforward, list-based collection method, this scraper remains a highly effective and fully maintained choice.

📄 Live sample output: see docs/sample-output.json for a representative dataset captured from a real run of this actor. Use it to validate the schema before subscribing.

Store Quickstart

Start with store-input.example.json or Legacy Quickstart (Proxy-aware). If running on Apify infrastructure, configure Residential proxy first.
Then use the legacy ladder from store-input.templates.json:
1. Legacy Quickstart (Proxy-aware)
2. Legacy Recurring Refresh (Proxy-aware)
3. Legacy Webhook Handoff (Proxy-aware)
Buyer-facing proof assets live in sample-output.example.json and live-proof.example.json.
New recurring or pack-first users should still move to reddit-all-in-one-scraper / reddit-keyword-monitor-alerts once the legacy need is proven.

Legacy Scope

Subreddit-based post scraping
Optional comment extraction
Basic sort/time controls
No recurring snapshot diff monitoring

Input

Field	Type	Default	Description
subreddits	string[]	(required)	Subreddit names (max 20)
sort	string	hot	hot, new, top, rising
maxItems	integer	25	Max posts per subreddit (1-500)
includeComments	boolean	false	Include nested comments

Input Example

{
  "subreddits": ["programming", "technology"],
  "sort": "hot",
  "maxItems": 50,
  "includeComments": true
}

Input Examples

Example: Top of all time in a subreddit

{
  "subreddits": [
    "DataIsBeautiful"
  ],
  "sort": "top",
  "time": "all",
  "maxPosts": 25,
  "includeComments": true,
  "commentDepth": 2
}

Example: Newest posts (multi-subreddit)

{
  "subreddits": [
    "MachineLearning",
    "datascience"
  ],
  "sort": "new",
  "maxPosts": 50,
  "includeComments": false
}

Example: Specific post + comment tree

{
  "posts": [
    "https://old.reddit.com/r/programming/comments/abc123/"
  ],
  "includeComments": true,
  "commentDepth": 5
}

Output

Field	Type	Description
`id`	string	Reddit post ID
`title`	string	Post title
`author`	string	Username of poster
`subreddit`	string	Subreddit name
`url`	string	Permalink to post
`score`	integer	Upvote score
`numComments`	integer	Comment count
`createdAt`	string	ISO timestamp
`selftext`	string	Post body (for text posts)
`comments`	object[]	Top comments (if includeComments enabled)

Output Example

{
  "title": "New JavaScript framework released",
  "author": "dev_user",
  "score": 1250,
  "url": "https://example.com/framework",
  "selftext": "Detailed writeup inside...",
  "subreddit": "programming",
  "createdUtc": 1712345678,
  "numComments": 342,
  "comments": [{"author": "...", "body": "..."}]
}

API Usage

Run this actor programmatically using the Apify API. Replace YOUR_API_TOKEN with your token from Apify Console → Settings → Integrations.

cURL

curl -X POST "https://api.apify.com/v2/acts/taroyamada~reddit-data-scraper/run-sync-get-dataset-items?token=YOUR_API_TOKEN" \
  -H "Content-Type: application/json" \
  -d '{ "subreddits": ["programming", "technology"], "sort": "hot", "maxItems": 50, "includeComments": true }'

Python

from apify_client import ApifyClient

client = ApifyClient("YOUR_API_TOKEN")
run = client.actor("taroyamada/reddit-data-scraper").call(run_input={
  "subreddits": ["programming", "technology"],
  "sort": "hot",
  "maxItems": 50,
  "includeComments": true
})

for item in client.dataset(run["defaultDatasetId"]).iterate_items():
    print(item)

JavaScript / Node.js

import { ApifyClient } from 'apify-client';

const client = new ApifyClient({ token: 'YOUR_API_TOKEN' });
const run = await client.actor('taroyamada/reddit-data-scraper').call({
  "subreddits": ["programming", "technology"],
  "sort": "hot",
  "maxItems": 50,
  "includeComments": true
});

const { items } = await client.dataset(run.defaultDatasetId).listItems();
console.log(items);

Tips & Limitations

⚠️ Proxy Required on Apify Datacenter

Reddit blocks many shared datacenter IPs. Without proxy setup on Apify infra, runs can fail with runStatus: all_blocked and 0 posts.

To fix: enable Apify Residential proxy (APIFY_USE_APIFY_PROXY=true, APIFY_PROXY_GROUPS=RESIDENTIAL) or provide your own residential PROXY_URL.

Legacy Positioning

This actor is not the recommended first choice for new pack users.
Prefer reddit-all-in-one-scraper for research/backfill and reddit-keyword-monitor-alerts for recurring alerting.

FAQ

Is this the main Reddit Intelligence Pack actor?

No. This is the legacy fallback actor. New recurring monitor workflows should use reddit-keyword-monitor-alerts.

Does Reddit block this?

Yes, frequently on datacenter IPs. Residential proxy is typically required on Apify cloud.

What is runStatus in output?

Value	Meaning
`ok`	All subreddits fetched successfully
`partial`	Some subreddits succeeded; others were blocked/errored
`all_blocked`	Every subreddit was blocked — no posts collected (exit code 1)

Reddit Intelligence Pack (recommended path):

🚨 Reddit Keyword Monitor Alerts — Hero recurring monitor for net-new alerts.
📡 Reddit All-in-One Scraper — Research/backfill companion.
📰 Article Extractor — Linked URL cleanup add-on.
🐘 Mastodon Hashtag & Account Scraper — Federated social listening (Twitter/X-free), same query/result shape on the Fediverse.

Cost

Pay Per Event:

actor-start: $0.01 (flat fee per run)
dataset-item: $0.003 per output item

Example: 1,000 items = $0.01 + (1,000 × $0.003) = $3.01

No subscription required — you only pay for what you use.

💾 Save it for later: click the bookmark icon at the top of the Apify Store page if you'd like to come back to it. Bookmarks help other engineers find this actor via Apify's discovery surfaces.

⭐ Was Reddit Posts & Subreddit Comment Scraper useful for your Reddit research?

If this actor saved you time, please leave a 5★ rating on Apify Store — it takes 10 seconds, helps other engineers and analysts discover it, and keeps updates free.

Have a feature request, bug, or sample workflow you'd like to share? Open an issue — we read every one and use them to prioritise the next release.

Reddit Scraper - Posts, Comments, Subreddits, Search

thirdwatch/reddit-scraper

Scrape Reddit posts, comments, and subreddits. Search globally or within specific subreddits. Get post title, body, score, comments, author, flair, awards, and media URLs. Ultra-fast HTTP-only scraper using Reddit's built-in JSON API.

Thirdwatch

206

5.0

(1)

Reddit Posts, Comments & Subreddit Analytics Scraper

khadinakbar/reddit-posts-comments-scraper

Scrape Reddit posts, comments & subreddit analytics via JSON API. No browser, no login, no API key. Structured JSON for AI, research & monitoring.

Khadin Akbar

543

Reddit Scraper - Posts, Comments, Subreddits & Users

makework36/reddit-scraper

Fast, reliable Reddit scraper. Extract posts, comments, subreddits & users from any subreddit without Reddit API keys or login. AI-ready JSON for LLM training, sentiment analysis, lead generation. Export JSON/CSV/Excel.

deusex machine

126

👽 Reddit Scraper - Posts, Comments & Subreddits

renzomacar/reddit-scraper

Scrape Reddit posts, comments & full threads with built-in sentiment scoring — no login, no API key, no rate limits. Pull titles, authors, scores, nested comment trees and sort by hot, new, top or rising. Built for market research and brand monitoring. Pay only for what you scrape.

Renzo Madueno

322

🔥Reddit Scraper - Posts, Comments & Subreddit Data Extractor

nourishing_courier/reddit-scraper-pro

Scrape Reddit posts, comments, and subreddit data. Extract upvotes, authors, timestamps, and nested replies. No API keys or login needed. Export to JSON, CSV, Excel. Pay per result - no monthly fees.

Ani Björkström

192

5.0

(1)

Reddit Scraper - Posts, Comments & Subreddits

viralanalyzer/reddit-scraper

Extract Reddit posts, comments, subreddit data, and user profiles.

viralanalyzer

5.0

(3)

Reddit Scraper - Posts, Comments & Subreddit Data

alizarin_refrigerator-owner/reddit-scraper

Scrape Reddit posts, comments, and subreddit data. Search across Reddit, extract discussions, track trending topics, and monitor specific communities. Subreddit Scraping Reddit Search Comment Extraction Flexible Sorting Upvote Filtering Webhook Support - Send results to Zapier, Make, n8n

The Howlers

Reddit Scraper — Posts, Comments & Subreddit Search

junipr/reddit-scraper

Scrape Reddit posts, comments, profiles, and subreddits without API key. Full comment threading, media extraction (video, gallery, embed), flair, awards. NSFW filtering. Hot/new/top/rising sort and search.

junipr

Reddit Scraper — Posts, Comments, Subreddits | MCP + AI

scrape.badger/reddit-scraper

Scrape Reddit posts, users, subreddits and other data with affordable ScrapeBadger Reddit Scraper. High success rates and fast support. 20 modes: posts, comment trees, subreddits, rules, wiki, user data, keyword & domain search, trending. No Reddit API key needed. From $1.00/1K items.

Scrape Badger

Reddit Scraper - Posts, Comments, Subreddits & User Profiles

convertfleetdotonline/reddit-scraper

Scrape Reddit posts, comments, subreddit communities, and user profiles by keyword or URL. Supports NSFW filter, sort, and time-range options. Powered by the Apify API for reliable, scalable Reddit data extraction.