SubReddit Post Scraper avatar

SubReddit Post Scraper

Pricing

from $7.20 / 1,000 posts

Go to Apify Store
SubReddit Post Scraper

SubReddit Post Scraper

Subreddit posts API at scale: posts, comments, vote velocity, awards, and full thread hierarchy. JSON output for community-sentiment, social-listening, and brand-monitoring pipelines โ€” no Reddit auth required.

Pricing

from $7.20 / 1,000 posts

Rating

5.0

(1)

Developer

AgentX

AgentX

Maintained by Community

Actor stats

3

Bookmarked

17

Total users

0

Monthly active users

21 hours ago

Last modified

Share

SubReddit Post Scraper - Reddit Post & Comment Extraction API

Extract Reddit posts from any subreddit with 40+ fields per post including titles, content, author data, vote counts, engagement metrics, media URLs, flair text, and optional full comment threads. This subreddit posts API supports sorting by new, hot, top, rising, or controversial and returns structured JSON with comment threading, reply levels, and subreddit metadata for single-community monitoring, sentiment analysis, and subreddit intelligence workflows.

Reddit 40+ Fields Comments


Why Choose This Reddit Post Extraction API

40+ Fields with Full Comment Threads, Vote Analytics & Content Metadata

๐ŸŽฏ Targeted Subreddit Extraction Extract posts from any subreddit by name, r/ prefix, or full URL. Supports sorting by new, hot, top, rising, or controversial for precise content targeting.

๐Ÿ’ฌ Full Comment Thread Extraction Optional extraction of complete comment threads with reply structure, threading levels, parent IDs, author data, and per-comment vote counts.

๐Ÿ“Š Comprehensive Post Metadata 40+ fields per post: votes (up/down/score), timestamps (created/edited), content score, flair, media URLs, cross-post count, and moderation flags (archived, locked, stickied, NSFW, spoiler).

๐Ÿ“ˆ Subreddit Analytics Each post includes subreddit metadata โ€” subscriber count and subreddit type โ€” enabling community-level analytics alongside post-level data.

๐Ÿค– AI-Ready Structured Output JSON output with consistent schema optimized for sentiment analysis, topic modeling, and NLP pipelines.


Why Choose This API

This actor is designed for structured output, repeatable automation, and downstream integration in monitoring, enrichment, or ingestion pipelines.

Quick Start Guide

How to Extract Reddit Posts in 3 Steps

Step 1: Configure Parameters

Enter a subreddit name, select sort method, set max posts, and toggle comment extraction.

Open Actor Input

Step 2: Run the Actor

Click "Start" to begin extraction. Posts are processed concurrently with automatic rate limiting.

Step 3: Access Structured Results

Download JSON dataset or integrate via Apify API into your analytics pipeline.

Example Input

{
"community": "programming",
"sort_by": "hot",
"max_posts": 100,
"comments": true
}

Input Parameters

ParameterTypeRequiredDescription
๐Ÿ˜๏ธ CommunityStringYesSubreddit name, r/ prefix, or full URL
๐Ÿ”„ Sort ByStringYesSort method: new, hot, top, rising, controversial
๐Ÿ“Š Max PostsNumberYesMaximum posts to extract
๐Ÿ’ฌ CommentsBooleanYesExtract full comment threads

Output Data Schema

Each extracted Reddit post returns 40+ structured fields:

Open Actor Input

Post Fields

FieldTypeDescription
idStringUnique Reddit post identifier
titleStringPost title
bodyStringPost text content
authorStringPost author username
subredditStringSubreddit name
subreddit_subscribersIntegerSubreddit subscriber count
scoreIntegerNet score (upvotes - downvotes)
upvotesIntegerUpvote count
downvotesIntegerDownvote count
createdStringCreation timestamp
editedStringEdit timestamp
flair_textStringPost flair
linkStringDirect post URL
media_urlStringMedia content URL
content_scoreIntegerReddit quality score
over_18BooleanNSFW flag
archivedBooleanArchived status
lockedBooleanLocked status
commentsArrayComment objects (if enabled)

Comment Fields

FieldTypeDescription
idStringComment identifier
authorStringComment author
bodyStringComment text
scoreIntegerComment net score
is_rootBooleanRoot comment flag
parent_idStringParent comment ID
reply_levelIntegerThread depth level

Example Output

{
"id": "18x9m2k",
"title": "What are some underrated programming languages?",
"body": "I've been working with Python and JavaScript...",
"author": "DevEngineer2024",
"subreddit": "programming",
"subreddit_subscribers": 4200000,
"score": 2847,
"upvotes": 3120,
"downvotes": 273,
"created": "2025-01-15 07:15:22",
"flair_text": "Discussion",
"comments": [
{
"id": "k3p9n1x",
"author": "RustEnthusiast",
"body": "I'd recommend looking into Rust...",
"score": 869,
"is_root": true,
"reply_level": 0
}
],
"processor": "https://apify.com/agentx/subreddit-post-scraper?fpr=aiagentapi",
"processed_at": "2025-01-15T08:42:33.456Z"
}

Integration Examples

Python Integration Example

from apify_client import ApifyClient
client = ApifyClient("YOUR_API_TOKEN")
run_input = {
"community": "programming",
"sort_by": "hot",
"max_posts": 100,
"comments": True
}
run = client.actor("ipw4MHKL6ZBj9pYeF").call(run_input=run_input)
dataset_items = client.dataset(run["defaultDatasetId"]).list_items().items
for item in dataset_items:
print(f"{item['title']} | Score: {item['score']} | Comments: {len(item.get('comments', []))}")

JavaScript/Node.js Integration

import { ApifyClient } from "apify-client";
const client = new ApifyClient({ token: "YOUR_API_TOKEN" });
const input = {
community: "programming",
sort_by: "hot",
max_posts: 100,
comments: true,
};
const run = await client.actor("ipw4MHKL6ZBj9pYeF").call(input);
const { items } = await client.dataset(run.defaultDatasetId).listItems();
items.forEach((item) => {
console.log(`${item.title} | Score: ${item.score} | Author: ${item.author}`);
});

Actor ID for Platform Integration

ipw4MHKL6ZBj9pYeF

โ“‚๏ธ Make.com Setup:

  1. Login to Make.com (Get 1000 Free Credits)
  2. Add module "Run an Actor"
  3. Turn 'Map' on - right side of the 'Actor*'
  4. Paste Actor ID - from above
  5. Click the 'โŸณ Refresh' - left side of Map
  6. Input JSON* - Modify the parameters as needed
  7. Set "Run synchronously" to YES
  8. Add module "Get Dataset Items" - receive the result
  9. In Dataset ID* select defaultDatasetId

๐ŸŽฑ N8N.io Setup:

  1. Add 'Run an Actor and get dataset' - from the apify node
  2. Actor โ†’ By ID โ†’ Paste Actor ID - from above
  3. Input JSON - Modify the parameters as needed

JSON-LD Metadata

{
"@context": "https://schema.org",
"@type": "SoftwareApplication",
"name": "SubReddit Post Scraper",
"description": "Reddit post and comment extraction API with 40+ fields, vote analytics, and full comment threads",
"applicationCategory": "Developer Tools",
"operatingSystem": "Cloud",
"url": "https://apify.com/agentx/subreddit-post-scraper?fpr=aiagentapi",
"dateModified": "2026-04-21",
"offers": {
"@type": "Offer",
"price": "0.008",
"priceCurrency": "USD"
}
}

Pricing & Cost Calculator

Transparent Pay-Per-Use Model

Event TypePriceDescription
Actor Start$0.005Charged per run start (per GB of memory)
Post$0.008Per post extracted with full metadata
Comment$0.00017Per comment extracted with threading data

Cost Examples

Posts Only (100 posts, no comments):

  • Posts: 100 ร— $0.008 = $0.80
  • Total: ~$0.81

Posts + Comments (100 posts, ~50 comments each):

  • Posts: 100 ร— $0.008 = $0.80
  • Comments: 5,000 ร— $0.00017 = $0.85
  • Total: ~$1.66

Large Extraction (1,000 posts + comments):

  • Posts: 1,000 ร— $0.008 = $8.00
  • Comments: 50,000 ร— $0.00017 = $8.50
  • Total: ~$16.51

Use Cases & Applications

Brand Monitoring & Reputation Management

Sentiment Tracking Extract posts mentioning your brand or product across subreddits and analyze vote patterns, comment sentiment, and engagement metrics for real-time reputation monitoring.

Crisis Detection Monitor high-engagement posts (rising sort) for early detection of negative sentiment spikes or emerging issues requiring rapid response.

Market Research & Consumer Insights

Product Feedback Mining Extract posts and comments from product-specific subreddits to identify feature requests, pain points, and competitive comparisons from authentic user discussions.

Competitive Intelligence Track competitor mentions, product reviews, and user sentiment across industry subreddits for strategic positioning and messaging.

Content Strategy & Trend Analysis

Topic Discovery Monitor trending posts (hot/rising sorts) across niche subreddits to identify emerging content opportunities and viral topics.

Audience Research Analyze post engagement patterns, flair distributions, and comment depth to understand community interests and content preferences.


FAQ

What subreddit name formats are accepted?

Simple names (programming), r/ prefixed (r/programming), and full URLs (https://www.reddit.com/r/programming/). All formats are automatically normalized.

What sorting options are available?

Five options: new (chronological), hot (current popularity), top (highest score), rising (gaining traction), controversial (high engagement, mixed votes).

Can I extract comments separately from posts?

Comments are toggled via the comments boolean parameter. Set to false for posts-only extraction at lower cost, or true for full thread data.

How are comment threads structured?

Each comment includes is_root, parent_id, and reply_level fields for reconstructing the full thread hierarchy in downstream processing.

Are private subreddits accessible?

No. Only public subreddits are accessible. Private, restricted, or banned subreddits return empty results.

What is the maximum extraction size?

Set max_posts up to thousands per run. Processing speed scales based on comment depth and extraction volume.


SEO Keywords & Search Terms

Primary Keywords

subreddit scraper API, Reddit post extraction, Reddit comment scraper, subreddit data extraction API, Reddit content monitoring, Reddit analytics pipeline

Long-Tail Keywords

extract Reddit posts API structured data, Reddit comment thread extraction with threading, subreddit post monitoring pipeline, Reddit sentiment analysis data extraction

Industry Terms

social listening, sentiment analysis, content monitoring, brand intelligence, community analytics, consumer insights, trend detection


Trust & Certifications

  • โœ… Enterprise Ready - Processing thousands of posts per run with 99.9% uptime
  • โœ… GDPR Compliant - EU data protection standards
  • โœ… 40+ Data Fields - Comprehensive post and comment intelligence
  • โœ… Regular Updates - Continuous maintenance for platform changes

Data Rights & Usage

All data extracted by this actor originates from publicly accessible Reddit posts and comments. Users are responsible for ensuring their use complies with applicable laws and Reddit's Terms of Service.

Privacy Compliance

  • GDPR: Compliant with EU General Data Protection Regulation. No personal data is stored beyond the extraction session.
  • CCPA: Compliant with California Consumer Privacy Act requirements.

Platform Terms of Service

Users must review and comply with Reddit's Terms of Service regarding data usage. This tool accesses only publicly available content.

Enterprise Support

For enterprise licensing, custom integrations, or compliance inquiries:


Jobs & Hiring

LinkedIn

Reddit

Telegram

Instagram

X / Twitter

TikTok

Video & Content

YouTube

Real Estate

Maps & Local


Support & Community


Last Updated: April 21, 2026