SubReddit Post Scraper avatar

SubReddit Post Scraper

Pricing

from $7.20 / 1,000 posts

Go to Apify Store
SubReddit Post Scraper

SubReddit Post Scraper

Bulk-extract up to 10,000 posts from any subreddit, sorted by new, top, hot, rising, or controversial, with each post returning title, body, score, upvote ratio, flair, author, and permalink. Optionally include up to 10,000 comments per post for full-thread sentiment and discussion mining.

Pricing

from $7.20 / 1,000 posts

Rating

5.0

(1)

Developer

AgentX

AgentX

Maintained by Community

Actor stats

3

Bookmarked

17

Total users

1

Monthly active users

3 days ago

Last modified

Share

SubReddit Post Scraper - SubReddit Post Data Extraction API

SubReddit Post Scraper is a subreddit post intelligence API that extracts structured Reddit post and comment threads, engagement scoring, flair taxonomy, and moderation flags from any subreddit in a single subreddit-name-driven run. SubReddit Post Scraper returns structured records per post, including post title, body text, score, upvotes, downvotes, upvote ratio, crosspost count, flair text and color, author username, post date, post URL, image and video URLs, archived/locked/stickied/pinned/hidden moderation flags, NSFW flag, and full comment threads with author, body, score, and parent linkage. Coverage spans all public Reddit communities globally with five sort modes (new/top/hot/rising/controversial). Built for social listening, brand monitoring, community sentiment analysis, NLP training-data construction, viral-content discovery, ad-creative research, and Reddit-content-intelligence pipelines. Per-post pay-per-result pricing at $0.00773 with no monthly minimum.

Reddit Post Intelligence Social Listening


Why Choose This API

Structured SubReddit Post Data for B2B Social Intelligence Pipelines

๐Ÿ“Š Engagement Signal Extraction The score, upvotes, downvotes, and crossposts fields deliver granular engagement signals per post โ€” enabling vote velocity analysis, content impact scoring, and engagement quality assessment for social listening and brand intelligence pipelines.

๐Ÿ”„ Sort Mode Flexibility Five sort modes โ€” new, top, hot, rising, and controversial โ€” enable precision-targeted extraction: emerging signals, high-impact content, trending topics, gaining-momentum posts, or polarizing sentiment for specific research workflows.

๐Ÿ’ฌ Full Comment Thread Extraction Configure max_comments to extract complete discussion threads per post โ€” enabling deep audience sentiment analysis, topic clustering, discussion network analysis, and community voice capture for qualitative research pipelines.

๐Ÿท๏ธ Flair & Community Context The flair_text, subreddit, subreddit_type, and subreddit_subscribers fields provide rich community context โ€” enabling subreddit audience segmentation, content category classification, and topic taxonomy mapping.

๐Ÿ”’ Moderation Intelligence The archived, locked, stickied, pinned, and hidden flags reveal post lifecycle states โ€” enabling content moderation research, post longevity scoring, and community governance intelligence.


Quick Start Guide

How to Extract SubReddit Posts in 3 Steps

Step 1: Choose the Target Subreddit

Enter the subreddit name with or without the r/ prefix, or as a full Reddit URL (case-insensitive).

Step 2: Configure Sort Mode and Volume

Select sort_by mode (new/top/hot/rising/controversial) and set max_posts and max_comments per post.

Step 3: Download Structured Results

Open Actor Input

Export results as JSON or CSV, or consume via the Apify dataset API.


Input Parameters

Configuration Fields

ParameterTypeRequiredDescriptionExample Values
communitystringโœ…Subreddit name, with/without r/ prefix, or full URL"datascience", "r/investing"
sort_byselectโœ…Sort mode: new / top / hot / rising / controversial"top", "hot", "new"
max_postsintegerโœ…Maximum posts to extract (1โ€“10,000)100, 500, 1000
max_commentsintegerโœ…Maximum comments per post (0 = disabled)0, 20, 100

Example Input Configuration

{
"community": "datascience",
"sort_by": "top",
"max_posts": 500,
"max_comments": 20
}

Output Data Schema

Complete Post Record Structure

Each extracted subreddit post record contains the following fields:

Open Actor Output

Core Post Fields

FieldTypeDescription
processorstringApify actor URL that processed this record
processed_atstringISO 8601 timestamp (UTC) when scraped
subredditstringSubreddit community name
subreddit_typestringAccess level (public/private/restricted)
subreddit_subscribersintegerTotal subscribers of the subreddit
idstringUnique Reddit post ID
authorstringReddit username of post author
createdstringPost creation timestamp
editedstringLast edit timestamp (null if not edited)
linkstringPermalink to the post
flair_textstringPost flair/category tag
titlestringPost headline text
bodystringFull selftext content
scoreintegerNet karma score (upvotes minus downvotes)
upvotesintegerTotal positive votes
downvotesintegerTotal negative votes
crosspostsintegerNumber of crossposts
archivedbooleanArchived status
lockedbooleanModerator lock status
stickiedbooleanSticky/pinned by moderators
pinnedbooleanUser profile pinned status
hiddenbooleanPost hidden status

Example JSON Output

{
"processor": "https://apify.com/agentx/subreddit-post-scraper?fpr=aiagentapi",
"processed_at": "2026-05-01T10:30:00.000Z",
"subreddit": "datascience",
"subreddit_type": "public",
"subreddit_subscribers": 1100000,
"id": "1abc23",
"author": "ml_researcher",
"created": "2026-04-28T14:22:00",
"link": "https://www.reddit.com/r/datascience/comments/1abc23/",
"flair_text": "Discussion",
"title": "What tools are you using for real-time ML feature stores?",
"score": 842,
"upvotes": 890,
"downvotes": 48,
"locked": false,
"stickied": false
}

Export Formats

  • JSON - Complete structured data with all engagement metadata
  • CSV - Spreadsheet-compatible for BI and analytics tools
  • API Access - Programmatic access via Apify Client SDK
  • Cloud Storage - Automatic upload to Apify Dataset

Integration Examples

Actor ID for Platform Integration

ipw4MHKL6ZBj9pYeF

โ“‚๏ธ Make.com Setup:

  1. Login to Make.com (Get 1000 Free Credits)
  2. Add module "Run an Actor"
  3. Turn 'Map' on - right side of the 'Actor*'
  4. Paste Actor ID - from above
  5. Click the 'โŸณ Refresh' - left side of Map
  6. Input JSON* - Modify the parameters as needed
  7. Set "Run synchronously" to YES
  8. Add module "Get Dataset Items" - receive the result
  9. In Dataset ID* select defaultDatasetId

๐ŸŽฑ N8N.io Setup:

  1. Add 'Run an Actor and get dataset' - from the apify node
  2. Actor โ†’ By ID โ†’ Paste Actor ID - from above
  3. Input JSON - Modify the parameters as needed

Python Integration Example

from apify_client import ApifyClient
client = ApifyClient('YOUR_API_TOKEN')
run_input = {
"community": "datascience",
"sort_by": "top",
"max_posts": 500,
"max_comments": 20
}
run = client.actor("ipw4MHKL6ZBj9pYeF").call(run_input=run_input)
for item in client.dataset(run["defaultDatasetId"]).iterate_items():
print(item)

JavaScript/Node.js Integration

import { ApifyClient } from "apify-client";
const client = new ApifyClient({ token: "YOUR_API_TOKEN" });
const input = {
community: "investing",
sort_by: "hot",
max_posts: 200,
max_comments: 0,
};
const run = await client.actor("ipw4MHKL6ZBj9pYeF").call(input);
const { items } = await client.dataset(run.defaultDatasetId).listItems();
items.forEach((item) => console.log(item));

JSON-LD Metadata

{
"@context": "https://schema.org",
"@graph": [
{
"@type": "SoftwareApplication",
"@id": "https://apify.com/agentx/subreddit-post-scraper#software",
"name": "SubReddit Post Scraper",
"description": "SubReddit Post Scraper is a SubReddit post intelligence API extracting structured post and comment data with engagement scoring, flair taxonomy, author attribution, moderation flags, and full comment threads for social listening and community sentiment analysis.",
"applicationCategory": "BusinessApplication",
"applicationSubCategory": "Web Scraping API",
"operatingSystem": "Web, Cloud",
"url": "https://apify.com/agentx/subreddit-post-scraper?fpr=aiagentapi",
"softwareVersion": "1.0.0",
"datePublished": "2024-08-01",
"dateModified": "2026-05-01",
"featureList": [
"Title, body, score, upvotes, downvotes",
"Crosspost count and upvote ratio",
"Flair text/color and author attribution",
"Five sort modes: new/top/hot/rising/controversial",
"Configurable max_comments per post with full threads",
"Moderation flags: archived/locked/stickied/pinned/hidden",
"Per-post pay-per-result at $0.00773",
"Native integrations with Make.com, n8n, LangChain, and CrewAI"
],
"offers": {
"@type": "Offer",
"price": "0.00773",
"priceCurrency": "USD",
"availability": "https://schema.org/InStock"
},
"author": { "@id": "https://apify.com/agentx#person" },
"publisher": { "@id": "https://apify.com#organization" }
},
{
"@type": "Person",
"@id": "https://apify.com/agentx#person",
"name": "AgentX",
"url": "https://apify.com/agentx",
"sameAs": [
"https://apify.com/agentx",
"https://t.me/AiAgentApi",
"https://t.me/Apify_Actor"
],
"knowsAbout": [
"web scraping",
"Reddit",
"subreddit posts",
"social listening",
"community sentiment"
]
},
{
"@type": "BreadcrumbList",
"itemListElement": [
{
"@type": "ListItem",
"position": 1,
"name": "Apify",
"item": "https://apify.com"
},
{
"@type": "ListItem",
"position": 2,
"name": "AgentX",
"item": "https://apify.com/agentx"
},
{
"@type": "ListItem",
"position": 3,
"name": "SubReddit Post Scraper",
"item": "https://apify.com/agentx/subreddit-post-scraper"
}
]
}
]
}

Pricing & Cost Calculator

Pay-Per-Result Pricing

EventRate
Actor Start$0.005 per GB memory
Post (per post)$0.00773 (BRONZE)
Comment (per comment)$0.00016 (BRONZE)

Cost Calculator Examples

PostsComments/PostEstimated Cost
100 posts0 comments~$0.78
500 posts0 comments~$3.87
1,000 posts0 comments~$7.73
500 posts20 comments~$5.47
1,000 posts50 comments~$15.73

Set max_comments: 0 to extract post metadata only and minimize cost per run.


Use Cases & Applications

Social Listening & Brand Monitoring

Community Sentiment Analysis Extract posts from brand-relevant subreddits โ€” building structured community sentiment datasets, audience feedback corpora, and product discussion archives for brand monitoring and social listening pipelines.

Competitive Intelligence Monitor subreddits discussing competitor products or industries โ€” capturing user pain points, feature requests, competitor weaknesses, and market sentiment for product intelligence research.

NLP Training Data Collection Extract high-volume post and comment text from topic-specific subreddits โ€” building domain-specific NLP training datasets for sentiment classification, topic modeling, and text generation model fine-tuning.

Content & Community Research

Community Content Archiving Use sort_by: "top" to extract a subreddit's highest-engagement content โ€” building content libraries, research archives, and knowledge base datasets for specific industry communities.

Trend Detection Within Subreddits Use sort_by: "rising" or sort_by: "new" with time-based collection โ€” detecting emerging topics, growing discussions, and community trend signals within specific Reddit communities.

Engagement Benchmarking Extract score, upvotes, and crossposts at scale โ€” building per-community content engagement benchmarks for social media analytics and community performance research.


FAQ

How does this differ from Reddit Viral Scraper?

Reddit Viral Scraper performs cross-subreddit keyword search. This actor extracts posts from a specific subreddit โ€” ideal for deep community analysis, brand subreddit monitoring, and community sentiment research.

Can I disable comment extraction?

Yes โ€” set max_comments: 0 to extract post metadata only. This significantly reduces cost per run.

What does sort_by: "controversial" return?

Posts with high vote counts but roughly equal upvotes and downvotes โ€” surfacing polarizing or debate-generating content within the community.

How do I monitor a brand's subreddit?

Enter the brand's subreddit name (e.g., r/apple), set sort_by: "new", and schedule recurring runs โ€” building a continuous brand community monitoring feed.


SEO Keywords & Search Terms

Primary Keywords

subreddit post scraper API, Reddit post extraction API, subreddit content intelligence, Reddit post data API, subreddit social listening API, Reddit community post scraper, subreddit sentiment data API, Reddit brand monitoring API, subreddit post dataset, Reddit NLP training data API

Long-Tail Keywords

how to extract Reddit posts from subreddit programmatically, subreddit post engagement data API, Reddit community content extractor, subreddit post comment scraper, Reddit post score data API, subreddit brand monitoring pipeline, Reddit post data collection, subreddit NLP dataset builder, Reddit community intelligence API, subreddit post analytics extractor

Industry Terms

subreddit post intelligence API, community sentiment analysis, social listening pipeline, Reddit content extraction, brand monitoring data, NLP training corpus, community engagement signals, Reddit community analytics, subreddit content archiving, social media intelligence API


Trust & Certifications

  • โœ… Production-Grade Infrastructure โ€” runs on the Apify cloud platform with managed proxy rotation and automatic retries
  • โœ… GDPR & CCPA-Region Aligned โ€” extracts only publicly available subreddit posts and comments; no personal contact data retained beyond the run session
  • โœ… Pay-Per-Result Billing โ€” transparent $0.00773 per post with no monthly minimum or seat fees
  • โœ… Continuously Maintained โ€” schema and selector updates released as Reddit evolves

Data Rights & Usage

All data extracted by this actor originates from publicly available Reddit posts and communities. Users are responsible for ensuring their use of extracted data complies with applicable laws and Reddit terms of service.

Privacy Compliance

  • GDPR: Compliant with EU GDPR for data processing workflows.
  • CCPA: Compliant with California Consumer Privacy Act requirements.

Platform Terms of Service

Users must review and comply with Reddit's developer terms of service when using extracted data.

Enterprise Support

For enterprise licensing, custom integrations, or compliance inquiries:


Jobs & Hiring

Social Media

Video & Transcript

E-Commerce & Retail

Classifieds & Automotive

Real Estate

Business Intelligence & Reviews

Other


Support & Community


Last Updated: May 01, 2026