๐Ÿ˜๏ธSubReddit Scraper - Posts & Comments avatar
๐Ÿ˜๏ธSubReddit Scraper - Posts & Comments

Pricing

Pay per event

Go to Apify Store
๐Ÿ˜๏ธSubReddit Scraper - Posts & Comments

๐Ÿ˜๏ธSubReddit Scraper - Posts & Comments

Developed by

AgentX

AgentX

Maintained by Community

Discover the ultimate tool for Reddit data extraction! Our Subreddit Scraper 2025 lets you gather posts, comments, and insights from any subreddit at scale.

5.0 (1)

Pricing

Pay per event

1

2

1

Last modified

2 days ago

Subreddit Scraper 2025 - Extract Reddit posts from any subreddit. Extract posts at scale with complete metadata including post titles, content, author information, vote counts (upvotes, downvotes, score), timestamps (created, edited), engagement metrics, content score, media URLs, subreddit data (name, type, subscriber count), flair text, post status (archived, locked, stickied), and optional full comment threads - enabling comprehensive content collection from specific Reddit communities by sorting posts (new, hot, top, rising, controversial) for content monitoring, trend analysis, and social media intelligence.

Reddit Community 40+Fields Reliable

๐Ÿš€ Why Our Subreddit Scraper is the ULTIMATE Content Intelligence & Social Media Monitoring Engine

  • ๐ŸŽฏ Targeted Subreddit Extraction - Extract posts from any specific Reddit subreddit by simply providing the subreddit name or URL
  • ๐Ÿ“Š Complete Data Extraction - Extract 40+ data fields per post including metadata, engagement metrics, timestamps, media URLs, and subreddit information
  • ๐Ÿ’ฌ Full Comment Threads - Optional extraction of complete comment threads with reply structure, threading information, and engagement data
  • ๐Ÿ”„ Flexible Sorting - Sort posts by new, hot, top, rising, or controversial to get the most relevant content for your needs
  • โšก Efficient Processing - Concurrent processing architecture with automatic account management and rate limit compliance

๐Ÿ“ฅ Input Configuration

Start extracting Reddit content data today - No coding required!

  1. ๐Ÿ’ฌ Set Comments - Choose whether to extract comments from posts for comprehensive content analysis (e.g., true, false)
  2. ๐Ÿ“Š Set Max Posts - Choose maximum number of posts to extract for optimal data collection (e.g., 100, 1000)
  3. ๐Ÿ”„ Set Sort By - Select sorting method for posts (e.g., "new", "hot", "top", "rising", "controversial")
  4. ๐ŸŽฏ Enter Community - Enter the Reddit subreddit name you want to scrape (e.g., "askreddit", "technology", "programming")
  5. ๐Ÿš€ Start Processing - Watch our extraction engine harvest Reddit data in real-time with transparent pricing and progress tracking

Subreddit Scraper Input Interface - Configure Reddit post extraction: select subreddit, sort method (new, hot, top, rising, controversial), max posts, and comment extraction options

Required Parameters

ParameterTypeDescriptionExample
commentsbooleanExtract comments from the poststrue, false
max_postsnumberMaximum number of posts to extract100, 1000
sort_bystringSorting method for posts"new", "hot", "top", "rising", "controversial"
communitystringReddit subreddit name to scrape (non-empty string)"askreddit", "technology", "programming"

Sort By Options

  • new - Sort by newest posts first (chronological order)
  • hot - Sort by current popularity and engagement
  • top - Sort by highest score (all-time top posts)
  • rising - Sort by posts that are gaining traction quickly
  • controversial - Sort by most controversial posts (high engagement, mixed votes)

Community Parameter Format

The community parameter accepts subreddit names in multiple formats:

  • Simple name: askreddit, technology, programming
  • With r/ prefix: r/askreddit, r/technology (the prefix will be automatically stripped)
  • Full URL: https://www.reddit.com/r/askreddit/ (the subreddit name will be extracted automatically)

Example Input:

{
"comments": true,
"max_posts": 1000,
"sort_by": "hot",
"community": "askreddit"
}

๐Ÿ“ค Output Data Format

Subreddit Scraper Output Data - Structured JSON output with 40+ fields: post metadata (title, content, author, votes, timestamps), engagement metrics, media URLs, subreddit info, and complete comment threads with reply structure

Structured Reddit Post Data Schema

Each extracted Reddit post contains comprehensive content intelligence with the following fields:

FieldTypeDescription
processorstringURL of the Apify actor that processed this post
processed_atstringISO formatted timestamp when the post was scraped
subredditstringName of the subreddit
subreddit_typestringType of the subreddit (public, private, etc.)
subreddit_subscribersintegerNumber of subscribers in the subreddit
idstringUnique identifier of the Reddit post
authorstringUsername of the post author
createdstringTimestamp when the post was created (YYYY-MM-DD HH:MM:SS)
editedstringTimestamp when the post was last edited (YYYY-MM-DD HH:MM:SS)
linkstringDirect link to the Reddit post
flair_textstringFlair text of the post
titlestringTitle of the Reddit post
bodystringText content of the post
scoreintegerNet score of the post (upvotes - downvotes)
upvotesintegerNumber of upvotes
downvotesintegerNumber of downvotes
crosspostsintegerNumber of crossposts
archivedbooleanWhether the post is archived
lockedbooleanWhether the post is locked
stickiedbooleanWhether the post is stickied
pinnedbooleanWhether the post is pinned
hiddenbooleanWhether the post is hidden
over_18booleanWhether the post is marked as NSFW
spoilerbooleanWhether the post contains spoilers
originalbooleanWhether the post is original content
advertisingbooleanWhether the post was created from ads UI
indexablebooleanWhether the post is robot indexable
crosspostablebooleanWhether the post can be crossposted
thumbnailstringURL of the post thumbnail
media_urlstringURL of the media content
no_followbooleanWhether the post has no follow attribute
content_scoreintegerReddit's content quality score for the post
duplicatesarrayArray of duplicate post links
commentsarrayArray of comment objects (if enabled)

Comment Data Schema

Each comment contains the following fields:

FieldTypeDescription
idstringUnique identifier of the comment
is_rootbooleanWhether this is a root comment
createdstringTimestamp when the comment was created (YYYY-MM-DD HH:MM:SS)
editedstringTimestamp when the comment was last edited (YYYY-MM-DD HH:MM:SS)
authorstringUsername of the comment author
bodystringText content of the comment
upvotesintegerNumber of upvotes
downvotesintegerNumber of downvotes
scoreintegerNet score of the comment (upvotes - downvotes)
archivedbooleanWhether the comment is archived
lockedbooleanWhether the comment is locked
stickiedbooleanWhether the comment is stickied
submitterbooleanWhether the comment is from the post submitter
parent_idstringID of the parent comment
reply_levelintegerDepth level of the comment in the thread
linkstringDirect link to the comment

Example Output:

{
"processor": "https://apify.com/agentx/subreddit-scraper?fpr=aiagentapi",
"processed_at": "2025-01-15T08:42:33.456Z",
"subreddit": "programming",
"subreddit_type": "public",
"subreddit_subscribers": 4200000,
"id": "18x9m2k",
"author": "DevEngineer2024",
"created": "2025-01-15 07:15:22",
"edited": null,
"link": "https://www.reddit.com/r/programming/comments/18x9m2k/what_are_some_underrated_programming_languages",
"flair_text": "Discussion",
"title": "What are some underrated programming languages worth learning in 2025?",
"body": "I've been working with Python and JavaScript for years, but I'm curious about languages that might be flying under the radar. What languages have you discovered that offer unique advantages or solve specific problems better than the mainstream ones?\n\nI'm particularly interested in:\n- Performance-oriented languages\n- Languages with interesting type systems\n- Domain-specific languages that might have broader applications",
"score": 2847,
"upvotes": 3120,
"downvotes": 273,
"crossposts": 12,
"archived": false,
"locked": false,
"stickied": false,
"pinned": false,
"hidden": false,
"over_18": false,
"spoiler": false,
"original": true,
"advertising": false,
"indexable": true,
"crosspostable": true,
"thumbnail": null,
"media_url": null,
"no_follow": false,
"content_score": 92,
"duplicates": [],
"comments": [
{
"id": "k3p9n1x",
"is_root": true,
"created": "2025-01-15 07:28:45",
"edited": null,
"author": "RustEnthusiast",
"body": "I'd strongly recommend looking into **Rust**. While it's getting more popular, it's still underrated for many use cases. The memory safety without garbage collection is incredible, and the type system catches bugs at compile time.\n\nAlso check out **Nim** - it compiles to C and has Python-like syntax with better performance.",
"upvotes": 892,
"downvotes": 23,
"score": 869,
"archived": false,
"locked": false,
"stickied": false,
"submitter": false,
"parent_id": null,
"reply_level": 0,
"link": "https://www.reddit.com/r/programming/comments/18x9m2k/what_are_some_underrated_programming_languages/k3p9n1x"
}
]
}

Export Formats

  • JSON - Complete structured data with all metadata
  • Dataset API - Programmatic access via Apify Client
  • Cloud Storage - Automatic upload to Apify Dataset with API access

๐Ÿ”Œ Integration Guides

๐Ÿ†” Actor ID (Copy for platforms):

ipw4MHKL6ZBj9pYeF

โ“‚๏ธ Make.com Setup:

  1. Login to Make.com (Get 1000 Free Credits)
  2. Add module "Run an Actor"
  3. Turn 'Map' on - right side of the 'Actor*'
  4. Paste Actor ID - from above
  5. Click the 'โŸณ Refresh' - left side of Map
  6. Input JSON* - Modify the parameters as needed
  7. Set "Run synchronously" to YES
  8. Add module "Get Dataset Items" - receive the result
  9. In Dataset ID* select defaultDatasetId

๐ŸŽฑ N8N.io Setup:

  1. Add 'Run an Actor and get dataset' - from the apify node
  2. Actor โ†’ By ID โ†’ Paste Actor ID - from above
  3. Input JSON - Modify the parameters as needed

๐Ÿ’ฐ Subreddit Scraper API Pricing - Affordable & Transparent

Pay-Per-Event Model

Event TypePriceDescription
Actor Usage$0.00001Charged for Actor runtime. Cost depends on resource consumption during execution.
Post$0.025Posts extracted with complete metadata (comments, title, content, votes, etc.)
Comment$0.0001Comments extracted with complete metadata (author, text, votes, etc.)

Note: A minimum charge of $0.05 per run applies to maintain account infrastructure and prevent abuse, ensuring the service remains available for long-term operation.

Competitive Price Comparison

FeatureSubreddit ScraperZyteOctoparseDiffbotSavings
Post Extraction Cost$0.025/post โญ$0.055/post$0.068/post$0.075/post55-67% cheaper
Platform Specializationโœ… Reddit Optimized โญโš ๏ธ Generic scrapingโš ๏ธ Generic scrapingโš ๏ธ Generic scrapingPlatform Expert
Data Completenessโœ… 40+ Fields โญ18 fields22 fields25 fields2x More Data
Comment Extractionโœ… Full Threads โญโš ๏ธ Limitedโš ๏ธ Basic onlyโš ๏ธ PartialComplete Coverage
Real-time Processingโœ… Live Updates โญโš ๏ธ Scheduled batchesโš ๏ธ Manual triggerโš ๏ธ Queue-basedAlways Current
Setup Complexityโœ… One-Click Setup โญโš ๏ธ Code requiredโš ๏ธ Visual builderโš ๏ธ API configurationEasiest to Use
Data Qualityโœ… 99.9% Accuracy โญ87% accuracy82% accuracy91% accuracyPremium Quality

๐Ÿ“ˆ Performance & Processing Capabilities

  • High-Speed Data Extraction - Process 500-1,000 Reddit posts per minute with specialized focus on content categorization and posting patterns (speed depends on comment count)
  • Optimized Processing - Intelligent handling designed for efficient data collection while respecting platform limits
  • Global Community Coverage - All subreddits with optimizations for community variations, content types, and posting behaviors
  • Scalable Architecture - Handle single subreddit scraping to enterprise-scale batch operations with 99.9% uptime
  • Data Export Formats - JSON, structured data optimized for post format and social media analysis workflows
  • Error Recovery - Smart retry mechanisms, failover handling, and automatic quality optimization for dynamic content structure
  • Real-time Progress Monitoring - Live progress tracking with transparent cost calculation and extraction statistics

โ‰๏ธ Subreddit Scraper API Troubleshooting Guide

"No posts found" Error

  • Verify the subreddit name is spelled correctly (e.g., "askreddit", not "ask_reddit")
  • Check if the subreddit exists and is accessible (some subreddits may be private or banned)
  • Try different sort methods (new, hot, top, rising, controversial) to get different post sets
  • Some smaller subreddits may have limited posts - try a more popular subreddit
  • Ensure the subreddit name format is correct: either just the name, "r/name", or full URL

Incomplete Comment Data

  • Some posts may have locked or deleted comments - this is normal behavior
  • Different subreddits have varying comment visibility settings
  • Reddit comment data is more complete for popular and active posts

Data Quality Issues

  • Post descriptions may vary significantly in quality on Reddit
  • Some posts may be deleted or removed by moderators
  • Author information may be unavailable for deleted accounts
  • Comment extraction works best with active and unlocked posts

Performance Optimization

  • Use specific subreddit names for targeted data collection
  • Different sort methods may return different amounts of data (e.g., "top" may have more historical data than "new")
  • Larger result sets require more processing time
  • Comments extraction significantly increases processing time
  • High-quality data extraction may take longer but provides better results
  • Budget limits may stop processing mid-batch

๐Ÿฆ How to Use Subreddit Scraper API for Business Growth

  • ๐Ÿ” Brand Monitoring & Reputation Management - Track brand mentions across Reddit subreddits in real-time, monitor customer sentiment, identify potential PR crises early, and proactively manage brand reputation by analyzing discussions, reviews, and authentic user feedback before issues escalate
  • ๐Ÿ’ฌ Consumer Insights & Product Research - Gather genuine customer feedback and product reviews from relevant communities to understand user needs, pain points, and preferences, identify product improvement opportunities, and validate product ideas through authentic user discussions and experiences
  • ๐Ÿ“Š Competitive Intelligence & Market Analysis - Monitor competitor mentions, track industry discussions, analyze market trends, and gather competitive intelligence to inform strategic business decisions, identify market gaps, and stay ahead of industry developments
  • ๐ŸŽฏ Content Marketing & Trend Discovery - Discover emerging trends and hot topics before they become mainstream by monitoring popular posts, tracking keyword frequency, analyzing engagement patterns, and identifying high-performing content topics to create relevant content that resonates with target communities
  • ๐Ÿ”ฎ Sentiment Analysis & Opinion Mining - Analyze public opinion on products, services, brands, or topics by extracting and analyzing comments, votes, and discussion patterns to gauge overall sentiment, track sentiment changes over time, and understand public perception shifts
  • ๐Ÿ“ˆ Social Media Listening & Crisis Management - Monitor industry discussions, track mentions across multiple subreddits, detect early warning signals of potential issues, and enable rapid response to customer concerns, complaints, or emerging crisis situations in your industry or niche

๐Ÿ† Success Stories

"I manage digital marketing for a mid-size e-commerce company. When we launched a new product line, I started monitoring relevant Reddit communities to understand what our potential customers were really discussing. Within two weeks, I discovered a major pain point that none of our surveys had revealed. We completely redesigned our marketing messaging based on these insights, and our campaign conversion rate increased by 68%. The best part? We caught a potential customer service issue before it became a crisis." โ€” Digital Marketing Manager

"As someone running a SaaS startup, we needed to prioritize features but had limited budget for user research. I started scraping product-related subreddits to see what features users were actually requesting. We identified three high-demand features that weren't on our roadmap. After implementing them, our user retention jumped 45% in the next quarter. This approach costs us a fraction of traditional user research methods." โ€” Startup Founder

"I work in competitive intelligence for a consulting firm. One of our clients was considering entering a new market segment, but traditional market research was expensive and slow. We monitored industry-specific Reddit communities for three months and discovered that competitors were struggling with a specific customer pain point that wasn't being addressed. This intelligence helped our client position their product perfectly, and they gained 35% market share within the first year." โ€” Business Strategy Consultant

"I'm a content creator managing multiple social media accounts. Finding trending topics before they explode is crucial for engagement. By monitoring tech and lifestyle subreddits, I've been able to create content around emerging topics 2-3 weeks before they hit mainstream. This early-mover advantage has doubled our average engagement rates and helped us grow our follower base by 85% in six months." โ€” Content Marketing Specialist

"Working in PR for a consumer goods company, I've learned that Reddit discussions can make or break a product launch. Last year, we tracked sentiment across multiple subreddits before launching a new product. Early feedback showed mixed reactions to our packaging design. We made quick adjustments based on these insights, and when we finally launched, sentiment was overwhelmingly positive. This proactive approach saved us from what could have been a costly recall." โ€” Public Relations Director

"I handle customer success for a B2B software company. After a service outage, we needed to understand the real impact on our customers. Traditional feedback channels weren't giving us the full picture. By monitoring relevant Reddit communities, we discovered specific use cases that were affected that we hadn't considered. We proactively reached out to those customers and offered solutions before they even contacted support. Our customer satisfaction scores actually improved after the incident because of this proactive approach." โ€” Customer Success Manager

๐Ÿ“š Other Professional Intelligence Actors

๐Ÿ” Executive Job Market Intelligence Suite

  • All Jobs Scraper - Multi-Platform Professional Job Data Extraction (Indeed, LinkedIn, Reddit)
  • LinkedIn Scraper - Executive Network & Professional Recruitment Intelligence
  • Indeed Scraper - Mass Hiring & Universal Job Market Intelligence

๐Ÿ“ฑ Professional Communication Intelligence

๐ŸŽฅ Executive Content Intelligence

๐Ÿ† Certifications & Trust

  • โœ… Enterprise Ready - Used by Fortune 500 companies and content marketing agencies
  • โœ… GDPR Compliant - EU data protection standards and privacy regulations
  • โœ… High Availability - 99.9% uptime with reliable cloud infrastructure
  • โœ… Regular Updates - Continuous maintenance for platform changes and new features

๐Ÿท๏ธ Subreddit Scraper Keywords

๐Ÿ”ฅ High-Volume Search Terms: subreddit scraper, reddit scraper, reddit api, subreddit data extraction, reddit monitoring, reddit analytics, reddit scraping tool, subreddit monitoring, reddit automation, reddit sentiment analysis, reddit brand monitoring, reddit social listening, reddit market research, subreddit data collection, reddit web scraping, reddit crawling, subreddit archive, reddit post download, subreddit analytics, reddit comment analysis

โšก Long-Tail Keywords & Technical Terms: subreddit scraper best subreddit scraper reddit scraper python subreddit scraper python how to scrape subreddit reddit api python subreddit data extraction tool reddit scraper free subreddit monitoring tool reddit api tutorial praw reddit scraper reddit scraping service subreddit scraper api reddit api pricing reddit scraper github automated subreddit scraper reddit post scraper reddit comment scraper subreddit bulk download reddit export data reddit api alternative subreddit analytics platform reddit sentiment analysis tool subreddit keyword monitoring reddit competitor analysis

๐Ÿ’ผ Business & Use Case Keywords: subreddit reputation management reddit market intelligence subreddit consumer insights reddit product research subreddit trend forecasting reddit content marketing subreddit influencer research reddit competitor tracking subreddit campaign monitoring reddit crisis detection subreddit audience analysis reddit engagement analytics reddit marketing automation subreddit business intelligence reddit data platform subreddit api integration reddit enterprise solution subreddit social media analytics reddit brand monitoring service subreddit market research tool reddit customer feedback analysis subreddit public opinion tracking reddit industry insights subreddit competitive intelligence


๐Ÿ‘ฅ Our community: @Apify_Actor ๐Ÿ‘ค Contact with team: @AiAgentApi