Pricing

$5.99/month + usage

Reddit Scraper | Posts, Search, Comments & User Data

Extract structured Reddit data from subreddits, search results, single posts, and user profiles. Get titles, text, scores, upvote ratio, comments, authors, flairs, timestamps, and more in clean JSON. Built for research, monitoring, trend tracking, and automation

Pricing

$5.99/month + usage

Rating

0.0

(0)

Developer

Scrape Pilot

Actor stats

Bookmarked

Total users

Monthly active users

4 months ago

Last modified

🚀 Reddit Posts & Comments Scraper

httpshttpshttpshttpshttpshttpshttpshttpshttpshttpshttpshttpshttpshttpshttpshttps

📖 About

The Reddit Posts & Comments Scraper is a professional-grade reddit scraper tool designed to efficiently extract public posts, comments, and metadata from Reddit subreddits. Whether you're conducting market research, sentiment analysis, or building data-driven applications, this reddit scraper provides reliable and structured data extraction capabilities.

This tool is built with scalability and compliance in mind, respecting Reddit's API guidelines while delivering high-performance data extraction for developers, researchers, and businesses.

✨ Features

Feature	Description
🎯 Targeted Scraping	Extract posts from specific subreddits with custom filters
💬 Comment Extraction	Optional comment scraping for deeper insights
🔒 Proxy Support	Residential & datacenter proxy configuration included
📊 Rich Metadata	Get scores, upvote ratios, authors, flairs, and more
🔄 Multiple Sort Options	Sort by hot, new, top, rising, and controversial
⏱️ Time Filtering	Filter posts by hour, day, week, month, year, or all time
📁 Multiple Formats	Export data in JSON, CSV, or XML formats
🚀 High Performance	Optimized for large-scale data extraction
🛡️ Rate Limiting	Built-in rate limiting to avoid IP bans
📝 Detailed Logging	Comprehensive logging for debugging and monitoring

⚡ Quick Start

Basic Usage

from reddit_scraper import RedditScraper

# Initialize the scraper
scraper = RedditScraper()

# Define your configuration
config = {
    "include_comments": False,
    "subreddit": "technology",
    "sort": "hot",
    "time_filter": "all",
    "max_results": 25
}

# Run the scraper
results = scraper.scrape(config)

# Export to JSON
scraper.export_to_json(results, "output.json")

# Export to CSV
scraper.export_to_csv(results, "output.csv")

Command Line Usage

# Basic scrape
python reddit_scraper.py --subreddit technology --max-results 25

# With comments
python reddit_scraper.py --subreddit technology --include-comments --max-results 50

# With custom sort and time filter
python reddit_scraper.py --subreddit technology --sort top --time-filter week --max-results 100

# With proxy configuration
python reddit_scraper.py --subreddit technology --use-proxy --proxy-group RESIDENTIAL

⚙️ Configuration

Input Parameters

Parameter	Type	Required	Default	Description
`subreddit`	string	✅ Yes	-	Target subreddit name (e.g., "technology")
`include_comments`	boolean	❌ No	`false`	Whether to scrape comments for each post
`sort`	string	❌ No	`"hot"`	Sort order: `hot`, `new`, `top`, `rising`, `controversial`
`time_filter`	string	❌ No	`"all"`	Time range: `hour`, `day`, `week`, `month`, `year`, `all`
`max_results`	integer	❌ No	`25`	Maximum number of posts to scrape (1-1000)
`proxyConfiguration.useApifyProxy`	boolean	❌ No	`false`	Enable Apify proxy service
`proxyConfiguration.apifyProxyGroups`	array	❌ No	`[]`	Proxy groups: `RESIDENTIAL`, `DATACENTER`

Example Configuration

{
  "include_comments": false,
  "proxyConfiguration": {
    "useApifyProxy": true,
    "apifyProxyGroups": ["RESIDENTIAL"]
  },
  "subreddit": "technology",
  "sort": "hot",
  "time_filter": "all",
  "max_results": 25
}

📥 Input/Output Format

Input Example

{
  "include_comments": false,
  "proxyConfiguration": {
    "useApifyProxy": true,
    "apifyProxyGroups": ["RESIDENTIAL"]
  },
  "subreddit": "technology",
  "sort": "hot",
  "time_filter": "all",
  "max_results": 25
}

Output Example

[
  {
    "post_id": "1rt52qa",
    "title": "Meta planning sweeping layoffs as AI costs mount",
    "text": null,
    "score": 4506,
    "upvote_ratio": 0.97,
    "url": "https://www.reuters.com/business/world-at-work/meta-planning-sweeping-layoffs-ai-costs-mount-2026-03-14/",
    "permalink": "https://www.reddit.com/r/technology/comments/1rt52qa/meta_planning_sweeping_layoffs_as_ai_costs_mount/",
    "author": "joe4942",
    "subreddit": "technology",
    "flair": "Business",
    "num_comments": 569,
    "awards": 0,
    "is_video": false,
    "domain": "reuters.com",
    "thumbnail": "https://external-preview.redd.it/...",
    "created_at": "1773448769"
  }
]

Output Fields Description

Field	Type	Description
`post_id`	string	Unique Reddit post identifier
`title`	string	Post title
`text`	string/null	Self-post text content (null for link posts)
`score`	integer	Total upvotes minus downvotes
`upvote_ratio`	float	Percentage of upvotes (0.0 - 1.0)
`url`	string	Original link URL (for link posts)
`permalink`	string	Reddit post permalink
`author`	string	Post author username
`subreddit`	string	Subreddit name
`flair`	string/null	Post flair text
`num_comments`	integer	Number of comments on the post
`awards`	integer	Total awards received
`is_video`	boolean	Whether the post is a video
`domain`	string/null	Domain of the linked content
`thumbnail`	string/null	Thumbnail image URL
`created_at`	string	Unix timestamp of post creation

🔌 API Reference

Class: `RedditScraper`

Constructor

scraper = RedditScraper(api_credentials=None, rate_limit=True)

Parameter	Type	Default	Description
`api_credentials`	dict	`None`	Reddit API credentials (client_id, client_secret)
`rate_limit`	boolean	`True`	Enable automatic rate limiting

Methods

Method	Parameters	Returns	Description
`scrape(config)`	`config: dict`	`list`	Main scraping method
`export_to_json(data, filename)`	`data: list, filename: str`	`bool`	Export data to JSON file
`export_to_csv(data, filename)`	`data: list, filename: str`	`bool`	Export data to CSV file
`export_to_xml(data, filename)`	`data: list, filename: str`	`bool`	Export data to XML file
`validate_config(config)`	`config: dict`	`bool`	Validate configuration parameters
`get_subreddit_info(name)`	`name: str`	`dict`	Get subreddit metadata

💡 Examples

Example 1: Scrape Top Posts from r/technology

config = {
    "subreddit": "technology",
    "sort": "top",
    "time_filter": "week",
    "max_results": 50
}

results = scraper.scrape(config)
print(f"Scraped {len(results)} posts")

Example 2: Scrape with Comments

config = {
    "subreddit": "programming",
    "include_comments": True,
    "sort": "hot",
    "max_results": 10
}

results = scraper.scrape(config)

for post in results:
    print(f"Post: {post['title']}")
    print(f"Comments: {len(post.get('comments', []))}")

Example 3: Multiple Subreddits

subreddits = ["technology", "programming", "artificial"]

for subreddit in subreddits:
    config = {
        "subreddit": subreddit,
        "max_results": 25
    }
    results = scraper.scrape(config)
    scraper.export_to_json(results, f"{subreddit}_posts.json")

Example 4: With Proxy Configuration

config = {
    "subreddit": "technology",
    "proxyConfiguration": {
        "useApifyProxy": True,
        "apifyProxyGroups": ["RESIDENTIAL"]
    },
    "max_results": 100
}

results = scraper.scrape(config)

🔐 Proxy Support

This reddit scraper supports advanced proxy configurations to avoid rate limiting and IP bans.

Supported Proxy Types

Proxy Type	Description	Best For
`RESIDENTIAL`	Real user IP addresses	High-volume scraping
`DATACENTER`	Datacenter IP addresses	Fast, cost-effective scraping

Proxy Configuration

{
  "proxyConfiguration": {
    "useApifyProxy": true,
    "apifyProxyGroups": ["RESIDENTIAL"],
    "apifyProxyCountry": "US"
  }
}

Environment Variables

# .env file
APIFY_API_TOKEN=your_apify_token_here
PROXY_ENABLED=true
PROXY_GROUP=RESIDENTIAL

⏱️ Rate Limiting

To ensure responsible usage and avoid bans, this reddit scraper includes built-in rate limiting:

Action	Rate Limit	Recommendation
API Requests	60/minute	Use proxy for higher limits
Post Scraping	100/minute	Enable delays between requests
Comment Scraping	50/minute	Use residential proxies

Rate Limit Configuration

scraper = RedditScraper(
    rate_limit=True,
    rate_limit_delay=1.0,  # seconds between requests
    max_retries=3
)

🛠️ Troubleshooting

Common Issues

Issue	Solution
429 Too Many Requests	Enable proxy, increase delay between requests
403 Forbidden	Check subreddit privacy settings, use API credentials
Empty Results	Verify subreddit name, check sort/time_filter values
Connection Timeout	Enable proxy, check network connection
Invalid JSON Output	Validate input configuration format

Debug Mode

# Enable verbose logging
python reddit_scraper.py --subreddit technology --debug

# Check API status
python reddit_scraper.py --status-check

Log Files

Logs are saved in ./logs/scraper.log by default. Configure log level:

import logging
logging.basicConfig(level=logging.DEBUG)

🤝 Contributing

We welcome contributions! Here's how you can help:

Fork the repository
Create a feature branch (git checkout -b feature/amazing-feature)
Commit your changes (git commit -m 'Add amazing feature')
Push to the branch (git push origin feature/amazing-feature)
Open a Pull Request

Development Setup

# Clone your fork
git clone https://github.com/yourusername/reddit-scraper.git

# Install dev dependencies
pip install -r requirements-dev.txt

# Run tests
pytest tests/

# Run linting
flake8 .
black .

seo keyword

Code Style

Follow PEP 8 guidelines
Add docstrings for all functions
Write unit tests for new features
Update documentation for changes

Reddit Scraper — Extract Posts, Comments & Subreddits

oneary/reddit-scraper

Scrape Reddit posts, comments, subreddits, and user data with Playwright. Extract titles, scores, authors, flairs, and comment counts from any subreddit or search.

Luan M.

Reddit Scraper - Posts, Comments & Subreddits

viralanalyzer/reddit-scraper

Extract Reddit posts, comments, subreddit data, and user profiles.

viralanalyzer

5.0

👽 Reddit Scraper — Posts & Comments by Subreddit or Search

iskoren/reddit-scraper

Scrape Reddit posts and comments from any subreddit or search query — scores, authors, timestamps, flairs, and full text. Export structured data for research and monitoring.

Is Koren

Reddit Scraper - Extract Posts, Comments & Search Results

azlan_lionheart/reddit-scraper

Scrape Reddit posts, comments, and search results. Get titles, scores, authors, comments, timestamps - no API key required. Perfect for market research, trend analysis, content creation, and AI training.

Iqbal Fauzy Amrullah

Reddit Scraper

ef12/reddit-scraper

Scrape Reddit posts and comments by subreddit, search query, or user. Get titles, scores, upvote ratios, comment counts, post bodies, and flairs via the Reddit JSON API.

Daniel Wilson

Reddit User Profile Posts And Comments Scraper

scrapelabsapi/reddit-user-profile-posts-and-comments-scraper

ScrapeLabs

Reddit User Profile Posts And Comments Scraper

scrapebase/reddit-user-profile-posts-and-comments-scraper

ScrapeBase

Reddit User Profile Posts And Comments Scraper

scraply/reddit-user-profile-posts-and-comments-scraper

Scraply

Reddit User Profile Posts And Comments Scraper

scrapemesh/reddit-user-profile-posts-and-comments-scraper

ScrapeMesh

Reddit User Profile Posts And Comments Scraper

scrapeflow/reddit-user-profile-posts-and-comments-scraper

ScrapeFlow

Reddit Scraper | Posts, Search, Comments & User Data

🚀 Reddit Posts & Comments Scraper

📋 Table of Contents

📖 About

✨ Features

⚡ Quick Start

Basic Usage

Command Line Usage

⚙️ Configuration

Input Parameters

Example Configuration

📥 Input/Output Format

Input Example

Output Example

Output Fields Description

🔌 API Reference

Class: RedditScraper

Constructor

Methods

💡 Examples

Example 1: Scrape Top Posts from r/technology

Example 2: Scrape with Comments

Example 3: Multiple Subreddits

Example 4: With Proxy Configuration

🔐 Proxy Support

Supported Proxy Types

Proxy Configuration

Environment Variables

⏱️ Rate Limiting

Rate Limit Configuration

🛠️ Troubleshooting

Common Issues

Debug Mode

Log Files

🤝 Contributing

Development Setup

seo keyword

reddit scraper, reddit post scraper, subreddit scraper, reddit search scraper, reddit comments scraper, reddit user scraper, reddit data extractor, reddit api scraper, apify reddit scraper, social listening scraper, community analysis scraper, osint reddit scrape

Code Style

You might also like

Reddit Scraper — Extract Posts, Comments & Subreddits

Reddit Scraper - Posts, Comments & Subreddits

👽 Reddit Scraper — Posts & Comments by Subreddit or Search

Reddit Scraper - Extract Posts, Comments & Search Results

Reddit Scraper

Reddit User Profile Posts And Comments Scraper

Reddit User Profile Posts And Comments Scraper

Reddit User Profile Posts And Comments Scraper

Reddit User Profile Posts And Comments Scraper

Reddit User Profile Posts And Comments Scraper

Class: `RedditScraper`