Reddit Comments Scraper avatar
Reddit Comments Scraper

Pricing

Pay per usage

Go to Apify Store
Reddit Comments Scraper

Reddit Comments Scraper

Extract detailed comments and discussion threads from Reddit instantly. Perfect for sentiment analysis, market research, and community monitoring. Get structured data from any post URL efficiently. Residential proxies are recommended for high-volume scraping stability.

Pricing

Pay per usage

Rating

0.0

(0)

Developer

Shahid Irfan

Shahid Irfan

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

10 days ago

Last modified

Share

Scrape all comments from any Reddit post using the official Reddit JSON API. Extract structured comment data including author, body, score, timestamps, and nested replies.

Features

  • Extracts all comments and nested replies from Reddit posts
  • Uses official Reddit JSON API for reliable data access
  • Handles comment threading and parent-child relationships
  • Includes comment scores, timestamps, and permalinks
  • Lightweight HTTP-based scraping with no browser required

Use Cases

  • Social media analysis and sentiment tracking
  • Community discussion monitoring
  • Content moderation and spam detection
  • Research on user engagement patterns
  • Data collection for NLP and machine learning projects

Input Parameters

ParameterTypeDescriptionDefault
startUrlstringThe URL of the Reddit post to scrape comments fromRequired
results_wantedintegerMaximum number of comments to collect100
proxyConfigurationobjectProxy settings for reliable scrapingResidential proxy

Output Data

FieldTypeDescription
idstringUnique comment identifier
authorstringReddit username of the commenter
bodystringFull text content of the comment
scorenumberUpvote score of the comment
created_utcnumberUnix timestamp of comment creation
parent_idstringID of parent comment (null for top-level)
permalinkstringDirect link to the comment

Usage Examples

Basic Usage

{
"startUrl": "https://www.reddit.com/r/webscraping/comments/1qs66k0/couldnt_find_proxy_directory_with_filters_so/"
}

With Custom Limits

{
"startUrl": "https://www.reddit.com/r/webscraping/comments/1qs66k0/couldnt_find_proxy_directory_with_filters_so/",
"results_wanted": 50
}

Sample Output

{
"id": "abc123",
"author": "webscraper_pro",
"body": "Great discussion on proxy configurations!",
"score": 15,
"created_utc": 1703123456,
"parent_id": null,
"permalink": "/r/webscraping/comments/1qs66k0/couldnt_find_proxy_directory_with_filters_so/abc123/"
}

Tips

  • Use residential proxies for best results with Reddit
  • Comments are returned in chronological order
  • Nested replies are flattened with parent_id references
  • Large threads may have thousands of comments

Integrations

  • Export data to CSV/JSON for analysis
  • Connect with data processing pipelines
  • Integrate with NLP tools for sentiment analysis
  • Use with business intelligence platforms

FAQ

Q: Does this work with private subreddits?
A: No, this scraper only works with public Reddit posts.

Q: Are deleted comments included?
A: Deleted comments are not returned by the Reddit API.

Q: What's the rate limit?
A: Reddit API has rate limits; use proxies and reasonable delays.

This scraper is for educational and research purposes only. Respect Reddit's Terms of Service and robots.txt. Do not use for spam, harassment, or unauthorized data collection. Ensure compliance with local laws and regulations.