Reddit Scraper — Posts, Comments, Any Subreddit, JSON avatar

Reddit Scraper — Posts, Comments, Any Subreddit, JSON

Under maintenance

Pricing

Pay per usage

Go to Apify Store
Reddit Scraper — Posts, Comments, Any Subreddit, JSON

Reddit Scraper — Posts, Comments, Any Subreddit, JSON

Under maintenance

Scrape Reddit posts and comments from any subreddit. Get title author score comments URLs. Export JSON CSV. No Reddit API key needed. Market research sentiment analysis trend monitoring. Email spinov001@gmail.com. Tips t.me/scraping_ai

Pricing

Pay per usage

Rating

0.0

(0)

Developer

Alex

Alex

Maintained by Community

Actor stats

0

Bookmarked

3

Total users

1

Monthly active users

14 hours ago

Last modified

Share

Reddit Scraper Pro — API-Based, Never Breaks on Redesigns

The most reliable Reddit scraper on Apify. Uses Reddit's native JSON API instead of HTML parsing — so it never breaks when Reddit updates their UI.

Why This Scraper?

Most Reddit scrapers use HTML/CSS selectors that break every time Reddit changes their design. This scraper uses Reddit's official JSON endpoint (/r/subreddit.json) — the same data format Reddit's own apps use. This means:

  • Never breaks on redesigns — JSON API is separate from the UI
  • Complete data — 20+ fields per post, full comment trees
  • Structured output — clean JSON, no HTML parsing artifacts
  • No login required — public data, no credentials needed
  • Built-in rate limiting — respects Reddit's API limits, won't get you banned

Features

  • 20+ data fields per post — title, author, score, upvote ratio, comment count, flair, awards, URL, self text, link URL, domain, NSFW flag, stickied status, and more
  • Full comment threads — nested comments with author, score, depth level, and creation date
  • Multiple subreddits — scrape r/programming, r/datascience, r/Entrepreneur in one run
  • Cross-Reddit search — find posts by keyword across all of Reddit
  • Flexible sorting — hot, new, top, rising with time filters (hour/day/week/month/year)
  • Automatic pagination — follows Reddit's cursor-based pagination for 500+ posts
  • Proxy support — uses Apify Proxy (residential) for reliable access

Output Data (20+ fields)

{
"id": "1b2c3d4",
"title": "What tools do you use for market research?",
"author": "startup_founder",
"subreddit": "Entrepreneur",
"score": 847,
"upvoteRatio": 0.94,
"numComments": 234,
"createdUtc": "2026-03-17T15:30:00.000Z",
"url": "https://reddit.com/r/Entrepreneur/comments/...",
"selfText": "I've been looking for affordable tools...",
"linkUrl": "https://example.com/article",
"flair": "Discussion",
"awards": 3,
"isNSFW": false,
"isStickied": false,
"domain": "self.Entrepreneur",
"thumbnail": "https://...",
"comments": [
{
"id": "abc123",
"author": "data_analyst",
"body": "I use a combination of...",
"score": 156,
"createdUtc": "2026-03-17T16:00:00.000Z",
"depth": 0
}
]
}

Use Cases

  • Market research — discover what people say about your product, brand, or industry
  • Sentiment analysis — collect posts and comments for NLP models
  • AI training data — build datasets from Reddit discussions for LLM fine-tuning
  • Trend monitoring — track emerging topics and viral content in real-time
  • Competitive intelligence — monitor competitor mentions and complaints
  • Content research — find top questions and topics your audience cares about
  • Lead generation — identify users asking for your type of product/service
  • Academic research — gather social media data for papers and studies

Input Parameters

ParameterTypeDefaultDescription
subredditsArray[]Subreddit names (e.g., ["technology", "startups"])
searchQueriesArray[]Search terms across all of Reddit
maxPostsPerSourceNumber50Max posts per subreddit/query (1-500)
includeCommentsBooleantrueExtract comment threads
maxCommentsPerPostNumber20Max comments per post
sortByString"hot"Sort: hot, new, top, rising
timeFilterString"week"Time filter: hour, day, week, month, year

Technical Details

  • Method: Reddit JSON API (/r/subreddit.json, /search.json)
  • Proxy: Apify residential proxy for reliable access
  • Rate limiting: Built-in delays between requests (2-3 seconds)
  • Pagination: Cursor-based (Reddit's after parameter)
  • Error handling: Graceful handling of 403/429 errors with retry logic

Cost Estimation

  • ~$0.50 per 100 posts without comments
  • ~$1.00 per 100 posts with full comment threads
  • Free tier available with Apify free plan

FAQ

Q: Why JSON API instead of HTML scraping? A: HTML scrapers break every time Reddit updates their design. The JSON API returns structured data in a format that hasn't changed in years. It's the same API Reddit's mobile app uses.

Q: Can I scrape private subreddits? A: No — only publicly accessible subreddits. This scraper uses public endpoints.

Q: Does it need my Reddit credentials? A: No. All data is fetched from public JSON endpoints.

Q: How many posts can I get per run? A: Up to 500 posts per subreddit with pagination. Multiple subreddits can be scraped in one run.


Part of 60+ data tools by knotless_cadence on Apify. Related tools:

More Tools