Reddit Scraper - Posts, Comments & Sentiment Data avatar

Reddit Scraper - Posts, Comments & Sentiment Data

Pricing

from $2.00 / 1,000 results

Go to Apify Store
Reddit Scraper - Posts, Comments & Sentiment Data

Reddit Scraper - Posts, Comments & Sentiment Data

Scrape Reddit subreddits, search results, and comment threads. Extract post titles, authors, scores, comment counts, body text, and full comment trees with nested replies. Sort by hot, new, top, or rising. Perfect for market research and sentiment analysis.

Pricing

from $2.00 / 1,000 results

Rating

0.0

(0)

Developer

Renzo Madueno

Renzo Madueno

Maintained by Community

Actor stats

0

Bookmarked

301

Total users

25

Monthly active users

3.7 days

Issues response

5 days ago

Last modified

Share

Reddit Posts & Comments Scraper

Extract Posts, Comments, and Community Insights from Any Subreddit

One brand-mention found in r/SaaS converted to a $9K enterprise pilot for a customer of mine last month. Reddit comments are unfiltered, dated, and ranked by people who actually use the product. This actor pulls them in JSON. No Reddit API key, no rate limits to manage, no auth dance.

⏱️ Most runs finish in under 90 seconds. Recently migrated to old.reddit.com endpoints + jittered residential proxies (v0.1.18, May 2026) — handles previously-blocked subreddits.

⭐️ Solo dev competing with bigger Apify publishers — a 30-second review is the single thing that lifts ranking. Thank you in advance.

📩 Want this delivered to your inbox daily? $9/mo curated digest — Stripe. Pick subreddit + keywords, get a clean email each morning. No actor runs to manage.

Turn Reddit into a structured dataset. This scraper collects posts and comments from subreddits, search results, and individual threads -- delivering titles, authors, upvote scores, comment text, timestamps, and flair data in a clean, export-ready format.

Reddit is one of the richest sources of authentic consumer opinion on the internet. Whether you are running market research, building a sentiment analysis pipeline, scouting content ideas, or monitoring brand mentions, this actor gives you programmatic access to the conversations that matter -- without rate limits or API key management.

What data can you extract?

Post data

FieldTypeDescription
dataTypestringRecord type identifier (post)
postTitlestringTitle of the Reddit post
authorstringUsername of the post author
subredditstringSubreddit where the post was published
scoreintegerNet upvote score (upvotes minus downvotes)
commentCountintegerTotal number of comments on the post
bodystringSelf-text content of the post
urlstringLink URL (external link posts) or post permalink
postUrlstringDirect Reddit permalink to the post
createdUtcstringUTC timestamp of when the post was created
flairstringPost flair or tag assigned by the author or moderators
isStickiedbooleanWhether the post is pinned by moderators
scrapedAtstringISO 8601 timestamp of when the data was collected

Comment data

FieldTypeDescription
dataTypestringRecord type identifier (comment)
authorstringUsername of the comment author
bodystringFull text of the comment
scoreintegerNet upvote score for the comment
depthintegerNesting level (0 = top-level reply, 1 = reply to a reply, etc.)
createdUtcstringUTC timestamp of when the comment was posted
postTitlestringTitle of the parent post
postUrlstringPermalink to the parent post
scrapedAtstringISO 8601 timestamp of when the data was collected

Use cases

  • Market research -- Understand what real users say about your product, your competitors, or your industry by mining relevant subreddits and search queries.
  • Sentiment analysis -- Feed Reddit post and comment data into NLP pipelines to measure community sentiment around brands, product launches, or events.
  • Content ideation -- Discover what questions people ask, what problems they face, and what topics generate the most engagement in your niche.
  • Competitive intelligence -- Monitor competitor mentions across Reddit to track public perception and identify strengths or weaknesses in their offering.
  • Academic research -- Collect structured datasets of online discussions for social science, linguistics, or communication studies.

Input parameters

ParameterTypeRequiredDefaultDescription
subredditsarrayNo[]Subreddit names to scrape (without r/), e.g., "technology", "programming"
searchQueriesarrayNo[]Keywords to search across all of Reddit
postUrlsarrayNo[]Direct Reddit post URLs to scrape comments from
maxPostsPerSubredditintegerNo100Maximum posts to extract per subreddit or search query (1--500)
includeCommentsbooleanNofalseWhether to extract comments from each post
maxCommentsPerPostintegerNo100Maximum comments to extract per post (1--500)
sortBystringNo"hot"How to sort posts: hot, new, top, or rising
timeFilterstringNo"week"Time period when sorting by top: hour, day, week, month, year, all
maxConcurrencyintegerNo3Number of browser pages to run in parallel (1--10)

At least one subreddit, search query, or post URL must be provided.

Example output

{
"dataType": "post",
"postTitle": "What's the best programming language to learn in 2026?",
"author": "dev_user123",
"subreddit": "programming",
"score": 1547,
"commentCount": 342,
"body": "I've been thinking about switching careers into software development. I have some experience with Python from college, but I'm wondering what the job market actually wants right now. Any advice?",
"url": "https://old.reddit.com/r/programming/comments/abc123/whats_the_best_programming_language/",
"postUrl": "https://old.reddit.com/r/programming/comments/abc123/whats_the_best_programming_language/",
"createdUtc": "2026-02-28T15:30:00Z",
"flair": "Discussion",
"isStickied": false,
"scrapedAt": "2026-03-01T12:00:00.000Z"
}
{
"dataType": "comment",
"author": "senior_eng_42",
"body": "Honestly, it depends on what you want to do. For web development, JavaScript/TypeScript is still king. For data science and ML, Python is non-negotiable. For enterprise backend work, Java and Go are in very high demand.",
"score": 892,
"depth": 0,
"createdUtc": "2026-02-28T16:15:00Z",
"postTitle": "What's the best programming language to learn in 2026?",
"postUrl": "https://old.reddit.com/r/programming/comments/abc123/whats_the_best_programming_language/",
"scrapedAt": "2026-03-01T12:00:00.000Z"
}

How much does it cost?

This actor uses a pay-per-result pricing model. You are charged for each data record (post or comment) that is successfully extracted.

  • $0.002 per result -- that is $2.00 per 1,000 records.
  • Scraping 100 posts from a subreddit (without comments) costs approximately $0.20.
  • Scraping 1,000 posts (without comments) costs approximately $2.00.
  • Scraping 100 posts with 20 comments each (2,100 results) costs approximately $4.20.

There are no monthly subscriptions or minimum commitments. You only pay for the data you extract.

Tips and tricks

  • Use search queries for broad research. Subreddit scraping is best for focused communities. If you want to find every mention of a brand or topic across all of Reddit, use the searchQueries input instead.
  • Enable comments selectively. Comments multiply the number of results quickly. If you only need post-level data (titles, scores, dates), leave includeComments set to false to keep costs low and runs fast.
  • Sort by "top" for high-signal data. When researching popular opinions or frequently recommended products, sort by top with a time filter of month or year to surface the most upvoted content.
  • NSFW subreddits are supported. The scraper handles Reddit's age verification prompts automatically, so you can scrape any publicly accessible subreddit.
  • YouTube Channel & Video Scraper -- Extract YouTube channel statistics and video metadata to combine with Reddit community insights for a complete content research workflow.
  • TikTok Profile & Post Scraper -- Collect TikTok creator profiles and video data to complement Reddit sentiment analysis with social media performance metrics.

Posting AI-generated content to Reddit?

If you scrape Reddit to inform your own posts and write them with Claude/ChatGPT, Reddit mods are catching on. Em-dashes, "delve", parallel bullets, and 9 other patterns get accounts publicly called out for "all comments AI generated" (my own account got 2 callouts in 24h before I built this).

aitells.vercel.app is a free detector + paid rewriter. Paste your AI text, see every fingerprint highlighted, get it rewritten in your voice. Free first rewrite. $19 lifetime.

Found this useful?

If this scraper saved you time, please consider leaving a quick review on the Reviews tab above. Reviews help small developers like me get visibility on the Apify Store, and your feedback is genuinely valuable for improving the actor. Thanks!


More from Renzo

If this actor was useful, you might like these too:

Open source family: github.com/Perufitlife