Reddit Comment Scraper - Export Comments & Replies ($1.5/1k) avatar

Reddit Comment Scraper - Export Comments & Replies ($1.5/1k)

Pricing

from $1.50 / 1,000 result saveds

Go to Apify Store
Reddit Comment Scraper - Export Comments & Replies ($1.5/1k)

Reddit Comment Scraper - Export Comments & Replies ($1.5/1k)

Scrape Reddit comments without the API — every comment and nested reply from any post URL, 'load more comments' expanded automatically. Export to CSV, Excel, or JSON for sentiment analysis, AI training data, or research. No login, no rate limits. From $1.50 per 1,000 comments.

Pricing

from $1.50 / 1,000 result saveds

Rating

5.0

(2)

Developer

Harsh Maur

Harsh Maur

Maintained by Community

Actor stats

2

Bookmarked

2

Total users

1

Monthly active users

a day ago

Last modified

Share

Reddit Comments Scraper — Download Every Comment From Any Reddit Post (No API)

Paste a Reddit post URL, get the post plus its entire comment tree — nested replies included, "load more comments" expanded automatically. No API key, no login. From $1.50 per 1,000 results.

Try on Apify Input Schema API Docs

What it doesHow to useInputOutputUse casesPricingFAQ


What does Reddit Comments Scraper do? {#what-does-reddit-comments-scraper-do}

Reddit Comments Scraper is a purpose-built reddit comment scraper: give it one or more post URLs (comment permalinks work too) and it returns the parent post record plus every comment in the thread, including nested replies. Where Reddit's website stops at roughly 500 comments per page, this actor keeps going — it expands every "load more comments" placeholder through Reddit's morechildren API, routinely pulling 5,000–6,000 comments from a single post (configurable up to 10,000 via maxCommentsPerPost).

There is nothing to configure beyond the URLs. No OAuth app, no Reddit account, no cookies — and none of the official API's 600-requests-per-10-minutes ceiling.

Why this actor instead of a general Reddit scraper:

  • 💬 Deep threads, fully expanded — nested replies and "load more" branches that page-level scrapers miss
  • 🧵 Thread structure preserved — every comment carries parentId, depth, and isSubmitter, so you can rebuild the reply tree
  • 🔓 Zero authentication — scrape reddit comments without API keys, logins, or rate limits
  • 📤 Export anywhere — JSON, CSV, Excel, XML, HTML · n8n, Zapier, Make, REST API, MCP
  • 💰 Pay per result — from $1.50 per 1,000 comments, no monthly fees

How to scrape Reddit comments {#how-to-scrape-reddit-comments}

Downloading Reddit comments from a post takes about a minute of setup:

  1. Sign up for a free Apify account (or log in)
  2. Open Reddit Comments Scraper
  3. Paste one or more Reddit post URLs into the Reddit post URLs field
  4. Optionally adjust Max comments per post (default 100, up to 10,000)
  5. Click Start and watch comments stream into the dataset
  6. Download the results as JSON, CSV, Excel, XML, or HTML

💡 Tip: A comment permalink (a link to a single comment inside a thread) is also accepted — the scraper resolves it to the parent post and collects the thread from there.

Inputs at a glance

FieldWhat it does
postUrlsOne or more Reddit post URLs (or comment permalinks) to collect comments from. Required.
maxCommentsPerPostCap per post, nested replies included. Default 100, maximum 10000.
includeNSFWSet true to include posts and comments flagged 18+. Default false.
proxyProxy settings. Apify residential proxy is preconfigured and recommended.

That's the entire schema — see the Input Schema tab for full details.


Input example {#input-example}

This configuration pulls up to 2,000 comments from each of two posts, skipping NSFW content:

{
"postUrls": [
{
"url": "https://www.reddit.com/r/generativeAI/comments/1l64st7/have_we_reached_a_point_where_aigenerated_video/"
},
{
"url": "https://www.reddit.com/r/technology/comments/1example/another_post_to_analyze/"
}
],
"maxCommentsPerPost": 2000,
"includeNSFW": false,
"proxy": {
"useApifyProxy": true,
"apifyProxyGroups": ["RESIDENTIAL"]
}
}

Output example {#output-example}

Each run produces one post item (the parent post) followed by one item per comment. Comment items carry the full thread structure — parentId points to the parent comment or post, depth tells you how deep in the reply tree the comment sits, and isSubmitter flags replies written by the original poster.

Here's a realistic nested-reply comment item (note depth: 1 and a parentId referencing another comment, t1_...):

{
"dataType": "comment",
"id": "t1_kx9d2fe",
"parsedId": "kx9d2fe",
"body": "Disagree — the uncanny valley is still obvious in any clip longer than ten seconds.",
"bodyHtml": "<p>Disagree — the uncanny valley is still obvious in any clip longer than ten seconds.</p>",
"authorId": "t2_8fk3la",
"parsedAuthorId": "8fk3la",
"authorName": "vfx_skeptic",
"authorFullname": "t2_8fk3la",
"authorFlairText": null,
"authorPremium": false,
"communityName": "r/generativeAI",
"parsedCommunityName": "generativeAI",
"subredditName": "generativeAI",
"postId": "t3_1l64st7",
"parsedPostId": "1l64st7",
"postTitle": "Have we reached a point where AI-generated video is indistinguishable?",
"postUrl": "https://www.reddit.com/r/generativeAI/comments/1l64st7/",
"postUpVotes": 873,
"postCommentsCount": 412,
"parentId": "t1_kx9bq1a",
"parsedParentId": "kx9bq1a",
"parentKind": "comment",
"depth": 1,
"isSubmitter": false,
"url": "https://www.reddit.com/r/generativeAI/comments/1l64st7/comment/kx9d2fe/",
"score": 156,
"commentUpVotes": 156,
"controversiality": 0,
"totalAwardsReceived": 0,
"gilded": 0,
"distinguished": null,
"stickied": false,
"scoreHidden": false,
"collapsed": false,
"collapsedReason": null,
"edited": false,
"editedAt": null,
"ageHours": 18.4,
"scorePerHour": 8.48,
"bodyLength": 84,
"wordCount": 16,
"commentCreatedAt": "2025-06-08T11:24:00.000Z",
"postCreatedAt": "2025-06-08T07:02:00.000Z",
"crawledAt": "2025-06-09T05:48:00.000Z"
}

Top-level comments have depth: 0 and a parentId equal to the post's t3_ ID with parentKind: "post". The parent post item itself ships with the full post field set — title, body, score, upvoteRatio, commentsCount, flair, flags, media fields, and derived analytics like engagementTotal and commentsPerHour.


What can you do with scraped Reddit comments? {#use-cases}

  • 🎯 Sentiment analysis — turn a product launch thread or an AMA into thousands of labeled-ready opinions. Reply depth lets you separate hot takes from the debates underneath them.
  • 🤖 AI training data — comment trees are natural conversation data. The parentId/depth structure makes it trivial to reconstruct dialogue pairs for fine-tuning or RAG corpora.
  • 📢 Brand monitoring — when your company gets mentioned in a viral thread, pull the entire discussion (not just the visible page) and quantify what people actually said.
  • 🎓 Academic research — collect complete, reproducible discussion datasets with timestamps, scores, controversiality, and structural metadata intact for citation and replication.

How much does it cost to scrape Reddit comments? {#pricing}

Reddit Comments Scraper is pay-per-result: you're billed per comment stored, starting at $1.50 per 1,000 results, with no monthly subscription and no platform fees. A 5,000-comment megathread costs about $7.50; a quick 200-comment pull costs pennies.

This model fits comment scraping well because thread sizes vary wildly — you pay for the 6,000-comment AMA when you scrape it, and nothing the rest of the month.

💼 Scraping comments continuously at scale? Reddit Scraper Pro offers unlimited results on a flat monthly subscription using the same engine.


Export Reddit comments to CSV, Excel, or JSON

Once a run finishes, download the dataset in JSON, CSV, Excel (XLSX), XML, or HTML straight from the Apify Console — Reddit comments to CSV is one click. For automated pipelines:

  • REST APIPOST https://api.apify.com/v2/acts/harshmaur~reddit-comments-scraper/run-sync-get-dataset-items?token=YOUR_TOKEN with the input JSON as the body
  • n8n / Zapier / Make — trigger runs and route comment data into sheets, databases, or Slack
  • MCP — expose the actor as a tool via the Apify MCP server, so Claude, ChatGPT, Cursor, or any MCP client can fetch a thread's comments mid-conversation

See the API tab for ready-made code snippets in Node.js, Python, and curl.


FAQ {#faq}

Comments on public Reddit threads are publicly accessible data, and scraping public data is generally permitted — but you're responsible for using it lawfully. Keep request volumes reasonable, respect Reddit's terms where applicable, handle personal data in line with GDPR/CCPA, and only collect public content. Apify's guide on the enforceability of website terms is a good primer.

Can I scrape Reddit comments without the API?

Yes — that's the whole point of this actor. It reads publicly available thread data directly, so there's no OAuth application, no API key, and no 600-requests-per-10-minutes quota. You don't even need a Reddit account.

How many comments can I scrape from one post?

Up to 10,000 per post (maxCommentsPerPost). Reddit's own interface serves only ~500 comments per page; this actor automatically expands the "load more comments" placeholders via Reddit's morechildren endpoint, so very large threads typically yield 5,000–6,000 comments without you doing anything.

Does it capture nested replies, or just top-level comments?

The full tree. Every reply at every level is collected, and each item records its parentId, parentKind, and depth so you can reconstruct the exact thread hierarchy in your own analysis.

Can I scrape multiple posts in one run?

Yes. Add as many URLs to postUrls as you like — each post is processed with its own maxCommentsPerPost budget, and every output item carries postId and postTitle so mixed datasets stay easy to split.

Do I need a proxy?

Yes — proxies keep large comment pulls reliable and prevent IP blocks. The actor comes preconfigured with Apify's residential proxy group, which is the recommended setting; you can also plug in your own proxies via the proxy input.

What input does it accept — post URLs only?

Standard post URLs (https://www.reddit.com/r/subreddit/comments/abc123/title/) and comment permalinks. A permalink is resolved to its parent post and the thread is scraped from there. Subreddit pages, user profiles, and search URLs are not accepted — for those, use one of the other Reddit scrapers below.

Can I export the comments to a spreadsheet?

Yes. Every dataset can be downloaded as CSV or Excel (plus JSON, XML, and HTML) directly from the platform, or fetched in any of those formats through the API for automated exports.


More Reddit scrapers {#more-reddit-scrapers}

This actor does one thing deeply. If your job is broader, pick the right tool:

ActorUse it when
Reddit ScraperYou want the all-in-one: posts, comments, subreddits, search, and user profiles in a single configurable actor.
Reddit Search ScraperYou're starting from keywords, not URLs — find posts and discussions across all of Reddit by search term.
Reddit User ScraperYou need a specific user's profile, post history, and comment history rather than a thread.
Subreddit ScraperYou want to pull an entire community's posts in bulk instead of drilling into individual threads.

Try Reddit Comments Scraper