Reddit Comments Scraper avatar

Reddit Comments Scraper

Pricing

from $3.50 / 1,000 comments

Go to Apify Store
Reddit Comments Scraper

Reddit Comments Scraper

The most powerful Reddit comments on Apify. Get comprehensive comments data from direct urls, post urls, or user profiles. Power your analysis with comment replies from post urls up to any depth. This tools is your best bet to unlock Reddit's ecosystem.

Pricing

from $3.50 / 1,000 comments

Rating

0.0

(0)

Developer

tolu.

tolu.

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

4 days ago

Last modified

Share

🚀 Reddit Comments Scraper

The most powerful Reddit comments on Apify. Get comprehensive comments data from direct urls, post urls, or user profiles. Power your analysis with comment replies from post urls up to any depth. This tools is your best bet to unlock Reddit's ecosystem.

💎 Why use this actor?

  • Comprehensive comments data from direct urls, post urls, or user profiles.
  • Complete post comments and replies.
  • Traverse post comments by depth or breadth.
  • Maximum depth and comment options to limit results.

⚙️ Input

ParameterTypeDescriptionDefault Value
startUrlsarrayEnter post, direct comment, or user urls.-
Post Settings
postSortByconfidence|top|new|controversial|old|qaChoose filter to sort post comments.confidence
postTraversalStrategydepth|breadthChoose the strategy for traversing the comment tree.depth
postMaxDepthintegerSet the maximum depth to reach in the comment tree. Use -1 to reach the maximum depth possible.-1
postMaxCommentsintegerSet maximum number of comments to scrape. Use 0 to scrape all comments.100
postIncludeModThreadsbooleanClick to include moderator threads in the output.true
postIncludeUnavailableThreadsbooleanClick to include unavailable (deleted or removed) threads in the output.true
User Settings
userSortByhot|new|topChoose filter to sort user comments.new
userSortTimePeriodhour|day|week|month|year|allChoose time period for user sort setting.day
userMaxCommentsintegerSet maximum number of comments to scrape. Use 0 to scrape all comments.100

📊 Output

Fields

  • input: Input direct, post, or user url.
  • category: Comment category. One of direct, post, or user.
  • rank: Rank of the comment based on the sort order. Valid for post and user category.
  • id: Comment id.
  • permalink: Permanent link to the comment.
  • postId: Id of the post where the comment was sent.
  • postPermalink: Permanant link of the post where the comment was sent.
  • parentId: Id of the parent comment to this comment. If null, it is a top-level comment.
  • createdAt: Datetime when the comment was created.
  • editedAt: Datetime when the comment was last edited. If null, the comment has never been edited.
  • content: Content of the comment. If null, the comment has been deleted or removed.
  • isAvailable: True if the comment is available i.e. it has not been deleted or removed.
  • unavailableReason: Reason why the comment is unavailable. If the comment is available, it will be null. Otherwise, it will be one of removed, deleted, or removed by reddit.
  • subreddit: Comment subreddit information.
    • subreddit.id: Subreddit id.
    • subreddit.name: Subreddit name.
    • subreddit.url: Subreddit url.
  • author: Comment author information.
    • author.id: Author id. If null, the author has been deleted or removed.
    • author.name: Author name. If null, the author has been deleted or removed.
    • author.url: Author url. If null, the author has been deleted or removed.
    • author.isDeleted: True if the author is deleted or removed.
    • author.isOp: True if the author is the original poster (OP) of the post the comment is in.
    • author.isMod: True if the author is a moderator of the subreddit the comment is in.
  • depth: The comment depth. 0 means it is a top-level comment, 1 means it is a reply to a top-level comment. 2 means it is a reply to a reply to a top-level comment, and so on. Note that while every comment has a depth, the field is only available for the post category i.e. only for post url inputs. For the direct and user category, the value is null.
  • controversiality: The controversiality of the comment. From my research, it seems that the values are only 0 and 1, therefore it could be thought of as a flag.
  • score: Score of the post i.e. the difference between the number of upvotes and the number of downvotes.

Examples

{
"input": "https://www.reddit.com/user/Curious_Coder5445/",
"category": "user",
"rank": 2,
"id": "t1_odrn8wd",
"permalink": "https://www.reddit.com/r/b2b_sales/comments/1s9uncf/i_built_a_massive_list_of_local_businesses_with/odrn8wd/",
"postId": "t3_1s9uncf",
"postPermalink": "https://www.reddit.com/r/b2b_sales/comments/1s9uncf/i_built_a_massive_list_of_local_businesses_with/",
"parentId": "t1_odrg3wh",
"createdAt": "2026-04-01T21:03:33+00:00",
"editedAt": null,
"content": "Haha fair enough mate. Didn't expect to sell it directly from here, but my DMs are open if anyone needs a look",
"isAvailable": true,
"unavailableReason": null,
"subreddit": {
"id": "t5_3bphg",
"name": "r/b2b_sales",
"url": "https://www.reddit.com/r/b2b_sales/"
},
"author": {
"id": "t2_1a0azx17aj",
"name": "u/Curious_Coder5445",
"url": "https://www.reddit.com/user/Curious_Coder5445/",
"isDeleted": false,
"isOp": true,
"isMod": false
},
"depth": null,
"controversiality": 0,
"score": 1
}
{
"input": "https://www.reddit.com/r/webscraping/comments/1sjd609/stop_defaulting_to_seleniumplaywright_check_the/",
"category": "post",
"rank": 3,
"id": "t1_ofrqm9n",
"permalink": "https://www.reddit.com/r/webscraping/comments/1sjd609/stop_defaulting_to_seleniumplaywright_check_the/ofrqm9n/",
"postId": "t3_1sjd609",
"postPermalink": "https://www.reddit.com/r/webscraping/comments/1sjd609/stop_defaulting_to_seleniumplaywright_check_the/",
"parentId": "t1_ofqqt4m",
"createdAt": "2026-04-12T15:45:21+00:00",
"editedAt": null,
"content": "This is assumes you are using some kind of proxy rotation right? Otherwise it seems you are constantly flagging your own ip doing every iteration test or am I wrong? Is it sticky sessions or just get it and get out single internal api requests?\n\nSorry for the nooby question xD",
"isAvailable": true,
"unavailableReason": null,
"subreddit": {
"id": "t5_318ly",
"name": "r/webscraping",
"url": "https://www.reddit.com/r/webscraping/"
},
"author": {
"id": "t2_5iltyxdn",
"name": "u/pck91999",
"url": "https://www.reddit.com/user/pck91999/",
"isDeleted": false,
"isOp": false,
"isMod": false
},
"depth": 1,
"controversiality": 0,
"score": 2
}
{
"input": "https://www.reddit.com/r/webscraping/comments/1sjd609/stop_defaulting_to_seleniumplaywright_check_the/ofrtmnr/",
"category": "direct",
"rank": null,
"id": "t1_ofrtmnr",
"permalink": "https://www.reddit.com/r/webscraping/comments/1sjd609/stop_defaulting_to_seleniumplaywright_check_the/ofrtmnr/",
"postId": "t3_1sjd609",
"postPermalink": "https://www.reddit.com/r/webscraping/comments/1sjd609/stop_defaulting_to_seleniumplaywright_check_the/",
"parentId": "t1_ofrqm9n",
"createdAt": "2026-04-12T15:59:06+00:00",
"editedAt": null,
"content": "Good point, I'd probably use a VPN during experimentation. Better yet, a remote VM as you can even be fingerprinted through proxies and VPNs",
"isAvailable": true,
"unavailableReason": null,
"subreddit": {
"id": "t5_318ly",
"name": "r/webscraping",
"url": "https://www.reddit.com/r/webscraping/"
},
"author": {
"id": "t2_vep5j",
"name": "u/matty_fu",
"url": "https://www.reddit.com/user/matty_fu/",
"isDeleted": false,
"isOp": false,
"isMod": false
},
"depth": null,
"controversiality": 0,
"score": 4
}