Reddit Post Scraper
Pricing
from $4.99 / 1,000 results
Reddit Post Scraper
Scrape specific Reddit post URLs for full post details, body text, images, videos, gallery, awards, metadata, and optional comments with nested replies.
Pricing
from $4.99 / 1,000 results
Rating
0.0
(0)
Developer
codingfrontend
Maintained by CommunityActor stats
0
Bookmarked
2
Total users
0
Monthly active users
5 days ago
Last modified
Categories
Share
Scrape specific Reddit post URLs for complete post data including title, body text, images, videos, gallery content, awards, crosspost details, and optional top comments with nested replies — using Reddit's official JSON API.
Ideal for monitoring specific posts, ingesting post data for NLP/analysis, or building content pipelines from curated Reddit URLs.
Features
- Batch input — Provide a list of Reddit post URLs and scrape them all
- Full post details — Title, selftext, author, score, flair, domain, awards, media, and 50+ fields
- Video extraction — HLS (streaming), DASH, and fallback video URLs with dimensions and duration
- Gallery support — All gallery images in order with captions and MP4 alternatives
- Comments — Optional top-20 comments with up to 5 nested reply levels
- Parallel execution — Processes 2 posts concurrently for efficiency
- Awards breakdown — Individual award names and gilding tiers
- Stealth mode — Random user agents, headers, and fingerprinting to avoid rate limits
Input Options
| Field | Type | Default | Description |
|---|---|---|---|
postUrls | Array | required | List of Reddit post URLs to scrape |
includeComments | Boolean | true | Fetch top 20 comments with nested replies |
headless | Boolean | true | Run browser in headless mode |
proxyConfiguration | Object | Apify Residential | Proxy settings |
Example Inputs
Scrape a single post with comments
{"postUrls": ["https://www.reddit.com/r/javascript/comments/1t9jt9w/bluejs_compile_javascript_to_12mb_native_binaries/"],"includeComments": true}
Scrape multiple posts, no comments
{"postUrls": ["https://www.reddit.com/r/programming/comments/abc123/post_one/","https://www.reddit.com/r/webdev/comments/def456/post_two/","https://www.reddit.com/r/reactjs/comments/ghi789/post_three/"],"includeComments": false}
Output Dataset Fields
| Field | Type | Description |
|---|---|---|
id | String | Reddit post ID |
title | String | Post title |
url | String | External link URL (or Reddit permalink for self posts) |
permalink | String | Full Reddit URL |
subreddit | String | Subreddit name |
subredditId | String | Subreddit fullname (t5_xxx) |
subredditType | String | public, private, restricted |
subredditSubscribers | Number | Subscriber count |
author | String | Post author username |
authorFullname | String | Author fullname (t2_xxx) |
authorFlair | String | Author flair text |
authorPremium | Boolean | Whether author has Reddit Premium |
score | Number | Upvotes minus downvotes |
upvoteRatio | Number | Fraction of upvotes (0.0–1.0) |
numComments | Number | Total comment count |
createdAt | String | ISO 8601 creation timestamp |
editedAt | String/Boolean | ISO 8601 edit timestamp or false |
selfText | String | Full post body text (self posts) |
postType | String | self · link · image · video · gallery · crosspost |
isVideo | Boolean | Whether post contains hosted video |
isSelf | Boolean | Whether it's a text/self post |
isGallery | Boolean | Whether it's a gallery post |
isOC | Boolean | Original content flag |
thumbnail | String | Thumbnail image URL |
flair | String | Post flair text |
flairBackgroundColor | String | Flair background hex color |
domain | String | Link domain |
nsfw | Boolean | NSFW flag |
spoiler | Boolean | Spoiler flag |
locked | Boolean | Whether post is locked |
pinned | Boolean | Whether post is pinned |
stickied | Boolean | Whether post is stickied |
archived | Boolean | Whether post is archived |
distinguished | String | moderator, admin, or null |
suggestedSort | String | Suggested comment sort |
numCrossposts | Number | Cross-post count |
postHint | String | self · link · image · rich:video · hosted:video |
isCrossPost | Boolean | Whether it's a cross-post |
crosspostFrom | String | Original subreddit if cross-posted |
crosspostOriginal | Object | Full original post data if cross-posted |
awardsCount | Number | Total awards received |
gilded | Number | Number of times gilded |
gildings | Object | Breakdown by award tier |
awards | Array | Award names and counts |
previewImages | Array | Preview image objects with resolutions |
videoData | Object | Video URLs (HLS, DASH, fallback), dimensions, duration |
galleryImages | Array | Gallery images with URL, caption, mp4 |
mediaEmbed | Object | Embedded media HTML/dimensions |
comments | Array | Top comments (if includeComments=true) |
scrapedAt | String | ISO 8601 scrape timestamp |
Comment Object
| Field | Type | Description |
|---|---|---|
id | String | Comment ID |
author | String | Comment author |
body | String | Comment text |
score | Number | Comment score |
createdAt | String | ISO 8601 timestamp |
edited | String/Boolean | Edit timestamp or false |
flair | String | Author flair |
distinguished | String | moderator, admin, or null |
stickied | Boolean | Whether comment is stickied |
replies | Array | Nested replies (up to 5 deep) |
Example Output
{"id": "1t9jt9w","title": "BlueJS: Compile JavaScript to 12MB native binaries","url": "https://github.com/example/bluejs","permalink": "https://www.reddit.com/r/javascript/comments/1t9jt9w/...","subreddit": "javascript","subredditSubscribers": 2400000,"author": "dev_user","score": 1842,"upvoteRatio": 0.97,"numComments": 143,"selfText": "","postType": "link","domain": "github.com","awardsCount": 3,"comments": [{"id": "cmt123","author": "another_user","body": "This is really impressive!","score": 245,"createdAt": "2025-06-01T11:00:00.000Z","replies": []}],"scrapedAt": "2025-06-01T12:00:00.000Z"}
Related Actors
- Reddit Subreddit Scraper — Scrape new/hot/top posts from a specific subreddit
- Reddit Search Scraper — Search Reddit by keyword across all subreddits
Author
Built by Kumar Lakshmanan · lakshmanan.w3dev@gmail.com