Reddit Post Scraper avatar

Reddit Post Scraper

Pricing

from $4.99 / 1,000 results

Go to Apify Store
Reddit Post Scraper

Reddit Post Scraper

Scrape specific Reddit post URLs for full post details, body text, images, videos, gallery, awards, metadata, and optional comments with nested replies.

Pricing

from $4.99 / 1,000 results

Rating

0.0

(0)

Developer

codingfrontend

codingfrontend

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

0

Monthly active users

5 days ago

Last modified

Share

Scrape specific Reddit post URLs for complete post data including title, body text, images, videos, gallery content, awards, crosspost details, and optional top comments with nested replies — using Reddit's official JSON API.

Ideal for monitoring specific posts, ingesting post data for NLP/analysis, or building content pipelines from curated Reddit URLs.

Features

  • Batch input — Provide a list of Reddit post URLs and scrape them all
  • Full post details — Title, selftext, author, score, flair, domain, awards, media, and 50+ fields
  • Video extraction — HLS (streaming), DASH, and fallback video URLs with dimensions and duration
  • Gallery support — All gallery images in order with captions and MP4 alternatives
  • Comments — Optional top-20 comments with up to 5 nested reply levels
  • Parallel execution — Processes 2 posts concurrently for efficiency
  • Awards breakdown — Individual award names and gilding tiers
  • Stealth mode — Random user agents, headers, and fingerprinting to avoid rate limits

Input Options

FieldTypeDefaultDescription
postUrlsArrayrequiredList of Reddit post URLs to scrape
includeCommentsBooleantrueFetch top 20 comments with nested replies
headlessBooleantrueRun browser in headless mode
proxyConfigurationObjectApify ResidentialProxy settings

Example Inputs

Scrape a single post with comments

{
"postUrls": [
"https://www.reddit.com/r/javascript/comments/1t9jt9w/bluejs_compile_javascript_to_12mb_native_binaries/"
],
"includeComments": true
}

Scrape multiple posts, no comments

{
"postUrls": [
"https://www.reddit.com/r/programming/comments/abc123/post_one/",
"https://www.reddit.com/r/webdev/comments/def456/post_two/",
"https://www.reddit.com/r/reactjs/comments/ghi789/post_three/"
],
"includeComments": false
}

Output Dataset Fields

FieldTypeDescription
idStringReddit post ID
titleStringPost title
urlStringExternal link URL (or Reddit permalink for self posts)
permalinkStringFull Reddit URL
subredditStringSubreddit name
subredditIdStringSubreddit fullname (t5_xxx)
subredditTypeStringpublic, private, restricted
subredditSubscribersNumberSubscriber count
authorStringPost author username
authorFullnameStringAuthor fullname (t2_xxx)
authorFlairStringAuthor flair text
authorPremiumBooleanWhether author has Reddit Premium
scoreNumberUpvotes minus downvotes
upvoteRatioNumberFraction of upvotes (0.0–1.0)
numCommentsNumberTotal comment count
createdAtStringISO 8601 creation timestamp
editedAtString/BooleanISO 8601 edit timestamp or false
selfTextStringFull post body text (self posts)
postTypeStringself · link · image · video · gallery · crosspost
isVideoBooleanWhether post contains hosted video
isSelfBooleanWhether it's a text/self post
isGalleryBooleanWhether it's a gallery post
isOCBooleanOriginal content flag
thumbnailStringThumbnail image URL
flairStringPost flair text
flairBackgroundColorStringFlair background hex color
domainStringLink domain
nsfwBooleanNSFW flag
spoilerBooleanSpoiler flag
lockedBooleanWhether post is locked
pinnedBooleanWhether post is pinned
stickiedBooleanWhether post is stickied
archivedBooleanWhether post is archived
distinguishedStringmoderator, admin, or null
suggestedSortStringSuggested comment sort
numCrosspostsNumberCross-post count
postHintStringself · link · image · rich:video · hosted:video
isCrossPostBooleanWhether it's a cross-post
crosspostFromStringOriginal subreddit if cross-posted
crosspostOriginalObjectFull original post data if cross-posted
awardsCountNumberTotal awards received
gildedNumberNumber of times gilded
gildingsObjectBreakdown by award tier
awardsArrayAward names and counts
previewImagesArrayPreview image objects with resolutions
videoDataObjectVideo URLs (HLS, DASH, fallback), dimensions, duration
galleryImagesArrayGallery images with URL, caption, mp4
mediaEmbedObjectEmbedded media HTML/dimensions
commentsArrayTop comments (if includeComments=true)
scrapedAtStringISO 8601 scrape timestamp

Comment Object

FieldTypeDescription
idStringComment ID
authorStringComment author
bodyStringComment text
scoreNumberComment score
createdAtStringISO 8601 timestamp
editedString/BooleanEdit timestamp or false
flairStringAuthor flair
distinguishedStringmoderator, admin, or null
stickiedBooleanWhether comment is stickied
repliesArrayNested replies (up to 5 deep)

Example Output

{
"id": "1t9jt9w",
"title": "BlueJS: Compile JavaScript to 12MB native binaries",
"url": "https://github.com/example/bluejs",
"permalink": "https://www.reddit.com/r/javascript/comments/1t9jt9w/...",
"subreddit": "javascript",
"subredditSubscribers": 2400000,
"author": "dev_user",
"score": 1842,
"upvoteRatio": 0.97,
"numComments": 143,
"selfText": "",
"postType": "link",
"domain": "github.com",
"awardsCount": 3,
"comments": [
{
"id": "cmt123",
"author": "another_user",
"body": "This is really impressive!",
"score": 245,
"createdAt": "2025-06-01T11:00:00.000Z",
"replies": []
}
],
"scrapedAt": "2025-06-01T12:00:00.000Z"
}

  • Reddit Subreddit Scraper — Scrape new/hot/top posts from a specific subreddit
  • Reddit Search Scraper — Search Reddit by keyword across all subreddits

Author

Built by Kumar Lakshmanan · lakshmanan.w3dev@gmail.com