Reddit Subreddit Scraper avatar

Reddit Subreddit Scraper

Pricing

from $4.99 / 1,000 results

Go to Apify Store
Reddit Subreddit Scraper

Reddit Subreddit Scraper

Scrape the newest, hottest, top, or rising posts from any Reddit subreddit with full metadata, images, videos, awards, and optional comments.

Pricing

from $4.99 / 1,000 results

Rating

0.0

(0)

Developer

codingfrontend

codingfrontend

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

0

Monthly active users

3 days ago

Last modified

Share

Scrape the newest, hottest, top, or rising posts from any Reddit subreddit with full post metadata, images, videos, gallery content, awards, and optional comments — using Reddit's official JSON API for reliable, structured data extraction.

Features

  • Sort modes — Scrape by New, Hot, Top, Rising, or Controversial
  • Time filters — For Top/Controversial: Past Hour, Today, This Week, This Month, This Year, All Time
  • Deep scraping — Opens each post in a parallel browser tab for complete fields (self-text, images, video, gallery, awards, crosspost data)
  • Parallel tabs — Processes 3 posts simultaneously for fast extraction
  • Pagination — Automatically follows after cursor to fetch all pages
  • Comments — Optional top-20 comments with up to 5 nested replies
  • 50+ fields — Author info, flair, media, video HLS/DASH/fallback URLs, gallery images with captions, awards breakdown, moderation metadata, and more
  • Stealth mode — Random user agents, headers, and fingerprinting to avoid rate limits

Input Options

FieldTypeDefaultDescription
subredditStringrequiredSubreddit name (without r/). E.g. javascript, worldnews
sortByEnumnewnew · hot · top · rising · controversial
topTimeEnumweekFor Top/Controversial: hour · day · week · month · year · all
maxItemsNumber50Maximum posts to scrape (1–1000)
deepScrapingBooleantrueOpen each post for full metadata (recommended)
includeCommentsBooleanfalseInclude top 20 comments with nested replies
headlessBooleantrueRun browser in headless mode
proxyConfigurationObjectApify ResidentialProxy settings

Example Inputs

Scrape newest posts from r/javascript

{
"subreddit": "javascript",
"sortBy": "new",
"maxItems": 100,
"deepScraping": true,
"includeComments": false
}

Top posts from r/worldnews this month

{
"subreddit": "worldnews",
"sortBy": "top",
"topTime": "month",
"maxItems": 50,
"deepScraping": true,
"includeComments": true
}

Hot posts from r/AskReddit

{
"subreddit": "AskReddit",
"sortBy": "hot",
"maxItems": 25,
"deepScraping": true,
"includeComments": true
}

Output Dataset Fields

FieldTypeDescription
idStringReddit post ID
titleStringPost title
urlStringExternal link URL (or Reddit permalink for self posts)
permalinkStringFull Reddit URL
subredditStringSubreddit name
subredditIdStringSubreddit fullname (t5_xxx)
subredditTypeStringpublic, private, restricted
subredditSubscribersNumberSubscriber count
authorStringPost author username
authorFullnameStringAuthor fullname (t2_xxx)
authorFlairStringAuthor flair text
authorPremiumBooleanWhether author has Reddit Premium
scoreNumberUpvotes minus downvotes
upvoteRatioNumberFraction of upvotes (0.0–1.0)
numCommentsNumberTotal comment count
createdAtStringISO 8601 creation timestamp
editedAtString/BooleanISO 8601 edit timestamp or false
selfTextStringPost body text (self posts)
postTypeStringself · link · image · video · gallery · crosspost
isVideoBooleanWhether post contains hosted video
isSelfBooleanWhether it's a text/self post
isGalleryBooleanWhether it's a gallery post
isOCBooleanOriginal content flag
thumbnailStringThumbnail image URL
flairStringPost flair text
flairBackgroundColorStringFlair background hex color
domainStringLink domain
nsfwBooleanNSFW flag
spoilerBooleanSpoiler flag
lockedBooleanWhether post is locked
pinnedBooleanWhether post is pinned
stickiedBooleanWhether post is stickied
archivedBooleanWhether post is archived
distinguishedStringmoderator, admin, or null
suggestedSortStringSuggested comment sort
numCrosspostsNumberCross-post count
postHintStringself · link · image · rich:video · hosted:video
isCrossPostBooleanWhether it's a cross-post
crosspostFromStringOriginal subreddit if cross-posted
crosspostOriginalObjectFull original post object if cross-posted
awardsCountNumberTotal awards received
gildedNumberNumber of times gilded
gildingsObjectBreakdown by award tier
awardsArrayAward names and counts
previewImagesArrayPreview image objects with resolutions
videoDataObjectVideo URLs (HLS, DASH, fallback), dimensions, duration
galleryImagesArrayGallery images with URL, caption, mp4
mediaEmbedObjectEmbedded media HTML/dimensions
commentsArrayTop comments (if includeComments=true)
scrapedAtStringISO 8601 scrape timestamp

Example Output

{
"id": "1t9jt9w",
"title": "BlueJS: Compile JavaScript to 12MB native binaries",
"url": "https://github.com/example/bluejs",
"permalink": "https://www.reddit.com/r/javascript/comments/1t9jt9w/...",
"subreddit": "javascript",
"subredditSubscribers": 2400000,
"author": "dev_user",
"score": 1842,
"upvoteRatio": 0.97,
"numComments": 143,
"createdAt": "2025-06-01T10:30:00.000Z",
"postType": "link",
"domain": "github.com",
"nsfw": false,
"awardsCount": 3,
"previewImages": [{ "url": "https://...", "width": 1200, "height": 630 }],
"scrapedAt": "2025-06-01T12:00:00.000Z"
}

  • Reddit Search Scraper — Search Reddit by keyword across all subreddits
  • Reddit Post Scraper — Scrape specific Reddit post URLs for full details

Author

Built by Kumar Lakshmanan · lakshmanan.w3dev@gmail.com