Reddit Search Scraper avatar

Reddit Search Scraper

Pricing

from $4.99 / 1,000 results

Go to Apify Store
Reddit Search Scraper

Reddit Search Scraper

Search Reddit by keyword and scrape matching posts with full metadata, images, videos, awards, and optional comments. Optionally restrict search to a single subreddit.

Pricing

from $4.99 / 1,000 results

Rating

0.0

(0)

Developer

codingfrontend

codingfrontend

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

0

Monthly active users

4 days ago

Last modified

Share

Search Reddit by keyword or phrase and scrape all matching posts with full metadata, images, videos, gallery content, awards, and optional comments — using Reddit's official JSON search API for reliable, structured data extraction.

Optionally restrict your search to a specific subreddit for targeted research.

Features

  • Keyword search — Search all of Reddit or within a specific subreddit
  • Sort modes — Relevance, New, Hot, Top, or Most Comments
  • Time filters — For Top sort: Past Hour, Today, This Week, This Month, This Year, All Time
  • Deep scraping — Opens each post in a parallel browser tab for complete fields (self-text, images, video, gallery, awards, crosspost data)
  • Parallel tabs — Processes 3 posts simultaneously for fast extraction
  • Pagination — Automatically follows after cursor to fetch all result pages
  • Comments — Optional top-20 comments with up to 5 nested replies
  • 50+ fields — Author info, flair, media, video HLS/DASH/fallback URLs, gallery images with captions, awards breakdown, moderation metadata, and more
  • Stealth mode — Random user agents, headers, and fingerprinting to avoid rate limits

Input Options

FieldTypeDefaultDescription
searchQueryStringrequiredKeyword or phrase to search for
subredditString""Restrict to subreddit (optional, without r/)
sortByEnumrelevancerelevance · new · hot · top · comments
topTimeEnumallFor Top: hour · day · week · month · year · all
maxItemsNumber50Maximum posts to scrape (1–1000)
deepScrapingBooleantrueOpen each post for full metadata (recommended)
includeCommentsBooleanfalseInclude top 20 comments with nested replies
headlessBooleantrueRun browser in headless mode
proxyConfigurationObjectApify ResidentialProxy settings

Example Inputs

Search all of Reddit for "React hooks"

{
"searchQuery": "React hooks best practices",
"sortBy": "relevance",
"maxItems": 100,
"deepScraping": true
}

Search within r/python for "web scraping"

{
"searchQuery": "web scraping",
"subreddit": "python",
"sortBy": "top",
"topTime": "year",
"maxItems": 50,
"deepScraping": true,
"includeComments": true
}

Find the newest posts about "GPT-4o"

{
"searchQuery": "GPT-4o",
"sortBy": "new",
"maxItems": 25,
"deepScraping": true
}

Output Dataset Fields

FieldTypeDescription
idStringReddit post ID
titleStringPost title
urlStringExternal link URL (or Reddit permalink for self posts)
permalinkStringFull Reddit URL
subredditStringSubreddit name
subredditIdStringSubreddit fullname (t5_xxx)
subredditTypeStringpublic, private, restricted
subredditSubscribersNumberSubscriber count
authorStringPost author username
authorFullnameStringAuthor fullname (t2_xxx)
authorFlairStringAuthor flair text
authorPremiumBooleanWhether author has Reddit Premium
scoreNumberUpvotes minus downvotes
upvoteRatioNumberFraction of upvotes (0.0–1.0)
numCommentsNumberTotal comment count
createdAtStringISO 8601 creation timestamp
editedAtString/BooleanISO 8601 edit timestamp or false
selfTextStringPost body text (self posts)
postTypeStringself · link · image · video · gallery · crosspost
isVideoBooleanWhether post contains hosted video
isSelfBooleanWhether it's a text/self post
isGalleryBooleanWhether it's a gallery post
isOCBooleanOriginal content flag
thumbnailStringThumbnail image URL
flairStringPost flair text
flairBackgroundColorStringFlair background hex color
domainStringLink domain
nsfwBooleanNSFW flag
spoilerBooleanSpoiler flag
lockedBooleanWhether post is locked
pinnedBooleanWhether post is pinned
stickiedBooleanWhether post is stickied
archivedBooleanWhether post is archived
distinguishedStringmoderator, admin, or null
suggestedSortStringSuggested comment sort
numCrosspostsNumberCross-post count
postHintStringself · link · image · rich:video · hosted:video
isCrossPostBooleanWhether it's a cross-post
crosspostFromStringOriginal subreddit if cross-posted
crosspostOriginalObjectFull original post object if cross-posted
awardsCountNumberTotal awards received
gildedNumberNumber of times gilded
gildingsObjectBreakdown by award tier
awardsArrayAward names and counts
previewImagesArrayPreview image objects with resolutions
videoDataObjectVideo URLs (HLS, DASH, fallback), dimensions, duration
galleryImagesArrayGallery images with URL, caption, mp4
mediaEmbedObjectEmbedded media HTML/dimensions
commentsArrayTop comments (if includeComments=true)
scrapedAtStringISO 8601 scrape timestamp

Example Output

{
"id": "1abc123",
"title": "React hooks best practices for 2025",
"url": "https://example.com/react-hooks",
"permalink": "https://www.reddit.com/r/reactjs/comments/1abc123/...",
"subreddit": "reactjs",
"author": "web_dev",
"score": 892,
"upvoteRatio": 0.95,
"numComments": 67,
"createdAt": "2025-05-20T14:00:00.000Z",
"postType": "link",
"domain": "example.com",
"nsfw": false,
"scrapedAt": "2025-06-01T12:00:00.000Z"
}

  • Reddit Subreddit Scraper — Scrape new/hot/top posts from a specific subreddit
  • Reddit Post Scraper — Scrape specific Reddit post URLs for full details

Author

Built by Kumar Lakshmanan · lakshmanan.w3dev@gmail.com