Reddit Search Scraper

Search Reddit by keyword and scrape matching posts with full metadata, images, videos, awards, and optional comments. Optionally restrict search to a single subreddit.

Pricing

from $4.99 / 1,000 results

Rating

0.0

(0)

Developer

Coding Frontned

Actor stats

Bookmarked

Total users

Monthly active users

a month ago

Last modified

Features

Keyword search — Search all of Reddit or within a specific subreddit
Sort modes — Relevance, New, Hot, Top, or Most Comments
Time filters — For Top sort: Past Hour, Today, This Week, This Month, This Year, All Time
Deep scraping — Opens each post in a parallel browser tab for complete fields (self-text, images, video, gallery, awards, crosspost data)
Parallel tabs — Processes 3 posts simultaneously for fast extraction
Pagination — Automatically follows after cursor to fetch all result pages
Comments — Optional top-20 comments with up to 5 nested replies
50+ fields — Author info, flair, media, video HLS/DASH/fallback URLs, gallery images with captions, awards breakdown, moderation metadata, and more
Stealth mode — Random user agents, headers, and fingerprinting to avoid rate limits

Input Options

Field	Type	Default	Description
`searchQuery`	String	required	Keyword or phrase to search for
`subreddit`	String	`""`	Restrict to subreddit (optional, without `r/`)
`sortBy`	Enum	`relevance`	`relevance` · `new` · `hot` · `top` · `comments`
`topTime`	Enum	`all`	For Top: `hour` · `day` · `week` · `month` · `year` · `all`
`maxItems`	Number	`50`	Maximum posts to scrape (1–1000)
`deepScraping`	Boolean	`true`	Open each post for full metadata (recommended)
`includeComments`	Boolean	`false`	Include top 20 comments with nested replies
`headless`	Boolean	`true`	Run browser in headless mode
`proxyConfiguration`	Object	Apify Residential	Proxy settings

Example Inputs

Search all of Reddit for "React hooks"

{
  "searchQuery": "React hooks best practices",
  "sortBy": "relevance",
  "maxItems": 100,
  "deepScraping": true
}

Search within r/python for "web scraping"

{
  "searchQuery": "web scraping",
  "subreddit": "python",
  "sortBy": "top",
  "topTime": "year",
  "maxItems": 50,
  "deepScraping": true,
  "includeComments": true
}

Find the newest posts about "GPT-4o"

{
  "searchQuery": "GPT-4o",
  "sortBy": "new",
  "maxItems": 25,
  "deepScraping": true
}

Output Dataset Fields

Field	Type	Description
`id`	String	Reddit post ID
`title`	String	Post title
`url`	String	External link URL (or Reddit permalink for self posts)
`permalink`	String	Full Reddit URL
`subreddit`	String	Subreddit name
`subredditId`	String	Subreddit fullname (t5_xxx)
`subredditType`	String	`public`, `private`, `restricted`
`subredditSubscribers`	Number	Subscriber count
`author`	String	Post author username
`authorFullname`	String	Author fullname (t2_xxx)
`authorFlair`	String	Author flair text
`authorPremium`	Boolean	Whether author has Reddit Premium
`score`	Number	Upvotes minus downvotes
`upvoteRatio`	Number	Fraction of upvotes (0.0–1.0)
`numComments`	Number	Total comment count
`createdAt`	String	ISO 8601 creation timestamp
`editedAt`	String/Boolean	ISO 8601 edit timestamp or `false`
`selfText`	String	Post body text (self posts)
`postType`	String	`self` · `link` · `image` · `video` · `gallery` · `crosspost`
`isVideo`	Boolean	Whether post contains hosted video
`isSelf`	Boolean	Whether it's a text/self post
`isGallery`	Boolean	Whether it's a gallery post
`isOC`	Boolean	Original content flag
`thumbnail`	String	Thumbnail image URL
`flair`	String	Post flair text
`flairBackgroundColor`	String	Flair background hex color
`domain`	String	Link domain
`nsfw`	Boolean	NSFW flag
`spoiler`	Boolean	Spoiler flag
`locked`	Boolean	Whether post is locked
`pinned`	Boolean	Whether post is pinned
`stickied`	Boolean	Whether post is stickied
`archived`	Boolean	Whether post is archived
`distinguished`	String	`moderator`, `admin`, or `null`
`suggestedSort`	String	Suggested comment sort
`numCrossposts`	Number	Cross-post count
`postHint`	String	`self` · `link` · `image` · `rich:video` · `hosted:video`
`isCrossPost`	Boolean	Whether it's a cross-post
`crosspostFrom`	String	Original subreddit if cross-posted
`crosspostOriginal`	Object	Full original post object if cross-posted
`awardsCount`	Number	Total awards received
`gilded`	Number	Number of times gilded
`gildings`	Object	Breakdown by award tier
`awards`	Array	Award names and counts
`previewImages`	Array	Preview image objects with resolutions
`videoData`	Object	Video URLs (HLS, DASH, fallback), dimensions, duration
`galleryImages`	Array	Gallery images with URL, caption, mp4
`mediaEmbed`	Object	Embedded media HTML/dimensions
`comments`	Array	Top comments (if `includeComments=true`)
`scrapedAt`	String	ISO 8601 scrape timestamp

Example Output

{
  "id": "1abc123",
  "title": "React hooks best practices for 2025",
  "url": "https://example.com/react-hooks",
  "permalink": "https://www.reddit.com/r/reactjs/comments/1abc123/...",
  "subreddit": "reactjs",
  "author": "web_dev",
  "score": 892,
  "upvoteRatio": 0.95,
  "numComments": 67,
  "createdAt": "2025-05-20T14:00:00.000Z",
  "postType": "link",
  "domain": "example.com",
  "nsfw": false,
  "scrapedAt": "2025-06-01T12:00:00.000Z"
}

Reddit Subreddit Scraper — Scrape new/hot/top posts from a specific subreddit
Reddit Post Scraper — Scrape specific Reddit post URLs for full details