Reddit API Scraper avatar

Reddit API Scraper

Pricing

from $5.00 / 1,000 dataset rows

Go to Apify Store
Reddit API Scraper

Reddit API Scraper

Scrape Reddit posts, comments, search results, subreddits, and user profiles via Reddit's OAuth data API. No login required.

Pricing

from $5.00 / 1,000 dataset rows

Rating

0.0

(0)

Developer

R.L.

R.L.

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

2 days ago

Last modified

Categories

Share

Reddit API Scraper extracts posts, comments, search results, subreddit metadata, and user profiles from Reddit — fast, structured, and without a login. It talks directly to Reddit's official OAuth data API (the same endpoints the Reddit mobile app uses), so you get clean JSON instead of brittle HTML scraping. Point it at a subreddit, a search query, a post URL, or a username and download the results as JSON, CSV, or Excel.

Running on the Apify platform, you also get scheduling, a REST API, webhook and integration support, proxy rotation, and run monitoring out of the box.

Why use Reddit API Scraper?

  • Market & audience research — track what communities are discussing around your product, niche, or competitors.
  • Brand & reputation monitoring — surface every mention of a keyword across Reddit and read the full comment threads.
  • Content & trend discovery — pull top/hot/rising posts from any subreddit to spot emerging topics.
  • Sentiment & NLP datasets — collect large, structured post + comment corpora for analysis or model training.
  • Lead and community analysis — profile users and inspect their posting history.

How to use Reddit API Scraper

  1. Open the Actor and go to the Input tab.
  2. Fill in one or more source fields: Subreddits, Search queries, Post URLs / IDs, or Usernames.
  3. Optionally set the sort order, time window, item limits, and whether to include comments.
  4. Click Start and watch the run log.
  5. When it finishes, open the Output / Storage tab and download your data as JSON, CSV, or Excel.

Input

You can combine any of the source types in a single run. At least one is required.

FieldTypeDescription
subredditsarraySubreddit names, r/-prefixed names, or URLs to pull post listings from.
sortenumListing sort: hot, new, top, rising, controversial.
timeenumTime window for top/controversial: hourall.
searchQueriesarrayFull-text search queries to run.
searchSortenumSearch sort: relevance, hot, top, new, comments.
searchInSubredditsbooleanRestrict each search query to the listed subreddits.
postUrlsarrayIndividual post permalinks/IDs — fetches full self-text + comments.
usernamesarrayUsers to scrape (profile + history).
userContentenumoverview, submitted, comments, or about.
includeCommentsbooleanAlso fetch comment trees for posts collected from subreddits/searches.
maxCommentsintegerMax comments emitted per post.
commentSortenumComment sort: confidence, top, new, controversial, old, qa.
maxItemsPerSourceintegerMax posts per subreddit/search/user (0 = unlimited).
includeNsfwbooleanInclude NSFW results in search.
proxyConfigurationobjectProxy settings (Apify Proxy recommended for larger runs).

Example input

{
"subreddits": ["programming", "r/python"],
"sort": "top",
"time": "week",
"maxItemsPerSource": 100,
"includeComments": true,
"maxComments": 50,
"searchQueries": ["large language models"],
"searchSort": "new"
}

Output

Each item is pushed to the dataset with a type field (post, comment, subreddit, or user). You can download the dataset in various formats such as JSON, HTML, CSV, or Excel.

Example output (a post)

{
"type": "post",
"id": "1ukim8j",
"subreddit": "programming",
"title": "Open source is a thankless job...",
"author": "swithek",
"selftext": "",
"score": 1092,
"upvoteRatio": 0.97,
"numComments": 267,
"createdAt": "2026-07-01T11:09:38+00:00",
"url": "https://example.com/article",
"permalink": "https://www.reddit.com/r/programming/comments/1ukim8j/..."
}

Example output (a comment)

{
"type": "comment",
"id": "ouw01y8",
"postId": "1ukim8j",
"author": "SP-Niemand",
"body": "As a user, you get to demand exactly nothing...",
"score": 453,
"depth": 0,
"createdAt": "2026-07-01T12:30:00+00:00"
}

Data fields

FieldApplies toDescription
typeallpost, comment, subreddit, or user.
idallBase-36 Reddit ID.
subredditpost, commentSubreddit name.
titlepost, subredditPost title / subreddit title.
authorpost, commentUsername of the author.
selftext / bodypost / commentFull text content.
score, ups, upvoteRatiopost, commentVote metrics.
numCommentspostComment count.
postId, parentId, depthcommentThread position.
subscribers, activeUserCountsubredditCommunity size.
commentKarma, linkKarma, totalKarmauserKarma breakdown.
createdAtallISO-8601 UTC creation time.
permalink, urlpost, commentLinks.

How much does it cost to scrape Reddit?

This Actor uses pay-per-result pricing: $5.00 per 1,000 rows ($0.005 per row). Every item written to the dataset — a post, a comment, a subreddit, or a user — counts as one row. So a run that collects 500 posts and 5,000 comments (5,500 rows) costs about $27.50.

Platform usage (compute + proxy) is billed on top of the per-row price at Apify's standard rates. The Actor is lightweight — it calls a JSON API rather than rendering pages — so compute is minimal; the main variable cost is proxy traffic if you enable Apify Proxy. To control spend, set Max total charge on the run and/or lower maxItemsPerSource and maxComments; the Actor stops as soon as the charge cap is reached. Reddit limits each OAuth token to roughly 100 requests per minute, so very large runs are paced accordingly.

Tips & advanced options

  • Fewer requests, faster runs: leave includeComments off if you only need post metadata.
  • Targeted comment collection: put specific posts in postUrls — their comments are always fetched.
  • Scoped search: enable searchInSubreddits with a subreddits list to search within specific communities.
  • Unlimited collection: set maxItemsPerSource to 0 to page through everything the API returns (Reddit typically caps listings around ~1000 items per source).
  • Scale past rate limits: run with Apify Proxy and/or split large jobs into scheduled runs.

FAQ, disclaimers, and support

Is scraping Reddit legal? This Actor collects only publicly available data. You are responsible for how you use it — comply with applicable laws (including data-protection law such as GDPR) and avoid collecting personal or sensitive data without a lawful basis.

Limitations: Reddit caps listing pagination (~1000 items per source) and comment trees may be truncated for very large threads. Search uses Reddit's REST search endpoint.

Support: found a bug or need a field that isn't extracted? Open an issue on the Actor's Issues tab. Custom scraping solutions can also be arranged.