Reddit Scraper avatar

Reddit Scraper

Pricing

from $0.50 / 1,000 reddit post scrapeds

Go to Apify Store
Reddit Scraper

Reddit Scraper

Scrape posts from any subreddit or user profile — no key, no login. Sort by top/hot/new/rising/controversial over any time window. Returns full post data plus optional clean-text/TTS-ready fields. For research, trend monitoring, video pipelines, datasets, lead gen, and more.

Pricing

from $0.50 / 1,000 reddit post scrapeds

Rating

0.0

(0)

Developer

Dami's Studio

Dami's Studio

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

2 days ago

Last modified

Share

Pulls posts from any subreddit or user profile without an API key or login. Point it at a list of sources, pick a sort and time window, and you get back full post data including the body text (selftext), with optional cleaned, TTS-ready narration. Built for people batching Reddit content: short-form video scripts, research datasets, content monitoring.

How it works

Reddit hard-blocks the anonymous .json API but still serves Atom RSS feeds from separate infrastructure that isn't rate-limited the same way, so the default rss method reads those feeds and parses them into structured records. RSS doesn't expose upvote scores or comments, so if you need those, add free Reddit "script" app credentials and the actor switches to the authenticated OAuth endpoint for that data.

Input

Nothing is strictly required. If you pass no sources, the actor falls back to its prefilled examples. The fields you'll actually touch:

FieldRequiredNotes
sourcesnoArray of subreddits and/or user profiles. Accepts tifu, r/tifu, u/spez, user/spez, or a full reddit.com URL. Mixed lists are fine.
sortnoOne of top, hot, new, rising, controversial. Defaults to top.
timenoTime window for top and controversial: hour, day, week, month, year, all. Ignored for the other sorts. Defaults to day.
maxPostsPerSubredditnoHow many posts to return per source. Defaults to 10, max 100.
methodnorss (default, no login), oauth (uses the creds below, adds scores/comments), or json (legacy anonymous API, usually blocked).
cleanTextnoStrips markdown, links, and edit stamps to produce narration sentences. On by default. Turn it off for raw text.
requireStorynoKeep only text posts and apply the word-count fit. Off by default, which returns all post types.
commentLimitnoTop comments to fetch per post. Needs Reddit app creds to be reliable; anonymous comment access is usually blocked. Defaults to 0.
redditClientId / redditClientSecretnoFree "script" app credentials from reddit.com/prefs/apps. Only needed for comments and upvote scores.
minScore, minWords, maxWords, minHookScore, includeNsfw, dedupeAcrossRunsnoFilters and run-to-run dedupe. Sensible defaults; leave them alone unless you're filtering for short-form fit.
proxyConfigurationnoResidential proxy is recommended at volume. The RSS feeds work from most IPs, so it's not mandatory for small runs.

Output

Each post is one dataset item. Alongside the metadata you get selftext (the full post body) and, when cleanText is on, a narration field plus ttsSegments split into sentences ready to hand to a text-to-speech step.

Key fields: id, subreddit, title, url, author, score, numComments, createdUtc, over18, selftext, postType, narration, ttsSegments, wordCount, readTimeSeconds, hookScore, fitsShort. When commentLimit is set, a comments array (author, body, score, depth) and fetchedCommentCount are added. Set rawMode if you'd rather have the unshaped Reddit post object.

Example

{
"sources": ["r/tifu", "r/AskReddit", "u/GallowBoob"],
"sort": "top",
"time": "week",
"maxPostsPerSubreddit": 25,
"cleanText": true
}

Pricing

$0.50 per 1000 posts, pay per result, no subscription. Steps that use an AI model expect your own OpenAI key.

Notes

Scores and comments are not in the RSS feed. If you need them, create a free script app at reddit.com/prefs/apps and pass redditClientId and redditClientSecret. Anonymous comment fetching is mostly blocked by Reddit and will usually come back empty.