Subreddit Scraper - Whole Subreddits, No 1k Post Cap ($1.5/1k) avatar

Subreddit Scraper - Whole Subreddits, No 1k Post Cap ($1.5/1k)

Pricing

from $1.50 / 1,000 result saveds

Go to Apify Store
Subreddit Scraper - Whole Subreddits, No 1k Post Cap ($1.5/1k)

Subreddit Scraper - Whole Subreddits, No 1k Post Cap ($1.5/1k)

Scrape entire subreddits — thousands of posts per community, far beyond Reddit's ~1,000-post listing cap, by combining every sort and time window. Optional comments per post. Archive communities or build ML datasets. No API key. CSV/Excel/JSON. From $1.50 per 1,000 posts.

Pricing

from $1.50 / 1,000 result saveds

Rating

5.0

(2)

Developer

Harsh Maur

Harsh Maur

Maintained by Community

Actor stats

2

Bookmarked

2

Total users

1

Monthly active users

a day ago

Last modified

Share

Subreddit Scraper — Scrape an Entire Subreddit Past the 1,000-Post Limit (No API Key)

Download all posts from a subreddit — thousands per community, far beyond Reddit's ~1,000-post listing cap. Optional comments, no API key, no login. From $1.50 per 1,000 results.

Try on Apify Input Schema API Docs

What it doesHow to use1,000-post limitInputOutputPricingFAQ


What does Subreddit Scraper do? {#what-does-subreddit-scraper-do}

Subreddit Scraper is a purpose-built subreddit scraper that takes a community name — AskReddit, r/AskReddit, or a full URL — and pulls as much of that subreddit as possible into one deduplicated dataset. Where an ordinary scrape stops at the ~1,000 posts Reddit exposes per listing, this actor walks every sort order across every time window plus per-flair listings, merging the results so you keep going long after a single listing runs dry.

  • 🗂️ Deep coverage — combines new, top, and controversial sorts over hour/day/week/month/year/all-time windows, then adds flair-filtered listings on top
  • 💬 Optional comments — flip one switch to also pull each post's comment thread
  • 🔑 Zero setup — no Reddit account, no OAuth app, no API key; sidesteps Reddit's 600 requests/10 min API ceiling entirely
  • 📊 70+ fields per post — full text and HTML, media and galleries, flair, flags, and derived engagement metrics
  • 🔄 Export and automate — JSON, CSV, Excel, XML, HTML downloads; integrates with n8n, Zapier, Make, the REST API, and MCP
  • 💰 Pay per result — from $1.50 per 1,000 results, no monthly fee

If you want keyword search, single posts, or user profiles, grab the all-in-one Reddit Scraper instead — this actor does one thing deeply: whole communities.


How to scrape an entire subreddit {#how-to-scrape-an-entire-subreddit}

No code needed — a deep subreddit pull takes five steps:

  1. Sign up for Apify for free (or log in)
  2. Open Subreddit Scraper
  3. Type the communities you want into Subreddits — any format works: AskReddit, r/AskReddit, or https://www.reddit.com/r/AskReddit/
  4. Set Max posts (default 1,000 — raise it up to 50,000 to go deep) and optionally enable Scrape comments for each post
  5. Press Start, then download the dataset as JSON, CSV, Excel, XML, or HTML

💡 Tip: Start with a small maxPostsCount to verify the output shape, then rerun with a high cap for the full archive pull.


How Subreddit Scraper bypasses Reddit's 1,000-post listing cap {#1000-post-limit}

Reddit never serves more than roughly 1,000 posts per listing. Page through r/technology/new as long as you like — the feed simply ends. The same wall applies to top, hot, and controversial, which is why most subreddit scrapers quietly top out around a thousand items no matter what limit you ask for.

This actor's workaround is a union of listings instead of one listing:

  1. Every sort order — it crawls new, top, and controversial separately
  2. Every time windowtop and controversial are fetched for hour, day, week, month, year, and all-time, and each window surfaces a different slice of history
  3. Every flair — the subreddit's flair-filtered listings are crawled too, reaching posts that never appeared in any global sort
  4. Deduplication — overlapping results are merged by post ID, so each post appears exactly once in your dataset

Each listing is individually capped at ~1,000 posts, but their union is not — on active communities this routinely yields several times more than any single listing could. Honest caveat: it still cannot guarantee literally every post ever submitted; very old posts that fall outside all current listings stay out of reach for any scraper.


Input example {#input-example}

The full reference lives on the Input Schema tab. A typical deep-archive run looks like this:

{
"subreddits": ["r/buildapc", "MachineLearning", "https://www.reddit.com/r/cscareerquestions/"],
"maxPostsCount": 20000,
"crawlCommentsPerPost": true,
"maxCommentsPerPost": 25,
"includeNSFW": false,
"proxy": {
"useApifyProxy": true,
"apifyProxyGroups": ["RESIDENTIAL"]
}
}
FieldTypeWhat it controls
subredditsarrayOne or more communities in any format — bare name, r/name, or full URL
maxPostsCountintegerTotal posts to save across all subreddits (default 1,000, max 50,000)
crawlCommentsPerPostbooleanAlso collect comments for every post found (default off — adds run time)
maxCommentsPerPostintegerComment cap per post when comment crawling is on (default 10)
includeNSFWbooleanKeep posts marked 18+ in the results (default off)
proxyobjectProxy settings — Apify residential proxies recommended

Output example {#output-example}

Every post arrives as a structured item with 70+ fields, downloadable as JSON, CSV, Excel, XML, or HTML. A trimmed example:

{
"dataType": "post",
"id": "t3_1xyz789",
"parsedId": "1xyz789",
"title": "Finally finished my first PC build — lessons learned",
"body": "After three months of part-hunting, here is everything I wish I had known...",
"bodyHtml": "<p>After three months of part-hunting...</p>",
"postType": "text",
"flair": "Build Complete",
"authorName": "first_time_builder",
"authorId": "t2_abc123",
"communityName": "r/buildapc",
"parsedCommunityName": "buildapc",
"subredditName": "buildapc",
"subredditId": "t5_2s3nh",
"subredditSubscribers": 7200000,
"score": 2841,
"upVotes": 2841,
"upvoteRatio": 0.96,
"commentsCount": 312,
"postUrl": "https://www.reddit.com/r/buildapc/comments/1xyz789/",
"nsfw": false,
"over18": false,
"isSelf": true,
"locked": false,
"stickied": false,
"archived": false,
"totalAwardsReceived": 1,
"domain": "self.buildapc",
"mediaType": "text",
"hasMedia": false,
"isVideo": false,
"isGallery": false,
"ageHours": 52.4,
"scorePerHour": 54.22,
"commentsPerHour": 5.95,
"engagementTotal": 3153,
"isHighEngagement": true,
"wordCount": 540,
"createdAt": "2026-06-08T09:15:00.000Z",
"crawledAt": "2026-06-10T13:40:00.000Z"
}

Posts with media additionally populate galleryImages, mediaAssets, videoUrl, thumbnail, and the raw Reddit objects (media, secureMedia, mediaMetadata, galleryData). With comment crawling enabled, comment items (dataType: "comment") carry the full thread context — parentId, depth, isSubmitter, controversiality, plus the parent post's title and URL.


What can you do with a full subreddit dataset?

  • 🔬 Community research — study how a community's topics, norms, and vocabulary evolved across thousands of posts, not just last week's front page
  • 🤖 ML and NLP datasets — turn a niche subreddit into a labeled training corpus; flair and engagement fields come free as features
  • 📈 Trend analysis — chart scorePerHour and posting volume over time to spot when a topic took off inside a community
  • 🗄️ Archiving — archive a subreddit before it goes private, gets banned, or its mods wipe history; export to CSV and keep it forever
  • 🕵️ Competitor community analysis — pull a rival product's subreddit end to end and mine complaints, feature requests, and churn signals

How much does it cost to scrape a subreddit? {#pricing}

Subreddit Scraper is pay-per-result — no subscription, no platform fee:

  • Actor start: $0.02 per run
  • Result stored: $0.0015 each

So a 1,000-post pull costs $1.52, a 10,000-post archive $15.02, and a 50,000-post maximum run $75.02. You pay only for items actually saved.

⏱️ Honest note on run time: depth has a cost in requests. Because the actor walks many listings per subreddit (sorts × time windows × flairs) before deduplicating, a deep run takes noticeably longer than a quick scrape of one listing. Budget accordingly for large maxPostsCount values — especially with comments enabled.


FAQ {#faq}

How many posts can I get from one subreddit?

It varies by community. Each Reddit listing maxes out near 1,000 posts, but because this actor merges new, top, and controversial across six time windows plus flair listings, active subreddits commonly yield several thousand unique posts — often 3-10× what a single-listing scrape returns. Quiet subreddits with few total posts will simply return everything they have.

Does it get around the 1,000 post limit?

Yes, in the only way that actually works: by unioning many capped listings rather than fighting the cap on one. You will get far more than 1,000 posts from any sizable subreddit. It does not guarantee every post ever made — content too old to appear in any current listing is unreachable for any scraper, including this one.

Can I download a whole subreddit to CSV?

Yes. After the run finishes, export the dataset straight from Apify as CSV (or Excel, JSON, XML, HTML). Each post becomes one row with 70+ columns — ready for spreadsheets, pandas, or BI tools.

Do I need a Reddit API key or account?

No. There is no OAuth flow, no login, and no Reddit account involved. The actor reads publicly available data directly, which also means Reddit's 600 requests/10 min API quota never applies to your runs.

Can it scrape comments too?

Yes — enable crawlCommentsPerPost and set maxCommentsPerPost to control depth. Be aware this multiplies the work: comments are fetched per post, so a 10,000-post run with comments takes substantially longer. For comment-first jobs, Reddit Comments Scraper is the better fit.

Can I scrape several subreddits in one run?

Yes. subreddits accepts a list, and maxPostsCount caps the total saved across all of them. Mix formats freely — gaming, r/pcgaming, and full URLs in the same array all work.

Does it include NSFW posts?

Only if you ask. 18+ posts are excluded by default; set includeNSFW: true to keep them in the dataset (they are flagged via the nsfw and over18 fields either way).

Collecting publicly visible data is generally permissible, but you are responsible for using it lawfully: respect Reddit's terms, applicable privacy regulations, and the communities you study. Apify's guide on the enforceability of terms of use is a good starting point.


More Reddit scrapers

ActorWhen to use it instead
Reddit ScraperThe all-in-one: keyword search, direct URLs, users, and subreddits in a single actor
Reddit Comments ScraperYou start from specific posts and want complete comment threads, nested replies included
Reddit Search ScraperYou're hunting a topic or brand across all of Reddit rather than inside one community
Reddit User ScraperYou need a redditor's profile, post history, and comment history

Try Subreddit Scraper