Under maintenance

Pricing

from $6.00 / 1,000 posts

Try for free

Go to Apify Store

Reddit Scraper

Under maintenance

Try for free

Extract posts, comments, user profiles, and search results from Reddit. Pure HTTP, no API key required.

Pricing

from $6.00 / 1,000 posts

Rating

0.0

(0)

Developer

Arnas

Actor stats

Bookmarked

189

Total users

Monthly active users

9 days ago

Last modified

[0.4] — 2026-06-02

Changed

Data source is now old.reddit.com server-rendered HTML (Cheerio), not Reddit's API. The 0.3 OAuth approach proved unusable: Reddit's Responsible Builder Policy (Nov 2025) ended self-service app creation and does not approve commercial scraping on the free tier, so requiring API credentials made the actor unusable for its audience. This release restores the no-API-key experience by scraping old.reddit HTML behind residential proxies — the same credential-free technique the popular Apify Reddit actors use. Switched HttpCrawler → CheerioCrawler with a generated Chrome browser fingerprint.
Removed the OAuth credential requirement (redditClientId / redditClientSecret and the REDDIT_CLIENT_* env fallback are gone). No Reddit account or API key is needed.
Block handling tuned for HTML scraping: a 403 "blocked by network security" page retires the session so Crawlee retries on a fresh residential IP (how rate-limit blocks clear); 404/private are benign per-source skips; 429 honors Retry-After. The fail-loud guard from 0.3 is kept — a run blocked on every request fails loudly instead of reporting an empty success; a run that reaches Reddit (even if all posts are filtered) succeeds.

Limitations (vs the old JSON API)

upvoteRatio, totalAwards, and subredditSubscribers are no longer available from HTML (set to 0 / 0 / null); gallery imageUrls is best-effort.
Comment depth is bounded by what old.reddit renders (fetched with ?limit=500). "Load more comments" / deeply collapsed threads use an AJAX endpoint Reddit blocks, so the deepest tails are not retrieved.
Listing/search posts carry metadata only (no selfText). Self-text + comments are populated when scraping a post URL directly, with includeComments=true, or for the AI output formats (all fetch the post page). Keyword filtering on listings matches the title.
RESIDENTIAL proxies are now required, not just recommended — datacenter IPs are blocked and per-IP rate-limits are cleared by rotation.

[0.3] — 2026-06-02 (superseded by 0.4)

Fixed

The actor returned 0 results while reporting success. Reddit shut down its unauthenticated public .json API — every www.reddit.com/*.json request (and old.reddit.com/*.json) now returns an HTTP 403 "blocked by network security" HTML page, regardless of User-Agent, browser fingerprint, cookies, or proxy IP (the block is endpoint-level, so residential proxies don't help). The actor treated those 403s as benign skips (ignoreHttpErrorStatusCodes: [403] + a "403 = private subreddit" assumption + non-JSON bodies producing only a warning + no zero-result guard), so every run drained its queue and exited successfully with an empty dataset.

Changed

Data now comes from Reddit's official OAuth API (oauth.reddit.com), which returns the identical Listing/Thing JSON the parsers already consume — so post/comment mapping is unchanged. Requests carry a bearer token obtained via the application-only (client_credentials) grant.
Credentials are resolved from input (redditClientId / redditClientSecret) or, as a fallback, from REDDIT_CLIENT_ID / REDDIT_CLIENT_SECRET environment variables (so a maintainer can set one shared app via Apify secrets and keep end users key-free). Credentials are validated up front; a missing or invalid pair fails the run immediately with setup instructions (https://www.reddit.com/prefs/apps ).
Error handling fails loud. 401 refreshes the OAuth token and retries; 403/404 are benign per-source skips; 429 honors Retry-After. If a run produces zero items because requests were blocked or rejected, it now calls Actor.fail() with a diagnostic instead of reporting an empty success. A legitimately empty source (valid but no matching posts) still succeeds.
User-Agent is now a stable, descriptive identifier per Reddit's API terms (the previous Chrome-fingerprint rotation was anti-bot theater for the now-dead public endpoints).

[0.2] — 2026-04-19

Fixed

Posts and comments now land in the run's default dataset. Previously the actor wrote to account-level named datasets (Actor.openDataset('posts') / 'comments'), which made the Apify Console "Storage" tab, run.defaultDatasetId API access, and standard SDK smoke tests all see an empty dataset even though items were silently accumulating on the user's account-wide named datasets across runs. Both record types now share the per-run default dataset and are discriminated by the existing type: 'post' | 'comment' field on each record.

Changed

dataset_schema.json now declares both post and comment field shapes and adds a "Comments" view alongside the existing "Posts" view. The type enum now includes comment.

Migration note

Existing data on the account-level named "posts" and "comments" datasets is unaffected (still readable via the Apify dataset API by name). New runs from 0.2 onward write to the per-run default dataset only.

[0.1.2] — 2026-04-18

Fixed

Removed redundant explicit Actor.charge('actor_start') call in src/main.ts — Apify Console uses the synthetic apify-actor-start event which fires automatically. The explicit call was logging an unknown-event warning per run without doing anything useful.

[0.1] — 2026-04-18

Added

Initial release. Reddit scraper covering subreddits, posts, comments, users, and search.
Three output formats: default (standard JSON), jsonl-finetune (OpenAI chat-format SFT records), rag-markdown (vector-DB-ready markdown documents with stable chunkId).
Pure HTTP via Crawlee HttpCrawler (no headless browser).
Pay-per-event pricing matching automation-lab/reddit-scraper rates as of 2026-04-18.
RESIDENTIAL proxy default with explicit DATACENTER override option (datacenter is no longer reliable against Reddit in 2026).
Hard cap maxCommentsPerPost ≤ 1000 to bound per-run proxy/compute cost.
Pre-flight DATACENTER + large-run guard with WARN at run start.
Reactive 429 backoff honoring Retry-After headers; session retirement on persistent blocks.
Charge-event ordering: actor_start after input validation (so failed-validation runs don't bill); post and comment charges only after successful dataset writes; comment charges suppressed for AI formats (comments are bundled into the post record).

Reddit Scraper

automation-lab/reddit-scraper

Working Reddit scraper for public Reddit search, subreddit listings, posts, comments, and user profiles. No Reddit account or API key required.

Stas Persiianenko

2.3K

4.7

Subreddit Scraper - Whole Subreddits, No 1k Post Cap ($1.5/1k)

harshmaur/reddit-subreddit-scraper

Scrape entire subreddits — thousands of posts per community, far beyond Reddit's ~1,000-post listing cap, by combining every sort and time window. Optional comments per post. Archive communities or build ML datasets. No API key. CSV/Excel/JSON. From $1.50 per 1,000 posts.

Harsh Maur

5.0

Reddit Scraper

labrat011/reddit-scraper

Scrape Reddit posts, comments, search results, and user profiles. No API keys or browser needed. Supports 4 modes: subreddit posts (hot/new/top/rising), Reddit search, user profiles, and full comment trees. Fast, lightweight HTTP-based scraping with built-in rate limiting and retry logic.

mick_

159

Reddit User Scraper - Profiles, Karma & Post History ($1.5/1k)

harshmaur/reddit-user-scraper

Scrape any Reddit user's profile from a username or URL — karma, account age, and full post and comment history. Built for audience research, moderation vetting, and OSINT on public data. No API key, no login. Export to CSV, Excel, or JSON. From $1.50 per 1,000 results.

Harsh Maur

5.0

⭐️ FREE Reddit Scraper Pro

spry_wholemeal/reddit-scraper

Free Reddit scraper that does what the paid ones do but better. No API keys needed, no usage fees. Pairs with ready-made n8n workflow templates for lead gen and content research.

Greg

934

5.0

Reddit MCP Scraper

crawlerbros/reddit-mcp-scraper

Unified Reddit scraper supporting 3 modes: (1) Subreddit posts with content extraction, (2) Post comments with threading, (3) User profiles with metadata. Extract comprehensive data including scores, timestamps, flairs, NSFW flags, and more.

Crawler Bros

4.6

Reddit Comment Scraper - Export Comments & Replies ($1.5/1k)

harshmaur/reddit-comments-scraper

Scrape Reddit comments without the API — every comment and nested reply from any post URL, 'load more comments' expanded automatically. Export to CSV, Excel, or JSON for sentiment analysis, AI training data, or research. No login, no rate limits. From $1.50 per 1,000 comments.

Harsh Maur

5.0

Reddit Scraper

trudax/reddit-scraper

Unlimited Reddit web scraper to crawl posts, comments, communities, and users without login. Limit web scraping by number of posts or items and extract all data in a dataset in multiple formats.

Trudax

14K

2.4

Reddit Scraper Lite

trudax/reddit-scraper-lite

Pay Per Result, unlimited Reddit web scraper to crawl posts, comments, communities, and users without login. Limit web scraping by number of posts or items and extract all data in a dataset in multiple formats.

Trudax

32K

4.6

Reddit Scraper - Posts, Comments, Search & Subreddits ($2/1k)

harshmaur/reddit-scraper

Scrape Reddit posts, comments, subreddits, user profiles, and keyword search results - no API key, no rate limits, no login. From $2 per 1,000 results, pay only for what you use. Full comment threads, 60+ fields per post, media and galleries. Works with AI Agents, MCP, n8n, Make, Zapier and more.

Harsh Maur

6.4K

5.0

Reddit Scraper

Changelog

[0.4] — 2026-06-02

Changed

Limitations (vs the old JSON API)

[0.3] — 2026-06-02 (superseded by 0.4)

Fixed

Changed

[0.2] — 2026-04-19

Fixed

Changed

Migration note

[0.1.2] — 2026-04-18

Fixed

[0.1] — 2026-04-18

Added

You might also like

Reddit Scraper

Subreddit Scraper - Whole Subreddits, No 1k Post Cap ($1.5/1k)

Reddit Scraper

Reddit User Scraper - Profiles, Karma & Post History ($1.5/1k)

⭐️ FREE Reddit Scraper Pro

Reddit MCP Scraper

Reddit Comment Scraper - Export Comments & Replies ($1.5/1k)

Reddit Scraper

Reddit Scraper Lite

Reddit Scraper - Posts, Comments, Search & Subreddits ($2/1k)

Changelog

[0.4] — 2026-06-02

Changed

Limitations (vs the old JSON API)

[0.3] — 2026-06-02 (superseded by 0.4)

Fixed

Changed

[0.2] — 2026-04-19

Fixed

Changed

Migration note

[0.1.2] — 2026-04-18

Fixed

[0.1] — 2026-04-18

Added