Pricing

from $1.94 / 1,000 reddit results

Reddit Scraper — Posts & Comments | from $1.50/1K

Scrape Reddit posts, comments, and user activity from any public subreddit. Returns 25+ fields: score, upvote ratio, flair, author, timestamps, parse_confidence. No API key needed — backed by Arctic Shift archive with unlimited historical depth. MCP-callable.

Pricing

from $1.94 / 1,000 reddit results

Rating

0.0

(0)

Developer

Vitalii Bondarev

Actor stats

Bookmarked

Total users

Monthly active users

2 days ago

Last modified

Why This Reddit Scraper Beats the Alternatives

	This scraper	trudax/reddit-scraper-lite	practicaltools/apify-reddit-api
Price	$1.50/1000	$3.40/1000	$2.00/1000
No proxy cost to buyer	✓	✗	✗
Historical data (no 1000-post cap)	✓	✗	✗
No OAuth API dependency	✓	✓	✗
parse_confidence in every record	✓	✗	✗
25+ fields	✓	✓	partial
Comments included	✓	partial	✗

Key advantage: Competitors hitting live Reddit directly require residential proxy to avoid 403s — that cost passes to you. This actor uses Arctic Shift (free Reddit archive API) as its backend, so you pay only for results, not proxy overhead.

Reddit Data Fields

Field	Posts	Comments
id	✓	✓
type (`post`/`comment`)	✓	✓
subreddit	✓	✓
title	✓	—
body	✓	✓
author	✓	✓
score	✓	✓
upvote_ratio	✓	—
num_comments	✓	—
created_utc (ISO-8601)	✓	✓
permalink	✓	✓
url	✓	✓
is_self	✓	—
over_18 (NSFW)	✓	—
flair_text	✓	✓
domain	✓	—
subreddit_subscribers	✓	✓
parent_id	—	✓
depth	—	✓
is_submitter (OP?)	—	✓
parse_confidence	✓	✓
warnings	✓	✓
scraped_at	✓	✓

What `parse_confidence` Means

Every Reddit record includes a score from 0.0 to 1.0:

1.0 — all fields parsed cleanly
0.9–0.95 — minor field missing (e.g. deleted author)
< 0.5 — critical issue (missing ID, no data returned)

warnings lists machine-readable codes explaining any deductions — broken scrapes are visible in your dataset, not silently hidden.

Reddit Scraper Use Cases

Brand monitoring — track keyword mentions across niche subreddits
Lead generation — find users asking questions your product solves
Sentiment analysis — bulk-export posts and comments for NLP pipelines
Competitor research — monitor product-related subreddits
Content strategy — find top-performing posts by score or comment count
AI agent memory — feed recent subreddit discussion into agent context

How to Use Reddit Scraper

Scrape Reddit Subreddit Posts

{
  "subreddits": ["python", "learnpython"],
  "sort": "new",
  "maxItems": 200,
  "includeComments": false
}

Scrape Reddit Posts + Comments Together

{
  "subreddits": ["entrepreneur"],
  "sort": "new",
  "maxItems": 100,
  "includeComments": true,
  "maxCommentsPerPost": 25
}

Scrape Reddit User Activity

{
  "users": ["spez", "some_username"],
  "maxItems": 50
}

Scrape via Reddit URL

{
  "urls": ["https://www.reddit.com/r/investing/"],
  "maxItems": 200
}

Input Parameters

Parameter	Type	Default	Description
`subreddits`	string[]	—	Subreddit names (e.g. `python`, `r/flask`)
`urls`	string[]	—	Reddit subreddit or profile URLs
`users`	string[]	—	Usernames to scrape (e.g. `spez`)
`sort`	`new`/`old`	`new`	Sort order
`maxItems`	integer	100	Max posts per subreddit or user
`includeComments`	boolean	false	Also scrape comments
`maxCommentsPerPost`	integer	50	Max comments per post

Sample Output

{
  "type": "post",
  "id": "1d2e3f4",
  "subreddit": "python",
  "title": "What's the best async HTTP library in 2026?",
  "body": "Looking for recommendations for an async HTTP client...",
  "author": "user123",
  "score": 847,
  "upvote_ratio": 0.97,
  "num_comments": 62,
  "created_utc": "2026-05-20T14:32:11+00:00",
  "permalink": "/r/python/comments/1d2e3f4/whats_the_best_async_http_library_in_2026/",
  "url": "https://www.reddit.com/r/python/comments/1d2e3f4/",
  "flair_text": "Discussion",
  "subreddit_subscribers": 1200000,
  "parse_confidence": 1.0,
  "warnings": [],
  "scraped_at": "2026-06-05T09:00:00+00:00"
}

Pricing — Pay Per Reddit Post or Comment

$1.50 per 1,000 posts · $0.50 per 1,000 comments (when includeComments=true) — PPE, no per-run fee. No proxy cost — Reddit data is fetched via Arctic Shift at no additional infrastructure charge. First $5 Apify credit covers ~3,300 post records.

Data Source & Freshness

This actor fetches from Arctic Shift (arctic-shift.photon-reddit.com), a community-maintained Reddit archive based on historical data dumps. Data is updated continuously with an approximate 36-hour lag on engagement metrics (score, num_comments) for very recent posts. Historical data goes back years with no per-subreddit post cap.

Arctic Shift is a free service with no uptime SLA. The parse_confidence and warnings fields in every record surface any API anomalies so you can filter them downstream.

Use with AI Agents (MCP)

This Reddit scraper is callable as a tool by AI agents (Claude Desktop, Cursor, VS Code, n8n, LangGraph, CrewAI, or any MCP-compatible client) via Apify's hosted Model Context Protocol server.

{
  "mcpServers": {
    "apify": {
      "command": "npx",
      "args": [
        "mcp-remote",
        "https://mcp.apify.com/?tools=bovi/reddit-scraper",
        "--header",
        "Authorization: Bearer <YOUR_APIFY_TOKEN>"
      ]
    }
  }
}

Keep maxItems low (e.g. 25) when calling from agents to limit token volume.

Frequently Asked Questions

Does this Reddit scraper need an API key? No. It uses Arctic Shift (a community Reddit archive), not the official Reddit API. No OAuth, no app registration.

Why is there a 36-hour lag? Arctic Shift syncs from Reddit data dumps continuously. Very recent posts (< 36h) may have slightly outdated score and num_comments — all other fields are accurate.

Can I get more than 1000 posts from a subreddit? Yes. Unlike live Reddit, Arctic Shift has no 1000-post cap. Use maxItems to control volume; the actor paginates via timestamps.

Is residential proxy needed? No — this actor does not hit live Reddit endpoints. No proxy cost to you.

Brand Monitoring & Incremental Scraping

Use sinceDate and Apify schedules to run this actor daily and get only new posts for ongoing brand-monitoring workflows. Set includeComments=true and a low maxCommentsPerPost for lightweight recurring runs that track sentiment changes over time.

Not affiliated with Reddit. Data retrieved from Arctic Shift, a community-maintained public Reddit archive.

Integrations

Built for social-listening and research teams tracking communities, trends, and sentiment at scale — the JSON/dataset output drops into the tools you already run, no glue code:

n8n / Make / Zapier — trigger a run or pipe every new dataset item into 500+ apps (Google Sheets, Airtable, Slack, HubSpot, your database) with no code: n8n, Make, Zapier.
Webhooks — fire your own endpoint the moment a run finishes, to push results straight into your pipeline (docs).
MCP server — expose this actor as a tool to Claude, Cursor, or any MCP client so an AI agent can pull this data mid-conversation (guide).
API & SDKs — fetch the dataset as JSON, CSV, or Excel through the Apify REST API or the Python / JS SDKs.

See all Apify integrations.

More scrapers from our toolkit

Building a data pipeline? These actors pair well with this one — each runs on your own Apify account with the same pay-per-result pricing, no subscription:

Chain any of them together from the Integrations tab (the Run succeeded trigger) to build a multi-step workflow — one actor's output feeds the next.

Use it from your existing tools

Use with Claude Desktop / Cursor / Cline (MCP)

Load this actor as a tool in your AI assistant. Call it directly from your AI assistant via the Apify MCP server — no Store browsing needed. Paste this into your MCP client config (e.g. claude_desktop_config.json) and restart the client:

{
  "mcpServers": {
    "apify-reddit-scraper": {
      "command": "npx",
      "args": [
        "-y",
        "@apify/actors-mcp-server",
        "--tools",
        "bovi/reddit-scraper"
      ],
      "env": {
        "APIFY_TOKEN": "YOUR_APIFY_TOKEN"
      }
    }
  }
}

Replace YOUR_APIFY_TOKEN with your own Apify API token (free at apify.com → Settings → Integrations). Curated to a handful of tools so the agent selects reliably.

Works with Clay

Run this actor as an HTTP enrichment step inside a Clay table:

Method: POST
URL: https://api.apify.com/v2/acts/bovi~reddit-scraper/run-sync-get-dataset-items?token={{apify_token}}
Body (JSON): map your Clay columns to the actor input (see the Input section above), e.g. {"subreddits": "{{clay_column}}"}

The run finishes synchronously and returns the dataset rows straight into your Clay table. It runs on Apify's cloud under your own token and usage. Synchronous runs must complete within 300 seconds.

🗄️ Reddit Archive Scraper - Years of Posts & Comments

benthepythondev/reddit-archive-scraper

Reddit Archive Scraper to extract years of historical Reddit posts and comments from the PullPush archive. Reddit's API caps subreddits at ~1000 posts; this Actor pulls months or years from many subreddits by date range and keyword. For historical backfill, research and AI datasets.

Ben

Reddit Scraper For Posts & Comments

creative_tablecloth/reddit-scraper-for-posts

Access Reddit data freely without authentication. Quickly extract detailed information from Reddit posts and comments, both efficiently and cost-effectively. (approx $0.015 for 1,000 results)

Jinny Kim

460

5.0

Reddit Scraper | Enterprise Grade

fatihtahta/reddit-scraper-search-fast

Extract Reddit posts and full comment threads from searches, subreddits, user pages, and direct post URLs. Built for enterprise-grade speed, richest-in-class data coverage, advanced filtering, and clean JSON for market intelligence, sentiment analysis and analytics.

Fatih Tahta

4.4K

3.6

Reddit Scraper

automation-lab/reddit-scraper

Scrape public Reddit search results and subreddit listings, with posts, comments, and profiles available on a best-effort basis. No Reddit account or API key required.

Stas Persiianenko

2.5K

4.7

Fast Twitter List Scraper API | Extract Tweets & Members

apidojo/twitter-list-scraper

Discover the Twitter (X) List Scraper you've been looking for! Find the ultimate tool for extracting Tweets List from X / Twitter! It offers unparalleled speed and comprehensiveness, ensuring lightning-fast extraction of Tweets.

API Dojo

994

5.0

Reddit Historical Archive Scraper - Old Posts by Date

logiover/reddit-historical-archive-scraper

Pushshift alternative to scrape old Reddit posts and comments without an API key. Full-text comment search, user history, export to CSV/JSON.

Logiover

Tweets-x-Scraper

mikolabs/tweets-x-scraper

Extract anything from X (Twitter) with speed and precision. This smart scraper auto-detects what to collect—tweets, profiles, users, lists, or media—and delivers clean, structured data instantly. Just enter usernames, URLs, or keywords and let automation do the rest.

mikolabs

253

5.0

Reddit Scraper — Posts & Comments by Subreddit or Search

hichemdev/reddit-scraper

Scrape Reddit posts and comments from any subreddit or search query: title, author, score, upvote ratio, text, and metadata. No login or API key.

Hichem Ben Moussa

Reddit Post Scraper

pratikdani/reddit-post-scraper

A Reddit post scraper, fetching data like titles, authors, content, and scores from specified subreddits or search queries. Delivers valuable insights from the Reddit hivemind for analysis and trend identification.

Pratik Dani

408

Reddit Scraper - Posts, Comments, Search & Subreddits ($2/1k)

harshmaur/reddit-scraper

Scrape Reddit posts, comments, subreddits, user profiles, and keyword search results - no API key, no rate limits, no login. From $2 per 1,000 results, pay only for what you use. Full comment threads, 60+ fields per post, media and galleries. Works with AI Agents, MCP, n8n, Make, Zapier and more.