Pricing

from $2.00 / 1,000 results

Reddit Intelligence Scraper

Collect public Reddit posts, comments, communities, and user profile data from searches, subreddit pages, Reddit URLs, and usernames. Export clean datasets for monitoring, research, and AI workflows.

Pricing

from $2.00 / 1,000 results

Rating

0.0

(0)

Developer

Muhammad Qaseem Iqbal

Actor stats

Bookmarked

Total users

Monthly active users

a month ago

Last modified

🚀 Reddit Intelligence Scraper

Collect public Reddit posts, comments, communities, and user profile data from searches, subreddit pages, Reddit URLs, and usernames. 🔎 Use it to monitor conversations, research customer opinions, follow trends, and export clean Reddit data into spreadsheets, dashboards, databases, AI workflows, or automation tools. 📊

This Actor is designed to be practical for both non-technical users and data teams. ✅ You can start with a keyword or Reddit URL, choose how many results you want, and download the results from the Apify dataset when the run finishes. 📥

🧠 What does this Actor do?

Reddit Intelligence Scraper turns public Reddit pages into structured data. 🧾 Instead of manually copying posts and comments from Reddit, you can run the Actor and get organized records with useful details such as:

📝 post title, body, author, subreddit, score, comment count, and URL
💬 comment text, author, parent post, score, depth, and timestamp
🏘️ subreddit/community name, description, subscriber count, and metadata
👤 public user profile information, including karma and profile URL
🏷️ optional sentiment labels, content categories, engagement metrics, media links, and raw payloads

No Reddit API key, OAuth setup, or Reddit login is required for supported public pages. 🔓

🎯 Common use cases

📣 Track brand, product, or competitor mentions on Reddit
📅 Monitor subreddit discussions on a schedule
💡 Find customer pain points, feature requests, complaints, and praise
🧵 Collect comments from a specific Reddit thread
🔬 Research topics, communities, trends, and market language
🤖 Build datasets for AI search, RAG, clustering, dashboards, or reports
📤 Export Reddit data to CSV, Excel, Google Sheets, Make, Zapier, n8n, webhooks, or your own API workflow

📦 What Reddit data can it collect?

Data type	What you can collect
📝 Posts	Search results, subreddit listings, direct post URLs, user submitted posts, `r/all`, and `r/popular`
💬 Comments	Comment search results and comment threads under posts when comment collection is enabled
🏘️ Communities	Subreddit metadata and community search results
👤 Users	Public Reddit user profile records and optional user activity inputs

The Actor works with several input styles, so you can start broad with keywords or stay precise with direct Reddit URLs. 🧭

⚡ How to scrape Reddit on Apify

🖥️ Open the Actor in Apify Console.
➕ Add at least one source:
- 🔎 keywords in Search terms
- 🔗 Reddit links in Direct Reddit URLs
- 🏘️ subreddit names or URLs in Full subreddit scrape inputs
- 👤 Reddit usernames or profile URLs in User profile inputs
🎚️ Set a result limit, such as maxItems.
⚙️ Choose whether to include comments, media links, sentiment, or other optional data.
▶️ Click Start.
📥 Download the results from the Dataset tab as JSON, CSV, Excel, XML, or RSS.

For a quick test, use a small limit such as maxItems: 10. 🧪 For scheduled monitoring, keep the limit modest and run the Actor repeatedly. 📅

🎛️ Input options

You only need one valid source to start. ✅ The most important fields are below.

Field	Plain-English meaning	Typical use
🔎 `searchTerms`	Keywords or phrases to search across Reddit	Brand monitoring, topic research, competitor tracking
🔗 `startUrls`	Direct Reddit URLs	Scrape a specific post, subreddit, user page, or Reddit search URL
🏘️ `subredditUrls`	Subreddit names or URLs	Collect posts from communities such as `r/startups`
👤 `userUrls`	Reddit usernames or profile URLs	Collect public user profile information
🎚️ `maxItems`	Maximum total records to save	Keep tests and production runs under control
💬 `crawlCommentsPerPost`	Also collect comments under each collected post	Thread research, sentiment, FAQ mining
🧵 `maxCommentsPerPost`	Comment limit for each post	Prevent very large threads from growing too much
🧭 `sort` and `time`	Reddit search ranking and time window	Newest posts, top posts this week, most commented posts, etc.
📍 `withinCommunity`	Search only inside one subreddit	Search for a topic within a specific community
🖼️ `includeMediaLinks`	Save image, video, gallery, and outbound link details	Media analysis or content discovery
😊 `sentimentAnalysis`	Add simple sentiment labels to posts and comments	Positive, negative, neutral, mixed, or uncertain
🏷️ `contentAnalysis`	Add topic/category labels to post records	Routing, grouping, research, and AI workflows
🛡️ `proxyConfiguration`	Optional Apify Proxy settings	Use Residential proxy when Reddit blocks cloud traffic

Advanced settings are available for date filters, comment depth, strict keyword matching, output style, raw data storage, and run reports. 🧰

🧪 Example inputs

🔎 1. Quick keyword search

Use this when you want a small sample of recent posts for a topic. ⚡

{
  "searchTerms": ["AI video generator"],
  "sort": "new",
  "time": "week",
  "maxItems": 25,
  "maxPostsPerSearch": 25
}

📣 2. Brand and competitor monitoring

Use this to track mentions and include comments found through Reddit comment search. 📡

{
  "searchTerms": ["Acme AI", "Acme pricing", "Acme alternative"],
  "searchPosts": true,
  "searchComments": true,
  "sort": "new",
  "time": "week",
  "maxItems": 150,
  "maxPostsPerSearch": 50,
  "maxCommentsCount": 50,
  "sentimentAnalysis": true
}

🏘️ 3. Scrape a subreddit

Use this to collect posts from one or more communities. 🧭

{
  "subredditUrls": ["r/startups"],
  "subredditSort": "new",
  "subredditTime": "month",
  "maxItems": 100,
  "maxPostsPerSubreddit": 100
}

🧵 4. Collect a full post thread

Use this when you already know the Reddit post URL and want the discussion under it. 💬

{
  "startUrls": [
    {
      "url": "https://www.reddit.com/r/Baking/comments/1hvoazn/my_best_cheesecake_so_far/"
    }
  ],
  "crawlCommentsPerPost": true,
  "maxCommentsPerPost": 500,
  "commentDepthLimit": 0
}

💸 5. Low-cost test run

Use this before a larger run to confirm your input works. ✅

{
  "searchTerms": ["customer support software"],
  "maxItems": 10,
  "maxPostsPerSearch": 10,
  "crawlCommentsPerPost": false,
  "includeMediaLinks": false,
  "saveRawData": false,
  "writeHtmlReport": false
}

📤 Output

Results are saved to the default Apify dataset. 📊 Each dataset item is one record.

Possible record types:

📝 post
💬 comment
🏘️ community
👤 user

Every record includes basic tracking fields such as: 🧾

Field	Meaning
🧩 `kind`	Type of record: post, comment, community, or user
🆔 `id`	Reddit item ID
🔗 `url`	Main Reddit URL for the item
✅ `canonicalUrl`	Normalized Reddit URL where available
⏱️ `scrapedAt`	When the Actor collected the record
📍 `source`	Which input produced the record
🔁 `sources`	Other inputs that found the same record, when duplicates are merged

📝 Example post output

{
  "kind": "post",
  "id": "1hvoazn",
  "url": "https://www.reddit.com/r/Baking/comments/1hvoazn/my_best_cheesecake_so_far/",
  "title": "My best cheesecake so far",
  "author": "example_user",
  "subreddit": "Baking",
  "createdAt": "2025-01-07T10:09:56.000Z",
  "score": 3489,
  "numComments": 43,
  "mediaType": "gallery",
  "hasMedia": true,
  "sentimentLabel": "positive",
  "contentCategoryLabel": "Food & Drink"
}

The exact fields depend on the record type and the options you enable. ⚙️

📋 Run summary

At the end of a run, the Actor writes RUN-SUMMARY.json to the key-value store. 🧾 This file is useful when you want a quick overview without opening the full dataset.

The summary includes:

🔢 total records saved
📦 records by type
🔎 query and subreddit breakdowns
⏭️ skipped items and why they were skipped
📈 request statistics
⚠️ warnings and errors
🆔 IDs of the output dataset and key-value store

If you enable writeHtmlReport, the Actor can also create a simple HTML report called RUN-MAP.html. 🗺️

💸 Cost and performance tips

This Actor is configured to keep costs low by default. ✅

🛡️ Residential proxy is enabled by default because Reddit currently blocks direct Apify cloud traffic.
🏠 For the cheapest successful tests, keep runs small and use direct Reddit URLs first.
🎚️ Result limits are conservative by default.
🔁 Request retries are disabled by default to avoid paying for repeated failed requests.
📁 Raw data, media details, awards, and HTML reports are off by default.
💬 Comments are only collected when you enable comment collection.

To keep runs cheap:

🧪 start with maxItems between 10 and 100
💬 keep crawlCommentsPerPost off unless you need thread-level discussion
📦 keep saveRawData off unless you are debugging
🗺️ keep writeHtmlReport off unless you need a visual report
🔭 avoid maximizeCoverage unless recall matters more than speed and cost
🛡️ disable proxy only if direct access works for your run environment

💳 Store pricing

This Actor is designed for simple pay-per-result pricing on Apify Store. 🧾

Recommended paid events:

Event	What it means
🚀 `apify-actor-start`	A very small startup event charged automatically by Apify
📦 `apify-default-dataset-item`	One saved dataset record, such as a post, comment, community, or user

This keeps pricing easy to predict: the more records you save, the more you pay. Apify shows the run cost before and during execution, and you can control spend by setting maxItems, comment limits, and other result caps. 🎚️

📅 Scheduling and integrations

You can schedule this Actor in Apify Console to monitor Reddit regularly. ⏰ For example:

⚡ every hour for fast-moving brand monitoring
📆 once per day for subreddit tracking
📊 once per week for market research exports

After each run, you can send the dataset to:

📗 Google Sheets
🧩 Make
⚡ Zapier
🔄 n8n
🪝 webhooks
☁️ cloud storage
🗄️ databases and warehouses
🔌 custom applications through the Apify API

⚠️ Important notes and limitations

Reddit controls how much public data is available through its pages and listings. 📌 This affects all Reddit scrapers, not only this Actor.

🔒 Some private, restricted, quarantined, deleted, removed, or login-gated content cannot be collected.
🪟 Reddit search and subreddit listings may expose only a limited window of results.
🕰️ Very old posts may require narrower keywords, different sort options, or direct URLs.
🚧 Reddit may rate limit or block traffic from cloud networks or proxies.
❌ If every Reddit request is blocked, the Actor fails the run instead of silently returning an empty successful dataset.
⚙️ This version is HTTP-first and does not use a browser fallback.

If a run is blocked by Reddit, try a smaller run first, reduce concurrency and request rate, try a direct post URL, use different inputs, or run again later. 🧪 Residential proxy settings are often the most reliable cloud option for Reddit, but they can increase cost and are not guaranteed to bypass every Reddit-side block. 🛡️

❓ FAQ

⚖️ Is Reddit scraping legal?

Scraping public Reddit data can be allowed in many cases, but you are responsible for how you collect, store, and use the data. 🛡️ Always follow Reddit's terms, applicable laws, privacy rules, and the rules of any downstream platform where you use the data.

🔑 Do I need a Reddit account or API key?

No. ✅ This Actor is built for supported public Reddit pages and does not require a Reddit login or Reddit API key.

💬 Can it scrape comments?

Yes. ✅ Enable crawlCommentsPerPost to collect comments under posts. You can control the amount with maxCommentsPerPost and commentDepthLimit.

🔗 Can I scrape a specific Reddit post?

Yes. ✅ Add the post URL to startUrls. If you also want the comments, enable crawlCommentsPerPost.

🏘️ Can I scrape a whole subreddit?

Yes. ✅ Add a subreddit name such as r/startups or a full subreddit URL to subredditUrls. You can choose sorting options such as new, hot, top, rising, or most commented.

📉 Why did I get fewer results than expected?

Common reasons include Reddit result limits, strict filters, date filters, duplicate removal, deleted or unavailable items, or Reddit blocking the request. 🔍 Check RUN-SUMMARY.json for warnings, errors, and skip counts.

🪟 Why can't I always get more than about 1,000 posts from a subreddit or search?

Reddit lists are not unlimited. 📌 Search pages and subreddit feeds often stop after a practical result window. To find more unique posts, try narrower keywords, different time windows, different sort options, or direct Reddit URLs.

🛡️ Do I need proxies?

On Apify cloud, usually yes. 🛡️ Reddit is currently blocking direct cloud requests in our tests, while the RESIDENTIAL proxy group succeeded. Residential proxy traffic can increase cost, so keep test runs small and lower maxItems while testing.

📤 Can I export the results?

Yes. ✅ Apify datasets can be exported as JSON, CSV, Excel, XML, RSS, or accessed through the Apify API.

🤖 Can I use the data with AI tools?

Yes. ✅ The output is structured JSON, which makes it suitable for AI search, summarization, clustering, dashboards, and RAG workflows. Make sure your use of the data follows applicable privacy and platform rules.

🛡️ Responsible use

Use this Actor only for public Reddit data that you are allowed to collect and process. ✅ Do not use it to collect private, login-gated, sensitive, or harmful personal data. 🔒 Avoid publishing datasets in a way that exposes individuals unfairly or outside the purpose for which the data was collected.

🧰 Support

If something does not work as expected, include:

🆔 the Apify run ID
📥 your input JSON
📋 the RUN-SUMMARY.json file
📝 a short description of what you expected and what happened

This makes it much easier to diagnose blocked requests, empty datasets, input mistakes, and result-limit questions. 🔍

Reddit Api Scraper

scraper-engine/reddit-api-scraper

Extract posts, comments, subreddit data, and user insights from Reddit using the Reddit API Scraper. Collect titles, scores, authors, timestamps, and full discussions. Ideal for market research, sentiment analysis, trend monitoring, and building datasets from Reddit communities.

Scraper Engine

Reddit Scraper

janbruinier/jan-reddit-scraper

Scrape posts and comments from Reddit

Jan Bruinier

Reddit Scraper

alwaysprimedev/reddit-scraper

Scrape Reddit posts, threads, and comments from any subreddit, search, or user — clean structured JSON, fast.

Always Prime

Reddit User Profile Posts And Comments Scraper

scrapelabsapi/reddit-user-profile-posts-and-comments-scraper

ScrapeLabs

Reddit User Profile Posts And Comments Scraper

scrapemesh/reddit-user-profile-posts-and-comments-scraper

ScrapeMesh

Reddit User Profile Posts And Comments Scraper

scraply/reddit-user-profile-posts-and-comments-scraper

Scraply

Reddit User Profile Posts And Comments Scraper

scrapebase/reddit-user-profile-posts-and-comments-scraper

ScrapeBase

Reddit User Profile Posts And Comments Scraper

scrapeflow/reddit-user-profile-posts-and-comments-scraper

ScrapeFlow

Reddit Scraper

gio21/reddit-scraper

Scrape Reddit posts and comments from any subreddit. Extract titles, scores, authors, comments, and more using Reddit's public JSON API.

Gio

5.0

Reddit Scraper

kawsar/reddit-scraper

Reddit scraper that extracts posts, comments, communities, and user profiles from any subreddit or search query, so marketers and researchers can collect structured Reddit data without API keys or login.

Kawsar

Reddit Intelligence Scraper

🚀 Reddit Intelligence Scraper

🧠 What does this Actor do?

🎯 Common use cases

📦 What Reddit data can it collect?

⚡ How to scrape Reddit on Apify

🎛️ Input options

🧪 Example inputs

🔎 1. Quick keyword search

📣 2. Brand and competitor monitoring

🏘️ 3. Scrape a subreddit

🧵 4. Collect a full post thread

💸 5. Low-cost test run

📤 Output

📝 Example post output

📋 Run summary

💸 Cost and performance tips

💳 Store pricing

📅 Scheduling and integrations

⚠️ Important notes and limitations

❓ FAQ

⚖️ Is Reddit scraping legal?

🔑 Do I need a Reddit account or API key?

💬 Can it scrape comments?

🔗 Can I scrape a specific Reddit post?

🏘️ Can I scrape a whole subreddit?

📉 Why did I get fewer results than expected?

🪟 Why can't I always get more than about 1,000 posts from a subreddit or search?

🛡️ Do I need proxies?

📤 Can I export the results?

🤖 Can I use the data with AI tools?

🛡️ Responsible use

🧰 Support

You might also like

Reddit Api Scraper

Reddit Scraper

Reddit Scraper

Reddit User Profile Posts And Comments Scraper

Reddit User Profile Posts And Comments Scraper

Reddit User Profile Posts And Comments Scraper

Reddit User Profile Posts And Comments Scraper

Reddit User Profile Posts And Comments Scraper

Reddit Scraper

Reddit Scraper