Reddit Fast Scraper avatar
Reddit Fast Scraper

Pricing

$2.00 / 1,000 results

Go to Apify Store
Reddit Fast Scraper

Reddit Fast Scraper

Fast & affordable Reddit scraper. Extract posts, comments & communities using JSON API - up to 80% cheaper than browser-based alternatives.

Pricing

$2.00 / 1,000 results

Rating

0.0

(0)

Developer

Alex41_DownUnder

Alex41_DownUnder

Maintained by Community

Actor stats

0

Bookmarked

4

Total users

3

Monthly active users

4 days ago

Last modified

Share

๐Ÿ” Reddit Scraper Lite

Extract posts, comments, communities, and user data from Reddit at scale using the cost-effective JSON API approach.

Up to 80% cheaper than browser-based scrapers by leveraging Reddit's public JSON endpoints.

What does Reddit Scraper Lite do?

Reddit Scraper Lite is a fast and efficient web scraping tool that extracts public data from Reddit. Unlike browser-based scrapers that require expensive setups, this Actor uses Reddit's JSON API endpoints, making it significantly more cost-effective while maintaining reliability.

Key highlights:

  • ๐Ÿš€ 10x faster than browser-based alternatives
  • ๐Ÿ’ฐ Up to 80% lower costs with the right proxy setup
  • ๐Ÿ“Š Extracts posts, comments, communities, and user profiles
  • ๐Ÿ” Powerful keyword search with filters
  • ๐Ÿ“… Date filtering for recent content only
  • โœ… Works on Free Plan with RESIDENTIAL proxy

โš ๏ธ Important: Proxy Required

Reddit blocks requests from Apify's cloud IPs. You must use a proxy to run this Actor successfully.

Your PlanRecommended ProxyCost per 1000 Requests
FreeRESIDENTIAL~$0.80
Starter+SHADER (Datacenter)~$0.30

The Actor defaults to RESIDENTIAL proxy, which works on all plans including Free.

Why use Reddit Scraper Lite?

Business Use Cases

Use CaseHow It HelpsRecommended Setup
Brand MonitoringTrack mentions of your brand or productKeyword search + Date filter
Sentiment AnalysisCollect comments for NLP analysisSearch comments + maxComments
Market ResearchUnderstand trends and discussionsr/popular + Top sorting
Competitor AnalysisMonitor competitor mentionsMulti-keyword search
Lead GenerationFind potential customersSubreddit-specific search
Content IdeasDiscover trending topicsr/popular + searchSort: top

Why This Scraper Over Others?

FeatureReddit Scraper LiteBrowser-Based Scrapers
TechnologyJSON APIPlaywright/Puppeteer
Cost (Starter plan)~$0.50/1000 posts~$5-10/1000 posts
Cost (Free plan)~$1.00/1000 posts~$10-15/1000 posts
Speed~67 posts/min~8 posts/min
Reliability95%+ success rate70-90% success rate

๐Ÿ’ฐ Proxy & Pricing Guide

Proxy Comparison

Proxy TypeAvailabilityCostSpeedBest For
RESIDENTIALAll plans (Free+)$8/GB (~$0.80/1k req)MediumTesting, small volumes
SHADERStarter+ ($39/mo)$0.60-1/IP (~$0.30/1k req)FastProduction, high volumes

Cost Calculator

VolumeFree Plan (RESIDENTIAL)Starter Plan (SHADER)
100 posts~$0.10~$0.05
1,000 posts~$1.00~$0.50
10,000 posts~$10.00~$5.00
100,000 posts~$100.00~$50.00

Recommendation by Use Case

Use CaseRecommended PlanWhy
Testing/LearningFree + RESIDENTIALNo monthly commitment
Hobby projects (<1k/mo)Free + RESIDENTIALCost-effective for low volume
Regular use (1k-10k/mo)Starter + SHADER50% cheaper than RESIDENTIAL
Production (10k+/mo)Scale + SHADERBest price per request

How to scrape Reddit data

Step 1: Choose Your Input Method

Option A: Search Keywords (Recommended for most use cases)

{
"searches": ["typescript", "react hooks"],
"searchPosts": true,
"maxItems": 100
}

Option B: Start URLs (For specific subreddits/posts)

{
"startUrls": [
{ "url": "https://www.reddit.com/r/webdev/" },
{ "url": "https://www.reddit.com/r/popular/" }
]
}

Step 2: Configure Proxy (Required!)

For Free Plan users:

{
"proxyConfiguration": {
"useApifyProxy": true,
"apifyProxyGroups": ["RESIDENTIAL"]
}
}

For Starter/Scale/Business Plan users:

{
"proxyConfiguration": {
"useApifyProxy": true,
"apifyProxyGroups": ["SHADER"]
}
}

Step 3: Run and Export

Click "Start" and download results in JSON, CSV, Excel, or HTML format.

What data can you extract from Reddit?

๐Ÿ“ Reddit Posts

FieldDescription
idUnique post identifier
titlePost title
bodyPost content (text/markdown)
urlDirect link to the post
authorUsername of the poster
subredditCommunity where posted
scoreUpvotes minus downvotes
upvoteRatioPercentage of upvotes
numCommentsTotal comment count
createdAtPost creation timestamp

๐Ÿ’ฌ Reddit Comments

FieldDescription
idUnique comment identifier
bodyComment text content
authorUsername of commenter
scoreComment score
createdAtComment timestamp
depthNesting level (0 = top-level)

๐Ÿ‘ฅ Reddit Communities

FieldDescription
nameSubreddit name
titleDisplay title
descriptionCommunity description
subscribersMember count

๐Ÿ‘ค Reddit Users

FieldDescription
usernameReddit username
postKarmaTotal post karma
commentKarmaTotal comment karma
createdAtAccount creation date

Input Examples by Use Case

๐Ÿ” Brand Monitoring

Track mentions of your brand in the last week:

{
"searches": ["your-brand", "YourCompany"],
"searchPosts": true,
"searchComments": true,
"searchSort": "new",
"searchTime": "week",
"maxItems": 500,
"proxyConfiguration": {
"useApifyProxy": true,
"apifyProxyGroups": ["RESIDENTIAL"]
}
}

๐Ÿ“Š Sentiment Analysis

Collect comments for NLP analysis:

{
"searches": ["product-name"],
"searchPosts": true,
"maxItems": 100,
"maxComments": 50,
"skipComments": false,
"proxyConfiguration": {
"useApifyProxy": true,
"apifyProxyGroups": ["RESIDENTIAL"]
}
}

๐Ÿ“ˆ Trend Discovery

Find what's trending right now:

{
"startUrls": [
{ "url": "https://www.reddit.com/r/popular/" }
],
"maxItems": 100,
"proxyConfiguration": {
"useApifyProxy": true,
"apifyProxyGroups": ["RESIDENTIAL"]
}
}

๐ŸŽฏ Subreddit Research

Deep dive into specific communities:

{
"startUrls": [
{ "url": "https://www.reddit.com/r/webdev/" },
{ "url": "https://www.reddit.com/r/programming/" }
],
"sort": "top",
"time": "month",
"maxPostCount": 50,
"proxyConfiguration": {
"useApifyProxy": true,
"apifyProxyGroups": ["RESIDENTIAL"]
}
}

Search within specific subreddits:

{
"searches": ["nextjs 15", "app router"],
"searchPosts": true,
"searchInSubreddits": ["nextjs", "reactjs", "webdev"],
"searchSort": "top",
"searchTime": "month",
"maxItems": 200,
"proxyConfiguration": {
"useApifyProxy": true,
"apifyProxyGroups": ["RESIDENTIAL"]
}
}

๐Ÿ‘ค User Research

Analyze a specific user's activity:

{
"startUrls": [
{ "url": "https://www.reddit.com/user/target-user/comments/" }
],
"maxItems": 100,
"proxyConfiguration": {
"useApifyProxy": true,
"apifyProxyGroups": ["RESIDENTIAL"]
}
}

Output Example

{
"dataType": "post",
"id": "1abc123",
"title": "How to optimize React performance in 2024",
"body": "Here are my tips for making React apps faster...",
"url": "https://www.reddit.com/r/reactjs/comments/1abc123/",
"author": "dev_enthusiast",
"subreddit": "reactjs",
"score": 1542,
"upvoteRatio": 0.96,
"numComments": 234,
"createdAt": "2024-11-28T14:30:00.000Z",
"isVideo": false,
"over18": false,
"scrapedAt": "2024-12-01T10:00:00.000Z"
}

Supported URL Types

URL TypeExampleWhat It Scrapes
Subredditreddit.com/r/webdev/Posts from the subreddit
Subreddit (sorted)reddit.com/r/webdev/top/Top posts
Postreddit.com/r/webdev/comments/abc123/Post + comments
Userreddit.com/user/username/User profile
User Commentsreddit.com/user/username/comments/User's comment history
Popularreddit.com/r/popular/Trending posts
Leaderboardreddit.com/subreddits/leaderboard/crypto/Top communities
Searchreddit.com/search/?q=keywordSearch results

Advanced Features

Session Pool

Maintains consistent IP addresses during pagination for better results:

{
"useSessionPool": true,
"proxyRotationOnError": true
}

Date Filtering

Only collect posts after a specific date:

{
"postDateLimit": "2024-11-01",
"searchSort": "new"
}

Note: When postDateLimit is set, the sort order automatically switches to "new" for chronological consistency.

Debug Mode

Enable detailed logging for troubleshooting:

{
"debugMode": true
}

This scraper only collects publicly available data that anyone can view without logging in. It does not:

  • โŒ Access private subreddits
  • โŒ Scrape private messages
  • โŒ Collect email addresses or personal data
  • โŒ Bypass authentication

Important: You are responsible for ensuring your use case complies with:

We recommend:

  • Respecting rate limits
  • Not storing personal data unnecessarily
  • Using data only for legitimate purposes

Troubleshooting

Getting 403 Errors?

This is the most common issue. Reddit blocks Apify's cloud IPs.

Solution: Make sure you have proxy enabled:

{
"proxyConfiguration": {
"useApifyProxy": true,
"apifyProxyGroups": ["RESIDENTIAL"]
}
}

"You do not have access to proxy groups: SHADER"

SHADER (Datacenter proxy) requires a paid plan (Starter or higher).

Solutions:

  1. Use RESIDENTIAL proxy instead (works on Free plan)
  2. Upgrade to Starter Plan ($39/mo)

No Results?

  1. Check your search keywords aren't too specific
  2. Verify the subreddit exists and is public
  3. Ensure date filter isn't too restrictive

Slow Performance?

  1. Make sure proxy is enabled (required for Reddit)
  2. Reduce maxComments if not needed
  3. Set skipComments: true for posts-only scraping

Integration with Other Tools

Export your data and integrate with:

  • Google Sheets: Direct export via Apify integration
  • Zapier: Automate workflows with new data
  • Make (Integromat): Complex automation scenarios
  • Webhooks: Real-time notifications
  • API: Programmatic access to all results

Support & Feedback

  • ๐Ÿ› Found a bug? Open an issue in the Issues tab
  • ๐Ÿ’ก Feature request? Let us know in the Issues tab
  • ๐Ÿ“ง Questions? Contact support@apify.com

Looking for other social media scrapers?


Built with โค๏ธ using Crawlee and the Apify SDK