Reddit Trends Scraper
Pricing
from $3.99 / 1,000 results
Go to Apify Store
Reddit Trends Scraper
Pricing
from $3.99 / 1,000 results
Rating
0.0
(0)
Developer
ScrapeBase
Maintained by CommunityActor stats
0
Bookmarked
1
Total users
0
Monthly active users
3 days ago
Last modified
Categories
Share
An Apify Actor for scraping Reddit posts, comments, and trends with intelligent proxy fallback.
Features
- Flexible Input: Supports Reddit URLs, usernames, subreddits, or keywords
- Bulk Processing: Process multiple URLs/keywords in a single run
- Smart Proxy Fallback: Automatically falls back through proxy types:
- No proxy (direct connection)
- Datacenter proxy (if blocked)
- Residential proxy with 3 retries (if datacenter fails)
- Detailed Logging: Comprehensive logs to track scraping progress
- Pagination Support: Automatically handles pagination to collect multiple pages
- Sort Options: Support for hot, new, top, and rising sort orders
Input Configuration
- startUrls: Array of Reddit URLs, usernames (e.g.,
usernameoru/username), subreddits (e.g.,r/popular), or keywords - sortOrder: Sort order for posts (
hot,new,top,rising) - default:hot - maxPosts: Maximum number of posts to scrape - default:
500 - maxComments: Maximum number of comments per post - default:
0(not implemented yet) - proxyConfiguration: Apify proxy configuration (optional)
Output
The actor outputs structured data to the Apify dataset with the following fields:
title: Post titlepostUrl: Full URL to the postupvotes: Number of upvotescomments: Number of commentssubreddit: Subreddit name (e.g.,r/popular)subredditUrl: URL to the subredditsubredditType: Type of subreddit (usuallylink)author: Post author usernameauthorProfile: URL to author profilepostTime: Post timestamp inYYYY-MM-DD HH:MM:SSformat
Proxy Fallback Logic
The actor implements intelligent proxy fallback:
- No Proxy: Starts with direct connection
- Datacenter Proxy: If blocked, automatically switches to datacenter proxy
- Residential Proxy: If datacenter fails, switches to residential proxy with 3 retries
- Sticky Proxy: Once a proxy type works, it sticks with it for all subsequent requests
All proxy events are logged clearly for monitoring.
Usage
- Configure input in Apify platform
- Add your Reddit URLs, usernames, or keywords
- Set sort order and maximum posts
- Optionally configure proxy settings
- Run the actor
Development
# Install dependenciespip install -r requirements.txt# Run locallypython -m src
Notes
- The actor uses
old.reddit.comfor scraping but outputs URLs withreddit.com - Rate limiting is implemented to be respectful to Reddit's servers
- All errors are logged and the actor continues processing remaining URLs