Reddit Scraper
Pricing
$19.99/month + usage
Reddit Scraper
Pricing
$19.99/month + usage
Rating
0.0
(0)
Developer
ScraperX
Actor stats
0
Bookmarked
2
Total users
1
Monthly active users
2 days ago
Last modified
Categories
Share
Reddit Scraper
Pricing
$19.99/month + usage
Pricing
$19.99/month + usage
Rating
0.0
(0)
Developer
ScraperX
Actor stats
0
Bookmarked
2
Total users
1
Monthly active users
2 days ago
Last modified
Categories
Share
One or more Reddit URLs to scrape. This field supports bulk input: use "+ Add" to add more URLs, "Bulk edit" to paste many at once, or "Text file" to upload a file with one URL per line. Accepted URL types: (1) Post URLs — any link containing /comments/ and a post ID (e.g. https://www.reddit.com/r/subreddit/comments/ABC123/title/); (2) Subreddit URLs — links to a community (e.g. https://www.reddit.com/r/python/); (3) User profile URLs — links to a user (e.g. https://www.reddit.com/user/username/). The actor detects the type of each URL and runs the appropriate scraper (post+comments, subreddit listing, or user profile). At least one Start URL is required unless you run in search-only mode with "Ignore start URLs" enabled.
If enabled, the actor will not fetch comments for any post URLs. Only the post title, body, score, and other post-level data will be collected. Use this to speed up runs or reduce output size when you only need post content.
If enabled, any user profile URLs in Start URLs are ignored and no user data (profile, submitted posts, comments) is scraped. Subreddit and post URLs are still processed as usual.
If enabled, the actor will not fetch subreddit/community metadata (e.g. description, subscriber count, creation date) for subreddits that appear in your Start URLs or in the subreddit info options. Post listings from those subreddits are still scraped when "Enable subreddit posts" is on.
One or more search phrases to run against Reddit. Use "+ Add" to add another term or "Bulk edit" to paste multiple lines. Each term is queried separately and results appear under the "search" key in the output, with the query as the key (e.g. "python programming"). Search is executed only when this array is non-empty and either you have no Start URLs or "Ignore start URLs" is enabled.
Optional subreddit name without the "r/" prefix (e.g. python, learnprogramming). When set, search is restricted to this community only (Reddit's "search within subreddit" behaviour). Leave empty to search all of Reddit. Only has an effect when you have at least one Search Term.
When enabled, the actor does not scrape any Start URLs. Only search runs, using the Search Term(s) you provided. Useful when you want to run a search-only job (e.g. multiple keywords) without scraping specific posts or subreddits. You must provide at least one search term for the run to do anything.
Include posts in search results. When on (default), search returns post listings matching your query. This is the primary search mode for this actor.
Reserved for future use: include comments in the search scope. Currently the actor focuses on post search.
Reserved for future use: include communities/subreddits in the search scope.
Reserved for future use: include user profiles in the search scope.
Controls how Reddit search results are ordered. Options: relevance (best match to query), new (newest first), hot (trending), top (highest score), comments (most comments). Applies to all search terms in this run.
Restricts search results to a relative time window: hour (last 24h), day, week, month, year, or all (no time limit). This is Reddit's "t" parameter and affects which posts are included in search. Use "Filter by date" below for an absolute or custom relative date.
Optional filter to only include posts from a certain date onward. You can choose an absolute date (e.g. 2024-01-15) or a relative range (e.g. "3 days", "2 weeks", "1 month"). Leave empty to not filter by date. Applies to search and listing results where applicable.
When on, the actor will run Reddit search if you have provided Search Terms and are in search mode (no Start URLs or Ignore start URLs). When off, search is skipped even if search terms are present.
When on, any subreddit URLs in Start URLs (e.g. reddit.com/r/python) are scraped for their post listing (hot, new, etc.) up to the limits you set. When off, subreddit URLs are ignored.
How to order posts when fetching from each subreddit: hot (trending), new (newest first), top (highest score), rising (gaining traction), controversial.
When on, any post URLs in Start URLs are scraped for the full post and its comment tree (unless "Skip comments" is enabled). When off, post URLs are ignored.
When on, any user profile URLs in Start URLs are scraped for profile info, submitted posts, and comments (subject to "Skip user posts" and limit settings). When off, user URLs are ignored.
When on, the actor fetches metadata for subreddits: for subreddits from Start URLs and, if enabled below, popular/new subreddit lists. Metadata includes title, description, subscribers, creation date, etc. When off, no subreddit info is collected.
Global maximum number of items (e.g. posts, search results) the actor will collect across all sources. Helps control run duration and dataset size. Each scraper type respects this cap where applicable.
Maximum number of posts to fetch from a single subreddit listing (e.g. one "hot" or "new" listing). Reddit returns up to 100 per request; lowering this reduces data per subreddit.
If set, only posts created on or after this date (YYYY-MM-DD) are included. Use the date picker or enter a date. Leave empty to include all posts regardless of date.
Maximum number of comments to retrieve per post. Reddit returns comments in a tree; this limits how many are fetched per post to avoid very large responses.
For each subreddit/community URL, the maximum number of listing pages to paginate through (each page can contain up to the "Limit of posts" value). Use a low number (e.g. 2) for quick runs or higher for more coverage.
For each user profile URL, the maximum number of pages to fetch for submitted posts and comments. Each page contains multiple items; this cap controls how deep the actor goes into a user's history.
Timeout in seconds for any page or scroll-related operations during scraping. If a request or scroll step takes longer than this, it may be aborted. Increase for slow networks.
Maximum depth of nested comment replies to parse. Depth 1 means only top-level comments; higher values include more nested threads. Reddit threads can be very deep; limiting depth keeps output manageable (1–20).
Per user, the maximum number of submitted posts and the maximum number of comments to fetch. For example, 20 means up to 20 submitted posts and up to 20 comments per user profile.
When on, fetch each user's profile (name, karma, created date). When off, only submitted posts and/or comments are fetched according to the toggles below.
When on, fetch posts submitted by each user. When off, only profile and/or comments are fetched according to the other toggles.
When on, fetch comments made by each user. When off, only profile and/or submitted posts are fetched according to the other toggles.
When on, in addition to profile, submitted posts, and comments, the actor fetches the user's combined overview and adds a summary (total items, posts count, comments count) to the user output. Slightly increases requests per user.
When on, the actor fetches Reddit's list of popular subreddits and adds them to the "subreddit_info.popular" array in the output. The number of subreddits is capped by "Max subreddits (popular/new lists)" below.
When on, the actor fetches recently created subreddits and adds them to "subreddit_info.new" in the output. The number is capped by "Max subreddits (popular/new lists)" below.
When "Fetch popular subreddits" or "Fetch new subreddits" is enabled, this is the maximum number of subreddits to fetch for each of those lists. Does not affect metadata for subreddits from your Start URLs.
Number of seconds to wait between consecutive HTTP requests to Reddit. A value of 1 or 2 can help avoid rate limiting (429). Set to 0 for no delay (faster but higher risk of blocks).
Configure Apify Proxy for this run. By default the actor uses no proxy (direct connection to Reddit). If Reddit blocks or rate-limits the run, the actor will automatically try datacenter proxy, then residential proxy (up to 3 retries), and keep using the first one that works. Here you can optionally enable Apify proxy from the start (e.g. use Apify Proxy with a specific group) so all requests go through proxy immediately.