Scrape Reddit posts, comments, communities, and profiles via URLs or keyword searches. Features proxy rotation, custom field names, flexible filtering, and automatic retries. Perfect for monitoring, research, and data collection.
Stops the run after this many dataset items have been stored.
Default value of this property is 100
Per feed post cap
postCapintegerOptional
Maximum posts to take from a single subreddit/feed/user listing.
Default value of this property is 50
Per post comment cap
commentCapintegerOptional
Maximum number of comments to fetch for each post.
Default value of this property is 25
Community cap
communityCapintegerOptional
Maximum number of communities to take from leaderboards or searches.
Default value of this property is 25
Profile cap
profileCapintegerOptional
Maximum number of user profiles to take from searches.
Default value of this property is 25
Leaderboard cap
leaderboardCapintegerOptional
Number of leaderboard entries to collect when crawling /subreddits/leaderboard.
Default value of this property is 25
Scroll wait (sec)
scrollWaitSecondsintegerOptional
How long to wait between dynamic pagination requests on infinite feeds.
Default value of this property is 30
Max concurrency
maxConcurrencyintegerOptional
Maximum number of parallel HTTP requests.
Default value of this property is 10
Use Apify Proxy
useApifyProxybooleanOptional
Whether to use Apify Proxy
Default value of this property is true
Proxy configuration
proxyConfigurationobjectOptional
Choose to use no proxy, Apify Proxy, or provide custom proxy URLs.
Default value of this property is {}
Extend result function
extendResultFunctionstringOptional
Javascript function executed for each dataset item. Receives { data, page } where data is the item object and page is always null (no browser context). Return an object to merge with the item.
Verbose logging
debugLogbooleanOptional
When true, prints extra diagnostic information into the actor log.
Default value of this property is false
Minimum score
minScoreintegerOptional
Filter posts and comments by minimum score. Items with score below this value will be excluded. Leave empty to disable filtering.
Include NSFW content
includeNSFWbooleanOptional
When false, excludes NSFW posts and communities from results.
Default value of this property is true
Log performance metrics
logMetricsbooleanOptional
When true, logs performance statistics at the end of the run (items/sec, duration, errors).
Default value of this property is true
Enable pagination
enablePaginationbooleanOptional
When true, automatically paginates through listings to collect more items beyond the initial 100 items per request.
Default value of this property is false
Date from
dateFromstringOptional
Filter items created after this date (ISO 8601 format, e.g., 2024-01-01T00:00:00Z). Leave empty to disable.
Date to
dateTostringOptional
Filter items created before this date (ISO 8601 format, e.g., 2024-12-31T23:59:59Z). Leave empty to disable.
Enable deduplication
enableDeduplicationbooleanOptional
When true, prevents duplicate items (by entityId) from being added to the dataset within the same run.