Reddit Post & Comment Scraper (JSON API) avatar

Reddit Post & Comment Scraper (JSON API)

Pricing

from $1.00 / 1,000 post extracteds

Go to Apify Store
Reddit Post & Comment Scraper (JSON API)

Reddit Post & Comment Scraper (JSON API)

Bypass headless browsers. Extract Reddit posts, comments, and upvote ratios instantly using fast JSON endpoints for social listening.

Pricing

from $1.00 / 1,000 post extracteds

Rating

0.0

(0)

Developer

Andok

Andok

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

a day ago

Last modified

Categories

Share

Reddit Post & Comment Scraper

Monitor Reddit discussions for product signals, brand mentions, and market trends. Extract structured post and comment data from any subreddit or thread URL using Reddit's public JSON endpoints — no browser, no login, no API key required. Processes hundreds of URLs per run with full control over item limits and timeouts.

Features

  • Subreddit scanning — extract all visible posts from any subreddit listing page
  • Thread comments — scrape full comment threads from individual post URLs
  • Structured output — get titles, scores, authors, vote ratios, comment counts, and selftext in clean JSON
  • No authentication — uses Reddit's public .json endpoints, no API keys or login needed
  • Bulk processing — pass multiple URLs and extract up to 500 items per URL
  • Pay-per-result — only pay for posts and comments actually extracted

Input

FieldTypeRequiredDefaultDescription
urlsarrayYesList of Reddit subreddit or post URLs to scrape
maxItemsintegerNo100Maximum number of posts or comments to extract per URL (1-500)
timeoutSecondsintegerNo15HTTP request timeout in seconds per URL (1-120)

Input Example

{
"urls": [
"https://www.reddit.com/r/Entrepreneur/",
"https://www.reddit.com/r/technology/comments/1abc123/example_post"
],
"maxItems": 50,
"timeoutSeconds": 15
}

Output

Each dataset item represents one Reddit post or comment with structured metadata.

Key fields:

  • id (string) — Reddit item ID
  • subreddit (string) — subreddit name
  • title (string) — post title (null for comments)
  • author (string) — Reddit username
  • score (number) — net upvotes
  • upvoteRatio (number) — ratio of upvotes to total votes
  • numComments (number) — comment count (posts only)
  • selftext (string) — post body text or comment body
  • permalink (string) — full URL to the post or comment
  • createdUtc (number) — Unix timestamp
  • isComment (boolean) — true if the item is a comment

Output Example

{
"inputUrl": "https://www.reddit.com/r/Entrepreneur/.json",
"id": "1h2x3y4",
"subreddit": "Entrepreneur",
"title": "How I grew my SaaS to $10k MRR in 6 months",
"author": "startup_founder",
"score": 342,
"upvoteRatio": 0.94,
"numComments": 87,
"permalink": "https://www.reddit.com/r/Entrepreneur/comments/1h2x3y4/how_i_grew_my_saas/",
"selftext": "I started building my product last year after leaving my job...",
"createdUtc": 1709312400,
"isComment": false
}

Pricing

EventCost
Post ExtractedIncluded in platform usage

You are charged per dataset item (post or comment) extracted. The actor respects the per-run max charge limit — processing stops automatically when the budget is reached.

Use Cases

  • Brand monitoring — track mentions of your product or company across relevant subreddits
  • Market research — analyze sentiment and feature requests in niche communities
  • Competitor analysis — monitor what users say about competing products
  • Content strategy — find trending topics and high-engagement discussions in your industry
  • Lead generation — identify users asking for solutions your product solves
  • Academic research — collect discussion data for social media analysis
ActorWhat it adds
Subreddit Media DownloaderDownload images and videos from the same subreddits you monitor
Hacker News API ScraperExtend discussion monitoring to the tech/startup community
Google News ScraperComplement Reddit signals with mainstream news coverage

Notes

  • Reddit's public JSON endpoints may rate-limit aggressive requests. Keep timeoutSeconds reasonable and avoid running thousands of URLs in a single run.
  • Some subreddits or posts may be private, quarantined, or age-restricted — these will return an error in the output.
  • The actor appends .json to the provided URL automatically, so standard Reddit URLs work directly.