Pricing

from $0.001 / reddit post

Reddit Niche Subreddit Scraper | Auto-Tagged | Free

Scrape posts from any list of niche subreddits with automatic keyword tagging. Filter by date, score, comments. Output: clean JSON ready for LLM training, social listening, or brand monitoring. FREE during launch preview.

Pricing

from $0.001 / reddit post

Rating

0.0

(0)

Developer

Polara Data

Actor stats

Bookmarked

Total users

Monthly active users

3 months ago

Last modified

Reddit Niche Subreddit Scraper (Auto-Tagged)

Scrape posts from a curated list of niche subreddits, with optional keyword search and automatic content tagging. Built for ML/LLM training pipelines, social listening, brand monitoring, and trend detection on niche communities that generic scrapers miss.

What it does

Pulls posts from any list of subreddits (no auth, no API key)
Filters by sort order (hot/new/top/rising), time window, min upvotes, min comments
Optional within-subreddit keyword search
Auto-tags every post with your custom keyword list — search the body+title for terms you care about, output them as a tags array
Returns clean structured JSON, ready to drop into ML pipelines or Slack/Notion automations

Use cases

LLM training data — Curate subreddit-specific corpora for fine-tuning domain models (e.g. r/MachineLearning + r/LocalLLaMA + r/datascience for AI dev models).

Social listening (niche) — Track brand mentions or competitor names across vertical subreddits without paying enterprise tools.

Trend detection — Auto-tag posts in r/startups, r/SaaS, r/Entrepreneur for emerging product categories or pain points.

Content discovery — Find high-engagement posts (>100 score, >50 comments) in your niche for content marketing inspiration.

Input

{
  "subreddits": ["MachineLearning", "datascience", "LocalLLaMA"],
  "sort": "hot",
  "searchQuery": "RAG",
  "tagKeywords": ["RAG", "fine-tuning", "Llama", "evaluation", "agent", "embedding"],
  "maxPostsPerSubreddit": 25,
  "minScore": 5,
  "minComments": 0,
  "includeBody": true
}

Field	Type	Default	Description
`subreddits`	array	required	Subreddit names (without /r/)
`sort`	enum	`hot`	hot / new / top / rising
`timeFilter`	enum	`week`	hour / day / week / month / year / all (only for sort=top)
`searchQuery`	string	(none)	Optional keyword search inside each subreddit
`tagKeywords`	array	[]	Auto-tag keywords applied to title+body
`maxPostsPerSubreddit`	int (1-500)	25	Cap per subreddit
`minScore`	int	5	Skip posts below this upvote count
`minComments`	int	0	Skip posts below this comment count
`includeBody`	bool	true	Include selftext body in output

Output

One dataset item per post:

{
  "id": "1abcxyz",
  "subreddit": "MachineLearning",
  "title": "[D] Best practices for evaluating RAG systems in production",
  "body": "...",
  "author": "user123",
  "url": "https://www.reddit.com/r/MachineLearning/comments/1abcxyz/...",
  "linkUrl": "https://arxiv.org/abs/...",
  "score": 234,
  "upvoteRatio": 0.97,
  "numComments": 56,
  "createdUtc": 1730000000,
  "createdAt": "2026-04-29T10:00:00Z",
  "isSelf": true,
  "flair": "Discussion",
  "domain": "self.MachineLearning",
  "tags": ["RAG", "evaluation"]
}

Pricing

Currently FREE during the launch preview — no per-result charges, no monthly cap.

When paid pricing rolls out (notice will be posted at least 14 days in advance):

Event	Price
Actor start	$0.01 (one-time per run)
Result item	$0.001 (per post)

Cost examples (post-launch):

100 posts: ~$0.11
1.000 posts: ~$1.01
10.000 posts: ~$10.01

Limits

Source: Reddit public JSON API (no auth required, no API key)
Rate limit: ~1 req/sec (politely paced internally with 0.6s sleep)
Max posts per subreddit: 500 per run (cumulative pagination)
No private subreddits, no NSFW filtering bypass
No comment scraping in v1 (planned for v2)

Source attribution

Data comes from Reddit's public JSON endpoint (/r/{sub}/.json), which does not require authentication. Subject to Reddit's Public Content Policy.

Author

Polara Data — niche scrapers for Italy, EU & global markets.

Reddit Scraper - Posts, Comments, Subreddits & Users

makework36/reddit-scraper

Fast, reliable Reddit scraper. Extract posts, comments, subreddits & users from any subreddit without Reddit API keys or login. AI-ready JSON for LLM training, sentiment analysis, lead generation. Export JSON/CSV/Excel.

deusex machine

133

HackerNews Monitor + Auto-Tagger | Free Preview

w4rd0g/hackernews-monitor

Search and monitor HackerNews stories, Show HN, Ask HN, polls, jobs by keyword and date range. Auto-tag every post with your custom keyword list. For trend detection, market intel, AI launch monitoring. FREE during launch preview.

Polara Data

Reddit Scraper — Posts, Comments, Users, Subreddits

good-apis/reddit-scraper

Fast Reddit scraper. Search posts, get subreddit data, user profiles, and comments. No login, no browser, clean JSON output. Launch pricing: $1.25 / 1,000 results.

Danny

Reddit Scraper - Posts, Comments & Subreddits

viralanalyzer/reddit-scraper

Extract Reddit posts, comments, subreddit data, and user profiles.

viralanalyzer

5.0

Reddit Scraper — Posts, Comments & Subreddits

junipr/reddit-scraper

Scrape Reddit posts, comments, subreddit feeds, profiles, and search results with threading, filters, media metadata, and JSON/CSV-ready output.

junipr

Reddit Scraper - Posts, Comments, Search & Subreddits

fetch_cat/reddit-scraper

Export public Reddit posts and comments from subreddit, user, search, post, comment, and short-link URLs for social listening, lead research, AI datasets, and alerts.

Hanna Nosova

Reddit Public Post & Comment Scraper

technicaldost/reddit-public-content-scraper

Scrape public Reddit posts and comments by subreddit, search term or URL. Get title, text, author, score, awards and timestamps. Perfect for research and social listening. JSON output.

Technical Dost Solutions

Reddit Scraper – Subreddit Posts & Comments

shuicici/reddit-scraper

Extract posts, comments, upvotes from any public subreddit. Filter by time, sort order, and keywords. Perfect for market research, sentiment tracking, and AI training data. JSON/CSV output.

Clara

Reddit Subreddit Posts Scraper - No API Key

wiry_kingdom/reddit-subreddit-scraper

Scrape any public subreddit. Posts, scores, comments, authors, awards, flairs, timestamps. No API key, no OAuth, no login. Free public Reddit JSON. For alt data, social listening, AI training datasets.

Mohieldin Mohamed

Reddit Lead Gen Scraper

sentry/reddit-lead-gen-scraper

Find Reddit leads by keyword — scrape posts & comments mentioning your product, niche, or pain points. Filter by subreddit, score & time. Export structured data for outreach, sales prospecting & competitor research.