Reddit Q&A Dataset for LLM Fine-Tuning & RAG
Created by
Dami's Studio
Top-voted Reddit Q&A threads with full comment trees, exported as a clean conversational dataset for fine-tuning an LLM or feeding a RAG pipeline.
Reddit Posts Search Scraperdami_studio/reddit-search-scraper
Sub
Title
Author
Score (app creds)
+3 fieldsTextNumberBooleanListObject
Input
Search query(required):how do I
Restrict to subreddit (optional):learnprogramming
Sort by:top
Time window:all
Max posts:200
Include top comments:true
Comments per post:50
Output fields
Sub
Title
Author
Score (app creds)
Comments (app creds)
Posted
URL
Sign up on Apify01
Create your Apify account to access the Reddit Posts Search Scraper.
Start the run02
The Actor will start running based on the input automatically.
Receive the output03
Monitor the progress in real-time. You will be notified as soon as your dataset is complete and ready for review.
Integrate into your workflow04
The final output is delivered in JSON, CSV, or Excel format, ready to be plugged into your workflow.

