Llm Sentiment Scraper
Pricing
from $1.00 / 1,000 results
Llm Sentiment Scraper
Scrape Reddit for real human opinions about 30+ LLM models. Sentiment analysis, top comments, and community consensus from r/LocalLLaMA, r/ChatGPT, r/ClaudeAI, and more. The qualitative layer benchmarks can't capture.
Pricing
from $1.00 / 1,000 results
Rating
0.0
(0)
Developer

David Flagg
Actor stats
0
Bookmarked
2
Total users
1
Monthly active users
3 days ago
Last modified
Categories
Share
The LLM Field Guide, Phase 2. What people actually think about language models — scraped from Reddit.
What you get
Qualitative human opinions about 30+ language models, scraped from 10 AI-focused subreddits. Each entry includes:
- Model identification — which model the post is about
- Post data — title, body, score, comment count, author, URL, flair
- Sentiment analysis — positive/negative/mixed/neutral label with a -1.0 to 1.0 score
- Top comments — the most upvoted, most relevant comments from each thread
- Signal indicators — whether the model is in the title, opinion keyword presence, mention count
Data sources
Posts and comments from these subreddits (configurable):
| Subreddit | Focus |
|---|---|
| r/LocalLLaMA | Open-weight model comparisons, benchmarks, local deployment |
| r/ChatGPT | End-user GPT experiences |
| r/ClaudeAI | Claude-specific discussion |
| r/OpenAI | OpenAI ecosystem |
| r/ollama | Local model runners |
| r/MachineLearning | Research-oriented ML discussion |
| r/singularity | AI capability discussions |
| r/SillyTavernAI | Creative writing model opinions |
| r/KoboldAI | Another creative-writing LLM community |
| r/artificial | Broader AI discussion |
Default models tracked
GPT-4o, GPT-4.1, o1, o3, o4-mini, Claude Sonnet 4, Claude Opus 4, Claude Sonnet 3.5, Gemini 2.5 Pro/Flash, Llama 4, Llama 3.3, Mistral Large/Small, Devstral, Qwen 3, Qwen 3.5, QwQ, DeepSeek V3/R1, Grok, Phi-4, Gemma 3, and more.
You can also provide your own custom model list with aliases.
Use cases
- Model selection — See what real users say before committing to an API
- Competitive analysis — Track sentiment shifts when new models launch
- Content creation — Source quotes and opinions for reviews and articles
- Trend tracking — Run weekly to monitor community consensus over time
- Pair with benchmarks — Combine with the LLM Benchmark Aggregator for the full picture
No API keys required
Uses public Reddit endpoints. No Reddit API key, no OAuth, no paid tier needed.
Proxy configuration
The actor uses Apify proxy by default to avoid Reddit's IP blocking. Some subreddits (notably r/LocalLLaMA) block datacenter proxies more aggressively. For these, set proxyGroups to ["RESIDENTIAL"] in the input — this uses residential IP rotation and has near-100% success rate, but consumes more Apify credits.