# Find Reddit pain points and buying intent

**Use case:** 

Analyze Reddit posts to detect workflow blockers, tool recommendations, competitor mentions, negative sentiment, and scored buying-intent opportunities.

## Input

```json
{
  "redditItems": [
    {
      "permalink": "https://www.reddit.com/r/InterstellarKinetics/comments/1tyunr0/warning_security_researchers_have_just_revealed/",
      "headline": "WARNING: Security Researchers Have Just Revealed That The Smart TV In Your Living Room May Be Actively Routing AI Web Scraping Traffic Through Your Home Internet Connection Right Now, Without You Realizing What You Agreed To 📺⚠️",
      "textBody": "Cybersecurity firm Include Security published a detailed technical investigation revealing that a company called Bright Data, which markets the world’s largest residential proxy network, embeds a software development kit into consumer apps on smart TVs and mobile phones that silently turns those devices into exit nodes for web scraping traffic routed by paying customers who include AI companies. Bright Data sells access to a network it advertises as containing over 150 million residential IP addresses, and the supply behind that network comes from an SDK embedded in partner apps across major platforms including Roku, LG, Samsung, Vizio, and Comcast-distributed services through partners like PlayWorks Digital, which reports distribution reach across approximately 250 million TV homes, and CloudTV, which is integrated into over 125 TV brands and 15 or more original equipment manufacturers. The SDK is framed as a way for users to get a free app or fewer ads in exchange for allowing the app to occasionally use their device’s resources, but the researcher documented that the SDK’s own publicly accessible configuration file sets a default monthly WiFi bandwidth budget of 200 gigabytes per device, a figure that bears no resemblance to the word “occasionally” used in the consent screens of at least one documented partner app called Petflix on Roku.\n The technical investigation, conducted through 30 days of traffic capture from a research device running a consent-installed partner app and static analysis of the iOS SDK binary, revealed a series of design choices that significantly exceed what a reasonable user would understand themselves to be consenting to. Once installed, the SDK opens a persistent WebSocket connection to Bright Data’s infrastructure and continuously transmits a live feed of the device’s physical state including screen status, battery level, CPU usage, memory usage, network type, and whether the user is actively on a phone call. The SDK’s own configuration explicitly sets a flag called ignore_screen_on to true and another called ignore_on_call to true, meaning a device is considered eligible to relay someone else’s scraping traffic even while the user is actively using the screen or in the middle of a phone call. The SDK also ships a flag called use_netifs that causes the peer tunnel carrying scraping traffic to bind directly to the device’s physical WiFi or cellular interface rather than routing through the system default, which means on iOS the scraping traffic bypasses any VPN the user has configured entirely, even when all other traffic on the device is passing through that VPN.\n Include Security notified Bright Data of the planned publication by email on May 11, 2026, and received no response before the article published. Bright Data’s SDK configuration endpoint, which contains the full partner manifest, per-country bandwidth tiers, and idle detection thresholds, is publicly accessible without any authentication to anyone who supplies a valid app bundle ID and SDK version string. The researchers documented that certain countries receive dramatically different treatment under the SDK’s configuration, with devices in Uzbekistan and Oman permitted to relay traffic down to a battery level of one percent with daily caps 20 times the global default and monthly caps 60 times the global default, while the worldwide default still permits 500 megabytes of third-party scraping traffic per month over a user’s home internet connection. For households that want to block the SDK’s peer tunnel at the router level right now, Include Security identified three specific hostnames that can be blocked via DNS settings without affecting any legitimate Bright Data customer-facing service: proxyjs.brdtnet.com, proxyjs.luminatinet.com, and proxyjs.bright-sdk.com."
    },
    {
      "permalink": "https://www.reddit.com/r/webscraping/comments/1tsrxxq/new_free_opensource_android_automation_for_web/",
      "headline": "New Free open-source Android automation for web scraping - Damru",
      "textBody": "Hey r/webscraping, I’m sharing a free open-source project I’ve been building called Damru: https://github.com/akwin1234/damru \n Damru is a browser automation framework built around real Android environments in Docker for scraping and automation tasks where mobile behavior matters.\n What sets it apart is that it’s not just another desktop browser with stealth patches. The project is built around zero JS injection, with spoofing handled at the OS, binary, and CDP levels instead of the usual JavaScript-heavy tricks used by many stealth tools.\n Compared with tools like Playwright, puppeteer-stealth, undetected-chromedriver, Camoufox, and Fingerprinting Chromium, Damru is trying to solve the problem differently: by running inside a real Android stack rather than faking mobile behavior on desktop Chrome. The idea is to get a more realistic mobile environment, stronger fingerprint control, and less reliance on brittle browser-side patches.\n What makes it different:\n  Zero JS injection: Damru does spoofing at the OS, binary, and CDP levels instead of relying on Object.defineProperty-style JavaScript patches. Real Android OS: It runs inside Redroid, so it’s not just desktop Chrome pretending to be mobile through viewport tricks. Native mobile fingerprinting controls: device profiles, hardware overrides, locale/timezone matching, mobile network emulation, and WebRTC/IPv6 blocking. Multi-instance pooling: built for scaling across multiple containers. Pre-baked image support: reduces setup overhead.  Some of the features include:\n  Android-in-Docker via Redroid. Playwright support. A built-in database of 32+ Android device profiles. Proxy-aware timezone, locale, and language matching. Hardware overrides for CPU, RAM, and touch points. Mobile network emulation. WebRTC and IPv6 leak blocking. Native Android iptables-based network protections. Multi-container pooling for scale. Pre-baked image support to reduce setup time. TLS spoofing and soo many things  Also stronger against systems like CreepJS, BrowserScan, Sannysoft, Cloudflare Turnstile,etc ALL CDN anti-bots dont waana name them than standard Playwright or typical stealth plugins, mainly because of the deeper Android-based approach.\n Pros: Highly UnDetectable\n Cons: Real Android OS hence little slower. Hard to Use (thats why custom docker image included)\n Repo: https://github.com/akwin1234/damru\n Official Website: https://damru.dev/\n Would love feedback from anyone who works on scraping, browser automation, or anti-bot research. I made this because i see many reddit post recommending Android Playwright CDP but there was no framework around it. This is strictly for educational purpose only. Do not do legal abuse."
    },
    {
      "permalink": "https://www.reddit.com/r/WebScrapingInsider/comments/1tyb3yj/new_free_opensource_android_automation_for_web/",
      "headline": "New Free open-source Android automation for web scraping - Damru",
      "textBody": "I’m sharing a free open-source project I’ve been building called Damru: https://github.com/akwin1234/damru\n 100% Bypass rate for all CDN, even websites like shopee, temu, lazada, etc. Because its made with a lot of research\n Damru is a browser automation framework built around real Android environments in Docker for scraping and automation tasks where mobile behavior matters.\n What sets it apart is that it’s not just another desktop browser with stealth patches. The project is built around zero JS injection, with spoofing handled at the OS, binary, and CDP levels instead of the usual JavaScript-heavy tricks used by many stealth tools.\n Compared with tools like Playwright, puppeteer-stealth, undetected-chromedriver, Camoufox, and Fingerprinting Chromium, Damru is trying to solve the problem differently: by running inside a real Android stack rather than faking mobile behavior on desktop Chrome. The idea is to get a more realistic mobile environment, stronger fingerprint control, and less reliance on brittle browser-side patches.\n What makes it different:\n  Zero JS injection: Damru does spoofing at the OS, binary, and CDP levels instead of relying on Object.defineProperty-style JavaScript patches. Real Android OS: It runs inside Redroid, so it’s not just desktop Chrome pretending to be mobile through viewport tricks. Native mobile fingerprinting controls: device profiles, hardware overrides, locale/timezone matching, mobile network emulation, and WebRTC/IPv6 blocking. Multi-instance pooling: built for scaling across multiple containers. Pre-baked image support: reduces setup overhead.  Some of the features include:\n  Android-in-Docker via Redroid. Playwright support. A built-in database of 32+ Android device profiles. Proxy-aware timezone, locale, and language matching. Hardware overrides for CPU, RAM, and touch points. Mobile network emulation. WebRTC and IPv6 leak blocking. Native Android iptables-based network protections. Multi-container pooling for scale. Pre-baked image support to reduce setup time. TLS spoofing and soo many things  Also stronger against systems like CreepJS, BrowserScan, Sannysoft, Cloudflare Turnstile,etc ALL CDN anti-bots dont waana name them than standard Playwright or typical stealth plugins, mainly because of the deeper Android-based approach.\n Pros: Highly UnDetectable\n Cons: Real Android OS hence little slower. Hard to Use (thats why custom docker image included)\n Repo: https://github.com/akwin1234/damru\n Would love feedback from anyone who works on scraping, browser automation, or anti-bot research. I made this because i see many reddit post recommending Android Playwright CDP but there was no framework around it. This is strictly for educational purpose only. Do not do legal abuse."
    },
    {
      "permalink": "https://www.reddit.com/r/ClaudeAI/comments/1uaysmd/best_vps_for_web_scraping/",
      "headline": "Best VPS for Web Scraping?",
      "textBody": "I built a scraper with Claude Code that worked perfectly, deployed on AWS until I got my IP Blocked lmao.\n I’ve since built it into a front end where I can interface with it, but I need a place to deploy it.\n Now I’m looking at using a VPS for scraping with an IP proxy. Does anyone have experience with a service they could recommend?"
    },
    {
      "permalink": "https://www.reddit.com/r/pwnhub/comments/1u07xzu/residential_proxy_architecture_how_embedded_sdks/",
      "headline": "Residential Proxy Architecture: How Embedded SDKs Turn Connected TVs into Silent Web-Scraping Networks for AI Models",
      "textBody": "We just got an incredibly detailed technical post-mortem from Include Security exposing how commercial proxy firms are sourcing their massive residential IP networks. If you’ve ever wondered why scraping networks can effortlessly hit major websites while your custom script gets blacklisted or blocked by a basic Cloudflare challenge within three requests, the answer is sitting right in your living room.\n Free apps on smart TV platforms (including Roku, Samsung Tizen, and LG webOS) are systematically embedding background Software Development Kits (SDKs) to turn consumer devices into unmonitored residential exit nodes.\n The Capture Matrix: Why Connected TVs (CTVs) are the Gold Standard\n For residential proxy operations, standard desktop or mobile exit nodes are unreliable. Mobile devices switch cells, drop onto metered connections, hit sleep mode, or drain their batteries.\n Smart TVs are the absolute holy grail for background scraping infrastructure:\n  Perpetual Uptime: They are plugged into a continuous power source and remain in an active network standby state 24/7. High Bandwidth / Low Churn: They sit on unmetered home fiber/broadband connections with static or long-lease DHCP residential IPs. Low Visibility: Nobody is running standard endpoint detection, netstat audits, or process monitoring on their television.  According to the reverse-engineering telemetry, the embedded SDK (specifically from Bright Data, the successor entity to Luminati/Hola VPN) sets an internal configuration budget allowing a massive 200GB of background data routing per month per device. The traffic is monetized by app publishers (like PlayWorks, CloudTV, and Longvision) who integrate the code to trade user bandwidth for an ad-free application experience.\n The Network Plane and VPN Bypasses\n The deep dive into the iOS framework variant of this SDK revealed some slick, highly intentional routing logic designed to protect the proxy network's integrity at the kernel level:\n  The Control Plane: The SDK establishes initial command-and-control communication back to base using CFNetwork HTTP primitives rather than high-level API calls, fetching target scraping jobs completely silently. The Interface Bypass: To ensure the customer paying for the residential IP gets a clean, local home connection, the SDK’s data plane utilizes the NWConnection API with a strict directive mapping directly to the physical interface (en0). The Leak: Because it forces communication over the hardware layer, the proxy traffic completely sidesteps any local on-device VPN tunnels (tun0) running on the system. It routes traffic raw over the home gateway, completely undermining the user's local privacy stack while utilizing their clean domestic IP address.  The AI Pipeline Demand\n This isn't a malicious botnet compromise in the traditional sense—it relies on legally bulletproof (though highly opaque) opt-in text navigated via a TV remote. The explosive growth of this infrastructure is being entirely driven by the LLM training boom.\n AI data collection pipelines require millions of clean, distinct, non-datacenter IP addresses to scrape massive web data blocks without triggering rate limits or bot-mitigation scripts. If you manage a home network and want to yank your streaming devices out of an automated scraping farm, you can sinkhole the tracking domains directly at your DNS resolver or Pi-hole interface.\n Full Reverse-Engineering Teardown, Peer-Tunnel plane IPs, and Manifest Domain Indicators: https://www.technadu.com/free-smart-tv-apps-embed-bright-data-sdk-to-build-an-ai-web-scraping-proxy-network/629113/"
    }
  ],
  "manualItems": [],
  "painKeywords": [
    "pain",
    "problem",
    "issue",
    "bug",
    "broken",
    "blocked",
    "expensive",
    "alternative",
    "looking for"
  ],
  "buyingIntentKeywords": [
    "looking for",
    "recommend",
    "alternative",
    "switching",
    "tool",
    "software",
    "vendor",
    "solution"
  ],
  "competitorKeywords": [
    "bright data",
    "playwright",
    "cloudflare",
    "aws"
  ],
  "mustIncludeKeywords": [],
  "excludeKeywords": [],
  "maxItems": 5,
  "minOpportunityScore": 30,
  "failOnNoResults": true
}
```

## Output

```json
{
  "rank": {
    "label": "Rank",
    "format": "integer"
  },
  "headline": {
    "label": "Headline",
    "format": "string"
  },
  "communityTag": {
    "label": "Community",
    "format": "string"
  },
  "painCategory": {
    "label": "Category",
    "format": "string"
  },
  "intentStage": {
    "label": "Intent",
    "format": "string"
  },
  "opportunityScore": {
    "label": "Opportunity",
    "format": "number"
  },
  "painScore": {
    "label": "Pain",
    "format": "number"
  },
  "matchedPainKeywords": {
    "label": "Pain keywords",
    "format": "array"
  },
  "matchedBuyingIntentKeywords": {
    "label": "Buying keywords",
    "format": "array"
  },
  "permalink": {
    "label": "Reddit",
    "format": "string"
  }
}
```

## About this Actor

This example demonstrates how to use [Reddit Pain Point Monitor](https://apify.com/runtime/reddit-pain-point-monitor) with a specific input configuration. Visit the [Actor detail page](https://apify.com/runtime/reddit-pain-point-monitor) to learn more, explore other use cases, and run it yourself.