Pricing

from $5.00 / 1,000 page converteds

Content Intelligence Extractor

Extract clean Markdown from Reddit threads and news sites. Built for LLM pipelines, n8n workflows, and AI content analysis. Uses Mozilla Readability + Reddit JSON API for noise-free output.

Pricing

from $5.00 / 1,000 page converteds

Rating

0.0

(0)

Developer

Andrew Luxem

Actor stats

Bookmarked

Total users

Monthly active users

4 months ago

Last modified

What it does

Give it a list of URLs — Reddit threads or articles from sites like Screen Rant, CBR, IGN, or any news site — and it returns clean Markdown with engagement signals, metadata, and source-specific fields ready to pipe directly into Claude, GPT, or any LLM.

Reddit threads are extracted via Reddit's JSON API (no browser needed) with post body, top comments sorted by upvotes, and engagement data.

Film/news sites are extracted using Mozilla Readability + Turndown — the same engine Firefox uses to strip ads, sidebars, author bios, and newsletter popups before converting to clean Markdown.

Use cases

Content gap analysis — feed competitor articles to an LLM to find unexplored angles
n8n content pipelines — schedule weekly runs, pipe output to Claude or GPT for article briefs
Reddit trend monitoring — extract high-upvote fan theories or discussions for content research
SEO research — extract and analyze top-ranking articles in bulk
RAG knowledge bases — clean Markdown is ideal for vector embeddings

Example input

{
  "urls": [
    "https://www.reddit.com/r/FanTheories/comments/abc123/my_theory/",
    "https://screenrant.com/some-article/"
  ],
  "maxRedditComments": 10,
  "includeEngagementData": true
}

Example output

{
  "url": "https://www.reddit.com/r/FanTheories/comments/abc123/",
  "sourceType": "reddit",
  "title": "Theory: The ending means something else entirely",
  "markdown": "# Theory: The ending...\n\nFull post body...\n\n## Top Comments\n\n...",
  "metadata": {
    "wordCount": 847,
    "estimatedReadTime": 4,
    "engagementSignal": 3200
  },
  "redditSpecific": {
    "subreddit": "FanTheories",
    "upvotes": 3200,
    "commentCount": 143,
    "topComments": [{ "body": "...", "upvotes": 412 }]
  }
}

n8n integration

Use the native Apify n8n node to trigger this actor on a schedule:

Schedule Trigger — weekly or daily
Apify: Run Actor — pass your URL list
Apify: Get Dataset — fetch results
Loop + LLM node — Claude/GPT analysis prompt
Google Sheets / Notion — store content briefs

Pricing

Pay-per-page: $0.005 per URL processed. First 20 pages free.

Supported sources

Reddit (all subreddits via JSON API)
Screen Rant, CBR, IGN, Variety, Hollywood Reporter
Any article-based news or blog site
Custom CSS selectors to strip site-specific noise

Reddit Scraper - Markdown for AI & n8n

clearpath/reddit-to-llm-api

Extract Reddit posts and comments as LLM-ready Markdown. No API key needed. Direct n8n/Make integration—connect output to AI nodes instantly. 20x faster than browser scrapers. Perfect for lead gen, product validation, and market research workflows.

ClearPath

Reddit Answers API "Ask Reddit" - AI Insights for n8n Pipelines

clearpath/reddit-answers-api

Extract AI-powered answers in 6 languages from Reddit discussions at scale. Structured JSON + markdown for n8n, Make, and LLM pipelines. Includes full post/comment context, quotes with citations, and subreddit metadata. 6 languages supported. No login required. Pay per successful answer.

ClearPath

n8n-mcp

nourishing_courier/web-data-for-ai

n8n-mcp

Ani Björkström

n8n Documentation MCP Server

agentify/n8n-mcp-server

n8n MCP Server provides AI assistants with structured access to n8n node documentation, properties, and validation tools for building and verifying workflows efficiently.

agentify

n8n Workflows Scraper

dadhalfdev/n8n-workflows-scraper

This scraper extracts pre-built, free workflow templates directly from the n8n template library. Pick a category and sort order, and the scraper will navigate n8n's library to extract not only the metadata of each workflow but the full, raw JSON configuration. Get up to 150 workflows per run.

Marco Rodrigues

n8n Workflow Automation Templates Scraper

scraped/n8n-workflow-automation-templates-scraper

A tool that automatically scrapes and collects n8n workflow automation templates from the n8n for easy access and use.

scraped

329

LLM Markdown Crawler

sleek_waveform/llm-markdown-crawler

Crawl any website and extract clean, boilerplate-free Markdown optimized for LLMs, RAG pipelines, and AI training datasets. Uses Mozilla Readability to strip navigation and ads, then converts to clean Markdown. No browser required — fast and cheap.

Daniel Dimitrov

Reddit Answers API - Ask Reddit (Rental)

clearpath/reddit-answers-api-rental

ClearPath

n8n Community Creators Scraper

lexis-solutions/n8n-community-creators

Scrape top n8n community creators by engagement: capture profiles, reputation metrics, locations, bios, plus linked n8n creator pages with workflow counts and verification for expert discovery and lead generation.

Lexis Solutions

n8n Creators Scraper

lexis-solutions/n8n-creators-scraper

Scrape n8n.io/creators — extract verified creator profiles, workflow counts, bios, links, categories & pricing. Perfect for automation agencies, recruiters & no-code researchers. Structured JSON output.

Lexis Solutions