Reddit Lead Scraper — Emails, Socials & Contact Info avatar

Reddit Lead Scraper — Emails, Socials & Contact Info

Pricing

from $2.00 / 1,000 results

Go to Apify Store
Reddit Lead Scraper — Emails, Socials & Contact Info

Reddit Lead Scraper — Emails, Socials & Contact Info

Turn Reddit into a B2B lead list. Keep only records that expose a contact signal — email, social profile or external website — found across posts, comments and user profiles. AI-ready text included; no login or developer token needed.

Pricing

from $2.00 / 1,000 results

Rating

0.0

(0)

Developer

Black Falcon Data

Black Falcon Data

Maintained by Community

Actor stats

0

Bookmarked

1

Total users

0

Monthly active users

5 hours ago

Last modified

Share

What does Reddit Lead Scraper do?

Reddit Lead Scraper turns Reddit discussions into a B2B lead list. Point it at any subreddit, post, profile, or keyword search and it returns only the records that carry a contact signal — an email, a social profile (LinkedIn, X, GitHub, Instagram and more), or an external website — mined from post bodies, comment threads, and user profiles. Each lead includes the contact fields plus the surrounding text in clean text / HTML / Markdown. No Reddit account, login, or API key required.

New to Apify? Sign up free and use the included $5 monthly platform credit to test this actor.

Key features

  • 🎯 Contact-bearing records only — returns only Reddit records that carry an email, a social profile, or an external website, so every row in your dataset is an actual lead. You pay for leads, not noise.
  • 📇 Emails, socials & websites — each lead carries an emails array, a structured socialProfiles map (LinkedIn, X, GitHub, Instagram…), and extractedUrls, parsed from posts, comments, and profiles.
  • 🔎 Search, subreddits & profiles — build lead lists from a subreddit feed, a keyword search across Reddit, a single thread, or specific user profiles — mix and match by dropping in any Reddit URL.
  • 💬 Comment-thread coverage — contacts are often buried in replies, so the actor scans nested comment threads (with depth and volume controls), not just top-level posts.
  • 🧩 Lead context included — every lead keeps the post or comment it came from, in clean text / HTML / Markdown, so you know who to contact and why before you reach out.
  • 🤖 AI-ready & automatable — structured JSON output drops straight into CRMs, enrichment tools, LLMs, and automation pipelines.
  • 🧹 Lean, flexible output — strip empty fields and pick a single description format to keep lead lists small and import-ready.
  • 🔑 No login or API key required — point the actor at any public Reddit URL or search term and run; no Reddit account or app registration needed.

What data can you extract from reddit.com?

This actor returns only records that carry a contact signal — an email, a social profile, or an external (non-Reddit) website. Every other record is skipped, so each row in your dataset is an actual lead. Each record keeps a stable itemType (post, comment, or user) so you can tell the source apart inside a single dataset.

  • emails — de-duplicated, lowercased emails from the body text and mailto: links (tracking / no-reply / asset false positives filtered out).
  • socialProfiles — structured map: linkedin, twitter, github, instagram, facebook, youtube, tiktok, xing, bluesky, threads, mastodon.
  • extractedUrls — outbound (non-Reddit) website links mentioned in the content.
  • Context fieldstitle, body as text / HTML / Markdown, score, author, community, createdAt, and the canonical url, so you can see where each lead came from.

Leads are richest in business-oriented communities — for example r/forhire, r/freelance, r/Entrepreneur, r/startups, r/SaaS. Point the actor at the subreddits or searches where your audience is active. Records with no contact signal are skipped, so you only pay for leads.

Input

Configure the actor through the input schema in Apify Console.

Key parameters:

  • startUrls — Reddit URLs to scrape — subreddits, post pages, user profiles, community pages, or search result pages. Each URL determines what type of content is fetched.
  • searchTerms — Search Reddit for these terms. Each entry becomes an independent search. Search posts are lightweight discovery records (plus their comments) — see Search Type.
  • searchType — Type of results to return when using Search Terms. Post results are lightweight discovery records — id, url, title, subreddit and NSFW flag — plus their comment threads; scrape a result's URL directly for its full post fields (author, body, score, timestamp). (default: "posts")
  • sort — Sort order for posts and search results. (default: "hot")
  • time — Restrict subreddit-feed results to a time window (applies to Top sort on feeds; search is not time-windowed). (default: "all")
  • includeNSFW — Include posts and communities marked as NSFW (18+). (default: false)
  • postDateLimit — Skip posts older than this ISO-8601 date (e.g. "2024-01-01"). Applies to subreddit feeds and post URLs; search results carry no date and are not filtered. Leave blank for no date limit.
  • maxItems — Maximum total records to save across all sources (posts, comments, users, communities). (default: 100)
  • maxComments — Maximum number of comments to collect from each post page. (default: 200)
  • includeCollapsed — Expand and include comments that are initially collapsed (controversial or low-score). Enables deeper thread coverage, up to the comment and depth limits you set. (default: true)
  • commentDepth — Maximum reply nesting depth to collect (1 = top-level only). (default: 10)
  • skipComments — Do not collect comments from post pages — output posts only. (default: false)
  • ...and 5 more parameters

Input examples

Leads from a hiring subreddit — Scan a freelance/hiring subreddit's posts and comments for contacts.

→ Only r/forhire records that carry an email, social profile, or website.

{
"startUrls": [
{
"url": "https://www.reddit.com/r/forhire/"
}
],
"sort": "new",
"maxItems": 50,
"maxComments": 50
}

Keyword lead search — Run keyword searches across Reddit and keep only contact-bearing posts.

→ Posts matching the search terms that expose a contact signal.

{
"searchTerms": [
"looking for a developer"
],
"searchType": "posts",
"sort": "new",
"maxItems": 100
}

Subreddit + comment leads — Founder and entrepreneur threads share contacts deep in the comments.

→ Contact-bearing posts and comments from r/Entrepreneur.

{
"startUrls": [
{
"url": "https://www.reddit.com/r/Entrepreneur/"
}
],
"maxItems": 50,
"maxComments": 200
}

Output

Each run produces a dataset of structured Reddit records. Results can be downloaded as JSON, CSV, or Excel from the Dataset tab in Apify Console.

Example Reddit record

{
"itemType": "post",
"id": "t3_1ttjtwv",
"url": "https://www.reddit.com/r/programming/comments/1ttjtwv/your_process_memory_is_a_file_the/",
"title": "Your process' memory is a file: The underappreciated gem that is /proc/<pid>/mem",
"body": null,
"bodyHtml": null,
"contentHref": "https://lcamtuf.substack.com/p/weekend-trivia-your-process-memory",
"postType": "link",
"language": "en",
"score": 129,
"upvoteRatio": 0.9708029197080292,
"numComments": 1,
"awardCount": 0,
"author": "mttd",
"authorId": "t2_6gkbb",
"community": "r/programming",
"communityId": "t5_2fwo",
"createdAt": "2026-06-01T08:32:12.581+02:00",
"icon": "https://www.redditstatic.com/avatars/defaults/v2/avatar_default_7.png",
"nsfw": false
}

Example lead record (post with contact)

{
"itemType": "post",
"id": "t3_1abc234",
"url": "https://www.reddit.com/r/Entrepreneur/comments/1abc234/launching_my_saas_feedback_welcome/",
"title": "Launched my SaaS — feedback welcome",
"descriptionText": "Spent 6 months building this. Would love feedback — site is https://acme.io, I'm on https://www.linkedin.com/in/janedoe, or email jane@acme.io.",
"score": 48,
"author": "jane_builds",
"community": "r/Entrepreneur",
"createdAt": "2026-06-10T09:22:00.000Z",
"emails": [
"jane@acme.io"
],
"extractedUrls": [
"https://acme.io",
"https://www.linkedin.com/in/janedoe"
],
"socialProfiles": {
"linkedin": "https://www.linkedin.com/in/janedoe",
"twitter": null,
"github": null
}
}

How to scrape reddit.com

  1. Go to Reddit Lead Scraper in Apify Console.
  2. Configure the input.
  3. Set maxItems to control how many results you need.
  4. Click Start and wait for the run to finish.
  5. Export the dataset as JSON, CSV, or Excel.

Use cases

  • Build B2B lead lists from freelance, hiring, founder, and SaaS communities.
  • Find prospects who share a LinkedIn, GitHub, or website in relevant threads.
  • Source freelancers and candidates posting in r/forhire and r/freelance.
  • Enrich an existing CRM with emails, social profiles, and websites from target subreddits.
  • Monitor specific subreddits on a schedule and collect new leads as they appear.
  • Combine emails, social handles, and websites into a single ready-to-import lead file.

How much does it cost to scrape reddit.com?

Reddit Lead Scraper uses pay-per-event pricing. You pay a small fee when the run starts and then for each result that is actually produced.

  • Run start: $0.008 per run
  • Per result: $0.002 per Reddit record

Example costs:

  • 10 results: $0.028
  • 25 results: $0.058
  • 100 results: $0.21
  • 200 results: $0.41
  • 500 results: $1.01

FAQ

How many results can I get from reddit.com?

The number of results depends on the search query and available listings on reddit.com. Use the maxItems parameter to control how many results are returned per run.

Can I integrate Reddit Lead Scraper with other apps?

Yes. Reddit Lead Scraper works with Apify's integrations to connect with tools like Zapier, Make, Google Sheets, Slack, and more. You can also use webhooks to trigger actions when a run completes.

Can I use Reddit Lead Scraper with the Apify API?

Yes. You can start runs, manage inputs, and retrieve results programmatically through the Apify API. Client libraries are available for JavaScript, Python, and other languages.

Can I use Reddit Lead Scraper through an MCP Server?

Yes. Apify provides an MCP Server that lets AI assistants and agents call this actor directly. Use a single descriptionFormat and excludeEmptyFields to keep payloads manageable for LLM context windows.

This actor extracts publicly available data from reddit.com. Web scraping of public information is generally considered legal, but you should always review the target site's terms of service and ensure your use case complies with applicable laws and regulations, including GDPR where relevant.

Your feedback

If you have questions, need a feature, or found a bug, please open an issue on the actor's page in Apify Console. Your feedback helps us improve.

You might also like

Getting started with Apify

New to Apify? Create a free account with $5 credit — no credit card required.

  1. Sign up — $5 platform credit included
  2. Open this actor and configure your input
  3. Click Start — export results as JSON, CSV, or Excel

Need more later? See Apify pricing.