Pricing

Pay per event

Substack Newsletter Scraper

Extract comprehensive Substack newsletter data including author profiles, subscriber counts, social media links, and contact information for B2B outreach and market research.

Pricing

Pay per event

Rating

4.0

(1)

Developer

Akram

Actor stats

Bookmarked

Total users

Monthly active users

2 months ago

Last modified

🚀 Key Features

📊 Publication Metadata: Extract title, description, author info, author ID, categories, and subscriber counts (when publicly visible)
📧 Email Discovery: Automatically find author contact emails from About pages and other content
🔗 Social Media Links: Extract Twitter, LinkedIn, Facebook, Instagram, and YouTube profiles
📝 Flexible Post Scraping: Choose from 3 modes - no posts, post metadata only, or full article content
📈 Engagement Metrics: Extract likes, comments, and word counts for each post
⚡ Batch Processing: Process multiple newsletters in a single run
💰 Pay-per-Event: Only pay for successfully processed newsletters

📋 Input Parameters

Required

newsletterUrls (array): List of Substack newsletter URLs to scrape
- Example: [{"url": "https://platformer.substack.com"}, {"url": "https://lennysnewsletter.com"}]
- Supports both *.substack.com URLs and custom domains
- Process multiple newsletters in a single run

Optional

postScrapingMode (string, default: "information_and_content"): Controls what post data to extract
- "none" - Newsletter metadata only (fastest, minimal data usage)
- "information" - Newsletter metadata + post info (title, date, engagement, preview)
- "information_and_content" - Newsletter metadata + post info + full article content
maxPostsPerNewsletter (number, default: 12, max: 12): Number of recent posts to scrape per newsletter (only applies when postScrapingMode is not "none")
delayBetweenRequests (number, default: 3000, range: 500-10000): Delay in milliseconds between HTTP requests to avoid rate limiting

💡 Example Input

{
  "newsletterUrls": [
    { "url": "https://platformer.substack.com" },
    { "url": "https://lennysnewsletter.com" },
    { "url": "https://andrewsullivan.substack.com" }
  ],
  "postScrapingMode": "information_and_content",
  "maxPostsPerNewsletter": 12,
  "delayBetweenRequests": 3000
}

📊 Output Structure

The output structure varies based on the postScrapingMode setting:

Mode 1: `"none"` - Newsletter Metadata Only

{
  "author_id": 241262,
  "author_name": "Casey Newton",
  "author_handle": "platformer",
  "author_bio": "Casey Newton is the founder and editor of Platformer...",
  "author_photo_url": "https://substack-post-media.s3.amazonaws.com/...",
  "email": "casey@platformer.news",
  "twitter_url": "https://twitter.com/CaseyNewton",
  "linkedin_url": null,
  "facebook_url": "https://www.facebook.com/...",
  "instagram_url": "https://instagram.com/crumbler",
  "website_url": "https://www.platformer.news",
  "publication_id": 7976,
  "publication_name": "Platformer",
  "publication_subdomain": "platformer",
  "publication_url": "https://www.platformer.news",
  "publication_description": "News at the intersection of Silicon Valley and democracy...",
  "publication_logo_url": "https://bucketeer-e05bbc84-baa3-437e-9518-adb32be77984...",
  "publication_custom_domain": "www.platformer.news",
  "publication_created_at": "2019-03-29T13:28:21.009Z",
  "subscriber_count": 176000,
  "subscriber_count_string": "176K+",
  "subscriber_count_visible": true,
  "follower_count": 209469,
  "is_paid_newsletter": false,
  "payments_enabled": false,
  "founding_plan_name": "Mystery Tier",
  "is_active": true,
  "has_posts": true,
  "has_likes": true,
  "profile_set_up_at": "2021-04-22T18:51:48.648Z",
  "scraped_at": "2025-11-07T19:19:13.110957+00:00"
}

Mode 2: `"information"` - Newsletter + Post Information

Includes all fields from Mode 1, plus a posts array:

{
  // ... all newsletter metadata from Mode 1 ...
  "posts": [
    {
      "url": "https://www.platformer.news/trump-won-heres-what-comes-next/",
      "title": "Trump won. Here's what comes next.",
      "subtitle": "What a second Trump administration could mean for the internet",
      "date": "2024-11-07",
      "author": "Casey Newton",
      "likes": 13,
      "comments_count": 43,
      "content_preview": "Good morning, and wow. Donald Trump won a second term as president...",
      "word_count": 2051
    }
    // ... up to maxPostsPerNewsletter posts ...
  ]
}

Mode 3: `"information_and_content"` - Full Article Content

Includes all fields from Mode 2, plus full content for each post:

{
  // ... all newsletter metadata ...
  "posts": [
    {
      "url": "https://www.platformer.news/trump-won-heres-what-comes-next/",
      "title": "Trump won. Here's what comes next.",
      "subtitle": "What a second Trump administration could mean for the internet",
      "date": "2024-11-07",
      "author": "Casey Newton",
      "likes": 13,
      "comments_count": 43,
      "content_preview": "Good morning, and wow. Donald Trump won a second term as president...",
      "word_count": 2051,
      "content": "Good morning, and wow. Donald Trump won a second term as president...\n\n[Full 2000+ word article text extracted here]\n\n..."
    }
    // ... up to maxPostsPerNewsletter posts with full content ...
  ]
}

🔧 Use Cases

B2B Outreach: Build targeted lists of newsletter creators with contact information for partnership opportunities
Market Research: Analyze newsletter subscriber counts, engagement metrics, and publishing frequency
Competitive Analysis: Track competitor newsletters, their growth, and content strategies
Content Analysis: Extract full article content for competitive research, trend analysis, or content strategy insights
Lead Generation: Find potential customers or partners in specific niches based on newsletter topics and engagement
Training Data Collection: Gather high-quality written content for AI/ML applications (respect copyright and usage rights)
Publishing Patterns: Analyze posting frequency, optimal publishing times, and content length strategies
Engagement Research: Study which topics, titles, and content formats drive the most likes and comments

💰 Pricing

This Actor uses pay-per-event pricing:

You're charged only for newsletters that are successfully processed
Failed URLs don't incur charges
Transparent usage-based billing

🏷️ Tags

Newsletter scraping, Substack, B2B outreach, lead generation, market research, email discovery, social media extraction

Substack Newsletter Scraper

red.cars/substack-newsletter-scraper

Extract newsletter content, subscriber counts, post analytics, and creator intelligence from any Substack publication - completely free, no authentication needed!

AutomateLab

1.0

Substack Leaderboard Scraper 📊

easyapi/substack-leaderboard-scraper

Scrape detailed publication data from Substack leaderboards. Get comprehensive insights about top newsletters including subscriber counts, pricing, author details, and more. Perfect for newsletter research and market analysis.

EasyApi

5.0

Substack Posts Scraper 📚

easyapi/substack-posts-scraper

Scrape Substack posts and articles by keywords. Extract comprehensive post data including title, author, publication details, podcast information, reactions, and more. Perfect for content analysis and research.

EasyApi

1.9

Substack Publications Scraper 📚

easyapi/substack-publications-scraper

Scrape detailed publication information from Substack based on keywords. Get comprehensive data about newsletters, authors, subscriber counts, and publication metrics in structured JSON format.

EasyApi

1.2

Newsletter Scraper

benthepythondev/newsletter-scraper

Extract newsletter archives from Substack, Beehiiv, and Ghost platforms. Get full content in markdown format, complete metadata, embedded images, word counts, and AI-ready token counts. Perfect for content research, competitive analysis, and training AI models.

ben

Substack Notes Scraper 🔍

easyapi/substack-notes-scraper

Extract notes and comments from Substack's search results with images, user info, and engagement metrics. Perfect for content analysis, user research, and tracking discussions around specific topics on Substack.

EasyApi

Substack Scraper

scraper_guru/substack-scraper

Extract complete data from Substack newsletters including posts, authors, engagement metrics, and article text. 13 fields per post. Fast and reliable.

LIAICHI MUSTAPHA

5.0

Substack Scraper

qpayre/substack-scraper

The Substack Author Scraper is a powerful Apify actor that makes it easy for content creators to scrape and retrieve all posts from their favorite Substack authors. With structured data presented in a user-friendly format, analyzing and processing valuable information has never been easier.

QPS

395

Reddit Lead Scraper: Monitor Keywords & Mentions

practicaltools/reddit-keyword-monitor

Stop wasting time manually searching Reddit. This actor finds high-intent leads, filters out the spam (crypto/bots), and alerts you only when a real customer is talking.

Practical Tools

Substack Newsletter Scraper

opalescent_quintet/substack-newsletter-scraper

Substack-Newsletter-Scraper Extract complete newsletter archives from any Substack publication with advanced filtering, multiple export formats, and engagement analytics. ## Features - Scrape entire newsletter archives from Substack - Extract full metadata: titles, content, author details etc.