Pricing

from $0.35 / 1,000 posts

Substack Newsletter Content Scraper

Scrape Substack newsletter posts, authors, dates, likes, comments, restacks, and article text. Built for content research, competitor tracking, and AI-ready datasets.

Pricing

from $0.35 / 1,000 posts

Rating

2.6

(2)

Developer

LIAICHI MUSTAPHA

Actor stats

Bookmarked

Total users

Monthly active users

5 hours ago

Last modified

Features

Extracts headline, subheading, author, date, post URL, and newsletter URL
Collects publicly visible likes, comments, and restacks
Returns full text for accessible posts and preview text for paywalled posts
Discovers posts through sitemaps with an archive-page fallback
Processes multiple newsletters in bounded batches
Controls post concurrency to reduce timeouts
Optionally builds an HTML newsletter digest

Use Cases

Newsletter research: compare publishing frequency, topics, and engagement
Competitive intelligence: monitor public posts from newsletters in a niche
Content analysis: identify headlines and formats associated with engagement
Media monitoring: create scheduled snapshots of selected publications
AI datasets: collect public article text with source and access metadata

Input

Field	Type	Default	Description
`substackUrls`	array	required	Substack publication URLs
`scrapingMethod`	string	`sitemap`	`sitemap` or `archive` discovery
`maxPostsPerSubstack`	integer	`50`	Maximum posts per publication; `0` means unlimited
`batchSize`	integer	`5`	Publications processed in each batch
`postConcurrency`	integer	`3`	Post pages processed in parallel
`generateNewsletterDigest`	boolean	`false`	Save an optional HTML digest preview

{
  "substackUrls": ["https://tedhope.substack.com"],
  "scrapingMethod": "sitemap",
  "maxPostsPerSubstack": 50,
  "batchSize": 5,
  "postConcurrency": 3,
  "generateNewsletterDigest": false
}

Output

Each dataset item represents one Substack post:

{
  "substack_url": "https://tedhope.substack.com",
  "post_url": "https://tedhope.substack.com/p/example-post",
  "headline": "Example post",
  "subheading": "A public post subtitle",
  "author_name": "Ted Hope",
  "author_url": "https://substack.com/@example",
  "date": "July 10, 2026",
  "free_or_paid": "Free",
  "likes": 156,
  "comments": 23,
  "restacks": 12,
  "article_text": "Available article text...",
  "content_type": "full"
}

content_type is full, preview_only, or failed. The STATS record reports publication count, post count, limits, method, and concurrency.

How to Use

Open the Actor and click Try for free.
Add one or more Substack publication URLs.
Keep sitemap as the preferred discovery method.
Start with maxPostsPerSubstack: 10 for a quick test.
Reduce postConcurrency if a large run encounters timeouts.
Start the run and export the dataset from the Output tab.

from apify_client import ApifyClient

client = ApifyClient("YOUR_API_TOKEN")
run = client.actor("scraper_guru/substack-scraper").call(run_input={
    "substackUrls": ["https://tedhope.substack.com"],
    "maxPostsPerSubstack": 50,
    "postConcurrency": 3,
})

Pricing

This is a pay-per-event Actor. The base price is $0.0005 per saved post, or $0.50 per 1,000 posts. Apify plan discounts can reduce the per-post price to $0.00035, and any additional platform usage is shown by Apify.

FAQ

Can it extract full paywalled posts?
No. It returns only the public preview unless the full article is publicly accessible.

Which discovery method should I use?
Use sitemap first. Choose archive when a publication does not expose usable sitemap entries.

How do I collect only recent posts?
Use a small maxPostsPerSubstack. Discovered post URLs are processed in source order, which is typically newest first but can vary by publication.

Why can engagement fields be zero?
The source may show no engagement or may not expose a metric on that page layout.

Is scraping Substack legal?
Collect only public content, respect publisher rights and Substack's terms, and comply with copyright and data-protection rules.

For page-layout changes or extraction problems, open the Issues tab on this Actor.

Substack Scraper - Download Newsletter Content Fast

stanvanrooy6/substack-scraper

Substack scraper for newsletters. Extract posts with titles, dates, authors, tags, and reactions.

Stan Van Rooy

Substack Scraper — Posts, Authors & Newsletters

cryptosignals/substack-scraper

Extract Substack newsletter content. Get post titles, authors, publish dates, paywall status, subscriber counts, and full article text. Ideal for newsletter research and content monitoring. PPE pricing — pay only for results.

Web Data Labs

Substack Scraper

automation-lab/substack-scraper

Scrape Substack newsletters — posts, comments, publication metadata. Full archive depth with no caps. Export to JSON, CSV, Excel, or connect via API.

Stas Persiianenko

237

Substack Posts Scraper 📚

easyapi/substack-posts-scraper

Scrape Substack posts and articles by keywords. Extract comprehensive post data including title, author, publication details, podcast information, reactions, and more. Perfect for content analysis and research.

EasyApi

200

1.9

Substack Newsletter Scraper

digispruce/substack-scraper

Extract comprehensive Substack newsletter data including author profiles, subscriber counts, social media links, and contact information for B2B outreach and market research.

Akram

4.0

Substack Scraper

qpayre/substack-scraper

The Substack Author Scraper is a powerful Apify actor that makes it easy for content creators to scrape and retrieve all posts from their favorite Substack authors. With structured data presented in a user-friendly format, analyzing and processing valuable information has never been easier.

QPS

458

Substack Scraper | All-In-One

fatihtahta/substack-scraper

Get full articles, user profiles, and search results with All-in-One Substack Scraper. Extract rich data including titles, bios, subscriber counts, social links and engagement metrics. ideal for market research, creator discovery, trend tracking, and audience analysis.

Fatih Tahta

174

Substack Scraper - Newsletters, Posts & Authors

logiover/substack-newsletter-scraper

Substack API alternative: scrape newsletters, posts & authors without login. Export Substack data to CSV/JSON. No key, no proxy.

Logiover

Substack Newsletter Scraper

dataharvest/substack-scraper

Scrape Substack newsletters, posts and comments.

Alex v

Substack Scraper: Posts, Comments & Authors

doggo/substack-scraper-posts-comments-authors

Scrape any Substack publication: post archives, article text, comments, author profiles and subscriber signals. Search across newsletters and export structured data for research, monitoring and AI datasets. No browser. Output to CSV, JSON or Excel.