Pricing

from $3.00 / 1,000 results

Try for free

Go to Apify Store

Substack Scraper

Try for free

Scrape Substack publications via the public RSS feed of any newsletter. Extract post title, URL, author, publication date, body HTML, categories, and enclosures. HTTP-only with TLS impersonation (no auth, no proxy).

Pricing

from $3.00 / 1,000 results

Rating

5.0

(13)

Developer

Crawler Bros

Actor stats

Bookmarked

Total users

Monthly active users

2 days ago

Last modified

What this actor does

Accepts publication URLs in any form: full URL, custom domain, *.substack.com, or bare slug
Auto-rewrites to <publication>/feed
Parses RSS feed → extracts title / link / pubDate / dc:creator / content:encoded / categories / enclosure
Filters: category, published-after, keyword in title/summary
Optional body HTML inclusion (default on)
Approximate wordCount and readingTimeMinutes
Empty fields are omitted

Output per post

title, url, guid
author — from <dc:creator>
publishedAt — ISO 8601 UTC (parsed from RFC 822 pubDate)
publishedAtRaw — original RFC 822 string
summary — plain-text version of <description> (capped at 500 chars)
bodyHtml — full HTML body from <content:encoded> (when includeBody=true)
wordCount, readingTimeMinutes
categories[]
coverImage — from <enclosure> URL
publication, publicationUrl
recordType: "post", scrapedAt

Input

Field	Type	Default	Description
`publications`	array	`["platformer.news"]`	List of publication URLs / domains / slugs (required)
`categoryAnyOf`	array	`[]`	Match at least one RSS `<category>` tag
`publishedAfter`	string	–	YYYY-MM-DD
`containsKeyword`	string	–	Title/summary contains substring
`includeBody`	bool	`true`	Include full body HTML
`maxItems`	int	`50`	Hard cap (1–1000)

Example: scrape Platformer + Noahpinion

{
  "publications": ["platformer.news", "noahpinion.substack.com"],
  "publishedAfter": "2024-01-01",
  "maxItems": 100
}

Example: filter by keyword

{
  "publications": ["platformer.news"],
  "containsKeyword": "antitrust",
  "includeBody": true
}

Example: bare slugs (auto-resolved to .substack.com)

{
  "publications": ["noahpinion", "thedailyupside"]
}

Use cases

Newsletter intel — track competitor publications, harvest content
Market research — newsletters in your domain (analyst notes, sector reports)
RSS aggregation — consolidate multiple Substacks into a single feed
Content analysis — bulk-export newsletter posts for NLP / topic modeling
Backup — archive your own / a friend's Substack posts

FAQ

Do I need a Substack account? No. The actor only reads public RSS feeds.

Why does it use TLS impersonation? Substack's edge sometimes 403s requests with default Python TLS fingerprint. curl_cffi with chrome131 profile sends a real Chrome handshake, which Substack accepts.

What's the post URL format? https://<publication>/p/<slug>. The actor preserves whatever the RSS feed returns.

Are paid-only posts included? Substack's public RSS includes free posts and the public previews of paid posts. Full paid post content is not accessible without a subscription.

How fresh is the data? Real-time. RSS feeds update within minutes of post publish.

Can I scrape multiple publications in one run? Yes — pass multiple entries in publications. The actor walks each feed sequentially and dedupes by URL.

What if a publication's RSS is blocked / rate-limited? The actor retries with exponential backoff on 403/429/5xx. After 3 retries it skips to the next publication and logs a warning.

Custom-domain Substacks? Yes — pass the custom domain (e.g. platformer.news, stratechery.com). The actor appends /feed regardless of subdomain shape.

Substack Scraper | $2 / 1k | All-In-One

fatihtahta/substack-scraper

Get full articles, user profiles, and search results with All-in-One Substack Scraper. Extract rich data including titles, bios, subscriber counts, social links and engagement metrics. ideal for market research, creator discovery, trend tracking, and audience analysis.

Fatih Tahta

Substack Scraper

qpayre/substack-scraper

The Substack Author Scraper is a powerful Apify actor that makes it easy for content creators to scrape and retrieve all posts from their favorite Substack authors. With structured data presented in a user-friendly format, analyzing and processing valuable information has never been easier.

QPS

443

Substack Posts Scraper 📚

easyapi/substack-posts-scraper

Scrape Substack posts and articles by keywords. Extract comprehensive post data including title, author, publication details, podcast information, reactions, and more. Perfect for content analysis and research.

EasyApi

131

1.9

(2)

Substack Scraper

scraper_guru/substack-scraper

Extract complete data from Substack newsletters including posts, authors, engagement metrics, and article text. 13 fields per post. Fast and reliable.

LIAICHI MUSTAPHA

2.6

(2)

Substack Leaderboard Scraper 📊

easyapi/substack-leaderboard-scraper

Scrape detailed publication data from Substack leaderboards. Get comprehensive insights about top newsletters including subscriber counts, pricing, author details, and more. Perfect for newsletter research and market analysis.

EasyApi

Substack Newsletter Scraper

digispruce/substack-scraper

Extract comprehensive Substack newsletter data including author profiles, subscriber counts, social media links, and contact information for B2B outreach and market research.

Akram

4.0

(1)

Substack Scraper

automation-lab/substack-scraper

Scrape Substack newsletters — posts with full content, comments with nested replies, and publication metadata. Unlimited archive depth, no proxy needed, 100% success rate. Export to JSON, CSV, Excel.

Stas Persiianenko

112

Substack Publications Scraper 📚

easyapi/substack-publications-scraper

Scrape detailed publication information from Substack based on keywords. Get comprehensive data about newsletters, authors, subscriber counts, and publication metrics in structured JSON format.

EasyApi

1.0

(1)

Substack Scraper

dacoder/substack-scraper

A powerful Apify actor that extracts data from Substack newsletters. Collects author profiles, post content, engagement metrics, and publication details. Perfect for content analysis, archiving, and competitive research. Outputs clean, structured data with clickable links and formatted images.

Da Coder

1.0

(1)

Substack Email Scraper – Advanced, Cheapest & Reliable 📧⚡

contactminerlabs/substack-email-scraper---advanced-cheapest-reliable

🔍 Scrape Substack Emails Enter your search parameters to collect verified contact emails from public Substack profiles, along with profile title, bio, source URL & platform info ✉️📊 Perfect for lead generation, influencer outreach & data enrichment in tools like Google Sheets or CRMs⚡🧩