Substack Newsletter Scraper
Pricing
$19.00/month + usage
Substack Newsletter Scraper
Extract newsletter content, subscriber data, and author insights from any Substack publication - no API key required!
Pricing
$19.00/month + usage
Rating
1.0
(2)
Developer
AutomateLab
Actor stats
0
Bookmarked
47
Total users
2
Monthly active users
8 hours ago
Last modified
Categories
Share
Extract newsletter content, subscriber data, and author insights from any Substack publication - no API key required!
Description
Extract newsletter content, subscriber data, and author insights from any Substack publication - no API key required!
Features
š Status š§ Context [2026-03-12] ā outputSchema description added (spec-005)
How It Works
This actor uses intelligent extraction with Apify's proxy infrastructure to gather public data from Substack Newsletter Scraper. It handles authentication, pagination, and error recovery automatically.
Quick Start
# Run via Apify APIapify run -a red.cars/substack-newsletter-scraper# Or configure input and click Run on:# https://apify.com/red.cars/substack-newsletter-scraper
Using as an MCP Tool
Running on Apify Platform
Set API_KEY in your Apify Console secrets, configure input via the Actor input schema UI, and click Run.
Input Schema
| Parameter | Type | Description | Default |
|---|---|---|---|
mode | string | Choose what type of data to extract Options: publication, posts, author, bulk | publication |
urls | array | List of Substack URLs to scrape. For best results, use archive URLs (e.g., newsletter.substack.com/archive) | https://newsletter.substack.com/archive |
maxPosts | integer | Maximum number of posts to extract per publication (default: 50) | 20 |
includeContent | boolean | Extract full post content (may slow down the scraper) | false |
includePaidPosts | boolean | Include paid/premium posts in results | true |
sortBy | string | How to sort the extracted posts Options: newest, oldest, popular | newest |
proxyType | string | Choose your preferred balance of cost vs reliability. Standard (Datacenter) is faster; Premium (Residential) is most reliable for high-security targets. Options: DATACENTER, RESIDENTIAL | DATACENTER |
maxResults | integer | The maximum total number of results to extract. | 100 |
debugMode | boolean | Enable minimal extraction for health checks and testing. Guarantees success within 300s. | false |
exportFormat | string | Choose output format. Use 'markdown' for token-efficient LLM-ready newsletter content. Options: json, csv, markdown, salesforce, hubspot | json |
checkOnly | boolean | When true, returns immediately with capability metadata without scraping. Used by AI agents to verify availability before committing to a paid run. Free, <5 seconds. | false |
Output Schema
| Field | Type | Description |
|---|---|---|
results | string | Substack newsletter posts and publication data. Each result contains: url, success, data (post title, content, author, publishDate, subscriberCount, likes), cleanMarkdown. Use for newsletter intelligence, thought leader monitoring, content competitive analysis, media research. AI agents: supply newsletter subdomain URL or author name for automatic discovery. |
Example Output
{"results": [{"url": "https://example.com/profile","username": "example_user","fullName": "Example User","followersCount": 10000,"followingCount": 500,"postsCount": 250,"isVerified": false,"biography": "Example bio text","latestPosts": [],"scrapedAt": "2026-03-23T13:24:33.183Z"}]}
Pricing
Use Cases
- Influencer Research: Identify and evaluate potential influencers by engagement metrics and audience quality
- Competitor Monitoring: Track competitor presence and activity on Target Platform
- Lead Generation: Build targeted lead lists from public profiles and contact information
- Market Intelligence: Gather market intelligence and trend data for business decisions
- Content Aggregation: Collect content for analysis, archiving, or republication
FAQ
Do I need an API key?
Yes, this actor requires an API key. Set it via the secret environment variable in Apify Console.
How does pricing work?
This actor uses Apify's Pay-Per-Event model. You are charged per successful result returned. No charge for queries that return zero results.
What are the rate limits?
Rate limits depend on your Apify plan. Higher-tier plans provide more compute units and faster extraction.
How do I increase success rate?
- Enable Premium (Residential) proxy for high-security targets
- Reduce concurrency for rate-limited profiles
- Use debug mode to test before full extraction
Can I run this on a schedule?
Yes, use Apify Scheduler to run this actor on a cron schedule for continuous monitoring.
How do I export to CRM?
Set exportFormat to salesforce or hubspot for direct CRM import format.
Troubleshooting
| Error | Cause | Fix |
|---|---|---|
| NETWORK_ERROR | Network connectivity issue | Check internet connection, retry with proxy |
| VALIDATION_ERROR | Invalid input parameters | Review input schema and retry |
Known Limitations
- Private accounts: Cannot extract data from private accounts without following
- Rate limiting: Platform may temporarily block repeated requests from same IP
- Data freshness: Extracted data reflects moment of extraction; historical data may be limited
- Proxy requirements: High-security targets may require residential proxy for reliable extraction
- Content limits: Platform-imposed limits on historical post retrieval
Legal
Data Source: the target platform (the target platform.com)
Terms of Service: This actor is provided for legitimate data collection purposes only. Users must comply with the target platform's Terms of Service and applicable laws. Red.cars is not responsible for misuse of this tool.
Privacy: All extracted data is processed in accordance with applicable privacy laws. Do not use this tool for unauthorized data collection or privacy-violating activities.
Attribution: When using the target platform data, comply with their attribution requirements and data policies.
red.cars Intelligence Fleet ⢠apify.com/red.cars