Pricing

from $5.99 / 1,000 email validateds

Newsletter List Cleaner - Validate & Score Subscribers

Clean and validate your newsletter subscriber list. Upload exports from Mailchimp, Beehiiv, Substack, ConvertKit, Ghost, ActiveCampaign, MailerLite, or Buttondown. Get quality scores, keep/re-engage/remove recommendations, and a full list health report.

Pricing

from $5.99 / 1,000 email validateds

Rating

0.0

(0)

Developer

ClearPath

Actor stats

Bookmarked

Total users

Monthly active users

2 months ago

Last modified

Email Tools ➤	Email Finder API

Changelog

March 2026
NEW 7-Layer Column Detection — auto-finds email columns in 12 languages, LLM fallback for edge cases
NEW 8 Platform Fingerprints — recognizes exports from Mailchimp, Beehiiv, Substack, ConvertKit, Ghost, ActiveCampaign, MailerLite, Buttondown
NEW Health Report — quality scoring, domain analysis, keep/re-engage/remove recommendations
File support — CSV, XLSX, TXT, ZIP (Mailchimp), Google Sheets URLs

Upload any subscriber export. Get a cleaned list back with quality scores and a health report.

Drop your CSV, XLSX, ZIP, or Google Sheets URL. The actor detects your newsletter platform, finds the email column, validates every address, and sorts subscribers into three buckets: keep, re-engage, or remove. No configuration needed for 95%+ of files.

Output Example

Copy to your AI assistant

Copy this block into ChatGPT, Claude, Cursor, or any LLM to start using this actor.

clearpath/newsletter-list-cleaner on Apify. Call: ApifyClient("TOKEN").actor("clearpath/newsletter-list-cleaner").call(run_input={...}), then client.dataset(run["defaultDatasetId"]).list_items().items for results. Key inputs: subscriberFile (string, URL to CSV/XLSX/ZIP/Google Sheets), emailAddresses (array), platform (string). Full actor spec (input schema with all params/enums/defaults, output dataset fields, README): GET https://api.apify.com/v2/acts/clearpath~newsletter-list-cleaner/builds/default (Bearer TOKEN) → inputSchema, actorDefinition.storages.dataset, readme. Pricing: platform usage cost only. Get token: https://console.apify.com/account/integrations

Subscriber lists decay. People abandon email addresses, fill inboxes, sign up with disposable emails, or get flagged as spamtraps. Sending to a dirty list tanks your deliverability score, lands you in spam folders, and wastes money on every send.

This actor cleans your list automatically.

Upload your export from any newsletter platform. Every subscriber gets validated at the mail server level and scored 0-100. You get three lists back: who to keep, who to try re-engaging, and who to remove immediately.

What You Get Per Subscriber

Every validated email returns:

✅ Quality Score (0-100) with keep / re-engage / remove action
✅ Deliverability Status — safe, risky, invalid, disposable, catch-all, unknown
✅ Risk Flags — spamtrap, role account, disabled mailbox, inbox full, catch-all, disposable
✅ Domain Info — MX records, free vs business email detection
✅ Tags — human-readable labels for quick filtering

List Health Report

Every run generates a health report at the end:

Metric	Description
List Health Score	0-100 grade for your entire subscriber list
Action Summary	How many to keep, re-engage, and remove
Risk Breakdown	Safe, risky, invalid, disposable, catch-all, spamtrap counts
Domain Analysis	Top 10 domains, free vs business split, unique domain count
Quality Distribution	Excellent / good / moderate / poor breakdown
Recommendations	Specific tips based on your list's problems
Ready Lists	Clean list, re-engage list, and remove list as email arrays

Supported Platforms

Upload your subscriber export and the platform is detected automatically:

Platform	Auto-detected	Export format
Mailchimp	Yes	CSV or ZIP
Beehiiv	Yes	CSV
Substack	Yes	CSV
ConvertKit (Kit)	Yes	CSV
Ghost	Yes	CSV
ActiveCampaign	Yes	CSV
MailerLite	Yes	CSV
Buttondown	Yes	CSV
Any other	Yes	CSV, XLSX, TXT, Google Sheets

Don't see your platform? The 7-layer detection pipeline handles unknown formats too. It scans column headers in 12 languages and falls back to content analysis when headers don't match.

How Email Column Detection Works

Most tools require you to map columns manually. This actor finds the email column automatically:

Layer	Method	Cost	What it catches
1	User override	Free	Explicit column name if provided
2	Platform fingerprints	Free	Mailchimp, Beehiiv, Substack, etc.
3	Known header patterns	Free	35+ patterns in 12 languages
4	Partial header match	Free	Any header containing "email", "correo", etc.
5	Content scan (80%)	Free	Columns where 80%+ values are emails
6	LLM fallback	~$0.001	Unusual headers, non-Latin scripts
7	Relaxed scan (60%)	Free	Sparse columns with some emails

The LLM only fires when layers 1-5 fail. Zero extra cost for the vast majority of runs.

Quality Scoring

Each subscriber gets a quality score from 0-100, adjusted for newsletter-specific risk factors:

Factor	Impact	Why
Spamtrap	-50	Destroys sender reputation
Disabled mailbox	-40	Dead address, guaranteed bounce
Disposable email	-30	Temporary address, zero engagement
Full inbox	-20	Won't receive your newsletter
Role account	-15	info@, admin@ rarely read newsletters
Catch-all domain	-10	Inflates list without engagement signal

Action buckets:

Keep (score ≥ 70, status safe) — safe to send
Re-engage (score 40-69 or risky) — send re-engagement campaign first
Remove (score < 40, invalid, disposable, spamtrap) — remove immediately

Input

Upload a subscriber file

{
    "subscriberFile": "https://your-file-url.com/subscribers.csv"
}

Supports CSV, XLSX, TXT (one email per line), ZIP (Mailchimp exports), and Google Sheets URLs. The file format and email column are detected automatically.

Paste emails directly

{
    "emailAddresses": [
        "alice@company.com",
        "bob@gmail.com",
        "info@example.org"
    ]
}

Both combined with a limit

{
    "subscriberFile": "https://your-file-url.com/export.xlsx",
    "emailAddresses": ["extra@company.com"],
    "maxItems": 500
}

Google Sheets

Paste a Google Sheets URL directly. The sheet must be set to "Anyone with the link can view."

{
    "subscriberFile": "https://docs.google.com/spreadsheets/d/1cYOJX228.../edit?usp=sharing"
}

Input Parameters

Parameter	Type	Description	Default
`subscriberFile`	string	Upload CSV, XLSX, TXT, ZIP, or paste a Google Sheets URL	—
`emailAddresses`	array	Enter emails manually. Can combine with file upload.	—
`maxItems`	integer	Limit how many subscribers to validate. `0` = validate all.	`0`
`platform`	string	Override automatic platform detection.	`auto`
`csvEmailColumn`	string	Override automatic email column detection.	—

Output Examples

Subscriber result

{
    "email": "subscriber@company.com",
    "action": "keep",
    "qualityScore": 98,
    "status": "safe",
    "tags": [],
    "domain": "company.com",
    "safeToSend": true,
    "deliverable": true,
    "catchAll": false,
    "disposable": false,
    "roleAccount": false,
    "freeEmail": false,
    "spamtrap": false,
    "disabled": false,
    "inboxFull": false,
    "mxRecords": ["mx1.company.com", "mx2.company.com"]
}

Health report (last item in dataset)

{
    "type": "list_health_report",
    "platformDetected": "mailchimp",
    "totalSubscribers": 1250,
    "listHealthScore": 82,
    "actionSummary": {
        "keep": 1020,
        "reEngage": 115,
        "remove": 115
    },
    "riskBreakdown": {
        "safe": 1020,
        "risky": 95,
        "invalid": 85,
        "disposable": 20,
        "catchAll": 30,
        "spamtrap": 0
    },
    "domainAnalysis": {
        "topDomains": [
            {"domain": "gmail.com", "count": 625, "percentage": 50.0},
            {"domain": "outlook.com", "count": 188, "percentage": 15.0}
        ],
        "freeEmailPercentage": 68.5,
        "businessEmailPercentage": 31.5,
        "uniqueDomains": 342
    },
    "qualityDistribution": {
        "excellent_90_100": 980,
        "good_70_89": 40,
        "moderate_40_69": 115,
        "poor_0_39": 115
    },
    "recommendations": [
        "Remove 115 invalid subscribers to reduce bounce rate by ~9.2%.",
        "Send a re-engagement campaign to 115 risky subscribers before your next send.",
        "68% of subscribers use free email providers. Optimize for Gmail and Yahoo rendering."
    ],
    "cleanList": ["subscriber1@company.com", "..."],
    "reEngageList": ["risky@example.com", "..."],
    "removeList": ["invalid@nowhere.xyz", "..."]
}

Pricing

Apify Plan	Per 1,000 emails
Free	$7.99
Starter	$7.49
Scale	$6.99
Business	$5.99

Cost examples

List size	Free plan	Business plan
100	$0.80	$0.60
500	$4.00	$3.00
1,000	$7.99	$5.99

Free tier includes 1 run with up to 10 emails at no cost.

API Integration

Python

from apify_client import ApifyClient

client = ApifyClient("your_token")

run = client.actor("clearpath/newsletter-list-cleaner").call(run_input={
    "subscriberFile": "https://your-file-url.com/subscribers.csv",
    "maxItems": 1000,
})

for item in client.dataset(run["defaultDatasetId"]).iterate_items():
    if item.get("type") == "list_health_report":
        print(f"Health score: {item['listHealthScore']}/100")
    else:
        print(f"{item['email']}: {item['action']} (score: {item['qualityScore']})")

JavaScript

import { ApifyClient } from 'apify-client';

const client = new ApifyClient({ token: 'your_token' });

const run = await client.actor('clearpath/newsletter-list-cleaner').call({
    subscriberFile: 'https://your-file-url.com/subscribers.csv',
    maxItems: 1000,
});

const { items } = await client.dataset(run.defaultDatasetId).listItems();
items.forEach(item => {
    if (item.type === 'list_health_report') {
        console.log(`Health score: ${item.listHealthScore}/100`);
    } else {
        console.log(`${item.email}: ${item.action} (score: ${item.qualityScore})`);
    }
});

Data Export

Download results as JSON, CSV, or Excel directly from the Apify dataset. The CSV export works well for re-importing your clean list back into your newsletter platform.

Automation

Scheduled runs — clean your list weekly or monthly on autopilot
Webhooks — get notified when cleaning completes
API — integrate into your subscriber management pipeline

Use with AI Assistants (MCP)

Connect this actor to AI assistants via Apify's MCP server.

Setup

Go to mcp.apify.com and add clearpath/newsletter-list-cleaner
Connect to your AI assistant (Claude Desktop, Cursor, etc.)

Example prompts

"Clean my newsletter list from this CSV and tell me how many subscribers to remove"

"Validate these 50 email addresses and flag any spamtraps or disposable addresses"

"Upload my Mailchimp export and give me a health report"

LLM context

Newsletter List Cleaner validates subscriber email lists and returns per-email quality scores (0-100) with keep/re-engage/remove actions. Data includes: deliverability status, risk flags (spamtrap, disposable, catch-all, role account), domain analysis, MX records. Supports CSV, XLSX, TXT, ZIP, Google Sheets. Auto-detects 8 newsletter platforms and email columns in 12 languages. Output: JSON array of scored subscribers + list health report with recommendations.

FAQ

Q: What file formats are supported? A: CSV, XLSX, TXT (one email per line), ZIP archives (Mailchimp exports), and Google Sheets URLs. Detected automatically.

Q: Do I need to specify which column has emails? A: No. The 7-layer detection pipeline finds it automatically. Override in settings if needed.

Q: What does the quality score mean? A: 0-100 based on deliverability and risk factors. 70+ with "safe" status = keep. Below 40 = remove. In between = re-engage.

Q: How accurate is the validation? A: Email deliverability is verified at the mail server level (SMTP). Spamtrap, disposable, and catch-all detection included.

Q: Can I combine file upload with manual emails? A: Yes. Duplicates are removed automatically.

Q: What happens with files that have no email column? A: Clear error message. No credits charged.

Q: How long does validation take? A: A few seconds for small lists. 10,000+ emails take 1-2 minutes.

Q: Does it work with non-English files? A: Yes. Email column headers recognized in 12 languages including German, Spanish, French, Italian, Portuguese, Japanese, Korean, Chinese, and Arabic.

Q: Is there a free trial? A: Free tier includes 1 run with up to 10 emails at no cost.

Q: What's the difference between this and the Email Validator API? A: The Email Validator API validates raw email addresses. This actor is designed for newsletter workflows: it handles file uploads, detects platforms, scores subscribers, and generates a health report with recommendations.

Support

Email: max@mapa.slmail.me
Bugs: Issues tab
Features: Email or issues

Legal Compliance

Validates publicly provided email addresses at the mail server level. No email content is read or stored. Users must comply with applicable data protection regulations (GDPR, CCPA).

Clean your newsletter list. Keep the engaged, re-engage the risky, remove the dead weight.

Beehiiv Newsletter Discovery Scraper

crawlerbros/beehiiv-newsletter-scraper

Discover and scrape newsletters from Beehiiv's public directory. Browse the full newsletter catalog, get detailed newsletter profiles by URL or subdomain, or extract recent posts from any Beehiiv newsletter. No login required

Crawler Bros

Newsletter Scraper — Substack, Beehiiv, Ghost Archives

benthepythondev/newsletter-scraper

Extract newsletter archives from Substack, Beehiiv, and Ghost platforms. Get full content in markdown format, complete metadata, embedded images, word counts, and AI-ready token counts. Perfect for content research, competitive analysis, and training AI models.

ben

Substack Newsletter Scraper

red.cars/substack-newsletter-scraper

Extract newsletter content, subscriber data, and author insights from any Substack publication - no API key required!

AutomateLab

1.0

Substack Newsletter Scraper

digispruce/substack-scraper

Extract comprehensive Substack newsletter data including author profiles, subscriber counts, social media links, and contact information for B2B outreach and market research.

Akram

4.0

Newsletter Archiver: Substack, Beehiiv, Ghost, RSS

aitoolbreakdown/atb-newsletter-archiver

Point it at a newsletter's public RSS feed. Returns every post as structured JSON: title, date, full HTML + plaintext, author, URL. Works with Substack, Beehiiv, Ghost, and any Atom/RSS 2.0 feed.

AI Tool Breakdown

Substack Newsletter Scraper

opalescent_quintet/substack-newsletter-scraper

Substack-Newsletter-Scraper Extract complete newsletter archives from any Substack publication with advanced filtering, multiple export formats, and engagement analytics. ## Features - Scrape entire newsletter archives from Substack - Extract full metadata: titles, content, author details etc.

Aryan

Substack Newsletter Lead Scraper

gocreative.ai/substack-newsletter-leads

Scrape Substack's public discovery API to find B2B newsletters by category. Returns newsletter name, author, subscriber count, URL, and niche. Perfect for newsletter sponsorship sales, content marketing outreach, and B2B audience research.

GoCreative AI

Buttondown Newsletter Archive Scraper

jungle_synthesizer/buttondown-newsletter-archive-scraper

Scrape posts from any Buttondown newsletter publication. Input a list of publication usernames and get back every post with title, date, excerpt, cover image, tags, and optional full body text. Supports multi-publication fan-out. No login required.

BowTiedRaccoon

Substack Posts Scraper - Newsletter Data Extractor

klondikeking/substack-posts-scraper

Extract posts, engagement metrics, and newsletter data from Substack publications. Perfect for content research.

Pierrick McD0nald

InboxReads Newsletter Directory Scraper

jungle_synthesizer/inboxreads-newsletter-directory-scraper

Scrape the InboxReads newsletter directory — 6,600+ cross-platform newsletters (Substack, beehiiv, Ghost, Mailchimp). Extracts name, description, topics, pricing, subscriber count, open rate, click rate, and signup URL. Filter by topic or scrape the full directory.

BowTiedRaccoon

Newsletter List Cleaner - Validate & Score Subscribers

Newsletter List Cleaner — Validate & Score Subscriber Lists (2026)

Changelog

Copy to your AI assistant

The Problem with Newsletter Lists

What You Get Per Subscriber

List Health Report

Supported Platforms

How Email Column Detection Works

Quality Scoring

Input

Upload a subscriber file

Paste emails directly

Both combined with a limit

Google Sheets

Input Parameters

Output Examples

Subscriber result

Health report (last item in dataset)

Pricing

Cost examples

API Integration

Python

JavaScript

Data Export

Automation

Use with AI Assistants (MCP)

Setup

Example prompts

LLM context

FAQ

Support

Legal Compliance

You might also like

Beehiiv Newsletter Discovery Scraper

Newsletter Scraper — Substack, Beehiiv, Ghost Archives

Substack Newsletter Scraper

Substack Newsletter Scraper

Newsletter Archiver: Substack, Beehiiv, Ghost, RSS

Substack Newsletter Scraper

Substack Newsletter Lead Scraper

Buttondown Newsletter Archive Scraper

Substack Posts Scraper - Newsletter Data Extractor

InboxReads Newsletter Directory Scraper