Pricing

Pay per event

NewsletterHunt Archived Issue Scraper

Scrapes NewsletterHunt's cross-publication email archive. Extracts subject lines, sender, date, full email body HTML and plain text, plus the newsletter signup URL. Covers hundreds of publications and tens of thousands of archived issues — no login required.

Pricing

Pay per event

Rating

0.0

(0)

Developer

BowTiedRaccoon

Actor stats

Bookmarked

Total users

Monthly active users

a month ago

Last modified

What You Get

Each output record is one archived email issue. That includes everything you need to understand what was sent, when, by whom, and where to sign up.

Field	Description
`newsletter_slug`	URL slug identifying the newsletter (e.g. `the-hustle`)
`newsletter_name`	Publication display name (e.g. `The Hustle`)
`newsletter_url`	NewsletterHunt page for this newsletter
`newsletter_signup_url`	The newsletter's own website or subscribe URL
`topic`	Topic/category tags assigned by NewsletterHunt
`email_id`	NewsletterHunt archive ID for this issue
`email_url`	Direct link to the archived issue page
`email_subject`	Subject line of the email issue
`email_sender`	Publication or author who sent it
`email_date`	Publication date (ISO 8601)
`email_body_html`	Full rendered email body HTML (capped at 500 KB)
`email_body_text`	Plain-text extract of the body (tags stripped, capped at 50 KB)
`scraped_at`	Timestamp of when this record was scraped

How It Works

Three-level crawl. Discovers newsletters from the listing page, fetches each newsletter's full issue archive via a JSON endpoint, then retrieves the email body from each issue page.

The email body is embedded directly in the page HTML as an iframe srcdoc attribute — no additional fetches required. Subject, sender, and date are pulled from the page alongside it.

Input

Parameter	Type	Default	Description
`maxItems`	integer	10	Maximum number of email records to return. Set to 0 for no limit.

Usage Notes

The listing page currently shows ~9 newsletters. Each has many archived issues.
Some older archived issues may not have a full email body (displayed as null).
NewsletterHunt is a public archive. No authentication is needed.
Polite crawl rate: 5 concurrent requests.

Example Output

{
  "newsletter_slug": "money-stuff-by-matt-levine",
  "newsletter_name": "Money Stuff by Matt Levine",
  "newsletter_url": "https://newsletterhunt.com/newsletters/money-stuff-by-matt-levine",
  "newsletter_signup_url": "http://link.mail.bloombergbusiness.com/join/4wm/moneystuff-signup",
  "topic": "Finance",
  "email_id": "340238",
  "email_url": "https://newsletterhunt.com/emails/340238",
  "email_subject": "Money Stuff: Index Funds Can't Say No to SpaceX",
  "email_sender": "Money Stuff by Matt Levine",
  "email_date": "2020-12-09T11:43:00",
  "email_body_html": "<!DOCTYPE html>...",
  "email_body_text": "Money Stuff: Index Funds Can't Say No to SpaceX...",
  "scraped_at": "2026-06-02T17:30:00.000Z"
}

Pairs well with actors that target specific newsletter platforms for deeper per-publication archives, or with a newsletter directory scraper for subscriber counts and open rates.

Wayback Machine: Recover Deleted Gov Pages & Data

thoob/gov-data-rescue-retriever

Recovers US federal pages and datasets pulled offline by finding their archived copies through the Internet Archive's official Wayback APIs. Returns the archived snapshot URL, timestamp, and metadata for each source. Billed only per archived snapshot found.

Pono Data

Wayback Machine URL Extractor - Archived URLs

logiover/wayback-machine-url-extractor

Extract every archived URL of any domain from the Internet Archive's Wayback Machine (CDX API). Recover lost or old pages, build redirect maps and run OSINT, with date and status filters. No API key, export to CSV or JSON.

Logiover

Buttondown Newsletter Archive Scraper

jungle_synthesizer/buttondown-newsletter-archive-scraper

Scrape posts from any Buttondown newsletter publication. Input a list of publication usernames and get back every post with title, date, excerpt, cover image, tags, and optional full body text. Supports multi-publication fan-out. No login required.

BowTiedRaccoon

Internet Archive Search — Wayback Machine Advanced Query Tool

maged120/archive-org-advanced-search

Search the Internet Archive (archive.org) with full advanced filter support — date range, media type, language, subject, and more. Returns metadata from archived web pages, books, audio, and video.

Maged

Bulk Email Sender

bhansalisoft/bulk-email-sender

Bulk Email Sender - Send Bulk Emails From Any Email Provider, so that you can send thousands of email.

bhansalisoft

Substack Scraper

crawlerbros/substack-scraper

Scrape Substack publications via the public RSS feed of any newsletter. Extract post title, URL, author, publication date, body HTML, categories, and enclosures. HTTP-only with TLS impersonation (no auth, no proxy).

Crawler Bros

Global News Archive API - Rise of the Phoenix

thescrapelab/Apify-The-Rise-of-the-Phoenix

Search archived global news articles by country, publisher, and date. Export clean article text and metadata for media monitoring, PR research, market intelligence, RAG, and LLM workflows.

Inus Grobler

5.0

Text Sentiment Analysis

easyapi/text-sentiment-analysis

Analyze the sentiment of your text, submit single or multiple lines of text and receive a detailed report, including the number of lines analyzed and the breakdown of sentiments (positive, negative, neutral). Gain insights into the emotional tone of your content effortlessly!

EasyApi

LinkedIn Newsletter Scraper

automation-lab/linkedin-newsletter-scraper

📰 Extract public LinkedIn newsletter metadata, edition URLs, full issue text, authors, dates, images, and engagement counts—without login.

Stas Persiianenko

Wayback Machine Search

crawlerbros/wayback-machine-search

Query Internet Archive's Wayback Machine for historical snapshots of any URL or domain. Filter by date, HTTP status, MIME type, and deduplicate. Optionally fetch the archived page text. Free public CDX API, no authentication.

Crawler Bros