Pricing

Pay per event

Beehiiv Newsletter Scraper

Scrape posts from any beehiiv-powered newsletter. Input publication domains — the actor discovers post URLs via sitemap and extracts title, author, publish date, excerpt, cover image, tags, and word count. Supports multi-newsletter fan-out in a single run.

Pricing

Pay per event

Rating

0.0

(0)

Developer

BowTiedRaccoon

Actor stats

Bookmarked

Total users

Monthly active users

2 months ago

Last modified

What it does

The actor accepts a list of beehiiv publication domains (e.g. readthepeak.com, discover.beehiiv.com) and for each domain:

Fetches <domain>/sitemap.xml to discover all public post URLs matching the /p/<slug> pattern.
Crawls each post page and extracts structured data from the embedded JSON-LD Article schema.
Yields one record per post with all metadata fields.

Publications that sit behind Cloudflare or other anti-bot measures are gracefully skipped with a warning. Free posts are scraped; paywalled posts (where isAccessibleForFree: false in JSON-LD) are automatically skipped.

Input

Parameter	Type	Description
`domains`	array	List of publication domains. Accepts bare domains (`readthepeak.com`), subdomains (`mybrand.beehiiv.com`), or full URLs (`https://readthepeak.com`).
`maxItems`	integer	Maximum posts to scrape per publication (0 = unlimited). Default: 10.

Example input:

{
  "domains": ["readthepeak.com", "discover.beehiiv.com"],
  "maxItems": 50
}

Output

Each record contains:

Field	Description
`publication_domain`	Input domain (e.g. `readthepeak.com`)
`publication_name`	Newsletter name from JSON-LD publisher
`post_url`	Canonical post URL
`post_title`	Post headline
`post_subtitle`	Post subtitle / description
`author`	Author name
`publish_date`	ISO 8601 publish timestamp
`excerpt`	Short description (up to 300 chars)
`cover_image_url`	Cover image URL
`word_count`	Estimated word count of post body
`tags`	Comma-separated tags
`full_text`	Full post body text (empty unless `include_full_text` is set)
`scraped_at`	ISO 8601 scrape timestamp

Example output record:

{
  "publication_domain": "readthepeak.com",
  "publication_name": "The Peak",
  "post_url": "https://www.readthepeak.com/p/canadian-universities-are-falling-behind",
  "post_title": "Canadian universities are falling behind",
  "post_subtitle": "Canada's post-secondary schools are losing their edge.",
  "author": "Lucas Arender",
  "publish_date": "2026-06-02T10:00:00.000Z",
  "excerpt": "Canada's post-secondary schools are losing their edge.",
  "cover_image_url": "https://beehiiv-images-production.s3.amazonaws.com/...",
  "word_count": 291,
  "tags": "Water Cooler, Perspectives",
  "full_text": "",
  "scraped_at": "2026-06-02T20:39:48.116Z"
}

Limitations

Publications behind Cloudflare or PerimeterX (e.g. some high-traffic custom domains) will return a warning and be skipped. Use a different domain format if the publication has a *.beehiiv.com subdomain that is not CF-walled.
Paywalled posts (subscriber-only) are detected via JSON-LD and automatically skipped.
Publications without a sitemap.xml or with no /p/ posts in their sitemap are skipped.
full_text extraction is best-effort — post body selectors may vary slightly across beehiiv themes.

Beehiiv Newsletter Archive Scraper

parseforge/beehiiv-newsletter-scraper

Pull every public post from one or many Beehiiv newsletters: title, description, image, publish date, author, word count, and excerpt. Discover via the public sitemap, fan across multiple newsletters, filter by keyword. Export to JSON, CSV, or Excel for newsletter research and content trends.

ParseForge

Beehiiv Newsletter Discovery Scraper

crawlerbros/beehiiv-newsletter-scraper

Discover and scrape newsletters from Beehiiv's public directory. Browse the full newsletter catalog, get detailed newsletter profiles by URL or subdomain, or extract recent posts from any Beehiiv newsletter. No login required

Crawler Bros

Beehiiv Newsletter Scraper - Low-cost 💲🔥📰📬

delectable_incubator/beehiiv-newsletter-scraper-low-cost

📰 Scrape Beehiiv newsletter articles from one or multiple Beehiiv publications. Extract article titles, URLs, publication dates, authors, descriptions, featured images, and other metadata. Perfect for content monitoring, newsletter analytics, market research, AI datasets and content automation 🚀📊

Prime Scrape

Beehiiv Newsletter Scraper

mattdef/beehiiv-newsletter-scraper

Scrape Beehiiv newsletters: metadata, articles, authors, and publication dates. Perfect for lead generation, content research, and competitive analysis.

Matthieu Cast

Buttondown Newsletter Archive Scraper

jungle_synthesizer/buttondown-newsletter-archive-scraper

Scrape posts from any Buttondown newsletter publication. Input a list of publication usernames and get back every post with title, date, excerpt, cover image, tags, and optional full body text. Supports multi-publication fan-out. No login required.

BowTiedRaccoon

Beehiiv Newsletter Scraper - Posts & Authors

elliotpadfield/beehiiv-newsletter-scraper

Scrape public Beehiiv newsletters by publication URL, custom domain, sitemap, or post URL. Extract posts, authors, full text, HTML, markdown, images, outbound links, sponsor links, and publication metadata.

Elliot Padfield

Substack Newsletter Scraper

scrapers-hub/substack-newsletter-scraper

Substack Newsletter scraper extracts publicly available newsletter posts, titles, authors, publication dates, subscriber-facing content, and metadata 📰📊 Perfect for content research, trend analysis, competitive intelligence, and newsletter monitoring.

Scrapers Hub

Beehiiv Newsletter Scraper

scraper_guru/beehiiv-scraper

Extract complete data from Beehiiv newsletters including posts, authors, engagement metrics, and full article HTML/text. Fast native API discovery & PerimeterX bypass

LIAICHI MUSTAPHA

Substack Newsletter Scraper

cloud9_ai/substack-scraper

Scrape posts from any Substack newsletter publication. Returns post titles, URLs, publish dates, authors, and content previews via RSS feed.

cloud9

Newsletter Scraper – Substack, Beehiiv & Ghost

ninhothedev/newsletter-scraper

$1/1K 🔥 Fast newsletter & RSS scraper! Titles, authors, dates, links & full content from any feed. JSON, CSV, Excel or API in seconds. Paste feed URLs & extract thousands of posts for content & monitoring ⚡