Pricing

Pay per usage

Get Site to Markdown

Website to Markdown Crawler An asynchronous web crawler that mirrors websites into a single organized markdown file, with handling for images and directory structure preservation. Designed to operate with low cost. This works great to build context for AI agents.

Pricing

Pay per usage

Rating

0.0

(0)

Developer

b-w.pro

Actor stats

Bookmarked

Total users

Monthly active users

63 days

Issues response

a year ago

Last modified

Website to Markdown Crawler

An asynchronous web crawler that mirrors websites into a single organized markdown file, with special handling for images and proper directory structure preservation. Built with Python, asyncio, and httpx.

Author: Jordan Haisley (jordan@b-w.pro)

Features

🚀 Fast asynchronous crawling using httpx and asyncio
📁 Preserves site structure - can be limited to specific subdirectories
🖼️ Smart image handling - preserves both alt text and filenames
📝 Clean Markdown output with proper sectioning
🔍 Depth-controlled crawling
🔒 Domain-restricted recursive crawling for safety
🤫 Quiet mode for silent operation

As an Apify Actor

Actor input schema:

{
    "start_urls": [{"url": "https://example.com"}],
    "max_depth": 1
}

Output Format

The generated markdown file contains:

A section for each page
Page title as heading
Original URL reference
Page content in Markdown format
Image references with both alt text and filenames

Example output:

# Page Title
*URL: https://example.com/page*

![Alt text (File: image.jpg)](https://example.com/image.jpg)

Page content in markdown...

----------------

Markdown API

vivid_astronaut/markdown

Fabio Suizu

AI Website Content Markdown Scraper

quaking_pail/ai-website-content-markdown-scraper

This Apify Actor, "Website Content Crawler with Markdown Extraction," is designed to perform a comprehensive crawl of specified websites, extract their text content, convert it into Markdown format, and store it in a structured dataset. The extracted content is suitable for feeding LLMs.

AI_Builder

904

2.3

Docs Markdown Rag Ready Crawler

devwithbobby/docs-markdown-rag-ready-crawler

Turn any documentation site or website into clean, structured markdown—ready for RAG, embeddings, and AI agents.

Dev with Bobby

Webpage to Markdown

extremescrapes/webpage-to-markdown

This actor cost-effectively converts websites into structured markdown optimized for AI processing. It extracts webpage content, formats it into clean markdown, and ensures compatibility with AI models.

Extreme Scrapes

176

5.0

Simple Website Scrapper (markdown format)

manojaditya64/simple-website-scrapper-markdown-format

A simple website scrapper that scrapes websites and converts it into markdown format which is easy to use with LLM. You can feed markdown data to LLM for easy analysis.

Manojaditya Nadar

5.0

Markdown Maker: HTML to Markdown 📝

shahidirfan/Markdown-Maker

Instantly convert complex HTML into clean, structured Markdown. This lightweight actor is optimized to render web content into a format that is easily readable for AI LLMs, reducing token usage and improving context. Perfect for RAG pipelines and preparing data for training.

Shahid Irfan

Ai Ready Web Page To Markdown Converter

mustafa.irshaid.113/ai-ready-web-page-to-markdown-converter

Convert any webpage into structured Markdown and HTML using just a URL. Get the page title, link, and content—perfect for SEO, devs, and AI crawlers. Fast, clean, and ideal for repurposing or analysis. Start turning websites into Markdown instantly.

Mustafa Irshaid

Crawl4ai To Markdown Pro2

juryless_rainbow/crawl4ai-to-markdown-pro2

A high-performance web-to-markdown crawler for AI agents, optimized for LLM data extraction using Crawl4AI. Features stealth browsing and high-fidelity content extraction.