Webpage To Markdown
Pricing
Pay per usage
Webpage To Markdown
Pricing
Pay per usage
Rating
0.0
(0)
Developer

Donny Nguyen
Actor stats
0
Bookmarked
1
Total users
0
Monthly active users
17 hours ago
Last modified
Categories
Share
Webpage to Markdown Converter
What does it do?
Webpage to Markdown Converter takes any URL and converts the webpage content into clean, structured Markdown text. It strips navigation, ads, footers, popups, and other clutter while preserving the document structure including headings, paragraphs, lists, links, images, code blocks, tables, and emphasis. The output is ready for use in LLM prompts, RAG pipelines, documentation, and content processing.
Why use this actor?
Converting web content to Markdown is essential for AI and LLM workflows. Raw HTML is messy and token-heavy, while plain text loses structure. This actor produces clean Markdown that preserves document hierarchy, making it ideal for feeding content to language models, building knowledge bases, creating documentation, or preparing training data. It processes pages in bulk and delivers structured output.
How to use it
- Go to the actor's page on the Apify platform.
- Click Start to open the input configuration.
- Enter one or more URLs to convert.
- Choose whether to include images and links in the output.
- Click Start and wait for the results.
- Download your Markdown data from the Dataset tab.
The actor can process many URLs in a single run and is callable via the Apify API for automation.
Input configuration
| Field | Type | Description | Default |
|---|---|---|---|
| urls | array | URLs to convert to Markdown | ["https://docs.apify.com/platform/actors"] |
| includeImages | boolean | Include image references | true |
| includeLinks | boolean | Preserve hyperlinks | true |
| proxyConfiguration | object | Proxy settings | Apify Proxy |
Output data
Each item in the dataset contains:
{"url": "https://docs.apify.com/platform/actors","title": "Actors | Apify Documentation","markdown": "# Actors\n\nActors are serverless cloud programs...","wordCount": 1250,"headings": 8,"charCount": 7890,"scrapedAt": "2026-02-19T14:30:00.000Z"}
Cost of usage
This actor uses CheerioCrawler for fast HTML processing. A typical run converting 10 URLs takes under 30 seconds and costs less than $0.01 in platform credits. The actor is priced at $0.30 per 1,000 results with pay-per-event pricing. It is one of the most cost-efficient ways to convert web content for AI workflows.
Tips
- Use this as a preprocessing step before feeding content to LLMs to reduce token usage.
- Disable images if you only need text content to keep output smaller.
- The actor automatically removes navigation, ads, and footer content for cleaner output.
- Combine with a web crawler to convert entire websites to Markdown documentation.
- The word count field helps you estimate token counts for LLM processing.
- For JavaScript-heavy single-page apps, consider using a Puppeteer-based scraper instead.
- Schedule runs to keep your Markdown knowledge base updated with the latest web content.
Built with Crawlee and Apify SDK. See more scrapers by consummate_mandala on Apify Store.