Pricing

from $10.00 / 1,000 results

Go to Apify Store

Webpage To Markdown Converter

Try for free

Pricing

from $10.00 / 1,000 results

Rating

0.0

(0)

Developer

Donny

Actor stats

Bookmarked

Total users

Monthly active users

6 hours ago

Last modified

What it does

Converts web pages into clean Markdown format. For each provided URL, the actor fetches the HTML content, extracts the main body, and converts HTML elements to their Markdown equivalents including headers, links, lists, bold and italic text, images, and paragraphs. Outputs the page title, converted Markdown content, word count, and fetch timestamp. Useful for content archival, documentation generation, knowledge base building, and converting web content for use in Markdown-based systems like GitHub or static site generators.

This Apify actor automates the collection of data from a public API or website, extracting structured information and saving it directly into an Apify dataset. It handles pagination automatically where applicable, supports configurable result limits, and includes robust error handling with timeouts on all HTTP requests. The actor is designed for reliability: it validates inputs, applies sensible defaults, and produces a fallback record when no results are found, so your downstream workflows never receive an empty dataset. Built on the Apify SDK with native Node.js 20 fetch for lightweight, fast execution without browser overhead.

Why use it

Manually collecting data from web APIs and websites is tedious and error-prone. This actor eliminates that burden by running in the cloud on the Apify platform, where it can be scheduled, integrated with webhooks, or chained with other actors. Whether you are conducting research, building a knowledge base, monitoring data sources, or feeding data into an analytics pipeline, this actor gives you structured, ready-to-use JSON output with zero browser overhead. It uses lightweight HTTP requests instead of a full browser, which makes it fast and cost-effective. Every request includes a 120-second timeout to prevent hanging, and all string fields are null-checked for data consistency.

Input parameters

urls (array, required): List of webpage URLs to convert to Markdown. Default: ["https://en.wikipedia.org/wiki/Web_scraping"].
maxContentLength (integer, optional): Maximum number of characters of Markdown content per page. Default: 50000. Range: 1000-200000.

All inputs are validated at startup with sensible defaults applied when values are missing. The actor will log warnings for any misconfigured options and continue with safe defaults rather than failing outright.

Output data

Each item in the output dataset contains the following fields:

url: The URL of the converted page
title: Page title from the title tag
markdown: The converted Markdown content (truncated to maxContentLength)
wordCount: Number of words in the Markdown output
fetchedAt: ISO timestamp when the page was fetched

All string fields are null-checked; missing values are stored as null rather than undefined.

Example output

{
    "url": "https://en.wikipedia.org/wiki/Web_scraping",
    "title": "Web scraping - Wikipedia",
    "markdown": "# Web scraping\n\nWeb scraping is the process of...",
    "wordCount": 3500,
    "fetchedAt": "2024-01-15T12:00:00.000Z"
}

Pricing

This actor is priced on a usage basis:

$0.01 per result returned in the dataset.
$0.005 per actor start (fixed platform fee).

For example, scraping 500 results would cost approximately $5.005. Apify provides free monthly credits for new users, so you can try the actor at no charge. Actual costs depend on the number of results, API response times, and memory allocation. You can control costs by setting the maxResults parameter to limit the number of results collected per run. For high-volume use cases, consider running the actor on a schedule during off-peak hours to optimize platform resource usage.

More scrapers from brave_paradise

Check out other useful scrapers built by brave_paradise:

Visit the brave_paradise profile on Apify to see the full catalogue of actors.

Webpage To Markdown

consummate_mandala/webpage-to-markdown

Webpage To Markdown. Powerful automation with structured JSON/CSV output, proxy rotation, and automatic retries. Pay only for results.

Donny Nguyen

Html To Markdown Converter 📄

powerful_bachelor/html-to-markdown-converter

📄✨ HTML to Markdown Converter transforms web pages into clean, portable Markdown. Simply input a URL to extract content while preserving structure, formatting, and media elements.🔄 Perfect for content repurposing, documentation, and creating readable, platform-independent text from any webpage! 🚀

Powerful Bachelor

Ai Ready Web Page To Markdown Converter

mustafa.irshaid.113/ai-ready-web-page-to-markdown-converter

Convert any webpage into structured Markdown and HTML using just a URL. Get the page title, link, and content—perfect for SEO, devs, and AI crawlers. Fast, clean, and ideal for repurposing or analysis. Start turning websites into Markdown instantly.

Mustafa Irshaid

Website To Markdown

hamzasaleem/website-to-markdown

Convert any webpage to clean, readable Markdown format. Perfect for content extraction and readability.

Hmza

Webpage to Markdown

extremescrapes/webpage-to-markdown

This actor cost-effectively converts websites into structured markdown optimized for AI processing. It extracts webpage content, formats it into clean markdown, and ensures compatibility with AI models.

Extreme Scrapes

170

5.0

(3)

Mcp Document Converter

consummate_mandala/mcp-document-converter

Mcp Document Converter. Transform data between formats with high fidelity. Fast processing with structured output.

Donny Nguyen

Webpage Content Scraper to Markdown

riisager/tulabot-cloudflare-markdown

Focus on cost, Scrape any webpage content into LLM-ready Markdown for RAG. Uses a smart hybrid 4 tier engine: Apify for crawling + Cloudflare Browser Rendering for perfect extraction. Automatically saves costs by detecting native markdown support.

Søren Riisager

AI Markdown Maker

onescales/bulk-ai-markdown-maker

Convert any web page into clean, AI ready markdown format in seconds. Perfect for feeding content to AI models, creating documentation, or archiving web content in a portable format. In addition it intelligently parse web content, removing ads, navigation, and other clutter. Generate Markdown Today!

One Scales

5.0

(2)

Website To Markdown

smart_api/website-to-markdown

Convert any webpage into clean, LLM-ready Markdown in seconds — perfect for AI training data, RAG pipelines, and content archiving.

SmartApi

5.0

(1)

URL to Markdown (JustHTML) - Clean Markdown Extractor

macheta/justhtml-link-to-markdown

Convert webpages to clean Markdown for RAG and archiving. Uses JustHTML and supports optional Cloudflare/Turnstile bypass plus CSS selector extraction.