Webpage to Markdown avatar

Webpage to Markdown

Pricing

from $2.00 / 1,000 results

Go to Apify Store
Webpage to Markdown

Webpage to Markdown

Get the main content of any page as Markdown. Great for LLMs and AI agent workflows.

Pricing

from $2.00 / 1,000 results

Rating

0.0

(0)

Developer

Epic Scrapers

Epic Scrapers

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

2

Monthly active users

2 days ago

Last modified

Share

Extract clean Markdown content from any webpage. This Apify Actor converts HTML pages into well-formatted Markdown — perfect for feeding to LLMs, creating knowledge bases, or archiving web content in a readable format.


What can Webpage to Markdown do?

  • Extract main content only — Removes navigation, ads, and clutter using intelligent content detection
  • Clean Markdown output — Properly formatted headings, lists, links, and code blocks
  • Works on any website — No site-specific configuration needed
  • Fast & lightweight — Single-page scraping optimized for quick results
  • API & scheduling ready — Run manually, via API, or on a schedule

What data can you extract from webpages?

FieldTypeDescription
urlStringThe source URL of the webpage
contentStringThe extracted Markdown content from the page

How to scrape webpages with Webpage to Markdown

  1. Open the Actor → Go to Webpage to Markdown on Apify
  2. Paste a URL → Enter any webpage URL in the URL input field
  3. Click Run → The Actor will fetch and parse the page
  4. Download results → Get your Markdown as JSON, or export to CSV/Excel

Example Input

{
"url": "https://docs.apify.com/academy/scraping-basics-javascript"
}

Input

The Actor accepts a simple JSON input:

FieldRequiredTypeDescription
urlStringThe full URL of the webpage to convert (e.g., https://example.com/article)

Input Schema

You can also view the input schema in the Input tab of the Apify Console.


Output

Example JSON Output

{
"url": "https://docs.apify.com/academy/scraping-basics-javascript",
"content": "# Scraping basics in JavaScript\n\nThis lesson covers the fundamentals...\n\n## What you'll learn\n\n- How to send HTTP requests\n- How to parse HTML with Cheerio\n- How to extract data from pages\n"
}

Export Formats

Results are stored in Apify Datasets and can be exported as:

  • JSON — Default format with full content
  • CSV — For spreadsheet applications (truncated long fields)
  • Excel — For Microsoft Excel users
  • HTML — For viewing in browsers
  • XML — For XML-based workflows

This Actor only scrapes publicly available data — content that any visitor can see without logging in. It does NOT extract:

  • ❌ Content behind paywalls or authentication
  • ❌ Copyrighted material beyond fair use

⚖️ Important: Always respect website Terms of Service and robots.txt files. This tool is designed for legitimate use cases like:

  • Creating LLM training datasets from your own content
  • Archiving public articles for research
  • Building knowledge bases from documentation

Why use Webpage to Markdown instead of copy-paste?

Manual Copy-PasteWebpage to Markdown
Includes ads, navigation, sidebars✅ Extracts only main article content
Messy formatting✅ Clean, structured Markdown
No metadata✅ Includes source URL
Manual work✅ Automated & scalable
Can't schedule✅ Schedule runs via API or Apify platform

FAQ

What websites does this work on?

This Actor works on any publicly accessible webpage. It's designed for article pages, blog posts, documentation, and content pages. Results may vary on heavily JavaScript-rendered sites or pages with unusual HTML structures.

Can I scrape multiple pages at once?

This Actor is designed for single-page extraction. For batch processing, use Apify's API or SDK to queue multiple runs, or check out the Website Content Crawler for full-site scraping.

Does this work with paywalled content?

No — this Actor only extracts content visible without authentication. It respects robots.txt and is not designed for bypassing paywalls.

Can I integrate this with my workflow?

Yes! Use Apify's API, webhooks, or integrations with Zapier, Make, n8n, and more to connect results to your apps.


Support

Need help or have a feature request?

  • 🐛 Issues: Report bugs in the Apify Console Issues tab
  • 💡 Feature requests: Contact via Apify or Discord
  • 📧 Custom solutions: Open to custom Actor development

Apify Platform Features

This Actor benefits from the full Apify platform:

  • 📅 Scheduling — Run daily, hourly, or custom schedules
  • 🔌 API access — Trigger via REST API
  • 🔗 Integrations — Connect to Zapier, Make, n8n, Google Sheets
  • 🔄 Proxy rotation — Automatic proxy handling (if needed)
  • ☁️ Cloud storage — Managed datasets with export options
  • 🔔 Monitoring & alerts — Get notified on errors or completion

Built with ❤️ for the Apify community.