Webpage to Markdown
Pricing
from $2.00 / 1,000 results
Webpage to Markdown
Get the main content of any page as Markdown. Great for LLMs and AI agent workflows.
Pricing
from $2.00 / 1,000 results
Rating
0.0
(0)
Developer
Epic Scrapers
Actor stats
0
Bookmarked
2
Total users
2
Monthly active users
2 days ago
Last modified
Categories
Share
Extract clean Markdown content from any webpage. This Apify Actor converts HTML pages into well-formatted Markdown — perfect for feeding to LLMs, creating knowledge bases, or archiving web content in a readable format.
What can Webpage to Markdown do?
- Extract main content only — Removes navigation, ads, and clutter using intelligent content detection
- Clean Markdown output — Properly formatted headings, lists, links, and code blocks
- Works on any website — No site-specific configuration needed
- Fast & lightweight — Single-page scraping optimized for quick results
- API & scheduling ready — Run manually, via API, or on a schedule
What data can you extract from webpages?
| Field | Type | Description |
|---|---|---|
url | String | The source URL of the webpage |
content | String | The extracted Markdown content from the page |
How to scrape webpages with Webpage to Markdown
- Open the Actor → Go to Webpage to Markdown on Apify
- Paste a URL → Enter any webpage URL in the URL input field
- Click Run → The Actor will fetch and parse the page
- Download results → Get your Markdown as JSON, or export to CSV/Excel
Example Input
{"url": "https://docs.apify.com/academy/scraping-basics-javascript"}
Input
The Actor accepts a simple JSON input:
| Field | Required | Type | Description |
|---|---|---|---|
url | ✅ | String | The full URL of the webpage to convert (e.g., https://example.com/article) |
Input Schema
You can also view the input schema in the Input tab of the Apify Console.
Output
Example JSON Output
{"url": "https://docs.apify.com/academy/scraping-basics-javascript","content": "# Scraping basics in JavaScript\n\nThis lesson covers the fundamentals...\n\n## What you'll learn\n\n- How to send HTTP requests\n- How to parse HTML with Cheerio\n- How to extract data from pages\n"}
Export Formats
Results are stored in Apify Datasets and can be exported as:
- JSON — Default format with full content
- CSV — For spreadsheet applications (truncated long fields)
- Excel — For Microsoft Excel users
- HTML — For viewing in browsers
- XML — For XML-based workflows
Is it legal to scrape webpages?
This Actor only scrapes publicly available data — content that any visitor can see without logging in. It does NOT extract:
- ❌ Content behind paywalls or authentication
- ❌ Copyrighted material beyond fair use
⚖️ Important: Always respect website Terms of Service and robots.txt files. This tool is designed for legitimate use cases like:
- Creating LLM training datasets from your own content
- Archiving public articles for research
- Building knowledge bases from documentation
Why use Webpage to Markdown instead of copy-paste?
| Manual Copy-Paste | Webpage to Markdown |
|---|---|
| Includes ads, navigation, sidebars | ✅ Extracts only main article content |
| Messy formatting | ✅ Clean, structured Markdown |
| No metadata | ✅ Includes source URL |
| Manual work | ✅ Automated & scalable |
| Can't schedule | ✅ Schedule runs via API or Apify platform |
FAQ
What websites does this work on?
This Actor works on any publicly accessible webpage. It's designed for article pages, blog posts, documentation, and content pages. Results may vary on heavily JavaScript-rendered sites or pages with unusual HTML structures.
Can I scrape multiple pages at once?
This Actor is designed for single-page extraction. For batch processing, use Apify's API or SDK to queue multiple runs, or check out the Website Content Crawler for full-site scraping.
Does this work with paywalled content?
No — this Actor only extracts content visible without authentication. It respects robots.txt and is not designed for bypassing paywalls.
Can I integrate this with my workflow?
Yes! Use Apify's API, webhooks, or integrations with Zapier, Make, n8n, and more to connect results to your apps.
Support
Need help or have a feature request?
- 🐛 Issues: Report bugs in the Apify Console Issues tab
- 💡 Feature requests: Contact via Apify or Discord
- 📧 Custom solutions: Open to custom Actor development
Apify Platform Features
This Actor benefits from the full Apify platform:
- 📅 Scheduling — Run daily, hourly, or custom schedules
- 🔌 API access — Trigger via REST API
- 🔗 Integrations — Connect to Zapier, Make, n8n, Google Sheets
- 🔄 Proxy rotation — Automatic proxy handling (if needed)
- ☁️ Cloud storage — Managed datasets with export options
- 🔔 Monitoring & alerts — Get notified on errors or completion
Built with ❤️ for the Apify community.