Mcp Document Converter avatar

Mcp Document Converter

Pricing

Pay per usage

Go to Apify Store
Mcp Document Converter

Mcp Document Converter

Pricing

Pay per usage

Rating

0.0

(0)

Developer

Donny Nguyen

Donny Nguyen

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

19 hours ago

Last modified

Categories

Share

An MCP (Model Context Protocol) server for document conversion. AI agents can use this actor to convert webpages, PDFs, and HTML content to clean Markdown or plain text format, enabling seamless document processing in AI workflows.

Features

  • Convert any webpage to Markdown, plain text, or clean HTML
  • Batch processing of multiple URLs in a single run
  • Intelligent content extraction that removes navigation, ads, and sidebars
  • Configurable image and link inclusion in output
  • Word count and content length metrics for each conversion
  • MCP-compatible output format for AI agent integration

How It Works

The actor fetches the target URL, extracts the main content area, strips unnecessary elements (scripts, styles, navigation), and converts the remaining content to the specified output format. It intelligently identifies the main content area using common selectors like main, article, and .content.

Input Configuration

ParameterTypeDefaultDescription
urlstringhttps://docs.apify.com/platform/actorsSingle URL to convert
urlsarray[]Multiple URLs for batch conversion
outputFormatstringmarkdownOutput format: markdown, text, or html
includeImagesbooleantrueInclude image references in output
includeLinksbooleantrueInclude hyperlinks in output

Output Format

Each converted document produces a dataset item with:

  • url - Source URL of the document
  • title - Page title extracted from HTML
  • outputFormat - The format used for conversion
  • content - The converted document content
  • contentLength - Character count of the output
  • wordCount - Word count of the output
  • convertedAt - ISO timestamp of conversion

Use Cases

  • AI Agent Document Processing: Feed webpage content to LLMs in clean Markdown format
  • Content Migration: Convert web content for use in documentation systems
  • Research Automation: Extract and convert multiple research papers or articles
  • Knowledge Base Building: Convert web resources into structured text for RAG systems
  • Content Archival: Save clean versions of webpages without clutter

API Integration

You can call this actor via the Apify API to integrate document conversion into your workflows. Use the MCP server endpoint for direct AI agent access, or call the actor programmatically from any language.

Pricing

This actor uses pay-per-event pricing. Each document conversion counts as one event. Visit the Apify platform for current pricing details and to manage your subscription.