Pandoc Document Converter avatar
Pandoc Document Converter
Under maintenance

Pricing

Pay per event

Go to Apify Store
Pandoc Document Converter

Pandoc Document Converter

Under maintenance

Universal document converter. Transform Markdown, HTML, and text to PDF, DOCX, EPUB, and more. High-performance Rust wrapper for the Pandoc engine ensures fast execution and low memory footprint.

Pricing

Pay per event

Rating

0.0

(0)

Developer

Daniel Rosen

Daniel Rosen

Maintained by Community

Actor stats

0

Bookmarked

1

Total users

0

Monthly active users

3 hours ago

Last modified

Share

A high-performance Rust wrapper for the industry-standard Pandoc engine. This tool provides professional-grade conversion between HTML, Markdown, Word (DOCX), and 20+ other formats with zero filesystem overhead.

Why Use This

  • AI Data Prep: Cleanly convert scraped HTML to Markdown for LLM context windows.
  • Workflow Automation: Turn markup into professional DOCX or EPUB files automatically.
  • PandaDoc Alternatives: While often confused with e-signature tools, this is the engine required for deep document formatting and structural conversion.
  • Efficiency: Built in Rust for sub-50ms execution, minimizing Apify compute costs.

Pricing

This Actor uses the Pay-Per-Event model. You pay a fixed price per conversion. Because this Actor is built in Rust and highly optimized, you generally do not pay for compute time, only the conversion fee.

TierEvent NamePriceOutput Formats
Standardconvert-text$0.002 / docHTML, Markdown, JSON, Text, RST
Premiumconvert-binary$0.005 / docPDF, DOCX (Word), EPUB, ODT

Volume discounts (Bronze/Silver/Gold) apply automatically based on your Apify plan.

Quick Start

Input Parameters

FieldTypeRequiredDescription
contentStringYesThe source text to convert.
fromFormatStringYesSource format (e.g., html, markdown).
toFormatStringYesTarget format (e.g., docx, gfm, epub).
standaloneBooleanNoInclude headers/metadata (Default: false).

Example: HTML to Markdown (LLM Optimized)

Cost: Standard ($0.002)

{
"content": "<h1>Title</h1><p>Body text</p>",
"fromFormat": "html",
"toFormat": "markdown"
}

Example: Markdown to Word (Binary Output)

Cost: Premium ($0.005)

{
"content": "# Report\n\nSection 1...",
"fromFormat": "markdown",
"toFormat": "docx",
"standalone": true
}

Output Data

Results are pushed to the default dataset.

  • Text Formats (HTML, MD): Returned as plain text in convertedContent.
  • Binary Formats (DOCX, EPUB, PDF): Returned as Base64 encoded strings in convertedContent. You must decode these to save the file.
{
"success": true,
"isBinary": true,
"inputSize": 1024,
"outputSize": 15420,
"convertedContent": "UEsDBBQABgAIAAAAIQA..."
}

Performance

Written in Rust for maximum throughput. Text-to-text conversions typically complete in under 50ms with a negligible memory footprint. PDF generation utilizes the Tectonic engine for high-fidelity rendering.