Universal Document Format Transformer avatar
Universal Document Format Transformer

Pricing

from $5.00 / 1,000 results

Go to Apify Store
Universal Document Format Transformer

Universal Document Format Transformer

Universal Document Format Transformer: a cloud-based Apify Actor that converts documents (PDF, DOCX, PPTX, HTML, TXT) into Markdown, JSON, CSV, HTML or TXT using Pandoc. Easy REST API for automations (n8n, Zapier, Make), production-ready error handling, and security controls.

Pricing

from $5.00 / 1,000 results

Rating

0.0

(0)

Developer

fanio zilla

fanio zilla

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

24 days ago

Last modified

Share

Convert documents between formats instantly without installing any software. Just provide a URL and get your converted file in seconds.

πŸš€ Get Started in 30 Seconds

  1. Go to the Actor: Universal Document Format Transformer
  2. Click "Run Actor"
  3. Enter your file details:
    {
    "fileUrl": "https://example.com/your-document.docx",
    "fromFormat": "docx",
    "toFormat": "markdown"
    }
  4. Get your converted file - Download link appears in results

That's it! No software to install, no API keys needed for basic use.

✨ What You Can Do

  • πŸ”„ Convert DOCX to Markdown - Perfect for GitHub documentation
  • πŸ“Š Extract tables from HTML to CSV - Great for data analysis
  • πŸ“„ Transform PPTX to PDF - Ideal for sharing presentations
  • πŸ“ Convert TXT to HTML - Useful for web publishing
  • πŸ“‹ Process multiple formats - DOCX, PPTX, HTML, TXT β†’ Markdown, JSON, CSV, HTML, TXT, PDF

πŸ“‹ Supported Formats

What You Can Convert From (Input)

FormatBest ForExample Files
DOCXReports, articles, documentation*.docx
PPTXPresentations, slides, training*.pptx
HTMLWeb pages, online articles*.html, *.htm
TXTPlain text, simple data*.txt

What You Can Convert To (Output)

FormatPerfect ForWhen to Use
MarkdownGitHub docs, technical writingConverting Word docs for code repositories
JSONData processing, APIsExtracting structured content
CSVSpreadsheets, data analysisPulling tables from web pages
HTMLWeb publishing, emailConverting docs for websites
TXTSimple text, loggingExtracting plain text from any format
PDFSharing, printingFinal document distribution

⚠️ Important: PDF Limitations

  • ❌ Cannot convert FROM PDF - This is a technical limitation
  • βœ… Can convert TO PDF - Perfect for final output
  • πŸ’‘ Workaround: Convert PDF to HTML online first, then use this actor

πŸ“ Content Teams & Bloggers

Convert Word documents to Markdown for GitHub

{
"fileUrl": "https://example.com/blog-post.docx",
"fromFormat": "docx",
"toFormat": "markdown"
}

Perfect for:

  • Technical documentation
  • GitHub README files
  • Markdown-based blogs
  • Documentation sites

πŸ“Š Data Analysts & Researchers

Extract tables from web pages to CSV

{
"fileUrl": "https://example.com/financial-report.html",
"fromFormat": "html",
"toFormat": "csv"
}

Perfect for:

  • Financial data extraction
  • Research data processing
  • Spreadsheet analysis
  • Data import to Excel

🎯 Product Managers & Business Users

Convert presentations to PDF for sharing

{
"fileUrl": "https://example.com/presentation.pptx",
"fromFormat": "pptx",
"toFormat": "pdf"
}

Perfect for:

  • Client presentations
  • Training materials
  • Meeting handouts
  • Document distribution

πŸ€– Automation Builders

Process text files to structured data

{
"fileUrl": "https://example.com/data.txt",
"fromFormat": "txt",
"toFormat": "json"
}

Perfect for:

  • n8n workflows
  • Zapier automations
  • Make.com integrations
  • API data processing

πŸ“ How to Use

Step 1: Prepare Your File URL

Your file must be:

  • βœ… Publicly accessible (no login required)
  • βœ… Direct link to file (not a web page)
  • βœ… Under 50MB in size
  • βœ… HTTP or HTTPS protocol

Good URLs:

https://example.com/report.docx
https://cdn.example.com/files/presentation.pptx
https://storage.googleapis.com/bucket/document.html

Bad URLs:

https://drive.google.com/file/d/123/view (requires login)
https://example.com/page.html (web page, not file)
ftp://example.com/file.docx (wrong protocol)

Step 2: Choose Your Formats

Check format compatibility:

From \ ToMarkdownJSONCSVHTMLTXTPDF
DOCXβœ…βœ…βœ…βœ…βœ…βœ…
PPTXβœ…βœ…βš οΈβœ…βœ…βœ…
HTMLβœ…βš οΈβš οΈβœ…βœ…βœ…
TXTβœ…βš οΈβš οΈβœ…βœ…βœ…

Legend:

  • βœ… Excellent - High quality conversion
  • ⚠️ Limited - Works but may lose some formatting

Step 3: Run the Conversion

Option A: Use Apify Web Interface (Easiest)

  1. Go to Actor Page
  2. Click "Run Actor"
  3. Enter your JSON input
  4. Click "Run"
  5. Download your converted file

Option B: Use API (For Automation)

curl -X POST "https://api.apify.com/v2/acts/WgRQY2Ta2VKQE5NgO/runs?token=YOUR_TOKEN" \
-H "Content-Type: application/json" \
-d '{
"fileUrl": "https://example.com/document.docx",
"fromFormat": "docx",
"toFormat": "markdown"
}'

πŸ“€ What You Get Back

After conversion, you'll receive:

{
"downloadUrl": "https://api.apify.com/v2/key-value-stores/...",
"inputFormat": "docx",
"outputFormat": "markdown",
"fileSize": 12345,
"processingTime": 2.5,
"status": "success"
}

What each field means:

  • downloadUrl: Link to download your converted file (works for 7 days)
  • inputFormat: The format we detected from your file
  • outputFormat: The format you requested
  • fileSize: Size of your converted file in bytes
  • processingTime: How long the conversion took
  • status: "success" or "error"

🚨 Common Problems & Solutions

❌ "Invalid URL format"

Problem: Your URL doesn't work Solution:

  • Check URL starts with http:// or https://
  • Test the URL in your browser first
  • Make sure it's a direct file link, not a web page

❌ "File not found"

Problem: The file doesn't exist or moved Solution:

  • Verify the URL is correct
  • Check if the file was deleted or moved
  • Try uploading the file again

❌ "Access denied"

Problem: File requires login or permission Solution:

  • Use a publicly accessible file
  • Upload to public cloud storage (Google Drive, Dropbox, etc.)
  • Make sure sharing permissions allow public access

❌ "Unsupported input format"

Problem: You tried to convert from PDF Solution:

  • PDF cannot be used as input (technical limitation)
  • Convert PDF to HTML first using online tools
  • Then use this actor to convert HTML to your desired format

⏰ "Conversion timed out"

Problem: File is too large or complex Solution:

  • Keep files under 50MB
  • Try a simpler output format
  • Split large documents into smaller parts

πŸ’‘ Pro Tips

🎯 For Best Results

  1. Test with small files first - Make sure everything works
  2. Choose the right format combination - Check the compatibility matrix
  3. Use direct file URLs - Avoid web pages that require login
  4. Check file size - Keep under 50MB for reliable processing

πŸ”— Getting File URLs

Google Drive:

  1. Right-click file β†’ "Share"
  2. Set to "Anyone with the link can view"
  3. Copy link and change .../view?usp=sharing to .../uc?export=download

Dropbox:

  1. Right-click file β†’ "Share"
  2. Create link with "Can edit" permissions
  3. Copy the direct download link

OneDrive:

  1. Right-click file β†’ "Share"
  2. Set to "Anyone with the link can view"
  3. Copy the link and ensure it's a direct download URL

S3/Cloud Storage:

  1. Set bucket/object to public read
  2. Generate pre-signed URL if needed
  3. Ensure URL points directly to the file

⚑ Speed Tips

  • TXT files convert fastest - Use when possible
  • DOCX to Markdown is very reliable - Great for documentation
  • HTML to TXT preserves text well - Good for content extraction
  • Simple conversions work best - Avoid complex format chains

πŸ”§ Advanced Configuration

For Power Users

If you're using this in automation, you can adjust these settings:

SettingDefaultWhat it Does
File Size Limit50MBMaximum input file size
Timeout60 secondsMaximum conversion time
Retry Attempts3How many times to retry failed downloads

API Usage

For high-volume usage, consider:

  • Batch processing - Process multiple files sequentially
  • Error handling - Check status before processing next file
  • Download timing - Files are available for 7 days only

πŸ†˜ Need Help?

Quick Troubleshooting

  1. File not working? - Try the URL in your browser first
  2. Conversion failed? - Check if format combination is supported
  3. Taking too long? - File might be too large or complex
  4. Wrong output? - Verify your fromFormat matches the actual file type

Get Support

πŸŽ‰ Ready to Start?

▢️ Run the Actor Now

No registration required for basic use. Free tier includes processing credits.