Universal Document Format Transformer
Pricing
from $5.00 / 1,000 results
Universal Document Format Transformer
Universal Document Format Transformer: a cloud-based Apify Actor that converts documents (PDF, DOCX, PPTX, HTML, TXT) into Markdown, JSON, CSV, HTML or TXT using Pandoc. Easy REST API for automations (n8n, Zapier, Make), production-ready error handling, and security controls.
Pricing
from $5.00 / 1,000 results
Rating
0.0
(0)
Developer

fanio zilla
Actor stats
0
Bookmarked
2
Total users
1
Monthly active users
24 days ago
Last modified
Categories
Share
Convert documents between formats instantly without installing any software. Just provide a URL and get your converted file in seconds.
π Get Started in 30 Seconds
- Go to the Actor: Universal Document Format Transformer
- Click "Run Actor"
- Enter your file details:
{"fileUrl": "https://example.com/your-document.docx","fromFormat": "docx","toFormat": "markdown"}
- Get your converted file - Download link appears in results
That's it! No software to install, no API keys needed for basic use.
β¨ What You Can Do
- π Convert DOCX to Markdown - Perfect for GitHub documentation
- π Extract tables from HTML to CSV - Great for data analysis
- π Transform PPTX to PDF - Ideal for sharing presentations
- π Convert TXT to HTML - Useful for web publishing
- π Process multiple formats - DOCX, PPTX, HTML, TXT β Markdown, JSON, CSV, HTML, TXT, PDF
π Supported Formats
What You Can Convert From (Input)
| Format | Best For | Example Files |
|---|---|---|
| DOCX | Reports, articles, documentation | *.docx |
| PPTX | Presentations, slides, training | *.pptx |
| HTML | Web pages, online articles | *.html, *.htm |
| TXT | Plain text, simple data | *.txt |
What You Can Convert To (Output)
| Format | Perfect For | When to Use |
|---|---|---|
| Markdown | GitHub docs, technical writing | Converting Word docs for code repositories |
| JSON | Data processing, APIs | Extracting structured content |
| CSV | Spreadsheets, data analysis | Pulling tables from web pages |
| HTML | Web publishing, email | Converting docs for websites |
| TXT | Simple text, logging | Extracting plain text from any format |
| Sharing, printing | Final document distribution |
β οΈ Important: PDF Limitations
- β Cannot convert FROM PDF - This is a technical limitation
- β Can convert TO PDF - Perfect for final output
- π‘ Workaround: Convert PDF to HTML online first, then use this actor
π‘ Popular Use Cases
π Content Teams & Bloggers
Convert Word documents to Markdown for GitHub
{"fileUrl": "https://example.com/blog-post.docx","fromFormat": "docx","toFormat": "markdown"}
Perfect for:
- Technical documentation
- GitHub README files
- Markdown-based blogs
- Documentation sites
π Data Analysts & Researchers
Extract tables from web pages to CSV
{"fileUrl": "https://example.com/financial-report.html","fromFormat": "html","toFormat": "csv"}
Perfect for:
- Financial data extraction
- Research data processing
- Spreadsheet analysis
- Data import to Excel
π― Product Managers & Business Users
Convert presentations to PDF for sharing
{"fileUrl": "https://example.com/presentation.pptx","fromFormat": "pptx","toFormat": "pdf"}
Perfect for:
- Client presentations
- Training materials
- Meeting handouts
- Document distribution
π€ Automation Builders
Process text files to structured data
{"fileUrl": "https://example.com/data.txt","fromFormat": "txt","toFormat": "json"}
Perfect for:
- n8n workflows
- Zapier automations
- Make.com integrations
- API data processing
π How to Use
Step 1: Prepare Your File URL
Your file must be:
- β Publicly accessible (no login required)
- β Direct link to file (not a web page)
- β Under 50MB in size
- β HTTP or HTTPS protocol
Good URLs:
https://example.com/report.docxhttps://cdn.example.com/files/presentation.pptxhttps://storage.googleapis.com/bucket/document.html
Bad URLs:
https://drive.google.com/file/d/123/view (requires login)https://example.com/page.html (web page, not file)ftp://example.com/file.docx (wrong protocol)
Step 2: Choose Your Formats
Check format compatibility:
| From \ To | Markdown | JSON | CSV | HTML | TXT | |
|---|---|---|---|---|---|---|
| DOCX | β | β | β | β | β | β |
| PPTX | β | β | β οΈ | β | β | β |
| HTML | β | β οΈ | β οΈ | β | β | β |
| TXT | β | β οΈ | β οΈ | β | β | β |
Legend:
- β Excellent - High quality conversion
- β οΈ Limited - Works but may lose some formatting
Step 3: Run the Conversion
Option A: Use Apify Web Interface (Easiest)
- Go to Actor Page
- Click "Run Actor"
- Enter your JSON input
- Click "Run"
- Download your converted file
Option B: Use API (For Automation)
curl -X POST "https://api.apify.com/v2/acts/WgRQY2Ta2VKQE5NgO/runs?token=YOUR_TOKEN" \-H "Content-Type: application/json" \-d '{"fileUrl": "https://example.com/document.docx","fromFormat": "docx","toFormat": "markdown"}'
π€ What You Get Back
After conversion, you'll receive:
{"downloadUrl": "https://api.apify.com/v2/key-value-stores/...","inputFormat": "docx","outputFormat": "markdown","fileSize": 12345,"processingTime": 2.5,"status": "success"}
What each field means:
- downloadUrl: Link to download your converted file (works for 7 days)
- inputFormat: The format we detected from your file
- outputFormat: The format you requested
- fileSize: Size of your converted file in bytes
- processingTime: How long the conversion took
- status: "success" or "error"
π¨ Common Problems & Solutions
β "Invalid URL format"
Problem: Your URL doesn't work Solution:
- Check URL starts with
http://orhttps:// - Test the URL in your browser first
- Make sure it's a direct file link, not a web page
β "File not found"
Problem: The file doesn't exist or moved Solution:
- Verify the URL is correct
- Check if the file was deleted or moved
- Try uploading the file again
β "Access denied"
Problem: File requires login or permission Solution:
- Use a publicly accessible file
- Upload to public cloud storage (Google Drive, Dropbox, etc.)
- Make sure sharing permissions allow public access
β "Unsupported input format"
Problem: You tried to convert from PDF Solution:
- PDF cannot be used as input (technical limitation)
- Convert PDF to HTML first using online tools
- Then use this actor to convert HTML to your desired format
β° "Conversion timed out"
Problem: File is too large or complex Solution:
- Keep files under 50MB
- Try a simpler output format
- Split large documents into smaller parts
π‘ Pro Tips
π― For Best Results
- Test with small files first - Make sure everything works
- Choose the right format combination - Check the compatibility matrix
- Use direct file URLs - Avoid web pages that require login
- Check file size - Keep under 50MB for reliable processing
π Getting File URLs
Google Drive:
- Right-click file β "Share"
- Set to "Anyone with the link can view"
- Copy link and change
.../view?usp=sharingto.../uc?export=download
Dropbox:
- Right-click file β "Share"
- Create link with "Can edit" permissions
- Copy the direct download link
OneDrive:
- Right-click file β "Share"
- Set to "Anyone with the link can view"
- Copy the link and ensure it's a direct download URL
S3/Cloud Storage:
- Set bucket/object to public read
- Generate pre-signed URL if needed
- Ensure URL points directly to the file
β‘ Speed Tips
- TXT files convert fastest - Use when possible
- DOCX to Markdown is very reliable - Great for documentation
- HTML to TXT preserves text well - Good for content extraction
- Simple conversions work best - Avoid complex format chains
π§ Advanced Configuration
For Power Users
If you're using this in automation, you can adjust these settings:
| Setting | Default | What it Does |
|---|---|---|
| File Size Limit | 50MB | Maximum input file size |
| Timeout | 60 seconds | Maximum conversion time |
| Retry Attempts | 3 | How many times to retry failed downloads |
API Usage
For high-volume usage, consider:
- Batch processing - Process multiple files sequentially
- Error handling - Check status before processing next file
- Download timing - Files are available for 7 days only
π Need Help?
Quick Troubleshooting
- File not working? - Try the URL in your browser first
- Conversion failed? - Check if format combination is supported
- Taking too long? - File might be too large or complex
- Wrong output? - Verify your
fromFormatmatches the actual file type
Get Support
- π Actor Page: View on Apify
- π Report Issues: GitHub Issues
- π¬ Community: Apify Forum
- π§ Direct Help: Contact through Apify platform
π Ready to Start?
No registration required for basic use. Free tier includes processing credits.