Pricing

from $20.00 / 1,000 pages

Image to Markdown

Image to Markdown converts images and scanned PDFs into structured Markdown using AI-powered document understanding. It recognizes text, tables, mathematical formulas (LaTeX), and figures while preserving the correct reading order and document layout.

Pricing

from $20.00 / 1,000 pages

Rating

0.0

(0)

Developer

AbotAPI

Actor stats

Bookmarked

Total users

Monthly active users

7 days ago

Last modified

What does Image to Markdown do?

Image to Markdown converts images and scanned PDFs into structured Markdown using AI-powered document understanding. It recognizes text, tables, mathematical formulas (LaTeX), and figures while preserving the correct reading order and document layout.

Key features

Layout analysis — Understands document structure (titles, paragraphs, figures, captions)
Table extraction — Recognizes table structures and outputs them as Markdown tables
Formula recognition — Detects mathematical formulas and converts them to LaTeX
PDF support — Process multi-page PDF documents
Multiple input methods — Upload files directly or provide URLs

Use cases

Academic paper digitization — Convert scanned research papers with equations into editable Markdown/LaTeX
Technical document processing — Parse engineering specs, datasheets, and manuals preserving tables and formulas
Invoice and receipt parsing — Extract structured data from scanned financial documents
Book digitization — Convert scanned book pages into searchable, editable text
Data pipeline integration — Use the Apify API to automate document parsing in your workflows
RAG / LLM preparation — Convert documents to Markdown for use as context in AI applications

How to use

Upload files or provide URLs to images (PNG, JPEG) or PDF documents
Run the Actor and get structured Markdown output

Supported file formats

Format	Extensions
PDF	`.pdf`
PNG	`.png`
JPEG	`.jpg`, `.jpeg`

Input

The Actor accepts uploaded files or URLs pointing to images or PDFs. All other parameters are optional with sensible defaults.

Upload Files — Drag and drop image or PDF files directly
URLs — List of URLs pointing to documents to parse
Include Metadata — Include processing metadata in output (default: true)
Output Format — Choose between Dataset (JSON), Key-Value Store (MD files), or both (default: both)

Output

The Actor outputs results to the Dataset (JSON) and/or Key-Value Store (Markdown files), depending on the outputFormat setting.

Dataset output example

{
  "source": "https://example.com/document.png",
  "success": true,
  "markdown": "## Introduction\n\nThe equation $E = mc^2$ describes...\n\n| Column A | Column B |\n|----------|----------|\n| Value 1  | Value 2  |",
  "format": "png",
  "sizeBytes": 245000,
  "processingTimeMs": 3500,
  "pagesProcessed": 1,
  "metadata": {
    "parsed_at": "2026-03-07T10:00:00.000000",
    "processing_time_ms": 3500,
    "file_size_bytes": 245000
  }
}

Key-Value Store output

When using keyValueStore or both output format, each successfully parsed document is saved as a .md file in the Key-Value Store, ready to download or use in downstream workflows.

PDF to Markdown Converter - AI-Powered with OCR & Tables

clearpath/pdf-to-markdown-api

Convert PDFs to clean Markdown with GPU-accelerated AI. Extracts tables, LaTeX formulas, and images from complex layouts. Supports OCR for scanned docs in 8 languages. Batch process hundreds of PDFs in parallel via URL, upload, or API.

ClearPath

Markdown API

vivid_astronaut/markdown

Fabio Suizu

Webpage to Markdown

extremescrapes/webpage-to-markdown

This actor cost-effectively converts websites into structured markdown optimized for AI processing. It extracts webpage content, formats it into clean markdown, and ensures compatibility with AI models.

Extreme Scrapes

174

5.0

Doc To Markdown MCP Server

abotapi/doc-to-markdown-mcp

An MCP server that converts documents to clean Markdown. Convert PDFs, Word docs, Excel spreadsheets, PowerPoints, HTML, images, and more to AI-friendly Markdown format.

AbotAPI

File to Markdown

shahidirfan/file-to-markdown

Transform files into clean, readable Markdown instantly. Convert PDFs, documents, images, and more to structured Markdown format. Perfect for automating documentation workflows, content migration, and building knowledge bases. Ideal for developers, writers, and content teams.

Shahid Irfan

5.0

Html To Markdown Converter 📄

powerful_bachelor/html-to-markdown-converter

📄✨ HTML to Markdown Converter transforms web pages into clean, portable Markdown. Simply input a URL to extract content while preserving structure, formatting, and media elements.🔄 Perfect for content repurposing, documentation, and creating readable, platform-independent text from any webpage! 🚀

Powerful Bachelor

Convert To Markdown

datavault/convert-to-markdown

Convert to Markdown, converts documents, spreadsheets, images (OCR), audio (transcription), and web/data files into clean Markdown. It runs fully locally, requires no API keys, and is ideal for LLMs, docs, and archiving.

Datavault

Ai Ready Web Page To Markdown Converter

mustafa.irshaid.113/ai-ready-web-page-to-markdown-converter

Convert any webpage into structured Markdown and HTML using just a URL. Get the page title, link, and content—perfect for SEO, devs, and AI crawlers. Fast, clean, and ideal for repurposing or analysis. Start turning websites into Markdown instantly.

Mustafa Irshaid

Elite Document Ocr Lite

thepattyroller/elite-document-ocr-lite

Basic document text extraction and processing. Extract text from documents, analyze document structure, and extract structured data from invoices and receipts. Perfect for document automation workflows.

Logan Kiser

🔥 FireScrape AI Website Content Markdown Scraper

mohamedgb00714/fireScraper-AI-Website-Content-Markdown-Scraper

Advanced web scraper powered by Crawlee and Puppeteer — extracts website content, converts it to Markdown, and structures it for LLM training datasets.