Image to Markdown avatar

Image to Markdown

Pricing

from $20.00 / 1,000 pages

Go to Apify Store
Image to Markdown

Image to Markdown

Image to Markdown converts images and scanned PDFs into structured Markdown using AI-powered document understanding. It recognizes text, tables, mathematical formulas (LaTeX), and figures while preserving the correct reading order and document layout.

Pricing

from $20.00 / 1,000 pages

Rating

0.0

(0)

Developer

AbotAPI

AbotAPI

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

7 days ago

Last modified

Share

What does Image to Markdown do?

Image to Markdown converts images and scanned PDFs into structured Markdown using AI-powered document understanding. It recognizes text, tables, mathematical formulas (LaTeX), and figures while preserving the correct reading order and document layout.

Key features

  • Layout analysis — Understands document structure (titles, paragraphs, figures, captions)
  • Table extraction — Recognizes table structures and outputs them as Markdown tables
  • Formula recognition — Detects mathematical formulas and converts them to LaTeX
  • PDF support — Process multi-page PDF documents
  • Multiple input methods — Upload files directly or provide URLs

Use cases

  • Academic paper digitization — Convert scanned research papers with equations into editable Markdown/LaTeX
  • Technical document processing — Parse engineering specs, datasheets, and manuals preserving tables and formulas
  • Invoice and receipt parsing — Extract structured data from scanned financial documents
  • Book digitization — Convert scanned book pages into searchable, editable text
  • Data pipeline integration — Use the Apify API to automate document parsing in your workflows
  • RAG / LLM preparation — Convert documents to Markdown for use as context in AI applications

How to use

  1. Upload files or provide URLs to images (PNG, JPEG) or PDF documents
  2. Run the Actor and get structured Markdown output

Supported file formats

FormatExtensions
PDF.pdf
PNG.png
JPEG.jpg, .jpeg

Input

The Actor accepts uploaded files or URLs pointing to images or PDFs. All other parameters are optional with sensible defaults.

  • Upload Files — Drag and drop image or PDF files directly
  • URLs — List of URLs pointing to documents to parse
  • Include Metadata — Include processing metadata in output (default: true)
  • Output Format — Choose between Dataset (JSON), Key-Value Store (MD files), or both (default: both)

Output

The Actor outputs results to the Dataset (JSON) and/or Key-Value Store (Markdown files), depending on the outputFormat setting.

Dataset output example

{
"source": "https://example.com/document.png",
"success": true,
"markdown": "## Introduction\n\nThe equation $E = mc^2$ describes...\n\n| Column A | Column B |\n|----------|----------|\n| Value 1 | Value 2 |",
"format": "png",
"sizeBytes": 245000,
"processingTimeMs": 3500,
"pagesProcessed": 1,
"metadata": {
"parsed_at": "2026-03-07T10:00:00.000000",
"processing_time_ms": 3500,
"file_size_bytes": 245000
}
}

Key-Value Store output

When using keyValueStore or both output format, each successfully parsed document is saved as a .md file in the Key-Value Store, ready to download or use in downstream workflows.