PaddleOCR VL

Try for free

Pricing

$49.00/month + usage

Rating

5.0

(1)

Developer

yeekal

Actor stats

Bookmarked

Total users

Monthly active users

4 months ago

Last modified

Paddle OCR Layout Parser

This Apify Actor provides a powerful interface to the Paddle OCR Layout Parsing API. It allows you to submit an image or a PDF file via a URL and receive structured Markdown content, with all embedded images correctly linked via their absolute URLs. It also provides a visual representation of the parsed layout.

Features

Supports Images and PDFs: Process various image formats (PNG, JPG, etc.) and multi-page PDF documents.
Smart File Type Detection: Automatically determines the file type from the URL, or you can specify it manually.
Markdown Content Extraction: Extracts the full textual content and structure of the document into clean Markdown.
Layout Visualization: Provides a URL to an image that visually highlights the detected layout structure (titles, paragraphs, figures, tables).
File Size Limit: Protects against oversized files by enforcing a 5MB limit.

Input

The Actor requires the following inputs, which are defined in the Input tab.

Field	Type	Description
File URL (`fileUrl`)	String	Required. A publicly accessible URL to the image or PDF file you want to process. The file size must not exceed 5MB.
File Type (`fileType`)	String	The type of the file. It's recommended to leave this as `Autodetect`. Options: `Autodetect`, `Image`, `PDF`.

Output

The Actor stores its results in the Apify default dataset. Each item in the dataset corresponds to a page from the input file.

Output Structure (JSON)

[
  {
    "pageNumber": 1,
    "processedMarkdown": "## This is the Title\n\nAnd this is a paragraph of text. Here is an image:\n\n<div style=\"text-align: center;\"><img src=\"https://example.com/path/to/image.jpg\" alt=\"Image\" width=\"50%\" /></div>",
    "layoutImageUrl": "https://example.com/path/to/layout_visualization.jpg",
  }
]```

- `processedMarkdown`: The primary output. Ready-to-render Markdown with absolute image URLs.
- `layoutImageUrl`: A URL to an image visualizing the detected document layout.