PaddleOCR VL avatar
PaddleOCR VL

Pricing

$49.00/month + usage

Go to Apify Store
PaddleOCR VL

PaddleOCR VL

Pricing

$49.00/month + usage

Rating

5.0

(1)

Developer

yeekal

yeekal

Maintained by Community

Actor stats

0

Bookmarked

3

Total users

2

Monthly active users

11 days ago

Last modified

Share

Paddle OCR Layout Parser

This Apify Actor provides a powerful interface to the Paddle OCR Layout Parsing API. It allows you to submit an image or a PDF file via a URL and receive structured Markdown content, with all embedded images correctly linked via their absolute URLs. It also provides a visual representation of the parsed layout.

Features

  • Supports Images and PDFs: Process various image formats (PNG, JPG, etc.) and multi-page PDF documents.
  • Smart File Type Detection: Automatically determines the file type from the URL, or you can specify it manually.
  • Markdown Content Extraction: Extracts the full textual content and structure of the document into clean Markdown.
  • Layout Visualization: Provides a URL to an image that visually highlights the detected layout structure (titles, paragraphs, figures, tables).
  • File Size Limit: Protects against oversized files by enforcing a 5MB limit.

Input

The Actor requires the following inputs, which are defined in the Input tab.

FieldTypeDescription
File URL (fileUrl)StringRequired. A publicly accessible URL to the image or PDF file you want to process. The file size must not exceed 5MB.
File Type (fileType)StringThe type of the file. It's recommended to leave this as Autodetect. Options: Autodetect, Image, PDF.

Output

The Actor stores its results in the Apify default dataset. Each item in the dataset corresponds to a page from the input file.

Output Structure (JSON)

[
{
"pageNumber": 1,
"processedMarkdown": "## This is the Title\n\nAnd this is a paragraph of text. Here is an image:\n\n<div style=\"text-align: center;\"><img src=\"https://example.com/path/to/image.jpg\" alt=\"Image\" width=\"50%\" /></div>",
"layoutImageUrl": "https://example.com/path/to/layout_visualization.jpg",
}
]```
- `processedMarkdown`: The primary output. Ready-to-render Markdown with absolute image URLs.
- `layoutImageUrl`: A URL to an image visualizing the detected document layout.