PaddleOCR VL
Pricing
$49.00/month + usage
Go to Apify Store

PaddleOCR VL
Pricing
$49.00/month + usage
Rating
5.0
(1)
Developer

yeekal
Maintained by Community
Actor stats
0
Bookmarked
3
Total users
2
Monthly active users
11 days ago
Last modified
Categories
Share
Paddle OCR Layout Parser
This Apify Actor provides a powerful interface to the Paddle OCR Layout Parsing API. It allows you to submit an image or a PDF file via a URL and receive structured Markdown content, with all embedded images correctly linked via their absolute URLs. It also provides a visual representation of the parsed layout.
Features
- Supports Images and PDFs: Process various image formats (PNG, JPG, etc.) and multi-page PDF documents.
- Smart File Type Detection: Automatically determines the file type from the URL, or you can specify it manually.
- Markdown Content Extraction: Extracts the full textual content and structure of the document into clean Markdown.
- Layout Visualization: Provides a URL to an image that visually highlights the detected layout structure (titles, paragraphs, figures, tables).
- File Size Limit: Protects against oversized files by enforcing a 5MB limit.
Input
The Actor requires the following inputs, which are defined in the Input tab.
| Field | Type | Description |
|---|---|---|
File URL (fileUrl) | String | Required. A publicly accessible URL to the image or PDF file you want to process. The file size must not exceed 5MB. |
File Type (fileType) | String | The type of the file. It's recommended to leave this as Autodetect. Options: Autodetect, Image, PDF. |
Output
The Actor stores its results in the Apify default dataset. Each item in the dataset corresponds to a page from the input file.
Output Structure (JSON)
[{"pageNumber": 1,"processedMarkdown": "## This is the Title\n\nAnd this is a paragraph of text. Here is an image:\n\n<div style=\"text-align: center;\"><img src=\"https://example.com/path/to/image.jpg\" alt=\"Image\" width=\"50%\" /></div>","layoutImageUrl": "https://example.com/path/to/layout_visualization.jpg",}]```- `processedMarkdown`: The primary output. Ready-to-render Markdown with absolute image URLs.- `layoutImageUrl`: A URL to an image visualizing the detected document layout.