Pricing

Pay per event

Pdf Power Tools

Split, merge, compress, convert & OCR PDFs via API. Extract text from scanned documents in 14 languages. Compress files for email, convert pages to PNG/JPEG/WebP, split by pages or ranges, merge multiple PDFs. Perfect for document automation & data extraction workflows.

Pricing

Pay per event

Rating

0.0

(0)

Developer

Agenscrape

Actor stats

Bookmarked

Total users

Monthly active users

3 months ago

Last modified

What is PDF Power Tools?

PDF Power Tools is a comprehensive PDF processing API that handles all your PDF manipulation needs in the cloud. Whether you need to split large documents, merge multiple PDFs, compress files for email, extract text from scanned documents using OCR, or convert PDF pages to images - this actor does it all.

Perfect for:

Document automation workflows - Process PDFs at scale without local software
Data extraction pipelines - Extract text from scanned invoices, receipts, contracts
Content management systems - Generate thumbnails, compress uploads, split documents
Archival and digitization - OCR historical documents, enhance scanned pages
Web applications - Server-side PDF processing via API

Features

Split PDF

Break down large PDF documents into smaller, manageable files. Split options include:

Each page separate - Create individual PDFs for every page
By page ranges - Split into custom ranges (e.g., pages 1-10, 11-20, 21-30)
Split in half - Divide document into two equal parts
Extract specific pages - Pull out only the pages you need
By file size - Automatically split when file exceeds size limit

Merge PDF

Combine multiple PDF files into a single document:

Merge unlimited PDFs in sequence
Custom merge order
Interleave pages from multiple documents
Insert pages from one PDF into another at specific positions

Compress PDF

Reduce PDF file size for email attachments, web uploads, or storage optimization:

Low compression - Minimal size reduction, highest quality
Medium compression - Balanced quality and file size (default)
High compression - Maximum size reduction
Screen preset - Optimized for on-screen viewing
Print preset - Optimized for printing quality

Convert PDF to Images

Transform PDF pages into high-quality images:

Output formats: PNG, JPEG, WebP, TIFF
Customizable DPI (72-600)
Convert all pages or specific page selection
Combine all pages into single tall image
Generate thumbnails

OCR - Text Extraction from Scanned PDFs

Extract text from scanned documents, images, and non-searchable PDFs using Tesseract OCR:

14 supported languages: English, French, German, Spanish, Italian, Portuguese, Dutch, Polish, Russian, Chinese (Simplified & Traditional), Japanese, Korean, Arabic
Image preprocessing for improved accuracy
Confidence scores per page
Word and line count statistics

Enhance Scanned PDFs

Improve readability of scanned documents:

Sharpen blurry text and images
Reduce noise and artifacts
Adjust contrast and brightness
Configurable DPI settings

Page Manipulation

Fine-grained control over PDF pages:

Reorder pages within a document
Remove unwanted pages
Insert pages at specific positions

PDF Information

Analyze PDF files before processing:

Page count and dimensions
File size breakdown
Detect if PDF is scanned or native text
Compression estimate

Input Options

Basic Input

{
    "operation": "split",
    "pdfUrl": "https://example.com/document.pdf"
}

Using Base64 Input

{
    "operation": "compress",
    "pdfBase64": "JVBERi0xLjcKCjEgMCBvYmoK..."
}

Operation Examples

Get PDF Information

{
    "operation": "info",
    "pdfUrl": "https://example.com/document.pdf"
}

Split Into Individual Pages

{
    "operation": "split",
    "pdfUrl": "https://example.com/large-document.pdf",
    "splitMode": "each_page"
}

Split By Page Ranges

{
    "operation": "split",
    "pdfUrl": "https://example.com/document.pdf",
    "splitMode": "ranges",
    "ranges": ["1-10", "11-20", "21-30"]
}

Extract Specific Pages

{
    "operation": "split",
    "pdfUrl": "https://example.com/document.pdf",
    "splitMode": "extract",
    "pages": [1, 5, 10, 15]
}

Merge Multiple PDFs

{
    "operation": "merge",
    "pdfUrls": [
        "https://example.com/part1.pdf",
        "https://example.com/part2.pdf",
        "https://example.com/part3.pdf"
    ]
}

Merge With Custom Order

{
    "operation": "merge",
    "pdfUrls": ["doc1.pdf", "doc2.pdf", "doc3.pdf"],
    "order": [2, 0, 1]
}

Compress PDF

{
    "operation": "compress",
    "pdfUrl": "https://example.com/large-file.pdf",
    "compressionPreset": "high"
}

Convert PDF to PNG Images

{
    "operation": "convert",
    "pdfUrl": "https://example.com/document.pdf",
    "outputFormat": "png",
    "dpi": 200,
    "quality": 95
}

Convert Specific Pages to JPEG

{
    "operation": "convert",
    "pdfUrl": "https://example.com/document.pdf",
    "outputFormat": "jpeg",
    "pages": [1, 3, 5],
    "dpi": 150
}

OCR - Extract Text from Scanned PDF

{
    "operation": "ocr",
    "pdfUrl": "https://example.com/scanned-document.pdf",
    "language": "eng",
    "preprocess": true
}

OCR in French

{
    "operation": "ocr",
    "pdfUrl": "https://example.com/french-scan.pdf",
    "language": "fra"
}

Enhance Scanned Document

{
    "operation": "enhance",
    "pdfUrl": "https://example.com/old-scan.pdf",
    "sharpen": true,
    "denoise": true,
    "contrast": 1.3,
    "brightness": 1.1
}

Generate Thumbnail

{
    "operation": "thumbnail",
    "pdfUrl": "https://example.com/document.pdf",
    "thumbnailWidth": 300,
    "outputFormat": "png"
}

Remove Pages

{
    "operation": "merge",
    "pdfUrl": "https://example.com/document.pdf",
    "pagesToRemove": [2, 5, 8]
}

Reorder Pages

{
    "operation": "merge",
    "pdfUrl": "https://example.com/document.pdf",
    "newPageOrder": [4, 3, 2, 1, 5, 6]
}

Output

Results are saved to the run's Key-Value Store for easy download:

Operation	Output Files
Split	`page_001.pdf`, `page_002.pdf`, ... or `pages_1-10.pdf`, etc.
Merge	`merged.pdf`
Compress	`compressed.pdf`
Convert	`page_001.png`, `page_002.png`, ...
OCR	`extracted_text.txt` + Dataset with per-page results
Enhance	`enhanced.pdf`
Thumbnail	`thumbnail.png`

Sample Output

{
    "operation": "compress",
    "preset": "high",
    "pageCount": 25,
    "originalSize": "4.5 MB",
    "compressedSize": "1.2 MB",
    "compressionRatio": "73.3%",
    "outputKey": "compressed.pdf"
}

Supported Languages for OCR

Code	Language
`eng`	English
`fra`	French
`deu`	German
`spa`	Spanish
`ita`	Italian
`por`	Portuguese
`nld`	Dutch
`pol`	Polish
`rus`	Russian
`chi_sim`	Chinese (Simplified)
`chi_tra`	Chinese (Traditional)
`jpn`	Japanese
`kor`	Korean
`ara`	Arabic

Compression Presets

Preset	Image Quality	Best For
`low`	90%	Archives, legal documents
`medium`	75%	General use, email
`high`	50%	Web uploads, storage saving
`screen`	60%	On-screen viewing
`print`	85%	Print-quality output

Pricing

Event	Price	Description
`pdf-loaded`	$0.005	Each PDF loaded from URL or base64
`page-enhanced`	$0.01	Each page enhanced (sharpen, denoise)
`page-processed`	$0.002	Each page processed (split, merge, compress)
`ocr-page`	$0.02	Each page with OCR text extraction
`pdf-compressed`	$0.01	PDF compression completed
`page-converted`	$0.005	Each page converted to image
`pdf-merged`	$0.01	PDF merge operation completed
`metadata-extracted`	$0.005	PDF info/metadata extraction
`text-extracted`	$0.005	Text extraction completed

Use Cases

Invoice Processing - Extract data from scanned invoices using OCR
Document Splitting - Break down large reports into chapters
PDF Compression - Reduce file size for email attachments
Image Generation - Create thumbnails for document previews
Document Merging - Combine multiple contracts into one file
Archival - Enhance and OCR historical scanned documents
Web Publishing - Convert PDF pages to web-friendly images
Data Extraction - Pull text from non-searchable PDFs

PDF OCR API - Document Extraction

alizarin_refrigerator-owner/pdf-ocr-api

Extract text from PDFs including scanned documents. OCR processing, table extraction & structured data output. Process invoices, contracts & forms at scale.

The Howlers

PDF MCP Server

constant_quadruped/pdf-mcp-server

Stateless PDF tools exposed via MCP. Open, split, merge, extract text, fill and flatten forms, and generate PDFs in a single run. Designed for deterministic, auditable document transformations by agents and workflows.

PDF to Markdown Converter

web.harvester/pdf-to-markdown-converter

Convert PDFs to clean Markdown with optional OCR for scanned documents. Uses PDF.js for text extraction and Tesseract.js for optical character recognition.

Web Harvester

PDF to Markdown Converter - AI-Powered with OCR & Tables

clearpath/pdf-to-markdown-api

Convert PDFs to clean Markdown with GPU-accelerated AI. Extracts tables, LaTeX formulas, and images from complex layouts. Supports OCR for scanned docs in 8 languages. Batch process hundreds of PDFs in parallel via URL, upload, or API.

ClearPath

Convert Image to PDF and PDF to Image

akash9078/image-pdf-converter

Convert images (JPG, PNG, BMP, and more) into high-quality PDFs, or extract images from PDF files in seconds. Image–PDF Converter Pro delivers fast, reliable, and professional results for all your document and image conversion needs.

Akash Kumar Naik

File Converter All-in-one

tufantoksoz/file-converter

Convert files between popular formats at scale on Apify. Transform documents, spreadsheets, and images with professional-grade quality and performance. Convert word to pdf, jpeg to png and more

Tufan Toksöz

Document Extractor API - AI-Powered PDF & Text Analysis

fresh_cliff/document-extractor-api

Extract text and data from PDF, Word, and image documents using AI-powered OCR. Convert documents to structured JSON, analyze content, and extract insights. No API keys required with mirror fallbacks.

Brennan Crawford

Ocr

vivid_astronaut/ocr

Extract text from images using advanced OCR technology. Supports multiple languages and image formats. Perfect for digitizing documents, receipts, screenshots, and scanned text.

Fabio Suizu

PDF to Text API | Document Extraction for LLMs & RAG

andok/pdf-text-converter

Convert bulk PDF documents via URL into clean, raw text. The perfect document scraper for LLMs, vector databases, and RAG pipelines.

Andok

Pdf OCR API

cspnair/pdf-ocr-api

Extract and convert text from PDF documents using advanced optical character recognition technology with support for multiple AI models.

csp

5.0

Pdf Power Tools

What is PDF Power Tools?

Features

Split PDF

Merge PDF

Compress PDF

Convert PDF to Images

OCR - Text Extraction from Scanned PDFs

Enhance Scanned PDFs

Page Manipulation

PDF Information

Input Options

Basic Input

Using Base64 Input

Operation Examples

Get PDF Information

Split Into Individual Pages

Split By Page Ranges

Extract Specific Pages

Merge Multiple PDFs

Merge With Custom Order

Compress PDF

Convert PDF to PNG Images

Convert Specific Pages to JPEG

OCR - Extract Text from Scanned PDF

OCR in French

Enhance Scanned Document

Generate Thumbnail

Remove Pages

Reorder Pages

Output

Sample Output

Supported Languages for OCR

Compression Presets

Pricing

Use Cases

You might also like

PDF OCR API - Document Extraction

PDF MCP Server

PDF to Markdown Converter

PDF to Markdown Converter - AI-Powered with OCR & Tables

Convert Image to PDF and PDF to Image

File Converter All-in-one

Document Extractor API - AI-Powered PDF & Text Analysis

Ocr

PDF to Text API | Document Extraction for LLMs & RAG

Pdf OCR API