Pdf Power Tools avatar
Pdf Power Tools

Pricing

Pay per event

Go to Apify Store
Pdf Power Tools

Pdf Power Tools

Split, merge, compress, convert & OCR PDFs via API. Extract text from scanned documents in 14 languages. Compress files for email, convert pages to PNG/JPEG/WebP, split by pages or ranges, merge multiple PDFs. Perfect for document automation & data extraction workflows.

Pricing

Pay per event

Rating

0.0

(0)

Developer

Agenscrape

Agenscrape

Maintained by Community

Actor stats

1

Bookmarked

2

Total users

1

Monthly active users

2 days ago

Last modified

Share

Facing an issue, unexpected error, edge case, or have a feature suggestion? Post it here and we'll address it within 24 hours.

What is PDF Power Tools?

PDF Power Tools is a comprehensive PDF processing API that handles all your PDF manipulation needs in the cloud. Whether you need to split large documents, merge multiple PDFs, compress files for email, extract text from scanned documents using OCR, or convert PDF pages to images - this actor does it all.

Perfect for:

  • Document automation workflows - Process PDFs at scale without local software
  • Data extraction pipelines - Extract text from scanned invoices, receipts, contracts
  • Content management systems - Generate thumbnails, compress uploads, split documents
  • Archival and digitization - OCR historical documents, enhance scanned pages
  • Web applications - Server-side PDF processing via API

Features

Split PDF

Break down large PDF documents into smaller, manageable files. Split options include:

  • Each page separate - Create individual PDFs for every page
  • By page ranges - Split into custom ranges (e.g., pages 1-10, 11-20, 21-30)
  • Split in half - Divide document into two equal parts
  • Extract specific pages - Pull out only the pages you need
  • By file size - Automatically split when file exceeds size limit

Merge PDF

Combine multiple PDF files into a single document:

  • Merge unlimited PDFs in sequence
  • Custom merge order
  • Interleave pages from multiple documents
  • Insert pages from one PDF into another at specific positions

Compress PDF

Reduce PDF file size for email attachments, web uploads, or storage optimization:

  • Low compression - Minimal size reduction, highest quality
  • Medium compression - Balanced quality and file size (default)
  • High compression - Maximum size reduction
  • Screen preset - Optimized for on-screen viewing
  • Print preset - Optimized for printing quality

Convert PDF to Images

Transform PDF pages into high-quality images:

  • Output formats: PNG, JPEG, WebP, TIFF
  • Customizable DPI (72-600)
  • Convert all pages or specific page selection
  • Combine all pages into single tall image
  • Generate thumbnails

OCR - Text Extraction from Scanned PDFs

Extract text from scanned documents, images, and non-searchable PDFs using Tesseract OCR:

  • 14 supported languages: English, French, German, Spanish, Italian, Portuguese, Dutch, Polish, Russian, Chinese (Simplified & Traditional), Japanese, Korean, Arabic
  • Image preprocessing for improved accuracy
  • Confidence scores per page
  • Word and line count statistics

Enhance Scanned PDFs

Improve readability of scanned documents:

  • Sharpen blurry text and images
  • Reduce noise and artifacts
  • Adjust contrast and brightness
  • Configurable DPI settings

Page Manipulation

Fine-grained control over PDF pages:

  • Reorder pages within a document
  • Remove unwanted pages
  • Insert pages at specific positions

PDF Information

Analyze PDF files before processing:

  • Page count and dimensions
  • File size breakdown
  • Detect if PDF is scanned or native text
  • Compression estimate

Input Options

Basic Input

{
"operation": "split",
"pdfUrl": "https://example.com/document.pdf"
}

Using Base64 Input

{
"operation": "compress",
"pdfBase64": "JVBERi0xLjcKCjEgMCBvYmoK..."
}

Operation Examples

Get PDF Information

{
"operation": "info",
"pdfUrl": "https://example.com/document.pdf"
}

Split Into Individual Pages

{
"operation": "split",
"pdfUrl": "https://example.com/large-document.pdf",
"splitMode": "each_page"
}

Split By Page Ranges

{
"operation": "split",
"pdfUrl": "https://example.com/document.pdf",
"splitMode": "ranges",
"ranges": ["1-10", "11-20", "21-30"]
}

Extract Specific Pages

{
"operation": "split",
"pdfUrl": "https://example.com/document.pdf",
"splitMode": "extract",
"pages": [1, 5, 10, 15]
}

Merge Multiple PDFs

{
"operation": "merge",
"pdfUrls": [
"https://example.com/part1.pdf",
"https://example.com/part2.pdf",
"https://example.com/part3.pdf"
]
}

Merge With Custom Order

{
"operation": "merge",
"pdfUrls": ["doc1.pdf", "doc2.pdf", "doc3.pdf"],
"order": [2, 0, 1]
}

Compress PDF

{
"operation": "compress",
"pdfUrl": "https://example.com/large-file.pdf",
"compressionPreset": "high"
}

Convert PDF to PNG Images

{
"operation": "convert",
"pdfUrl": "https://example.com/document.pdf",
"outputFormat": "png",
"dpi": 200,
"quality": 95
}

Convert Specific Pages to JPEG

{
"operation": "convert",
"pdfUrl": "https://example.com/document.pdf",
"outputFormat": "jpeg",
"pages": [1, 3, 5],
"dpi": 150
}

OCR - Extract Text from Scanned PDF

{
"operation": "ocr",
"pdfUrl": "https://example.com/scanned-document.pdf",
"language": "eng",
"preprocess": true
}

OCR in French

{
"operation": "ocr",
"pdfUrl": "https://example.com/french-scan.pdf",
"language": "fra"
}

Enhance Scanned Document

{
"operation": "enhance",
"pdfUrl": "https://example.com/old-scan.pdf",
"sharpen": true,
"denoise": true,
"contrast": 1.3,
"brightness": 1.1
}

Generate Thumbnail

{
"operation": "thumbnail",
"pdfUrl": "https://example.com/document.pdf",
"thumbnailWidth": 300,
"outputFormat": "png"
}

Remove Pages

{
"operation": "merge",
"pdfUrl": "https://example.com/document.pdf",
"pagesToRemove": [2, 5, 8]
}

Reorder Pages

{
"operation": "merge",
"pdfUrl": "https://example.com/document.pdf",
"newPageOrder": [4, 3, 2, 1, 5, 6]
}

Output

Results are saved to the run's Key-Value Store for easy download:

OperationOutput Files
Splitpage_001.pdf, page_002.pdf, ... or pages_1-10.pdf, etc.
Mergemerged.pdf
Compresscompressed.pdf
Convertpage_001.png, page_002.png, ...
OCRextracted_text.txt + Dataset with per-page results
Enhanceenhanced.pdf
Thumbnailthumbnail.png

Sample Output

{
"operation": "compress",
"preset": "high",
"pageCount": 25,
"originalSize": "4.5 MB",
"compressedSize": "1.2 MB",
"compressionRatio": "73.3%",
"outputKey": "compressed.pdf"
}

Supported Languages for OCR

CodeLanguage
engEnglish
fraFrench
deuGerman
spaSpanish
itaItalian
porPortuguese
nldDutch
polPolish
rusRussian
chi_simChinese (Simplified)
chi_traChinese (Traditional)
jpnJapanese
korKorean
araArabic

Compression Presets

PresetImage QualityBest For
low90%Archives, legal documents
medium75%General use, email
high50%Web uploads, storage saving
screen60%On-screen viewing
print85%Print-quality output

Pricing

EventPriceDescription
pdf-loaded$0.005Each PDF loaded from URL or base64
page-enhanced$0.01Each page enhanced (sharpen, denoise)
page-processed$0.002Each page processed (split, merge, compress)
ocr-page$0.02Each page with OCR text extraction
pdf-compressed$0.01PDF compression completed
page-converted$0.005Each page converted to image
pdf-merged$0.01PDF merge operation completed
metadata-extracted$0.005PDF info/metadata extraction
text-extracted$0.005Text extraction completed

Use Cases

  • Invoice Processing - Extract data from scanned invoices using OCR
  • Document Splitting - Break down large reports into chapters
  • PDF Compression - Reduce file size for email attachments
  • Image Generation - Create thumbnails for document previews
  • Document Merging - Combine multiple contracts into one file
  • Archival - Enhance and OCR historical scanned documents
  • Web Publishing - Convert PDF pages to web-friendly images
  • Data Extraction - Pull text from non-searchable PDFs