PDF OCR API - Document Extraction
Pricing
from $200.00 / 1,000 page processeds
PDF OCR API - Document Extraction
Extract text from PDFs including scanned documents. OCR processing, table extraction & structured data output. Process invoices, contracts & forms at scale.
Pricing
from $200.00 / 1,000 page processeds
Rating
0.0
(0)
Developer
The Howlers
Actor stats
0
Bookmarked
17
Total users
1
Monthly active users
3 days ago
Last modified
Categories
Share
PDF OCR API
Extract text from PDF files using OCR. Supports scanned documents, images, and multi-page PDFs. Returns structured text with page numbers and confidence scores. Built by John Rippy (https://www.linkedin.com/in/johnrippy/ | https://johnrippy.link/).
Quick Start
Test with Demo Mode (free, no API key needed)
{"demoMode": true,"pdfUrl": ""}
Run with real data
{"demoMode": false,"pdfUrl": "","language": "eng","outputFormat": "json","detectTables": false}
Input Parameters
| Parameter | Type | Default | Required | Description |
|---|---|---|---|---|
pdfUrl | string | - | No | URL of the PDF file to process |
pdfBase64 | string | - | No | Base64-encoded PDF content (alternative to URL) |
language | string | "eng" | No | Language hint for OCR (improves accuracy) |
pageRange | string | - | No | Pages to process (e.g., '1-5' or '1,3,5'). Leave empty for all pages. |
outputFormat | string | "json" | No | How to structure the output |
detectTables | boolean | false | No | Attempt to preserve table structure |
demoMode | boolean | true | No | Return sample output without processing (for testing) |
webhookUrl | string | - | No | Optional URL to receive results via POST request when actor completes |
Pricing
This actor uses pay-per-event billing:
| Event | Description | Price |
|---|---|---|
| Page Processed | Each PDF page processed with OCR | $0.02 |
Demo mode is free -- no charges for sample data.
Troubleshooting
"API error 429" or "Rate limit"
Too many requests. Wait a minute and try again, or reduce the number of items per run.
No results or empty dataset
Check the run log for error messages. Common causes:
- Invalid input format (check the examples above)
- The target data doesn't exist or is too small to track
How do I test without an API key?
Enable Demo Mode in the input. This returns realistic sample data so you can verify the output format works for your workflow.
Built by John Rippy | Actor Arsenal