Pricing

from $0.01 / 1,000 results

Try for free

Go to Apify Store

Pdf OCR API

Try for free

Extract and convert text from PDF documents using advanced optical character recognition technology with support for multiple AI models.

Pricing

from $0.01 / 1,000 results

Rating

5.0

(4)

Developer

csp

Actor stats

Bookmarked

Total users

Monthly active users

79 days

Issues response

3 months ago

Last modified

Categories

Developer tools

Other

OCR Model

ocrModel

Required

Select the OCR model to use for text extraction

Type:string

Default:native

Options:

google-visiondeepseek-ocramazon-textractazure-visionopenaihuggingfacegemininative

PDF Document URLs

pdfUrls

Required

Array of PDF document URLs to process

Type:string[]

Min. items:1

Default:

[
  "https://pdfobject.com/pdf/sample.pdf"
]

Document Language

language

Optional

Primary language of the documents

Type:string

Default:eng

Options:

engspafradeuitaporruschi_simjpnkoraradan

Preserve Document Formatting

preserveFormatting

Optional

Maintain document layout and structure

Type:boolean

Default:true

Extract Images

extractImages

Optional

Extract and process images from PDF

Type:boolean

Default:false

Output Format

outputFormat

Optional

Format for extracted text

Type:string

Default:json

Options:

jsontextmarkdown

Page Range

pageRange

Optional

Specific pages to process (e.g., '1-5' or '1,3,5' or 'all')

Type:string

Default:all

Google Vision API Key

googleVisionApiKey

Optional

Required if using Google Vision API

Type:string

DeepSeek API Key

deepseekApiKey

Optional

Required if using DeepSeek OCR

Type:string

AWS Access Key ID

awsAccessKeyId

Optional

Required if using Amazon Textract

Type:string

AWS Secret Access Key

awsSecretAccessKey

Optional

Required if using Amazon Textract

Type:string

AWS Region

awsRegion

Optional

AWS region for Textract (e.g., us-east-1)

Type:string

Default:us-east-1

Azure Vision Endpoint

azureEndpoint

Optional

Required if using Azure AI Vision

Type:string

Azure Vision API Key

azureApiKey

Optional

Required if using Azure AI Vision

Type:string

OpenAI API Key

openaiApiKey

Optional

Required if using OpenAI GPT-4 Vision

Type:string

Hugging Face API Key

huggingfaceApiKey

Optional

Required if using Hugging Face models

Type:string

Google Gemini API Key

geminiApiKey

Optional

Required if using Gemini API

Type:string

PDF To JSON Parser

parseforge/pdf-to-json-parser

Convert PDF documents into structured JSON using AI-powered OCR and smart data extraction. The Actor processes every page to ensure complete coverage, then identifies text, fields, tables, and key details, delivering clean, organized JSON ready for automation or analysis.

ParseForge

5.0

PDF Scraper

onidivo/pdf-scraper

Scrape and extract text from PDF links.

Onidivo Technologies

494

PDF Text Extractor

jirimoravcik/pdf-text-extractor

PDF Text Extractor allows you to extract text from PDF files. It also supports chunking of the text to prepare the data for usage with large language models.

Jiří Moravčík

984

5.0

Image To Text Ai

welcoming_fireplace/image-to-text-ai

A powerful OCR tool that goes beyond standard text extraction. Powered by a Premium Vision AI model, it accurately reads handwriting, preserves table structures, and converts messy receipts or documents into structured JSON or Markdown. Supports batch processing for high-volume workflows.

Richmond Nkrumah

Image Text Extractor

m3web/image-text-extractor

Extract text from images using OCR (Optical Character Recognition) via direct URLs or uploaded JSON/CSV files. Works with multiple languages and automatically enriches your structured file with the text found inside images.

M3Web

Website Screenshot API - Page Screenshot Generator

code-node-tools/website-screenshot-api

Website screenshot API to capture any webpage as an image. This screenshot API supports full page, viewport, and element screenshots. Website screenshot generator API for automated website screenshot capture, visual testing, monitoring, and thumbnail generation. Reliable page screenshot API.

CodeNodeTools

Docling

vancura/docling

Docling document parser & converter – Convert documents into structured data without complexity. This Actor leverages the powerful Docling library to parse and transform various document formats into clean, structured outputs ready for analysis or integration.

Václav Vančura

395

5.0

PDF to Markdown Converter - AI-Powered with OCR & Tables

clearpath/pdf-to-markdown-api

Convert PDFs to clean Markdown with GPU-accelerated AI. Extracts tables, LaTeX formulas, and images from complex layouts. Supports OCR for scanned docs in 8 languages. Batch process hundreds of PDFs in parallel via URL, upload, or API.

ClearPath

Website API and Endpoint Analyzer

lofomachines/website-api-and-endpoint-analyzer

Analyze one or more page URLs and output one dataset row per detected API or endpoint with network metadata and risk signals.

Lofomachines

Universal Book Engineer: AI Publisher & Ghostwriter

visita/universal-book-engineer-ai-publisher-ghostwriter

Turn ChatGPT/Gemini conversations (or other text), into industry-standard PDFs & EPUBs. Features "Fractal Expansion" to write full books from ideas. Specialized layout engines for Novels, Islamic/Scriptural texts, Children's books, and Technical Manuals. Includes DALL-E 3 art integration.