Deprecated

Pricing

Pay per usage

See alternative Actors

Go to Apify Store

OCR Structured Extractor (AI) — Image/PDF → OCR Text + JSON

Deprecated

See alternative Actors

Extract OCR text and structured JSON from an image or PDF URL. Great for invoices, receipts, forms, IDs, and tables. Powered by Gemini 3 Pro.

Pricing

Pay per usage

Rating

0.0

(0)

Developer

Anass

Actor stats

Bookmarked

Total users

Monthly active users

a month ago

Last modified

OCR Structured Extractor (AI) — Image/PDF → OCR Text + Structured JSON

OCR Structured Extractor icon

OCR Structured Extractor banner

Extract OCR text and structured fields from an image URL or PDF URL using Gemini 3 Pro (through the same proxy Worker used by the other AI actors in this repo).

Keywords (SEO)

ocr api, pdf ocr, image ocr, pdf to json, image to json, invoice ocr, receipt ocr, form extraction, document understanding, gemini ocr, structured extraction, data extraction, ai document parser, id card ocr, table extraction

How it works

Downloads your file from fileUrl
Sends the bytes as inlineData to models/{model}:generateContent (JSON mode)
Parses the model response and outputs:
- text: full OCR transcription
- data: structured fields (either a default structure or your extractionSchema)

Best for

Invoices, receipts, and utility bills (key-value extraction)
Forms and screenshots (clean OCR + structured fields)
PDFs that mix text, tables, and images (document understanding)
Identity documents (IDs, passports) and card-style layouts

Input

fileUrl (string, required): Public URL to an image (png/jpg/webp) or a PDF
instructions (string, optional): Extraction instructions for the model
extractionSchema (object, optional): JSON object describing the structure you want in data
model (string, default gemini-3-pro-preview)
maxBytes (int, default 52428800): Max size to download (PDF inline is commonly limited to 50MB)

Supported file types

Images: image/png, image/jpeg, image/webp (and other image/* types if the server reports a correct MIME type)
Documents: application/pdf

Output

The Actor stores:

Dataset: one item with fileUrl, mimeType, text, and data
Key-value store exports:
- ocr.json (full JSON output)
- ocr.txt (OCR text only, if available)

Dataset item example:

{
  "fileUrl": "https://example.com/invoice.pdf",
  "mimeType": "application/pdf",
  "model": "gemini-3-pro-preview",
  "text": "Invoice #INV-1002 ...",
  "data": {
    "summary": "Invoice from ACME Corp for January services.",
    "key_value_pairs": [
      { "key": "Invoice Number", "value": "INV-1002" },
      { "key": "Total", "value": "$1,249.00" }
    ]
  }
}

Example input (custom schema)

{
  "fileUrl": "https://example.com/receipt.jpg",
  "instructions": "Extract receipt line items and totals. Return ONLY JSON.",
  "extractionSchema": {
    "merchant": "string",
    "date": "string",
    "currency": "string",
    "total": "string",
    "items": [
      { "name": "string", "qty": "string", "price": "string" }
    ]
  }
}

Prompt tips

For invoices/receipts: ask for merchant, invoice_number, date, currency, subtotal, tax, total, items[]
For IDs: ask for full_name, document_number, dob, expiry_date
If the document has tables, ask for rows with normalized columns

Document Extractor API - AI-Powered PDF & Text Analysis

fresh_cliff/document-extractor-api

Extract text and data from PDF, Word, and image documents using AI-powered OCR. Convert documents to structured JSON, analyze content, and extract insights. No API keys required with mirror fallbacks.

Brennan Crawford

Restaurant Menu Scraper

wedo_software/wedo-scrape-menu

AI Restaurant Menu Scraper: Extract prices, descriptions, and allergens from images, PDFs, or web pages using OCR. Turn any restaurant URL into a structured Menu API.

Benjamin

Invoice & Receipt Extractor — Automated Document Data Extrac...

apricot_blackberry/invoice-receipt-extractor

Invoices and receipts → structured data. Amounts, dates, vendors, line items, tax details. Clean JSON, zero manual entry.

Creator Fusion

PDF To JSON Parser

parseforge/pdf-to-json-parser

Convert PDF documents into structured JSON using AI-powered OCR and smart data extraction. The Actor processes every page to ensure complete coverage, then identifies text, fields, tables, and key details, delivering clean, organized JSON ready for automation or analysis.

ParseForge

5.0

(1)

esg-csrd-scraper

korobz/esg-csrd-scraper

Automate CSRD compliance. Extract Scope 1, 2, 3 emissions and ESG metrics from corporate reports. Perfect for Carbon Accounting & Supply Chain analysis.

Korobz Korobz

Pdf OCR API

cspnair/pdf-ocr-api

Extract and convert text from PDF documents using advanced optical character recognition technology with support for multiple AI models.

csp

5.0

(3)

Bulk Pdf To Json OCR

gagandeo/bulk-pdf-to-json-ocr

Convert PDF invoices, menus, images with text and documents into structured JSON. Features hybrid Digital+OCR parsing and AI-powered data extraction.

Kumar Gagandeo

Image To Text Ai

welcoming_fireplace/image-to-text-ai

A powerful OCR tool that goes beyond standard text extraction. Powered by a Premium Vision AI model, it accurately reads handwriting, preserves table structures, and converts messy receipts or documents into structured JSON or Markdown. Supports batch processing for high-volume workflows.

Richmond Nkrumah

PDF Intelligence

marielise.dev/pdf-intelligence

Stop fighting PDFs. Extract text, tables, and insights from any document, scanned or digital. Get RAG-ready chunks for LangChain & LlamaIndex. AI-powered summaries, classification, entity extraction. Use our API keys or bring your own (50% discount). From PDF chaos to clean data in minutes.

Marielise

PDF to Markdown Converter - AI-Powered with OCR & Tables

clearpath/pdf-to-markdown-api

Convert PDFs to clean Markdown with GPU-accelerated AI. Extracts tables, LaTeX formulas, and images from complex layouts. Supports OCR for scanned docs in 8 languages. Batch process hundreds of PDFs in parallel via URL, upload, or API.