Pricing

Pay per event + usage

Go to Apify Store

Document AI - Extract, Summarize & Classify Files

Try for free

Automatically read PDFs and documents. Extract text, create summaries, and categorize documents by type.

Pricing

Pay per event + usage

Rating

0.0

(0)

Developer

daehwan kim

Actor stats

Bookmarked

Total users

Monthly active users

3 hours ago

Last modified

Document Intelligence MCP Server

Extract text (OCR), summarize, classify, and extract tables from document images using local AI vision models.

Features

Tool	Description	Price
`extract_text`	OCR - Extract all text from document images in markdown format	$0.05/use
`summarize_document`	Analyze and summarize document content with key information	$0.10/use
`classify_document`	Classify document type (invoice, contract, receipt, etc.) with confidence	$0.05/use
`extract_tables`	Extract all tables as structured JSON (headers + rows)	$0.10/use

Connect via Claude Desktop

Add to your Claude Desktop MCP settings:

{
  "mcpServers": {
    "document-intelligence": {
      "url": "https://ntriqpro--document-intelligence-mcp.apify.actor/mcp?token=YOUR_APIFY_TOKEN"
    }
  }
}

Supported Document Types

Invoices and receipts
Contracts and agreements
Business letters and memos
Reports and presentations
Forms and applications
ID documents and certificates
Tables, spreadsheets, and charts

Supported Languages

Responses can be generated in: English, Korean, Japanese, Chinese, Spanish.

Input

Each tool accepts:

imageUrl (required): URL of the document image
language (optional): Response language code (default: "en")

Output

Each tool returns structured JSON with the analysis results.

Example: extract_text

Input:

{
  "imageUrl": "https://example.com/invoice.png"
}

Output:

{
  "status": "success",
  "text": "# Invoice\n\n**Invoice #**: 2024-001\n**Date**: March 15, 2024\n...",
  "model": "qwen2.5-vl",
  "imageUrl": "https://example.com/invoice.png"
}

Example: classify_document

Output:

{
  "status": "success",
  "classification": {
    "document_type": "invoice",
    "confidence": 0.95,
    "language": "en",
    "key_fields": ["invoice_number", "date", "total_amount", "vendor"]
  }
}

Example: extract_tables

Output:

{
  "status": "success",
  "tables": {
    "tables": [
      {
        "headers": ["Item", "Quantity", "Price"],
        "rows": [
          ["Widget A", "10", "$5.00"],
          ["Widget B", "5", "$12.00"]
        ]
      }
    ]
  }
}

Technology

Vision Model: Qwen2.5-VL (Apache 2.0 License)
Processing: Local AI inference, zero external API calls
Privacy: Document images are processed in real-time and not stored

Open Source Licenses

This service uses the following open source models:

Qwen2.5-VL — Apache 2.0 License

Platform usage is free. You only pay per event (see pricing above).

Extend this actor with the ntriqpro intelligence network:

supply-chain-risk-mcp — MCP server for supply chain risk
video-intelligence-mcp — MCP server for video intelligence
content-factory-mcp — MCP server for content-factory

⭐ Love it? Leave a Review

Your rating helps professionals discover this actor. Rate it here.

Document Extractor API - AI-Powered PDF & Text Analysis

fresh_cliff/document-extractor-api

Extract text and data from PDF, Word, and image documents using AI-powered OCR. Convert documents to structured JSON, analyze content, and extract insights. No API keys required with mirror fallbacks.

Brennan Crawford

AI Document Assistant

devninja/ai-document-assistant

This actor analyzes uploaded documents using AI to extract and process information. It helps businesses quickly get answers from their documents and automate decision-making.

Devinja

5.0

Elite Document Ocr Lite

thepattyroller/elite-document-ocr-lite

Basic document text extraction and processing. Extract text from documents, analyze document structure, and extract structured data from invoices and receipts. Perfect for document automation workflows.

Logan Kiser

Zip Download Extraction Scraper

fresh_cliff/zip-download-extraction-scraper

Download and extract zip files automatically. Extract archives, process documents, analyze logs, backup files. Batch extract text, JSON, CSV content. Real-time data extraction API.

Brennan Crawford

Federal Register Documents Scraper

compute-edge/federal-register-scraper

Extract U.S. Federal Register documents via the public API. Filter by query, document types (rules, notices, proposed rules), publication date, and agencies. Includes full document details, citations, and regulatory information.

Compute Edge

AI Meeting Assistant - Transcribe & Summarize

ntriqpro/meeting-notes-mcp

Record meetings, automatically convert speech to text, create summaries, and generate action items.

daehwan kim

Text Summarizer

vivid_astronaut/text-summarizer

Summarize long text automatically. AI-powered compression.

Fabio Suizu

PDF OCR API - Document Extraction

alizarin_refrigerator-owner/pdf-ocr-api

Extract text from PDFs including scanned documents. OCR processing, table extraction & structured data output. Process invoices, contracts & forms at scale.

The Howlers

PDF to Markdown & JSON Converter (Docling)

actorzlab/docling-pdf-converter

Convert PDF documents to clean Markdown, structured JSON, and plain text using IBM's open-source Docling AI. Handles text PDFs and scanned documents (OCR), extracts tables and images. No external API key required — runs fully on-device.

Khalil Drissi

Pdf Power Tools

agenscrape/pdf-power-tools

Split, merge, compress, convert & OCR PDFs via API. Extract text from scanned documents in 14 languages. Compress files for email, convert pages to PNG/JPEG/WebP, split by pages or ranges, merge multiple PDFs. Perfect for document automation & data extraction workflows.