ID to JSON Parser avatar

ID to JSON Parser

Pricing

Pay per event

Go to Apify Store
ID to JSON Parser

ID to JSON Parser

Extract structured JSON data from passports, driver’s licenses, and ID cards using advanced AI vision. Automatically capture personal details, document info, dates, and all relevant fields from ID images, turning them into clean, accurate JSON for fast verification workflows.

Pricing

Pay per event

Rating

5.0

(1)

Developer

ParseForge

ParseForge

Maintained by Community

Actor stats

0

Bookmarked

4

Total users

0

Monthly active users

2 days ago

Last modified

Share

ParseForge Banner

🪪 ID to JSON Parser

🚀 Extract structured data from identity documents in seconds. Upload passports, driver's licenses, or ID cards. No coding, no manual data entry required.

🕒 Last updated: 2026-04-16 · 📊 15+ fields · 🔄 Runs on Apify cloud or locally · 📁 Export: JSON, CSV, Excel

The ID to JSON Parser converts identity documents into clean, structured JSON data using advanced AI vision. Upload images of passports, driver's licenses, or ID cards and get back structured data with names, document numbers, dates, addresses, and all other visible fields. Each document yields 15+ data points extracted automatically, ready for database ingestion, compliance workflows, or verification systems.

Built for companies processing identity documents at scale, KYC compliance teams automating customer verification, HR departments digitizing employee records, and developers building identity data pipelines. The parser handles multiple image formats (JPEG, PNG, WebP) and PDFs, processes multi-page documents, and lets you specify exactly which fields to extract. No manual data entry, no template configuration, just upload and get structured output.

Target AudienceUse Cases
KYC/Compliance TeamsCustomer identity verification
HR DepartmentsEmployee document digitization
Financial InstitutionsAccount opening automation
Immigration ServicesDocument processing and validation
Insurance CompaniesPolicy holder verification
Software DevelopersID extraction pipeline integration

📋 What the ID to JSON Parser does

  • 🆔 Identifies document type automatically, distinguishing between passports, driver's licenses, national ID cards, and other identity documents
  • 👤 Extracts full personal details including first name, last name, middle name, date of birth, gender, and nationality
  • 📄 Captures document metadata such as document number, issue date, expiry date, and issuing authority
  • 📍 Pulls address information including street, city, state, and postal code when present on the document
  • 🧬 Reads physical details like height, weight, eye color, and hair color from driver's licenses and ID cards
  • 🔧 Supports custom field selection so you can specify exactly which fields to extract for your workflow

The parser processes uploaded images through AI vision models that understand document layouts across countries and formats. It handles different orientations, image quality levels, and document types without manual template configuration.

💡 Why it matters: Manual data entry from ID documents is slow, expensive, and error-prone. This parser automates the entire process, extracting structured data from any identity document in seconds instead of minutes.


🎬 Full Demo

🚧 Coming soon...


⚙️ Input

FieldTypeRequiredDescription
idImagearrayYesOne or more ID document images or PDFs. Supports JPEG, PNG, GIF, WebP, and PDF formats.
maxItemsintegerNoMaximum documents to process. Free users: limited to 10. Paid users: up to 1,000,000.
fieldsToExtractstringNoComma-separated list of specific fields to extract (e.g., "firstName, lastName, dateOfBirth"). Leave blank for all fields.
systemPromptstringNoCustom instructions to guide extraction. Leave blank for default behavior.

Example 1: Extract all fields from an ID image

{
"idImage": ["https://example.com/passport-scan.jpg"],
"maxItems": 10
}

Example 2: Extract specific fields only

{
"idImage": ["https://example.com/drivers-license.png"],
"fieldsToExtract": "firstName, lastName, dateOfBirth, documentNumber, expiryDate",
"maxItems": 5
}

⚠️ Good to Know: You can upload multiple documents in a single run by adding more URLs to the idImage array. The parser handles both image files and multi-page PDFs. Free users are automatically limited to 10 documents per run.


📊 Output

🧾 Schema

EmojiFieldTypeDescription
📄documentTypestringType of document (passport, driver's license, ID card)
🆔documentNumberstringUnique document number or ID
👤firstNamestringFirst name
👤lastNamestringLast name
👤middleNamestringMiddle name (if present)
🌍nationalitystringNationality or citizenship
🌍countrystringCountry of issue
📅dateOfBirthstringDate of birth
📅issueDatestringDocument issue date
📅expiryDatestringDocument expiry date
🧬genderstringGender
📏heightstringHeight (driver's licenses)
👁️eyeColorstringEye color (driver's licenses)
📍addressstringFull address on document
🖼️sourceImagestringURL of the processed image
errorstringError message if extraction failed

📦 Sample records


✨ Why choose this Actor

FeatureDetails
🪪 Multi-document supportPassports, driver's licenses, national ID cards, and more
🌍 International coverageHandles documents from countries worldwide
🔧 Custom field selectionExtract only the fields you need
📄 PDF supportProcesses multi-page PDF documents
🖼️ Multiple image formatsJPEG, PNG, GIF, WebP
⚡ Fast processingSeconds per document, not minutes
📁 Structured outputClean JSON ready for databases and APIs

📈 Typical performance: Processes 1 document in 3-5 seconds. A batch of 50 documents completes in about 4 minutes.


📈 How it compares to alternatives

FeatureThis ActorManual Data EntryGeneric OCR Tools
Structured JSON outputPartial
Multi-document type support✅ (slow)Partial
International document handling✅ (slow)Partial
Custom field selectionN/A
Batch processingPartial
No template configurationN/A
API and automation supportPartial

Built specifically for identity documents, with AI vision that understands document layouts rather than generic text recognition.


🚀 How to use

  1. Create a free Apify account - Sign up here (includes free credits)
  2. Open the ID to JSON Parser - Navigate to the Actor page and click "Start"
  3. Upload your documents - Add image URLs or PDF links to the idImage field
  4. Choose fields (optional) - Specify which fields to extract, or leave blank for all
  5. Run and download - Click "Start", wait for results, then export as JSON, CSV, or Excel

⏱️ First results appear in under 10 seconds. A typical run processing 10 documents completes in about 1 minute.


💼 Business use cases

KYC & Compliance

  • Automate identity verification workflows
  • Extract document data for compliance checks
  • Process customer onboarding documents at scale
  • Build audit trails from ID submissions

Human Resources

  • Digitize employee identity documents
  • Verify work authorization documents
  • Build structured employee records from scans
  • Process new hire paperwork automatically

Financial Services

  • Accelerate account opening processes
  • Extract data for anti-money laundering checks
  • Validate customer identity across channels
  • Automate loan application document processing

Insurance & Healthcare

  • Process policyholder identity documents
  • Verify patient identity for medical records
  • Extract data from insurance claims submissions
  • Digitize legacy paper-based ID archives

🔌 Automating ID to JSON Parser

Node.js

import { ApifyClient } from 'apify-client';
const client = new ApifyClient({ token: 'YOUR_API_TOKEN' });
const run = await client.actor("parseforge/id-to-json-parser").call({
idImage: ["https://example.com/passport.jpg"],
maxItems: 10
});
const { items } = await client.dataset(run.defaultDatasetId).listItems();
console.log(items);

Python

from apify_client import ApifyClient
client = ApifyClient("YOUR_API_TOKEN")
run = client.actor("parseforge/id-to-json-parser").call(run_input={
"idImage": ["https://example.com/passport.jpg"],
"maxItems": 10
})
items = list(client.dataset(run["defaultDatasetId"]).iterate_items())
print(items)

Schedules: While ID parsing is typically triggered on demand, you can set up Apify Schedules to process batch uploads at regular intervals, such as daily processing of submitted documents.


❓ Frequently Asked Questions


🔌 Integrate with any app

  • 🔗 Make (Integromat) - Connect ID parsing output to CRM, compliance, or database apps
  • 🔗 Zapier - Trigger workflows when new ID data is extracted
  • 🔗 Slack - Send notifications when document processing completes
  • 🔗 Airbyte - Sync extracted data to your data warehouse
  • 🔗 GitHub - Automate document processing pipelines with GitHub Actions
  • 🔗 Google Drive - Export parsed data to Google Sheets

ActorDescription
📄 PDF to JSON ParserExtract structured data from any PDF document
🔍 HTML to JSON Smart ParserConvert HTML content to structured JSON
🎤 Audio TranscriberTranscribe audio files to text
📝 CV OptimizerAnalyze and optimize resume/CV documents
🔗 Address NormalizerStandardize and validate extracted addresses

💡 Pro Tip: Use the ID to JSON Parser to extract address data, then pipe it through the Address Normalizer to standardize and validate the results.


🆘 Need Help? Open our contact form and we will get back to you within 24 hours. For bug reports, feature requests, or integration help, we are here to assist.


Disclaimer: This Actor is provided as-is, without warranty. Users are responsible for ensuring they have proper authorization to process identity documents and comply with applicable privacy laws and data protection regulations. The authors are not responsible for how the extracted data is used. Always verify extracted data for critical applications.