Pricing

Pay per event

Try for free

Go to Apify Store

ID to JSON Parser

Try for free

Extract structured JSON data from passports, driver’s licenses, and ID cards using advanced AI vision. Automatically capture personal details, document info, dates, and all relevant fields from ID images, turning them into clean, accurate JSON for fast verification workflows.

Pricing

Pay per event

Rating

5.0

(1)

Developer

ParseForge

Actor stats

Bookmarked

Total users

Monthly active users

20 days ago

Last modified

🪪 ID to JSON Parser

🚀 Extract structured data from identity documents in seconds. Upload passports, driver's licenses, or ID cards. No coding, no manual data entry required.

🕒 Last updated: 2026-04-23 · 📊 15+ fields · 🔄 Runs on Apify cloud or locally · 📁 Export: JSON, CSV, Excel

The ID to JSON Parser converts identity documents into clean, structured JSON data using advanced AI vision. Upload images of passports, driver's licenses, or ID cards and get back structured data with names, document numbers, dates, addresses, and all other visible fields. Each document yields 15+ data points extracted automatically, ready for database ingestion, compliance workflows, or verification systems.

Built for companies processing identity documents at scale, KYC compliance teams automating customer verification, HR departments digitizing employee records, and developers building identity data pipelines. The parser handles multiple image formats (JPEG, PNG, WebP) and PDFs, processes multi-page documents, and lets you specify exactly which fields to extract. No manual data entry, no template configuration, just upload and get structured output.

Target Audience	Use Cases
KYC/Compliance Teams	Customer identity verification
HR Departments	Employee document digitization
Financial Institutions	Account opening automation
Immigration Services	Document processing and validation
Insurance Companies	Policy holder verification
Software Developers	ID extraction pipeline integration

📋 What the ID to JSON Parser does

🆔 Identifies document type automatically, distinguishing between passports, driver's licenses, national ID cards, and other identity documents
👤 Extracts full personal details including first name, last name, middle name, date of birth, gender, and nationality
📄 Captures document metadata such as document number, issue date, expiry date, and issuing authority
📍 Pulls address information including street, city, state, and postal code when present on the document
🧬 Reads physical details like height, weight, eye color, and hair color from driver's licenses and ID cards
🔧 Supports custom field selection so you can specify exactly which fields to extract for your workflow

The parser processes uploaded images through AI vision models that understand document layouts across countries and formats. It handles different orientations, image quality levels, and document types without manual template configuration.

💡 Why it matters: Manual data entry from ID documents is slow, expensive, and error-prone. This parser automates the entire process, extracting structured data from any identity document in seconds instead of minutes.

🎬 Full Demo

🚧 Coming soon...

⚙️ Input

Field	Type	Required	Description
idImage	array	Yes	One or more ID document images or PDFs. Supports JPEG, PNG, GIF, WebP, and PDF formats.
maxItems	integer	No	Maximum documents to process. Free users: limited to 10. Paid users: up to 1,000,000.
fieldsToExtract	string	No	Comma-separated list of specific fields to extract (e.g., "firstName, lastName, dateOfBirth"). Leave blank for all fields.
systemPrompt	string	No	Custom instructions to guide extraction. Leave blank for default behavior.

Example 1: Extract all fields from an ID image

{
  "idImage": ["https://example.com/passport-scan.jpg"],
  "maxItems": 10
}

Example 2: Extract specific fields only

{
  "idImage": ["https://example.com/drivers-license.png"],
  "fieldsToExtract": "firstName, lastName, dateOfBirth, documentNumber, expiryDate",
  "maxItems": 5
}

⚠️ Good to Know: You can upload multiple documents in a single run by adding more URLs to the idImage array. The parser handles both image files and multi-page PDFs. Free users are automatically limited to 10 documents per run.

📊 Output

🧾 Schema

Emoji	Field	Type	Description
📄	documentType	string	Type of document (passport, driver's license, ID card)
🆔	documentNumber	string	Unique document number or ID
👤	firstName	string	First name
👤	lastName	string	Last name
👤	middleName	string	Middle name (if present)
🌍	nationality	string	Nationality or citizenship
🌍	country	string	Country of issue
📅	dateOfBirth	string	Date of birth
📅	issueDate	string	Document issue date
📅	expiryDate	string	Document expiry date
🧬	gender	string	Gender
📏	height	string	Height (driver's licenses)
👁️	eyeColor	string	Eye color (driver's licenses)
📍	address	string	Full address on document
🖼️	sourceImage	string	URL of the processed image
❌	error	string	Error message if extraction failed

📦 Sample records

✨ Why choose this Actor

Feature	Details
🪪 Multi-document support	Passports, driver's licenses, national ID cards, and more
🌍 International coverage	Handles documents from countries worldwide
🔧 Custom field selection	Extract only the fields you need
📄 PDF support	Processes multi-page PDF documents
🖼️ Multiple image formats	JPEG, PNG, GIF, WebP
⚡ Fast processing	Seconds per document, not minutes
📁 Structured output	Clean JSON ready for databases and APIs

📈 Typical performance: Processes 1 document in 3-5 seconds. A batch of 50 documents completes in about 4 minutes.

📈 How it compares to alternatives

Feature	This Actor	Manual Data Entry	Generic OCR Tools
Structured JSON output	✅	❌	Partial
Multi-document type support	✅	✅ (slow)	Partial
International document handling	✅	✅ (slow)	Partial
Custom field selection	✅	N/A	❌
Batch processing	✅	❌	Partial
No template configuration	✅	N/A	❌
API and automation support	✅	❌	Partial

Built specifically for identity documents, with AI vision that understands document layouts rather than generic text recognition.

🚀 How to use

Create a free Apify account - Sign up here (includes free credits)
Open the ID to JSON Parser - Navigate to the Actor page and click "Start"
Upload your documents - Add image URLs or PDF links to the idImage field
Choose fields (optional) - Specify which fields to extract, or leave blank for all
Run and download - Click "Start", wait for results, then export as JSON, CSV, or Excel

⏱️ First results appear in under 10 seconds. A typical run processing 10 documents completes in about 1 minute.

💼 Business use cases

KYC & Compliance

Automate identity verification workflows
Extract document data for compliance checks
Process customer onboarding documents at scale
Build audit trails from ID submissions

Human Resources

Digitize employee identity documents
Verify work authorization documents
Build structured employee records from scans
Process new hire paperwork automatically

Financial Services

Accelerate account opening processes
Extract data for anti-money laundering checks
Validate customer identity across channels
Automate loan application document processing

Insurance & Healthcare

Process policyholder identity documents
Verify patient identity for medical records
Extract data from insurance claims submissions
Digitize legacy paper-based ID archives

🌟 Beyond business use cases

Data like this powers more than commercial workflows. The same structured records support research, education, civic projects, and personal initiatives.

🎓 Research and academia

Empirical datasets for papers, thesis work, and coursework
Longitudinal studies tracking changes across snapshots
Reproducible research with cited, versioned data pulls
Classroom exercises on data analysis and ethical scraping

🎨 Personal and creative

Side projects, portfolio demos, and indie app launches
Data visualizations, dashboards, and infographics
Content research for bloggers, YouTubers, and podcasters
Hobbyist collections and personal trackers

🤝 Non-profit and civic

Transparency reporting and accountability projects
Advocacy campaigns backed by public-interest data
Community-run databases for local issues
Investigative journalism on public records

🧪 Experimentation

Prototype AI and machine-learning pipelines with real data
Validate product-market hypotheses before engineering spend
Train small domain-specific models on niche corpora
Test dashboard concepts with live input

🤖 Ask an AI assistant about this scraper

Open a ready-to-send prompt about this ParseForge actor in the AI of your choice:

❓ Frequently Asked Questions

🔌 Automating ID to JSON Parser

Node.js

import { ApifyClient } from 'apify-client';
const client = new ApifyClient({ token: 'YOUR_API_TOKEN' });
const run = await client.actor("parseforge/id-to-json-parser").call({
  idImage: ["https://example.com/passport.jpg"],
  maxItems: 10
});
const { items } = await client.dataset(run.defaultDatasetId).listItems();
console.log(items);

Python

from apify_client import ApifyClient
client = ApifyClient("YOUR_API_TOKEN")
run = client.actor("parseforge/id-to-json-parser").call(run_input={
    "idImage": ["https://example.com/passport.jpg"],
    "maxItems": 10
})
items = list(client.dataset(run["defaultDatasetId"]).iterate_items())
print(items)

Schedules: While ID parsing is typically triggered on demand, you can set up Apify Schedules to process batch uploads at regular intervals, such as daily processing of submitted documents.

🔌 Integrate with any app

🔗 Make (Integromat) - Connect ID parsing output to CRM, compliance, or database apps
🔗 Zapier - Trigger workflows when new ID data is extracted
🔗 Slack - Send notifications when document processing completes
🔗 Airbyte - Sync extracted data to your data warehouse
🔗 GitHub - Automate document processing pipelines with GitHub Actions
🔗 Google Drive - Export parsed data to Google Sheets

🔗 Recommended Actors

Actor	Description
📄 PDF to JSON Parser	Extract structured data from any PDF document
🔍 HTML to JSON Smart Parser	Convert HTML content to structured JSON
🎤 Audio Transcriber	Transcribe audio files to text
📝 CV Optimizer	Analyze and optimize resume/CV documents
🔗 Address Normalizer	Standardize and validate extracted addresses

💡 Pro Tip: Use the ID to JSON Parser to extract address data, then pipe it through the Address Normalizer to standardize and validate the results.

🆘 Need Help? Open our contact form and we will get back to you within 24 hours. For bug reports, feature requests, or integration help, we are here to assist.

Disclaimer: This Actor is provided as-is, without warranty. Users are responsible for ensuring they have proper authorization to process identity documents and comply with applicable privacy laws and data protection regulations. The authors are not responsible for how the extracted data is used. Always verify extracted data for critical applications.

Facebook Url To Id Scraper

api-empire/facebook-url-to-id-scraper

Facebook Url To Id Scraper converts any Facebook profile, page, group, or post URL into its exact numeric ID. Get fast, reliable ID extraction for automation, data workflows, and integrations. Ideal for marketers, analysts, and developers needing clean structured IDs.

API Empire

Document Verification

vivid_astronaut/document-verification

Verify identity documents using AI. Validates passports, driver licenses, and national IDs. Checks authenticity, extracts data, and validates MRZ codes. Essential for KYC compliance.

Fabio Suizu

Facebook Url To Id Scraper

scraper-engine/facebook-url-to-id

The Facebook URL To ID Scraper converts Facebook profile, page, group, and post URLs into their numeric IDs. It extracts clean, structured identifiers for automation, data matching, integrations, and advanced scraping workflows, delivered in JSON or CSV.

Scraper Engine

5.0

mango product images actor

concrete2/mango-product-images-actor

Find all related images to products by product ID and color ID

George

HTML to JSON Smart Parser

parseforge/html-to-json-smart-parser

Convert HTML to structured JSON using AI! Uses OpenAI to extract and structure data from HTML into clean JSON format. Perfect for developers and data analysts who need to transform HTML into structured data without manual parsing.

ParseForge

5.0

TikTok Profile Id Scraper

alpha-scraper/tiktok-profile-id-scraper

TikTok Profile Scraper extracts public profile data like user ID, sec_uid, and username from one or multiple TikTok profiles. Supports URLs or usernames, delivers clean structured JSON output, and is perfect for automation, analytics, and bulk data collection workflows.

Alpha Scraper

Facebook Url To Id

scrapers-hub/facebook-url-to-id

Facebook URL to ID tool to convert Facebook profile or page URLs into unique IDs quickly 🔗🆔 Ideal for developers, data tracking, and integrations. Fast, simple, and accurate conversion.

Scrapers Hub

Document To Json MCP

opportunity-biz/document-to-json-mcp

Turn PDFs into structured JSON in seconds. AI-powered, no coding needed.

opportunity-biz

Pdf Json Extractor

p6t_p10n/pdf-json-extractor

Convert any PDF into structured JSON using AI and OCR (Tesseract or Google Vision). Supports custom schemas, validation, and auto-repair. Ideal for invoices, contracts, receipts, and automation workflows. Fast, accurate, and easy to integrate.

Peerapat Pongnipakorn

PDF To JSON Parser

parseforge/pdf-to-json-parser

Convert PDF documents into structured JSON using AI-powered OCR and smart data extraction. The Actor processes every page to ensure complete coverage, then identifies text, fields, tables, and key details, delivering clean, organized JSON ready for automation or analysis.

ParseForge

5.0