Pricing

from $10.00 / 1,000 schema generateds

JSON Schema Auto-Generator (Infer from Samples)

Provide one or more JSON samples (inline or from URLs) and get an inferred JSON Schema (Draft 7 / 2020-12) describing their shape. Bootstrap API validators, Apify input schemas, BigQuery / DuckDB schemas. Powered by genson. $0.01 per inference.

Pricing

from $10.00 / 1,000 schema generateds

Rating

0.0

(0)

Developer

Hojun Lee

Actor stats

Bookmarked

Total users

Monthly active users

9 days ago

Last modified

JSON Schema Auto-Generator

Provide one or more JSON samples (inline or from URLs) and get an inferred JSON Schema (Draft 7 / 2020-12) describing their shape. Bootstrap API validators, Apify input schemas, BigQuery / DuckDB schemas. Powered by genson. $0.01 per inference.

⚡ Run in 30 seconds

Click Start with default settings — takes the sample JSON object and returns a complete JSON Schema (Draft 7) inferred from its structure, including property types, required fields, and nested object definitions, ready to paste into a validator or API spec.

Input Parameters

Parameter	Type	Default	Description
`samples`	array	`[]`	Array of JSON objects to merge into one inferred schema.
`sample`	object	`{}`	Used when 'samples' is empty.
`sampleUrls`	array	`[]`	Fetch each URL as JSON and add to the sample set.
`schemaUri`	string	`https://json-schema.org/draft/2020-12/schema`	Which JSON Schema version to target.
`schemaTitle`	string	``	Embed in the generated schema's 'title' field.
`schemaDescription`	string	``	Embed in the generated schema's 'description' field.
`requireAll`	boolean	`true`	If false, drop the 'required' array.
`userAgent`	string	``	Custom UA when fetching sampleUrls.

Why this exists

You hit an API that returns JSON. You want to validate downstream payloads, store them in a typed table, or auto-generate TypeScript types — but writing a JSON Schema by hand from 30 nested fields is tedious.

This actor takes one or more sample payloads and infers the schema. The result is a real, RFC-compliant JSON Schema you can drop into Ajv, BigQuery, OpenAPI, Apify input_schema.json, etc.

What you get

{
  "_type": "schema",
  "samples_used": 3,
  "schema_uri": "https://json-schema.org/draft/2020-12/schema",
  "title": "User",
  "inferred_schema": {
    "$schema": "https://json-schema.org/draft/2020-12/schema",
    "type": "object",
    "title": "User",
    "additionalProperties": false,
    "properties": {
      "id": {"type": "integer"},
      "email": {"type": "string"},
      "tags": {"type": "array", "items": {"type": "string"}},
      "meta": {
        "type": "object",
        "properties": {
          "created": {"type": "string"},
          "verified": {"type": "boolean"}
        },
        "required": ["created"]
      }
    },
    "required": ["id", "email"]
  },
  "schema_str": "<pretty-printed schema as text>",
  "top_level_keys": ["id", "email", "tags", "meta"],
  "top_level_required": ["id", "email"]
}

The full schema is also saved as inferred_schema.json in the run's KeyValueStore — easy to download.

Quick start

Single sample

{
  "sample": {
    "id": 1,
    "email": "test@example.com",
    "tags": ["a", "b"],
    "meta": {"created": "2024-01-01", "verified": true}
  }
}

Merge multiple samples (recommended — better union types)

{
  "samples": [
    {"id": 1, "name": "foo"},
    {"id": 2, "name": "bar", "deleted_at": "2024-01-01"},
    {"id": 3, "name": "baz", "deleted_at": null}
  ]
}

Fetch samples from URLs

{
  "sampleUrls": [
    "https://api.github.com/users/torvalds",
    "https://api.github.com/users/octocat"
  ],
  "schemaTitle": "GitHub User",
  "schemaUri": "https://json-schema.org/draft/2020-12/schema"
}

Pricing

Pay-Per-Event: $0.01 per schema inference.

Cheap, fixed-cost. Run as many times as you want during API evolution.

Use cases

Bootstrap Apify input_schema.json — Use it on a sample input, drop the schema into your actor's .actor/input_schema.json
REST API validators — Generate Ajv-compatible schemas from real production responses
BigQuery / DuckDB tables — Convert JSON Schema → DDL with a follow-up tool
OpenAPI — Drop schemas under components.schemas for type-safe SDK generation
Comparison — Run on prod vs staging payloads; diff the schemas to spot drift

Output details

inferred_schema — the schema as a JS object (use this in code)
schema_str — pretty-printed text (paste into a file)
top_level_keys — convenience for renaming / sorting fields
additionalProperties: false is set by default for objects (strict mode). To allow extras, remove it before using the schema.

Limitations / gotchas

More samples = better schema. A single sample can't tell required vs optional. Pass 5-10 samples covering all common cases.
null handling — When a field is sometimes null, the inferred type becomes ["string", "null"] (or similar). This is correct JSON Schema but some tools (e.g. old Avro) don't like unions.
Format detection — genson doesn't infer format: date-time etc. by default. For format-aware generation, post-process manually.

Engine

genson v1.2+ — the reference Python implementation. Well-maintained, used in many data pipelines.

Feedback

A short review helps developers find it: Leave a review on Apify Store

JSON Schema Generator — Infer Draft-07 Schema from Any JSON

eliai/json-schema-inferer

Infer a JSON Schema from a sample via API. Input: a JSON URL or pasted JSON. Output: a draft-07 JSON Schema with types, required fields, nested object and array item schemas, string formats, and detected enums. Cheap flat pay-per-file pricing per inference.

Anthony Snider

JSON Schema Validator & Generator — Infer, Validate & Document

perryay/json-schema-validator-generator

Infer JSON Schema from sample JSON data and validate JSON documents against existing schemas. Supports Draft-04, Draft-07, and 2019-09. Features nested objects, array item type inference, enum detection, batch validation, and human-readable schema documentation generation.

Perry AY

JSON Schema - Infer, Validate, TypeScript

lazymac/json-schema-api

🔥 7-DAY LAUNCH SPRINT (May 1–8, 2026): First 100 runs free for new users. JSON Schema toolkit: automatically infer schemas from JSON data, validate documents against schemas, and generate TypeScript type definitions. Essential for API development and data validation workflows.

2x lazymac

CSV to JSON Converter with Schema Inference & Validation

nibble/csv-json-schema-converter

Convert CSV files to clean, typed JSON. Auto-detects delimiter, infers a JSON Schema, and validates rows against your own schema. Ideal for APIs, data pipelines and AI agents.

Simon Fletcher

MCP Tool Schema Validator

junipr/mcp-tool-schema-validator

Validate MCP tool schemas and manifests for agent-readiness, input/output clarity, and common schema mistakes.

junipr

Output & Dataset Schema Creator

zuzka/output-dataset-schema-creator

Generate JSON schemas for output and dataset on your Actor using AI. Perfect for testing new actors.

Zuzka Pelechová

Structured Data Extractor — URL to JSON

shelvick/structured-extractor

Extract structured data from a batch of URLs as schema-validated JSON. Send web pages and a JSON Schema; it scrapes each (stealth + residential proxy as needed), runs an LLM to convert the page to JSON matching your schema, and validates per URL. Omit schema for best-effort. Public pages only.

Scott Helvick

Web Structured Data Extractor (Claude, JSON Schema)

gochujang/web-structured-extractor

Pass a URL + JSON schema (or natural-language goal). Claude reads the page and returns a strict JSON object matching your schema. Product / news / hotel / real-estate / job-board extraction. BYO Anthropic API key. $0.01 per page.