Pricing

Pay per usage

Structured Data Validator + Rich Results & AEO Checker

Detect ALL schema.org structured data (JSON-LD + microdata) on any public page, validate each type against Google Rich Results requirements, and get a 0-100 validity score, per-type eligibility, AEO readiness signals, and concrete prop-level fixes.

Pricing

Pay per usage

Rating

0.0

(0)

Developer

Tommy G

Actor stats

Bookmarked

Total users

Monthly active users

2 months ago

Last modified

Structured Data Validator + Rich Results & AEO Checker (Apify Actor)

Give it any public page URL and it detects every schema.org structured-data type on the page (both JSON-LD and microdata), validates each type against Google Rich Results requirements, and tells you exactly what's missing to win a rich result — plus an AEO (Answer-Engine Optimization) readiness read for AI search. HTML-only (no headless browser), fast, cheap, deterministic.

Built for SEO audits at scale, rich-result eligibility monitoring, schema QA in CI, competitor structured-data teardown, and AEO readiness checks.

What it validates

Per-type Google REQUIRED + RECOMMENDED property coverage for:

Article / NewsArticle / BlogPosting, Product (merchant listing and product snippet), Offer / AggregateOffer, AggregateRating / Review, Recipe, Event, JobPosting, LocalBusiness (+ subtypes like Restaurant/Store), FAQPage, HowTo, BreadcrumbList, VideoObject, Organization, Course.

It encodes the real-world gotchas: HowTo is deprecated (detected for AEO but never a rich result), FAQPage rich results are deprecated (validated structurally only), Article & Organization have no Google-required props, Product splits into two experiences, conditional-required logic (offers ⇒ price), nested vs standalone ratings/reviews, BreadcrumbList needs ≥2 items, ratingValue must be in range, and microdata counts exactly like JSON-LD.

Input

{ "startUrls": [{ "url": "https://www.example.com/product/123" }], "maxConcurrency": 5, "maxPages": 100 }

maxPages capped at 200, maxConcurrency at 20.

{
  "status": "ok",
  "requested_url": "...",
  "final_url": "...",
  "http_status": 200,
  "found": true,
  "complete": true,
  "page_type": "rich-eligible",
  "source": "json-ld",
  "detected_types": ["Product", "Offer", "AggregateRating", "BreadcrumbList"],
  "entities": [
    {
      "type": "Product",
      "source": "json-ld",
      "required_missing": [],
      "recommended_missing": ["offers.shippingDetails", "offers.hasMerchantReturnPolicy"],
      "rich_result_eligible": true,
      "errors": []
    }
  ],
  "rich_result_eligible_types": ["Product", "BreadcrumbList"],
  "validity_score": 88,
  "aeo_signals": {
    "has_faq": false,
    "has_howto": false,
    "has_breadcrumb": true,
    "has_article_meta": false,
    "answer_upfront": false
  },
  "fixes": [
    "Product.offers.priceCurrency missing — add ISO 4217 code for merchant listing eligibility."
  ],
  "extracted_at": "2026-06-04T..."
}

Field guide

detected_types — every distinct schema.org @type found (JSON-LD + microdata, @graph and nested types flattened in, deduped, first-seen order).
entities[] — one per recognized typed node: its required_missing, recommended_missing, rich_result_eligible, and hard errors (malformed date, rating out of range, non-numeric price, breadcrumb < 2 items, conditional-required miss).
rich_result_eligible_types — types where at least one entity fully passes Google's REQUIRED props for a live rich result.
validity_score — integer 0-100. REQUIRED-coverage dominates (80%), RECOMMENDED is bonus (20%), hard errors subtract. Capped at 79 until at least one type is rich-result eligible (so a page can't score "green" without a real, eligible type). 0 when nothing is found.
aeo_signals — has_faq, has_howto, has_breadcrumb, has_article_meta, answer_upfront (answer-first content ≤300 chars) for Answer-Engine Optimization.
fixes[] — concrete, prop-level remediation naming Type.property + why it matters.

found / complete semantics

found = true iff ≥1 entity exists with a recognized non-empty schema.org @type (JSON-LD or microdata). A page with only <title>/OpenGraph, or only empty {} / @type-less nodes ⇒ found = false (never overbills).
complete = true iff at least one type is rich_result_eligible. HowTo (deprecated) and bare FAQPage never make a page complete on their own.

Run locally / test

npm install
npm test     # 60 unit tests on the pure validator (node:test) — rich-result rules + edge cases

Publish to Apify (account-holder's step)

$npm install -g apify-cli && apify login && apify push

Notes / safety

SSRF-guarded, robots-respecting, rate-limited, cost-capped (shared src/lib/actor_runner.js).
Stores only derived validation results — no raw page bodies.
HTML-only: client-rendered pages that inject JSON via JS return found:false with render_required:true. Core logic in src/validate.js (pure, deterministic, unit-tested).
Validation rules web-verified against developers.google.com/search/docs structured-data docs (June 2026).

Schema.org / JSON-LD Validator

pattonholdings/schema-validator

Extract and validate JSON-LD + microdata structured data on any URL against schema.org. Checks 17 Google Rich Results types (Article, Product, Recipe, FAQ, etc), flags missing required properties. Use for SEO rich-snippet audits. Input: url or urls[]. Output: JSON entities + grade.

Coleton Patton

Structured Data Validator (JSON-LD / OG)

jungle_synthesizer/structured-data-validator-pro

Extract and validate structured data from any URL: JSON-LD, Open Graph, Twitter Cards, microdata, RDFa, meta tags. Local schema.org validation. Flags Google rich-result eligibility and AI-discovery readiness. Pure HTTP. Built for SEO audits and structured-data debugging at scale.

BowTiedRaccoon

Public Structured Data & Rich Results Readiness Agent

jacksu/public-structured-data-readiness-agent

Audit public JSON-LD, Microdata, RDFa, Schema.org types, rich-result candidate signals, missing recommended fields, and change hashes with useful-result pricing.

jack su

Website JSON-LD and Schema.org Extractor

automationagents/web-json-ld

Extract structured JSON-LD and Schema.org data from any web page. Pull products, articles, breadcrumbs, and rich results for SEO and data work.

Alex Jordan

Schema.org Structured Data Validator

anaselgamed/schema-validator

Validate and audit Schema.org structured data (JSON-LD) on any webpage. Check for errors, warnings, and missing fields. Ensure your rich snippets display correctly in Google search results.

Anas Hossam

Rich Results Schema.org Tester

alizarin_refrigerator-owner/rich-results-tester

Test any URL for Google Rich Results eligibility. Validate structured data, identify schema errors & get recommendations. Batch test multiple pages for SEO audits.

The Howlers

Schema Markup Validator

maximedupre/schema-markup-validator

Validate schema markup on public pages. Extract JSON-LD, Microdata, RDFa, Open Graph, Twitter Cards, meta tags, schema.org types, issue counts, and rich-result readiness signals.

Maxime Dupré

GEO / AEO Website Audit — AI Search Readiness

rtworule/ai-search-readiness-auditor

Audit public websites for GEO and AEO readiness: AI crawler access, robots.txt, llms.txt, sitemaps, Schema.org, social metadata, scores, and prioritized fixes.

Kunteper Koyu

JSON-LD & Schema.org Extractor

andok/jsonld-extractor

Extract structured microdata (JSON-LD) from webpages to audit SEO schema implementations and rich snippets.

Andok

Schema Markup Generator: JSON-LD for Google Rich Results

raional/schema-markup-generator

Generate valid schema.org JSON-LD structured data for Product, Article, FAQPage, LocalBusiness, Review, Recipe, Event, JobPosting, HowTo, VideoObject, Organization and WebSite — from structured fields, or auto-detected from a page URL. Validated against Google's rich-result requirements.