Schema.org Structured Data Bulk Validator avatar

Schema.org Structured Data Bulk Validator

Pricing

from $9.00 / 1,000 results

Go to Apify Store
Schema.org Structured Data Bulk Validator

Schema.org Structured Data Bulk Validator

Crawl and validate JSON-LD/Microdata structured data across multiple pages. Detect missing or malformed Schema.org markup at scale.

Pricing

from $9.00 / 1,000 results

Rating

0.0

(0)

Developer

太郎 山田

太郎 山田

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

2 days ago

Last modified

Share

Crawl and validate JSON-LD and Microdata structured data across multiple pages. Detect missing, malformed, or incomplete Schema.org markup at scale.

What does this actor do?

Fetches HTML pages, extracts JSON-LD (<script type="application/ld+json">) and Microdata (itemscope/itemtype) blocks, then validates them against Schema.org best practices. Each page gets a quality score (0-100) and grade (A-F).

Key Features

  • 🔍 JSON-LD + Microdata extraction — Both formats supported
  • 📊 Quality scoring — 0-100 with A-F grade per page
  • ⚠️ Error detection — Missing @type, invalid JSON, missing @context
  • 💡 Warnings — Sparse data, missing recommended properties
  • 📋 Bulk processing — Check up to 200 URLs per run
  • 🪝 Webhook delivery — Integrate into SEO monitoring workflows

Validation Rules

CheckPointsDescription
JSON-LD present40At least one valid JSON-LD block
Microdata present20itemscope/itemtype found
No errors20No parse errors or missing @type
Schema.org context10Proper @context with schema.org
Rich data105+ properties in JSON-LD

Input Example

{
"urls": ["https://www.google.com", "https://github.com", "https://schema.org"],
"concurrency": 3
}

Output Example

{
"url": "https://www.google.com",
"jsonLd": [
{ "type": "WebSite", "context": "https://schema.org", "name": "Google", "_keyCount": 7 }
],
"microdata": [],
"errors": [],
"warnings": [],
"score": { "total": 80, "grade": "A" }
}

Cost

Zero external costs. Simple HTTP page fetches — no API keys, no proxies, no browser. A run checking 50 URLs takes ~20 seconds.

Commercial Ops

Set up .env first:

$cp -n .env.example .env

Cloud Task/Schedule setup (idempotent):

$npm run apify:cloud:setup

Daily reliability checks:

npm run canary:check
npm run contract:test:live

OpenClaw cron commands:

  • openclaw-cron-commands.md