Structured Data Extractor - JSON-LD, Microdata & RDFa avatar
Structured Data Extractor - JSON-LD, Microdata & RDFa

Pricing

Pay per usage

Go to Apify Store
Structured Data Extractor - JSON-LD, Microdata & RDFa

Structured Data Extractor - JSON-LD, Microdata & RDFa

Extract and validate structured data from any web page for SEO. Parses JSON-LD, detects Microdata and RDFa, highlights schema.org types, and reports common markup issues.

Pricing

Pay per usage

Rating

0.0

(0)

Developer

Bikram Adhikari

Bikram Adhikari

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

6 days ago

Last modified

Categories

Share

Structured Data Extractor (JSON-LD, Microdata & RDFa)

Extract structured data from any web page for SEO audits.

This Actor:

  • Extracts JSON-LD blocks (<script type="application/ld+json">)
  • Detects & extracts Microdata (itemscope, itemtype, itemprop)
  • Detects & extracts basic RDFa (property, typeof, about, resource, vocab)
  • Highlights detected schema.org types and reports common issues (missing @type, non-schema.org @context, parse errors)

Input

  • Start URLs: pages to analyze.
  • Follow internal links (optional): crawl additional pages for site-wide audits.
  • Extraction toggles for JSON-LD / Microdata / RDFa.

Output

  • Dataset: one item per analyzed page (counts, detected types, warnings/errors).
  • Key-Value Store:
    • SUMMARY: run summary + top schema.org types
    • REPORT: compact per-page report

Example API call

{
"startUrls": [{"url": "https://example.com"}, {"url": "https://json-ld.org/"}],
"maxPages": 10,
"followLinks": false,
"validateSchemaOrg": true
}

Quick start

Store page: https://apify.com/scrappy_garden/structured-data-extractor

Paste this into Input and click Run:

{
"startUrls": [
{
"url": "https://example.com/"
}
],
"proxyConfiguration": {
"useApifyProxy": false
}
}

Outputs (what you get)

  • Dataset: Dataset items typically include fields like: url, statusCode, title, jsonLdCount, microdataItemCount, rdfaStatementCount, schemaTypes, warnings, errors, extractedAt.
  • Key-value store: REPORT, SUMMARY

Tips (trust + predictable results)

  • Start with 1–3 URLs to validate behavior, then scale up.
  • If a target blocks requests, enable Proxy and/or slow down concurrency in Input.
  • Use the SUMMARY / REPORT keys (when present) for automation pipelines and monitoring.

Search keywords

structured data extractor, structured data extractor - json-ld, microdata & rdfa, website audit, seo