Structured Data Extractor - JSON-LD, Microdata & RDFa
Pricing
Pay per usage
Go to Apify Store

Structured Data Extractor - JSON-LD, Microdata & RDFa
Extract and validate structured data from any web page for SEO. Parses JSON-LD, detects Microdata and RDFa, highlights schema.org types, and reports common markup issues.
Pricing
Pay per usage
Rating
0.0
(0)
Developer

Bikram Adhikari
Maintained by Community
Actor stats
0
Bookmarked
2
Total users
1
Monthly active users
6 days ago
Last modified
Categories
Share
Structured Data Extractor (JSON-LD, Microdata & RDFa)
Extract structured data from any web page for SEO audits.
This Actor:
- Extracts JSON-LD blocks (
<script type="application/ld+json">) - Detects & extracts Microdata (
itemscope,itemtype,itemprop) - Detects & extracts basic RDFa (
property,typeof,about,resource,vocab) - Highlights detected schema.org types and reports common issues (missing
@type, non-schema.org@context, parse errors)
Input
- Start URLs: pages to analyze.
- Follow internal links (optional): crawl additional pages for site-wide audits.
- Extraction toggles for JSON-LD / Microdata / RDFa.
Output
- Dataset: one item per analyzed page (counts, detected types, warnings/errors).
- Key-Value Store:
SUMMARY: run summary + top schema.org typesREPORT: compact per-page report
Example API call
{"startUrls": [{"url": "https://example.com"}, {"url": "https://json-ld.org/"}],"maxPages": 10,"followLinks": false,"validateSchemaOrg": true}
Quick start
Store page: https://apify.com/scrappy_garden/structured-data-extractor
Paste this into Input and click Run:
{"startUrls": [{"url": "https://example.com/"}],"proxyConfiguration": {"useApifyProxy": false}}
Outputs (what you get)
- Dataset: Dataset items typically include fields like:
url,statusCode,title,jsonLdCount,microdataItemCount,rdfaStatementCount,schemaTypes,warnings,errors,extractedAt. - Key-value store:
REPORT,SUMMARY
Tips (trust + predictable results)
- Start with 1–3 URLs to validate behavior, then scale up.
- If a target blocks requests, enable Proxy and/or slow down concurrency in Input.
- Use the
SUMMARY/REPORTkeys (when present) for automation pipelines and monitoring.
Related actors
- meta-tag-analyzer (https://apify.com/scrappy_garden/meta-tag-analyzer)
- open-graph-tag-checker (https://apify.com/scrappy_garden/open-graph-tag-checker)
- twitter-card-validator (https://apify.com/scrappy_garden/twitter-card-validator)
- canonical-url-checker (https://apify.com/scrappy_garden/canonical-url-checker)
Search keywords
structured data extractor, structured data extractor - json-ld, microdata & rdfa, website audit, seo