Structured Data Extractor - JSON-LD, OpenGraph, Microdata avatar

Structured Data Extractor - JSON-LD, OpenGraph, Microdata

Pricing

from $2.00 / 1,000 url extracteds

Go to Apify Store
Structured Data Extractor - JSON-LD, OpenGraph, Microdata

Structured Data Extractor - JSON-LD, OpenGraph, Microdata

Extract every piece of structured data from any URL: JSON-LD blocks by schema.org type, OpenGraph and Twitter Card tags, microdata items, canonical and meta basics. Batch over URL lists or call synchronously from AI agents.

Pricing

from $2.00 / 1,000 url extracteds

Rating

0.0

(0)

Developer

Jimmy A

Jimmy A

Maintained by Community

Actor stats

0

Bookmarked

1

Total users

0

Monthly active users

8 hours ago

Last modified

Share

Structured Data Extractor - JSON-LD, OpenGraph, Twitter Cards, Microdata

Extract all the structured data from any URL in one call: every JSON-LD block with its schema.org types, OpenGraph and Twitter Card tags, canonical URL, meta basics, and microdata items. Batch over URL lists, or call it synchronously from AI agents and SEO tools.

What you get per URL

{
"url": "https://example.com/article",
"status": 200,
"title": "Page title",
"metaDescription": "...",
"canonical": "https://example.com/article",
"jsonLd": [ { "@context": "https://schema.org", "@type": "Article", "...": "..." } ],
"jsonLdTypes": ["Article"],
"openGraph": { "title": "...", "image": "...", "type": "article" },
"twitter": { "card": "summary_large_image" },
"microdataItems": [],
"errors": []
}

Malformed JSON-LD is reported in errors instead of being silently dropped - useful for SEO audits. Unreachable URLs produce an error record without killing the batch.

Use cases

  • SEO audits: verify structured data across a site (pair with a sitemap extractor for full coverage)
  • Rich-result debugging: see exactly what crawlers see, including parse errors
  • Content pipelines: pull clean Article/Product/Recipe/Event objects without writing per-site scrapers
  • AI agents: a fetch-and-understand primitive - one call returns the machine-readable summary of any page
  • Competitive research: what schema types and OG strategies competitors deploy

API / Standby mode

GET /?url=https://example.com/page

Returns the extraction JSON synchronously. Works as a tool for agent frameworks that support Apify actors.

Pricing

EventPrice
Actor start$0.0001
Per URL extracted$0.002
API call (standby)$0.01

A 1,000-page site audit costs $2.

FAQ

Does it render JavaScript? No - it reads the served HTML, which is where structured data lives for any site that wants search engines to see it. That keeps it fast and cheap. JS-only SPAs may show fewer blocks.

What about pages behind bot protection? You get an honest error record with the HTTP status. No retries through gray-zone proxies.

Nested @graph and arrays? Handled - types are collected recursively from @graph structures and type arrays.