Schema Markup Extractor - Structured Data & SEO avatar

Schema Markup Extractor - Structured Data & SEO

Pricing

Pay per usage

Go to Apify Store
Schema Markup Extractor - Structured Data & SEO

Schema Markup Extractor - Structured Data & SEO

Extract JSON-LD structured data, Open Graph tags, Twitter Card metadata, and all meta tags from any URL. Returns @type values, schema objects, og: properties. Fast pure-HTTP SEO audit tool.

Pricing

Pay per usage

Rating

0.0

(0)

Developer

Ava Torres

Ava Torres

Maintained by Community

Actor stats

0

Bookmarked

3

Total users

2

Monthly active users

3 days ago

Last modified

Share

Schema Markup & SEO Data Extractor

Extract JSON-LD structured data, Open Graph tags, Twitter Card metadata, and meta tags from any URL. Built for SEO auditors, developers, and data engineers who need structured page metadata at scale.

Pricing: $0.002 per URL (~$2 per 1,000 URLs)


What It Extracts

Data TypeExamples
JSON-LDProduct, Article, BreadcrumbList, FAQPage, LocalBusiness, WebSite, Person, Organization
Open Graphog:title, og:description, og:image, og:url, og:type, og:site_name
Twitter Cardtwitter:card, twitter:title, twitter:description, twitter:image, twitter:site
Meta Tagsdescription, keywords, author, robots, viewport, canonical
Schema TypesDeduplicated list of all @type values found on the page

Input

FieldTypeDefaultDescription
urlsarrayrequiredURLs to extract from
includeJsonLdbooleantrueParse JSON-LD script blocks
includeOpenGraphbooleantrueParse og: meta properties
includeTwitterCardbooleantrueParse twitter: meta tags
includeMetaTagsbooleantrueParse all <meta name=...> tags
concurrencyinteger5Parallel requests (1-20)
timeoutinteger30Per-URL timeout in seconds
maxResultsinteger50Cap on URLs processed

Output

Each URL produces one dataset record:

{
"url": "https://example.com/product/widget",
"jsonLd": [
{
"@context": "https://schema.org",
"@type": "Product",
"name": "Widget Pro",
"description": "A professional widget",
"offers": {
"@type": "Offer",
"price": "29.99",
"priceCurrency": "USD"
}
}
],
"openGraph": {
"title": "Widget Pro - Best Widgets",
"description": "A professional widget for professionals",
"image": "https://example.com/widget.jpg",
"type": "product"
},
"twitterCard": {
"card": "summary_large_image",
"title": "Widget Pro",
"image": "https://example.com/widget-twitter.jpg"
},
"metaTags": [
{ "name": "description", "content": "A professional widget for professionals" },
{ "name": "keywords", "content": "widget, pro, professional" }
],
"schemaTypes": ["Product", "Offer"]
}

If a URL fails to fetch or parse, the record includes an error field and empty arrays/objects for the structured data fields.


Use Cases

  • SEO audits — verify JSON-LD is present and correct across hundreds of pages
  • Competitor research — see what schema types competitors implement
  • Rich result eligibility — check if pages qualify for Google rich results (Product, FAQ, Article, etc.)
  • Content aggregation — extract og:image and og:title for link previews
  • Schema validation — identify missing or malformed structured data before a site launch
  • Crawl pipelines — feed output into downstream validators or dashboards

Notes

  • Uses a pure HTTP client — no browser required, fast and cost-efficient
  • Handles @graph arrays in JSON-LD (common on WordPress/Yoast sites)
  • Handles both property="twitter:..." and name="twitter:..." meta tag formats
  • Follows up to 10 redirects per URL
  • Response body capped at 10 MB per page
  • No API key required