Meta Tags Extractor avatar

Meta Tags Extractor

Pricing

Pay per usage

Go to Apify Store
Meta Tags Extractor

Meta Tags Extractor

Extract SEO meta tags, Open Graph, Twitter Cards, JSON-LD structured data, and headings from any website. Perfect for SEO analysis, competitor research, and content audits.

Pricing

Pay per usage

Rating

0.0

(0)

Developer

Praveen Kumar

Praveen Kumar

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

a day ago

Last modified

Share

Extract SEO meta tags, Open Graph data, Twitter Cards, JSON-LD structured data, and heading analysis from any website URL(s). Built for SEO analysis, competitor research, and content auditing.

What it does

For each page, the Actor extracts:

  • Basic SEO tags - title, meta description, keywords, canonical URL, robots directive, language
  • Open Graph tags - og:title, og:description, og:image, og:url, og:type
  • Twitter Card tags - twitter:card, twitter:title, twitter:description, twitter:image
  • JSON-LD structured data - All <script type="application/ld+json"> blocks parsed as objects
  • Headings analysis - H1 and H2 counts with full text content

Input

FieldTypeDefaultDescription
startUrlsarrayrequiredURLs of web pages to extract meta tags from
includeOpenGraphbooleantrueExtract Open Graph (og:*) meta tags
includeTwitterCardsbooleantrueExtract Twitter Card (twitter:*) meta tags
includeJsonLdbooleantrueExtract JSON-LD structured data
maxPagesinteger10Maximum number of pages to crawl (0 = unlimited)

Example input

{
"startUrls": [
{ "url": "https://apify.com" },
{ "url": "https://crawlee.dev" }
],
"includeOpenGraph": true,
"includeTwitterCards": true,
"includeJsonLd": true,
"maxPages": 10
}

Output

Each page produces a dataset item with this structure:

{
"url": "https://apify.com",
"title": "Apify: Full-stack web scraping and data extraction platform",
"metaDescription": "Cloud platform for web scraping and automation...",
"metaKeywords": null,
"canonicalUrl": "https://apify.com/",
"robots": "index, follow",
"openGraph": {
"og:title": "Apify: Full-stack web scraping and data extraction platform",
"og:description": "Cloud platform for web scraping...",
"og:image": "https://apify.com/img/og-image.png",
"og:url": "https://apify.com/",
"og:type": "website"
},
"twitterCard": {
"twitter:card": "summary_large_image",
"twitter:title": "Apify: Full-stack web scraping and data extraction platform",
"twitter:description": "Cloud platform for web scraping...",
"twitter:image": "https://apify.com/img/twitter-card.png"
},
"jsonLd": [
{
"@context": "https://schema.org",
"@type": "Organization",
"name": "Apify"
}
],
"headings": {
"h1": { "count": 1, "texts": ["Full-stack web scraping platform"] },
"h2": { "count": 4, "texts": ["Features", "Pricing", "Resources", "Company"] }
},
"language": "en",
"scrapedAt": "2026-03-01T12:00:00.000Z"
}

Pages that fail to load are recorded with an error field instead:

{
"url": "https://example.com/broken",
"error": "Failed after 3 retries",
"scrapedAt": "2026-03-01T12:00:00.000Z"
}

Use cases

  • SEO auditing - Verify meta tags, canonical URLs, and robots directives across your site
  • Competitor analysis - Compare meta tag strategies across competing websites
  • Content monitoring - Track changes to titles, descriptions, and structured data over time
  • Social media preview checks - Validate Open Graph and Twitter Card tags before sharing
  • Structured data validation - Extract and review JSON-LD schema markup

Running locally

$apify run --input='{"startUrls": [{"url": "https://example.com"}], "maxPages": 5}'

Deploy to Apify

apify login
apify push

Technology

  • Crawlee - Web scraping library with built-in anti-blocking
  • Cheerio - Fast HTML parser
  • Apify SDK - Actor platform toolkit