Meta Tags Extractor
Pricing
Pay per usage
Go to Apify Store
Meta Tags Extractor
Extract SEO meta tags, Open Graph, Twitter Cards, JSON-LD structured data, and headings from any website. Perfect for SEO analysis, competitor research, and content audits.
Pricing
Pay per usage
Rating
0.0
(0)
Developer
Krawlify Krawlify
Maintained by CommunityActor stats
0
Bookmarked
2
Total users
0
Monthly active users
3 months ago
Last modified
Categories
Share
Extract SEO meta tags, Open Graph data, Twitter Cards, JSON-LD structured data, and heading analysis from any website URL(s). Built for SEO analysis, competitor research, and content auditing.
What it does
For each page, the Actor extracts:
- Basic SEO tags - title, meta description, keywords, canonical URL, robots directive, language
- Open Graph tags - og:title, og:description, og:image, og:url, og:type
- Twitter Card tags - twitter:card, twitter:title, twitter:description, twitter:image
- JSON-LD structured data - All
<script type="application/ld+json">blocks parsed as objects - Headings analysis - H1 and H2 counts with full text content
Input
| Field | Type | Default | Description |
|---|---|---|---|
startUrls | array | required | URLs of web pages to extract meta tags from |
includeOpenGraph | boolean | true | Extract Open Graph (og:*) meta tags |
includeTwitterCards | boolean | true | Extract Twitter Card (twitter:*) meta tags |
includeJsonLd | boolean | true | Extract JSON-LD structured data |
maxPages | integer | 10 | Maximum number of pages to crawl (0 = unlimited) |
Example input
{"startUrls": [{ "url": "https://apify.com" },{ "url": "https://crawlee.dev" }],"includeOpenGraph": true,"includeTwitterCards": true,"includeJsonLd": true,"maxPages": 10}
Output
Each page produces a dataset item with this structure:
{"url": "https://apify.com","title": "Apify: Full-stack web scraping and data extraction platform","metaDescription": "Cloud platform for web scraping and automation...","metaKeywords": null,"canonicalUrl": "https://apify.com/","robots": "index, follow","openGraph": {"og:title": "Apify: Full-stack web scraping and data extraction platform","og:description": "Cloud platform for web scraping...","og:image": "https://apify.com/img/og-image.png","og:url": "https://apify.com/","og:type": "website"},"twitterCard": {"twitter:card": "summary_large_image","twitter:title": "Apify: Full-stack web scraping and data extraction platform","twitter:description": "Cloud platform for web scraping...","twitter:image": "https://apify.com/img/twitter-card.png"},"jsonLd": [{"@context": "https://schema.org","@type": "Organization","name": "Apify"}],"headings": {"h1": { "count": 1, "texts": ["Full-stack web scraping platform"] },"h2": { "count": 4, "texts": ["Features", "Pricing", "Resources", "Company"] }},"language": "en","scrapedAt": "2026-03-01T12:00:00.000Z"}
Pages that fail to load are recorded with an error field instead:
{"url": "https://example.com/broken","error": "Failed after 3 retries","scrapedAt": "2026-03-01T12:00:00.000Z"}
Use cases
- SEO auditing - Verify meta tags, canonical URLs, and robots directives across your site
- Competitor analysis - Compare meta tag strategies across competing websites
- Content monitoring - Track changes to titles, descriptions, and structured data over time
- Social media preview checks - Validate Open Graph and Twitter Card tags before sharing
- Structured data validation - Extract and review JSON-LD schema markup
Running locally
$apify run --input='{"startUrls": [{"url": "https://example.com"}], "maxPages": 5}'
Deploy to Apify
apify loginapify push