Open Graph & Meta Tag Extractor avatar

Open Graph & Meta Tag Extractor

Pricing

Pay per event

Go to Apify Store
Open Graph & Meta Tag Extractor

Open Graph & Meta Tag Extractor

This actor fetches any list of URLs and extracts all social media meta tags (Open Graph, Twitter Cards), SEO metadata (title, description, canonical, robots), structured data (JSON-LD), and internationalization (hreflang). Use it for social media audits, SEO analysis, link preview...

Pricing

Pay per event

Rating

0.0

(0)

Developer

Stas Persiianenko

Stas Persiianenko

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

2 days ago

Last modified

Categories

Share

Extract Open Graph, Twitter Card, meta tags, JSON-LD structured data, and hreflang from any URL. Get social media preview data, SEO metadata, and structured data at scale.

What does Open Graph & Meta Tag Extractor do?

This actor fetches any list of URLs and extracts all social media meta tags (Open Graph, Twitter Cards), SEO metadata (title, description, canonical, robots), structured data (JSON-LD), and internationalization (hreflang). Use it for social media audits, SEO analysis, link preview validation, or content monitoring. Process hundreds of URLs in a single run with fast HTTP-only requests.

Use cases

  • Social media managers -- verify OG tags and Twitter Cards render correctly before launching campaigns
  • SEO specialists -- audit title tags, descriptions, canonical URLs, and hreflang across entire websites
  • Content teams -- validate how URLs appear when shared on LinkedIn, Facebook, Twitter, and Slack
  • Developers -- monitor structured data (JSON-LD) changes across pages after deployments
  • Migration teams -- preserve and verify metadata integrity during site migrations

Why use Open Graph & Meta Tag Extractor?

  • Complete metadata extraction -- Open Graph, Twitter Cards, JSON-LD, hreflang, and all meta tags in one pass
  • Bulk processing -- process hundreds of URLs in a single run
  • Fast and lightweight -- HTTP-only, no browser needed, so runs complete quickly
  • Structured JSON output -- clean, machine-readable data ready for analysis or integration
  • API access -- call programmatically from Node.js, Python, or any HTTP client
  • Pay-per-event pricing -- only pay for what you use, starting at fractions of a cent per URL

Input parameters

ParameterTypeRequiredDefaultDescription
urlsstring[]Yes--List of URLs to extract meta tags from. Domain names without protocol are auto-prefixed with https://

Example input

{
"urls": ["https://apify.com", "https://github.com", "https://stripe.com"]
}

Output example

{
"url": "https://apify.com",
"title": "Apify: Full-stack web scraping and data extraction platform",
"description": "Build reliable web scrapers...",
"canonical": "https://apify.com/",
"favicon": "/favicon.ico",
"language": "en",
"charset": "utf-8",
"viewport": "width=device-width, initial-scale=1",
"robots": null,
"openGraph": {
"title": "Apify: Full-stack web scraping platform",
"description": "Build reliable web scrapers...",
"image": "https://apify.com/og-image.png",
"url": "https://apify.com/",
"type": "website",
"siteName": "Apify",
"locale": "en_US"
},
"twitterCard": {
"card": "summary_large_image",
"title": "Apify",
"description": "Build reliable web scrapers...",
"image": "https://apify.com/og-image.png",
"site": "@apaborovicka",
"creator": null
},
"alternateLanguages": [],
"jsonLd": [{ "@context": "https://schema.org", "@type": "Organization", "name": "Apify" }],
"allMetaTags": [
{ "name": "description", "content": "Build reliable web scrapers..." },
{ "name": "viewport", "content": "width=device-width, initial-scale=1" }
],
"checkTimeMs": 456,
"error": null,
"checkedAt": "2026-03-01T12:00:00.000Z"
}

How much does it cost?

EventPriceDescription
Start$0.035One-time per run
URL extracted$0.001Per URL processed

Example costs:

  • 10 URLs: $0.035 + 10 x $0.001 = $0.045
  • 100 URLs: $0.035 + 100 x $0.001 = $0.135
  • 1,000 URLs: $0.035 + 1,000 x $0.001 = $1.035

Using the Apify API

Node.js

import { ApifyClient } from 'apify-client';
const client = new ApifyClient({ token: 'YOUR_TOKEN' });
const run = await client.actor('automation-lab/og-meta-extractor').call({
urls: ['https://apify.com', 'https://github.com'],
});
const { items } = await client.dataset(run.defaultDatasetId).listItems();
console.log(items);

Python

from apify_client import ApifyClient
client = ApifyClient('YOUR_TOKEN')
run = client.actor('automation-lab/og-meta-extractor').call(run_input={
'urls': ['https://apify.com', 'https://github.com'],
})
items = client.dataset(run['defaultDatasetId']).list_items().items
for item in items:
print(f'{item["url"]}: OG title = {item["openGraph"]["title"]}')

Integrations

Open Graph & Meta Tag Extractor integrates with your existing workflow through the Apify platform. Connect it to Make (formerly Integromat), Zapier, or n8n to automate metadata audits on a schedule. Export results to Google Sheets for team review, send alerts to Slack when OG tags are missing or incorrect, or use webhooks to trigger downstream actions when a run finishes. The actor's API is compatible with any HTTP client for custom integrations.

Tips and best practices

  • Audit before sharing -- run the extractor on your URLs before major social media campaigns to ensure previews look correct.
  • Monitor competitor metadata -- schedule regular runs on competitor URLs to track changes in their SEO and social strategy.
  • Check JSON-LD regularly -- structured data errors can silently hurt your search appearance; use this actor to catch them early.
  • Combine with a sitemap -- feed your XML sitemap URLs into the actor for a comprehensive metadata audit of your entire site.
  • Use the allMetaTags field -- if you need a meta tag not covered by the structured fields, check the raw allMetaTags array.

FAQ

What social platforms does it cover? It extracts Open Graph tags (used by Facebook, LinkedIn, Slack, and most platforms) and Twitter Card tags (used by Twitter/X). These two standards cover the vast majority of social media preview data.

Does it render JavaScript? No. The actor uses fast HTTP requests without a browser. If your meta tags are injected by client-side JavaScript, they may not be captured. Most modern frameworks server-render meta tags, so this is rarely an issue.

Can I extract metadata from pages behind a login? No. The actor makes unauthenticated HTTP requests and can only process publicly accessible URLs.

What is JSON-LD and why does it matter? JSON-LD is a structured data format used by search engines to understand page content. It powers rich search results like star ratings, FAQs, and product listings. This actor extracts all JSON-LD blocks from the page so you can verify your structured data is correct.

How many URLs can I process in one run? There is no hard limit. The actor processes URLs concurrently, making it efficient for runs with hundreds or thousands of URLs.