Open Graph & Meta Tag Extractor
Pricing
Pay per event
Open Graph & Meta Tag Extractor
This actor fetches any list of URLs and extracts all social media meta tags (Open Graph, Twitter Cards), SEO metadata (title, description, canonical, robots), structured data (JSON-LD), and internationalization (hreflang). Use it for social media audits, SEO analysis, link preview...
Pricing
Pay per event
Rating
0.0
(0)
Developer

Stas Persiianenko
Actor stats
0
Bookmarked
2
Total users
1
Monthly active users
2 days ago
Last modified
Categories
Share
Extract Open Graph, Twitter Card, meta tags, JSON-LD structured data, and hreflang from any URL. Get social media preview data, SEO metadata, and structured data at scale.
What does Open Graph & Meta Tag Extractor do?
This actor fetches any list of URLs and extracts all social media meta tags (Open Graph, Twitter Cards), SEO metadata (title, description, canonical, robots), structured data (JSON-LD), and internationalization (hreflang). Use it for social media audits, SEO analysis, link preview validation, or content monitoring. Process hundreds of URLs in a single run with fast HTTP-only requests.
Use cases
- Social media managers -- verify OG tags and Twitter Cards render correctly before launching campaigns
- SEO specialists -- audit title tags, descriptions, canonical URLs, and hreflang across entire websites
- Content teams -- validate how URLs appear when shared on LinkedIn, Facebook, Twitter, and Slack
- Developers -- monitor structured data (JSON-LD) changes across pages after deployments
- Migration teams -- preserve and verify metadata integrity during site migrations
Why use Open Graph & Meta Tag Extractor?
- Complete metadata extraction -- Open Graph, Twitter Cards, JSON-LD, hreflang, and all meta tags in one pass
- Bulk processing -- process hundreds of URLs in a single run
- Fast and lightweight -- HTTP-only, no browser needed, so runs complete quickly
- Structured JSON output -- clean, machine-readable data ready for analysis or integration
- API access -- call programmatically from Node.js, Python, or any HTTP client
- Pay-per-event pricing -- only pay for what you use, starting at fractions of a cent per URL
Input parameters
| Parameter | Type | Required | Default | Description |
|---|---|---|---|---|
urls | string[] | Yes | -- | List of URLs to extract meta tags from. Domain names without protocol are auto-prefixed with https:// |
Example input
{"urls": ["https://apify.com", "https://github.com", "https://stripe.com"]}
Output example
{"url": "https://apify.com","title": "Apify: Full-stack web scraping and data extraction platform","description": "Build reliable web scrapers...","canonical": "https://apify.com/","favicon": "/favicon.ico","language": "en","charset": "utf-8","viewport": "width=device-width, initial-scale=1","robots": null,"openGraph": {"title": "Apify: Full-stack web scraping platform","description": "Build reliable web scrapers...","image": "https://apify.com/og-image.png","url": "https://apify.com/","type": "website","siteName": "Apify","locale": "en_US"},"twitterCard": {"card": "summary_large_image","title": "Apify","description": "Build reliable web scrapers...","image": "https://apify.com/og-image.png","site": "@apaborovicka","creator": null},"alternateLanguages": [],"jsonLd": [{ "@context": "https://schema.org", "@type": "Organization", "name": "Apify" }],"allMetaTags": [{ "name": "description", "content": "Build reliable web scrapers..." },{ "name": "viewport", "content": "width=device-width, initial-scale=1" }],"checkTimeMs": 456,"error": null,"checkedAt": "2026-03-01T12:00:00.000Z"}
How much does it cost?
| Event | Price | Description |
|---|---|---|
| Start | $0.035 | One-time per run |
| URL extracted | $0.001 | Per URL processed |
Example costs:
- 10 URLs: $0.035 + 10 x $0.001 = $0.045
- 100 URLs: $0.035 + 100 x $0.001 = $0.135
- 1,000 URLs: $0.035 + 1,000 x $0.001 = $1.035
Using the Apify API
Node.js
import { ApifyClient } from 'apify-client';const client = new ApifyClient({ token: 'YOUR_TOKEN' });const run = await client.actor('automation-lab/og-meta-extractor').call({urls: ['https://apify.com', 'https://github.com'],});const { items } = await client.dataset(run.defaultDatasetId).listItems();console.log(items);
Python
from apify_client import ApifyClientclient = ApifyClient('YOUR_TOKEN')run = client.actor('automation-lab/og-meta-extractor').call(run_input={'urls': ['https://apify.com', 'https://github.com'],})items = client.dataset(run['defaultDatasetId']).list_items().itemsfor item in items:print(f'{item["url"]}: OG title = {item["openGraph"]["title"]}')
Integrations
Open Graph & Meta Tag Extractor integrates with your existing workflow through the Apify platform. Connect it to Make (formerly Integromat), Zapier, or n8n to automate metadata audits on a schedule. Export results to Google Sheets for team review, send alerts to Slack when OG tags are missing or incorrect, or use webhooks to trigger downstream actions when a run finishes. The actor's API is compatible with any HTTP client for custom integrations.
Tips and best practices
- Audit before sharing -- run the extractor on your URLs before major social media campaigns to ensure previews look correct.
- Monitor competitor metadata -- schedule regular runs on competitor URLs to track changes in their SEO and social strategy.
- Check JSON-LD regularly -- structured data errors can silently hurt your search appearance; use this actor to catch them early.
- Combine with a sitemap -- feed your XML sitemap URLs into the actor for a comprehensive metadata audit of your entire site.
- Use the allMetaTags field -- if you need a meta tag not covered by the structured fields, check the raw
allMetaTagsarray.
FAQ
What social platforms does it cover? It extracts Open Graph tags (used by Facebook, LinkedIn, Slack, and most platforms) and Twitter Card tags (used by Twitter/X). These two standards cover the vast majority of social media preview data.
Does it render JavaScript? No. The actor uses fast HTTP requests without a browser. If your meta tags are injected by client-side JavaScript, they may not be captured. Most modern frameworks server-render meta tags, so this is rarely an issue.
Can I extract metadata from pages behind a login? No. The actor makes unauthenticated HTTP requests and can only process publicly accessible URLs.
What is JSON-LD and why does it matter? JSON-LD is a structured data format used by search engines to understand page content. It powers rich search results like star ratings, FAQs, and product listings. This actor extracts all JSON-LD blocks from the page so you can verify your structured data is correct.
How many URLs can I process in one run? There is no hard limit. The actor processes URLs concurrently, making it efficient for runs with hundreds or thousands of URLs.