Pricing

$5.00/month + usage

Try for free

Go to Store

Metadata Scraper

Try for free

Developed by

Autofactor

A powerful web scraper that extracts various types of structured metadata from web pages, including JSON-LD, Microdata, Open Graph, Twitter Cards, and more. Perfect for SEO analysis, content aggregation, and research purposes.

5.0 (1)

Pricing

$5.00/month + usage

Last modified

2 months ago

Developer tools

Automation

SEO tools

Features

🔍 Comprehensive Metadata Extraction:
- JSON-LD structured data
- Microdata structured data (schema.org)
- Open Graph metadata
- Twitter Card metadata
- Website icons/favicons
- Standard meta tags
⚙️ Advanced Configuration:
- Configurable crawling depth
- Adjustable concurrency
- Request limits
- Proxy support
🚀 Robust Performance:
- Efficient HTML parsing
- Handles multiple JSON-LD formats
- Support for various icon formats

Input Parameters

Parameter	Type	Description	Default
`startUrls`	Array	URLs to start crawling from	(required)
`maxRequestsPerCrawl`	Integer	Maximum number of pages to crawl	`100`
`maxConcurrency`	Integer	Maximum number of pages processed in parallel	`10`
`extractMetaTags`	Boolean	Whether to extract all meta tags	`true`

Output Data Structure

For each crawled page, the scraper outputs a JSON object with the following fields:

Field	Type	Description
`url`	String	The URL of the crawled page
`title`	String	The page title
`icon`	String	URL of the website's icon/favicon
`linkedData`	Array	JSON-LD structured data found on the page
`microdata`	Array	Microdata structured data (schema.org) found on the page
`openGraph`	Object	Open Graph metadata (used by Facebook and other platforms)
`twitterCard`	Object	Twitter Card metadata
`metaTags`	Object	Other meta tags from the page (when `extractMetaTags` is enabled)

Example Use Cases

E-commerce Research

Extract product information, pricing, availability, and reviews from various online stores for competitive analysis or price monitoring.

Content Aggregation

Build a news aggregator or content recommendation engine by extracting article metadata from different sources.

SEO Analysis

Analyze websites' structured data implementation for SEO optimization recommendations.

Test how your content will appear when shared on social media platforms by extracting Open Graph and Twitter Card data.

Example Outputs

Medium Article

{
	"url": "https://medium.com/coding-beauty/new-google-project-idx-fae1fdd079c7",
	"title": "This new IDE from Google is an absolute game changer | by Tari Ibaba | Coding Beauty | Mar, 2025 | Medium",
	"linkedData": [
		{
			"@context": "http://schema.org",
			"@type": "NewsArticle",
			"image": [
				"https://miro.medium.com/v2/resize:fit:1200/1*f-1HQQng85tbA7kwgECqoQ.png"
			],
			"url": "https://medium.com/coding-beauty/new-google-project-idx-fae1fdd079c7",
			"dateCreated": "2025-03-11T19:45:26.427Z",
			"datePublished": "2025-03-11T19:45:26.427Z",
			"dateModified": "2025-04-15T14:25:56.263Z",
			"headline": "This new IDE from Google is an absolute game changer",
			"name": "This new IDE from Google is an absolute game changer",
			"description": "I was not surprised to see this sort of thing coming from Google — with their deep-seated hatred for local desktop apps. Loading your projects from GitHub and then install dependencies instantly…",
			"identifier": "fae1fdd079c7",
			"author": {
				"@type": "Person",
				"name": "Tari Ibaba",
				"url": "https://medium.com/@tariibaba"
			},
			"creator": [
				"Tari Ibaba"
			],
			"publisher": {
				"@type": "Organization",
				"name": "Coding Beauty",
				"url": "https://medium.com/coding-beauty",
				"logo": {
					"@type": "ImageObject",
					"width": 272,
					"height": 60,
					"url": "https://miro.medium.com/v2/resize:fit:544/7*V1_7XP4snlmqrc_0Njontw.png"
				}
			},
			"mainEntityOfPage": "https://medium.com/coding-beauty/new-google-project-idx-fae1fdd079c7",
			"isAccessibleForFree": "False",
			"hasPart": {
				"@type": "WebPageElement",
				"isAccessibleForFree": "False",
				"cssSelector": ".meteredContent"
			}
		}
	],
	"microdata": [],
	"openGraph": {
		"site_name": "Medium",
		"type": "article",
		"title": "This new IDE from Google is an absolute game changer",
		"description": "This new IDE from Google is seriously revolutionary.",
		"url": "https://medium.com/coding-beauty/new-google-project-idx-fae1fdd079c7",
		"image": "https://miro.medium.com/v2/resize:fit:1200/1*f-1HQQng85tbA7kwgECqoQ.png"
	},
	"twitterCard": {
		"app:name:iphone": "Medium",
		"app:id:iphone": "828256236",
		"site": "@CodingBeautyDev",
		"app:url:iphone": "medium://p/fae1fdd079c7",
		"image:src": "https://miro.medium.com/v2/resize:fit:1200/1*f-1HQQng85tbA7kwgECqoQ.png",
		"card": "summary_large_image",
		"creator": "@tariibabadev",
		"label1": "Reading time",
		"data1": "5 min read",
		"title": "This new IDE from Google is an absolute game changer",
		"description": "This new IDE from Google is seriously revolutionary.",
		"image": "https://miro.medium.com/v2/resize:fit:1200/1*f-1HQQng85tbA7kwgECqoQ.png",
		"has_large_image": "true"
	},
	"metaTags": {
		"viewport": "width=device-width,minimum-scale=1,initial-scale=1,maximum-scale=1",
		"theme-color": "#000000",
		"al:ios:app_name": "Medium",
		"al:ios:app_store_id": "828256236",
		"al:android:package": "com.medium.reader",
		"fb:app_id": "542599432471018",
		"article:published_time": "2025-04-10T09:50:11.344Z",
		"title": "This new IDE from Google is an absolute game changer | by Tari Ibaba | Coding Beauty | Mar, 2025 | Medium",
		"al:android:url": "medium://p/fae1fdd079c7",
		"al:ios:url": "medium://p/fae1fdd079c7",
		"al:android:app_name": "Medium",
		"description": "I was not surprised to see this sort of thing coming from Google — with their deep-seated hatred for local desktop apps. Loading your projects from GitHub and then install dependencies instantly…",
		"al:web:url": "https://medium.com/coding-beauty/new-google-project-idx-fae1fdd079c7",
		"article:author": "https://medium.com/@tariibaba",
		"author": "Tari Ibaba",
		"robots": "index,noarchive,follow,max-image-preview:large",
		"referrer": "unsafe-url"
	},
	"icon": "https://miro.medium.com/v2/resize:fill:304:304/10fd5c419ac61637245384e7099e131627900034828f4f386bdaa47a74eae156"
}

YouTube Video

{
	"url": "https://www.youtube.com/watch?v=YCgnccJW_O0",
	"title": "Rust on Vercel | Build and Deploy Blazing Fast Serverless Functions (Full Guide) - YouTube",
	"linkedData": [],
	"microdata": [
		{
			"@type": "http://schema.org/VideoObject",
			"url": [
				"https://www.youtube.com/watch?v=YCgnccJW_O0",
				"http://www.youtube.com/@Semicolon10",
				"https://i.ytimg.com/vi/YCgnccJW_O0/maxresdefault.jpg"
			],
			"name": [
				"Rust on Vercel | Build and Deploy Blazing Fast Serverless Functions (Full Guide)",
				"Semicolon"
			],
			"description": "Want to deploy blazing fast serverless functions using Rust? In this video, I'll show you how to run Rust on Vercel with zero hassle. We'll build a simple AP...",
			"requiresSubscription": "False",
			"identifier": "YCgnccJW_O0",
			"duration": "PT12M15S",
			"position": "1",
			"thumbnailUrl": "https://i.ytimg.com/vi/YCgnccJW_O0/maxresdefault.jpg",
			"width": [
				"1280",
				"1280"
			],
			"height": [
				"720",
				"720"
			],
			"embedUrl": "https://www.youtube.com/embed/YCgnccJW_O0",
			"playerType": "HTML5 Flash",
			"isFamilyFriendly": "true",
			"regionsAllowed": "AD,AE,AF,AG,AI,AL,AM,AO,AQ,AR,AS,AT,AU,AW,AX,AZ,BA,BB,BD,BE,BF,BG,BH,BI,BJ,BL,BM,BN,BO,BQ,BR,BS,BT,BV,BW,BY,BZ,CA,CC,CD,CF,CG,CH,CI,CK,CL,CM,CN,CO,CR,CU,CV,CW,CX,CY,CZ,DE,DJ,DK,DM,DO,DZ,EC,EE,EG,EH,ER,ES,ET,FI,FJ,FK,FM,FO,FR,GA,GB,GD,GE,GF,GG,GH,GI,GL,GM,GN,GP,GQ,GR,GS,GT,GU,GW,GY,HK,HM,HN,HR,HT,HU,ID,IE,IL,IM,IN,IO,IQ,IR,IS,IT,JE,JM,JO,JP,KE,KG,KH,KI,KM,KN,KP,KR,KW,KY,KZ,LA,LB,LC,LI,LK,LR,LS,LT,LU,LV,LY,MA,MC,MD,ME,MF,MG,MH,MK,ML,MM,MN,MO,MP,MQ,MR,MS,MT,MU,MV,MW,MX,MY,MZ,NA,NC,NE,NF,NG,NI,NL,NO,NP,NR,NU,NZ,OM,PA,PE,PF,PG,PH,PK,PL,PM,PN,PR,PS,PT,PW,PY,QA,RE,RO,RS,RU,RW,SA,SB,SC,SD,SE,SG,SH,SI,SJ,SK,SL,SM,SN,SO,SR,SS,ST,SV,SX,SY,SZ,TC,TD,TF,TG,TH,TJ,TK,TL,TM,TN,TO,TR,TT,TV,TW,TZ,UA,UG,UM,US,UY,UZ,VA,VC,VE,VG,VI,VN,VU,WF,WS,YE,YT,ZA,ZM,ZW",
			"interactionType": [
				"https://schema.org/LikeAction",
				"https://schema.org/WatchAction"
			],
			"userInteractionCount": [
				"28",
				"525"
			],
			"datePublished": "2025-04-18T05:31:11-07:00",
			"uploadDate": "2025-04-18T05:31:11-07:00",
			"genre": "Science & Technology"
		}
	],
	"openGraph": {
		"site_name": "YouTube",
		"url": "https://www.youtube.com/watch?v=YCgnccJW_O0",
		"title": "Rust on Vercel | Build and Deploy Blazing Fast Serverless Functions (Full Guide)",
		"image": "https://i.ytimg.com/vi/YCgnccJW_O0/maxresdefault.jpg",
		"image:width": "1280",
		"image:height": "720",
		"description": "Want to deploy blazing fast serverless functions using Rust? In this video, I'll show you how to run Rust on Vercel with zero hassle. We'll build a simple AP...",
		"type": "video.other",
		"video:url": "https://www.youtube.com/embed/YCgnccJW_O0",
		"video:secure_url": "https://www.youtube.com/embed/YCgnccJW_O0",
		"video:type": "text/html",
		"video:width": "1280",
		"video:height": "720"
	},
	"twitterCard": {
		"card": "player",
		"site": "@youtube",
		"url": "https://www.youtube.com/watch?v=YCgnccJW_O0",
		"title": "Rust on Vercel | Build and Deploy Blazing Fast Serverless Functions (Full Guide)",
		"description": "Want to deploy blazing fast serverless functions using Rust? In this video, I'll show you how to run Rust on Vercel with zero hassle. We'll build a simple AP...",
		"image": "https://i.ytimg.com/vi/YCgnccJW_O0/maxresdefault.jpg",
		"app:name:iphone": "YouTube",
		"app:id:iphone": "544007664",
		"app:name:ipad": "YouTube",
		"app:id:ipad": "544007664",
		"app:url:iphone": "vnd.youtube://www.youtube.com/watch?v=YCgnccJW_O0&feature=applinks",
		"app:url:ipad": "vnd.youtube://www.youtube.com/watch?v=YCgnccJW_O0&feature=applinks",
		"app:name:googleplay": "YouTube",
		"app:id:googleplay": "com.google.android.youtube",
		"app:url:googleplay": "https://www.youtube.com/watch?v=YCgnccJW_O0",
		"player": "https://www.youtube.com/embed/YCgnccJW_O0",
		"player:width": "1280",
		"player:height": "720"
	},
	"metaTags": {
		"theme-color": "rgba(255, 255, 255, 0.98)",
		"title": "Rust on Vercel | Build and Deploy Blazing Fast Serverless Functions (Full Guide)",
		"description": "Want to deploy blazing fast serverless functions using Rust? In this video, I'll show you how to run Rust on Vercel with zero hassle. We'll build a simple AP...",
		"keywords": "video, sharing, camera phone, video phone, free, upload",
		"al:ios:app_store_id": "544007664",
		"al:ios:app_name": "YouTube",
		"al:ios:url": "vnd.youtube://www.youtube.com/watch?v=YCgnccJW_O0&feature=applinks",
		"al:android:url": "vnd.youtube://www.youtube.com/watch?v=YCgnccJW_O0&feature=applinks",
		"al:web:url": "http://www.youtube.com/watch?v=YCgnccJW_O0&feature=applinks",
		"al:android:app_name": "YouTube",
		"al:android:package": "com.google.android.youtube",
		"fb:app_id": "87741124305"
	},
	"icon": "https://www.youtube.com/s/desktop/c722ba88/img/logos/favicon_32x32.png"
}

Getting Help

If you need help or have questions about using this Actor, please don't hesitate to submit issues.

On this page

Metadata Scraper

Share Actor:

URL Metadata Crawler

easyapi/url-metadata-crawler

Extracting comprehensive metadata from web pages. Gather vital information like meta tags, favicons, Open Graph tags, and more, all while enjoying flexible options for customization. Perfect for SEO specialists, developers, and content creators looking to enhance their web presence! 🌐

EasyApi

Meta Data Extractor

dainty_screw/metadata-extractor-reliable-web-page-metadata-extraction

Metadata Extractor is your go-to tool for extracting meta-data from web pages. Using Cheerio, it parses HTML to extract titles, descriptions, authors, and more.Perfect for content managers and SEO experts.

codemaster devops

Get Metadata

maged120/get-metadata

The actor extracts comprehensive metadata including image previews, titles, descriptions, author, time of publish, fav icon, and a lot more

Maged

5.0

URL to Metadata

njoylab/url-summary-scraper

A powerful Apify actor that extracts essential website information, including title, description, images, and social media links. Perfect for quick data gathering and insights from any URL.

njoylab

5.0

Metadata Scraper

louisdeconinck/metadata-scraper

Automatically scrape metadata such as title, description, heading and article from websites. It will crawl the start URLs and then scrape the metadata from the detail pages automatically navigating through the pagination.

Louis Deconinck

5.0

Website Metadata Extractor (meta tags, sitemap, robots) 🔎

powerful_bachelor/website-metadata-extractor

🔍 Website Metadata Extractor 🌐 Extract essential website data: meta tags, robots.txt, and sitemap.xml in one scan. 📊 Analyze SEO elements, crawler directives, and site structure. ✅ Perfect for SEO audits, 🔎 competitor research, and 🚀 understanding how search engines view your website.

Powerful Bachelor

Single page web scraping

krishnapada.m.99/single-page-web-scraping

Scrapes the <title> tag or H1 tag from a single webpage provided by the user. Useful for SEO audits or content previews.

Somnath Mandal

Metadata Extractor

jancurn/extract-metadata

A small efficient actor that loads a web page, parses its HTML using Cheerio library and extracts the following meta-data from the <HEAD> tag, such as page title, description, author etc.

Jan Čurn

1.3K

Sitemap URL Extractor

onescales/sitemap-url-extractor

Provide a link to a sitemap.xml and the app will extract and list all URLs in the sitemap as well as additional data in the sitemap (i.e. https://onescales.com/sitemap.xml).

One Scales

5.0

Sitemap Sniffer

vaclavrut/sitemap-sniffer

Sitemap sniffer will check the most used variants of sitemaps and you can use that for crawling. This will just save you time so you don't have to check manually.