Metadata Extractor
Pricing
Pay per usage
Go to Apify Store
Metadata Extractor
A small efficient actor that loads a web page, parses its HTML using Cheerio library and extracts the following meta-data from the <HEAD> tag, such as page title, description, author etc.
Pricing
Pay per usage
Rating
0.0
(0)
Developer

Jan Čurn
Maintained by Community
Actor stats
15
Bookmarked
1.2K
Total users
20
Monthly active users
2 years ago
Last modified
Categories
Share
The actor takes a list of URLs of web pages on input, loads the HTML, and then extracts metadata from the HTML. The result is stored as a JSON file into the default dataset.
For example, for https://www.apify.com, the JSON result looks as follows:
{"url": "https://www.apify.com/","title": "Web Scraping, Data Extraction and Automation · Apify","meta": {"X-UA-Compatible": "IE=edge,chrome=1","viewport": "width=device-width,minimum-scale=1,initial-scale=1","copyright": "Copyright© 2019 Apify Technologies s.r.o. All rights reserved.","keywords": "web scraper, web crawler, scraping, data extraction, API","robots": "index,follow","referrer": "origin","googlebot": "index,follow","description": "Apify extracts data from websites, crawls lists of URLs and automates workflows on the web. Turn any website into an API in a few minutes!","twitter:card": "summary_large_image","twitter:creator": "@apify","fb:app_id": "1636933253245869","og:url": "https://apify.com/","og:type": "website","og:title": "Web Scraping, Data Extraction and Automation · Apify","og:description": "Apify extracts data from websites, crawls lists of URLs and automates workflows on the web. Turn any website into an API in a few minutes!","og:image": "https://apify.com/img/og-image.png","og:image:alt": "Apify","og:image:width": "1200","og:image:height": "630","og:locale": "en_IE","og:site_name": "Apify","next-head-count": "19"}}
