Metadata Extractor avatar
Metadata Extractor

Pricing

Pay per usage

Go to Apify Store
Metadata Extractor

Metadata Extractor

A small efficient actor that loads a web page, parses its HTML using Cheerio library and extracts the following meta-data from the <HEAD> tag, such as page title, description, author etc.

Pricing

Pay per usage

Rating

0.0

(0)

Developer

Jan Čurn

Jan Čurn

Maintained by Community

Actor stats

15

Bookmarked

1.2K

Total users

20

Monthly active users

2 years ago

Last modified

Share

The actor takes a list of URLs of web pages on input, loads the HTML, and then extracts metadata from the HTML. The result is stored as a JSON file into the default dataset.

For example, for https://www.apify.com, the JSON result looks as follows:

{
"url": "https://www.apify.com/",
"title": "Web Scraping, Data Extraction and Automation · Apify",
"meta": {
"X-UA-Compatible": "IE=edge,chrome=1",
"viewport": "width=device-width,minimum-scale=1,initial-scale=1",
"copyright": "Copyright&copy; 2019 Apify Technologies s.r.o. All rights reserved.",
"keywords": "web scraper, web crawler, scraping, data extraction, API",
"robots": "index,follow",
"referrer": "origin",
"googlebot": "index,follow",
"description": "Apify extracts data from websites, crawls lists of URLs and automates workflows on the web. Turn any website into an API in a few minutes!",
"twitter:card": "summary_large_image",
"twitter:creator": "@apify",
"fb:app_id": "1636933253245869",
"og:url": "https://apify.com/",
"og:type": "website",
"og:title": "Web Scraping, Data Extraction and Automation · Apify",
"og:description": "Apify extracts data from websites, crawls lists of URLs and automates workflows on the web. Turn any website into an API in a few minutes!",
"og:image": "https://apify.com/img/og-image.png",
"og:image:alt": "Apify",
"og:image:width": "1200",
"og:image:height": "630",
"og:locale": "en_IE",
"og:site_name": "Apify",
"next-head-count": "19"
}
}