Pricing

Pay per usage

Metadata Extractor

A small efficient actor that loads a web page, parses its HTML using Cheerio library and extracts the following meta-data from the <HEAD> tag, such as page title, description, author etc.

Pricing

Pay per usage

Rating

0.0

(0)

Developer

Jan Čurn

Actor stats

Bookmarked

1.4K

Total users

Monthly active users

3 years ago

Last modified

Categories

Developer tools

Open source

You can access the Metadata Extractor programmatically from your own applications by using the Apify API. You can also choose the language preference from below. To use the Apify API, you’ll need an Apify account and your API token, found in Integrations settings in Apify Console.

Python

JavaScript

CLI

OpenAPI

HTTP

MCP

1from apify_client import ApifyClient
2
3# Initialize the ApifyClient with your Apify API token
4# Replace '<YOUR_API_TOKEN>' with your token.
5client = ApifyClient("<YOUR_API_TOKEN>")
6
7# Prepare the Actor input
8run_input = {
9    "urls": [
10        "https://www.apify.com/",
11        "https://blog.apify.com",
12    ],
13    "proxy": { "useApifyProxy": True },
14}
15
16# Run the Actor and wait for it to finish
17run = client.actor("jancurn/extract-metadata").call(run_input=run_input)
18
19# Fetch and print Actor results from the run's dataset (if there are any)
20print("💾 Check your data here: https://console.apify.com/storage/datasets/" + run["defaultDatasetId"])
21for item in client.dataset(run["defaultDatasetId"]).iterate_items():
22    print(item)
23
24# 📚 Want to learn more 📖? Go to → https://docs.apify.com/api/client/python/docs/quick-start

Metadata Extractor API in Python

The Apify API client for Python is the official library that allows you to use Metadata Extractor API in Python, providing convenience functions and automatic retries on errors.

Install the apify-client

$pip install apify-client

Other API clients include:

Metadata Extractor API in JavaScript

Metadata Extractor API through CLI

Metadata Extractor OpenAPI definition

Metadata Extractor API

Backlinks Checker

s-r/backlinks-checker

Get the full backlink profile for any domain — domain score, referring-domain count, total-link count, and per-link records (origin URL, target URL, anchor text, follow/nofollow, dates).

Smart Article Extractor

lukaskrivka/article-extractor-smart

📰 Smart Article Extractor extracts articles from any scientific, academic, or news website with just one click. The extractor crawls the whole website and automatically distinguishes articles from other web pages. Download your data as HTML table, JSON, Excel, RSS feed, and more.

Lukáš Křivka

7.5K

4.1

Articles Extractor

web.harvester/articles-extractor

The Article Extractor is an enterprise-grade web scraping solution designed specifically for extracting structured data from news articles, blog posts, and online publications. Our advanced HTML parsing engine delivers unmatched accuracy in content extraction across thousands of websites.

Web Harvester

747

5.0

News Website Crawler & Article Extractor

xtech/news-source-crawler

Scrape all articles from any news website. Extract full text, metadata, keywords, and summaries. Ideal for content analysis, research, and news aggregation.

Xtech

394

4.8

Article Extractor & News Scraper

web.harvester/article-extractor-news-scraper

Extract articles from any news site, blog, or webpage. Get title, full text, author, date, images & metadata using 7 extraction engines (Newspaper4k, Trafilatura, Goose3). Anti-bot bypass, proxy rotation, automatic fallback. Perfect for news monitoring, NLP datasets & content aggregation.

Web Harvester

5.0

Article Content Extractor 📄

easyapi/article-content-extractor

Extract clean article content, metadata and structured information from any web page. Supports multiple URLs and returns well-formatted JSON with title, description, content, author, publish date and more. 🔍📄

EasyApi

121

Indonews Aggregator

nadpra/indonews

Automatically crawls, enriches, and summarizes news articles (artikel berita) from major Indonesian media outlets: Detik, Kompas, CNN, Beritasatu, etc. Provides sentiment, category, and emotion analysis with the latest news updates.

Nadda Pratama

180

5.0

Google News Scraper

epctex/google-news-scraper

Unlock timely news insights with our Google News data retrieval tool. Get the latest news on any news at any time, and more. Effortless and powerful. 📰🔍 #NewsData

epctex

588

5.0

Google News Realtime Scraper

devisty/google-news

Provide real-time news and articles sourced from Google News

Devisty

257

Google News Scraper

easyapi/google-news-scraper

Powerful Google News scraper, collect up to 5000 news articles with flexible search options, language support. Perfect for news aggregation, market research, and sentiment analysis. 📰🔍

EasyApi

1.6K

3.8