Metascraper — Web Metadata Extractor avatar

Metascraper — Web Metadata Extractor

Pricing

from $0.50 / 1,000 results

Go to Apify Store
Metascraper — Web Metadata Extractor

Metascraper — Web Metadata Extractor

Extract structured metadata (title, description, author, image, publisher, date) from any web page using the metascraper library.

Pricing

from $0.50 / 1,000 results

Rating

0.0

(0)

Developer

daehwan kim

daehwan kim

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

11 days ago

Last modified

Share

What does Metascraper Actor do?

Metascraper Actor extracts structured metadata from any web page — title, description, author, publisher, date, image, logo, and language — in a single API call. It is powered by the open-source metascraper library (MIT license).

Provide a list of URLs and receive clean, structured JSON. No browser required — fast HTTP-based extraction works for most public pages. Results are stored in an Apify Dataset and downloadable in JSON, CSV, Excel, or HTML.

Why use Metascraper Actor?

  • Content research: harvest article titles, authors, publish dates, and featured images at scale.
  • SEO auditing: check Open Graph / Twitter Card metadata across pages.
  • Feed generation: build structured feeds from arbitrary web pages without RSS.
  • Data enrichment: enrich link lists with titles, descriptions, and thumbnails.
  • Scheduling: run on a schedule via Apify Console to monitor metadata changes.

How to use Metascraper Actor

  1. Open the Actor in Apify Console.
  2. In the Input tab, enter one or more URLs under URLs to scrape.
  3. Optionally set Max items to limit processing.
  4. Click Start and wait a few seconds.
  5. View results in the Output tab or download from the Dataset tab.

Input

FieldTypeDescription
urlsstring[]List of web page URLs to process (required)
maxItemsintegerMax URLs to process (0 = no limit, default 10)
proxyConfigurationobjectOptional Apify proxy settings

Example input:

{
"urls": [
"https://www.apify.com/",
"https://crawlee.dev/"
],
"maxItems": 5
}

Output

Each processed URL produces one JSON record in the dataset.

{
"url": "https://www.apify.com/",
"pageTitle": "Apify: Full-Stack Web Scraping and Data Extraction Platform",
"pageDescription": "Extract structured data from any website at scale.",
"author": null,
"publisher": "Apify",
"publishedDate": null,
"imageUrl": "https://apify.com/og-image.png",
"logoUrl": "https://apify.com/favicon.ico",
"lang": "en"
}

You can download the dataset in various formats such as JSON, HTML, CSV, or Excel.

Data fields

FieldDescription
urlFinal URL after redirects
pageTitlePage title
pageDescriptionMeta description
authorContent author
publisherSite publisher / brand name
publishedDatePublish or modified date
imageUrlFeatured / Open Graph image
logoUrlSite logo
langPage language code
errorError message if fetch failed (only present on failure)

Pricing / Cost estimation

This Actor uses the Pay-per-event model. Each URL processed counts as one event. Typical cost is fractions of a cent per URL.

Apify offers a free tier with $5/month of platform credits — enough for thousands of metadata extractions.

Tips

  • For pages behind paywalls or heavy JS rendering, consider pairing with a browser-based Actor.
  • Use maxItems during testing to avoid unnecessary costs.
  • Schedule the Actor via Apify Console to track metadata changes over time.

FAQ, Disclaimers, and Support

Is it legal? This Actor fetches publicly accessible web pages using standard HTTP requests, the same way a browser does. It is your responsibility to comply with each website's Terms of Service, robots.txt, and applicable laws (GDPR, CCPA, etc.). The Actor does not bypass authentication or access non-public data.

Known limitations: Pages requiring JavaScript execution or login are not supported. For JS-heavy sites, the metadata extraction may be incomplete.

Issues or feature requests? Use the Issues tab on the Actor page.


Legal Disclaimer: This Actor is an unofficial integration of metascraper. Not affiliated with original project.


Extend this actor with the ntriqpro intelligence network:

⭐ Love it? Leave a Review

Your rating helps professionals discover this actor. Rate it here.