Metascraper — Web Metadata Extractor
Pricing
from $0.50 / 1,000 results
Metascraper — Web Metadata Extractor
Extract structured metadata (title, description, author, image, publisher, date) from any web page using the metascraper library.
Pricing
from $0.50 / 1,000 results
Rating
0.0
(0)
Developer
daehwan kim
Actor stats
0
Bookmarked
2
Total users
1
Monthly active users
11 days ago
Last modified
Categories
Share
What does Metascraper Actor do?
Metascraper Actor extracts structured metadata from any web page — title, description, author, publisher, date, image, logo, and language — in a single API call. It is powered by the open-source metascraper library (MIT license).
Provide a list of URLs and receive clean, structured JSON. No browser required — fast HTTP-based extraction works for most public pages. Results are stored in an Apify Dataset and downloadable in JSON, CSV, Excel, or HTML.
Why use Metascraper Actor?
- Content research: harvest article titles, authors, publish dates, and featured images at scale.
- SEO auditing: check Open Graph / Twitter Card metadata across pages.
- Feed generation: build structured feeds from arbitrary web pages without RSS.
- Data enrichment: enrich link lists with titles, descriptions, and thumbnails.
- Scheduling: run on a schedule via Apify Console to monitor metadata changes.
How to use Metascraper Actor
- Open the Actor in Apify Console.
- In the Input tab, enter one or more URLs under URLs to scrape.
- Optionally set Max items to limit processing.
- Click Start and wait a few seconds.
- View results in the Output tab or download from the Dataset tab.
Input
| Field | Type | Description |
|---|---|---|
urls | string[] | List of web page URLs to process (required) |
maxItems | integer | Max URLs to process (0 = no limit, default 10) |
proxyConfiguration | object | Optional Apify proxy settings |
Example input:
{"urls": ["https://www.apify.com/","https://crawlee.dev/"],"maxItems": 5}
Output
Each processed URL produces one JSON record in the dataset.
{"url": "https://www.apify.com/","pageTitle": "Apify: Full-Stack Web Scraping and Data Extraction Platform","pageDescription": "Extract structured data from any website at scale.","author": null,"publisher": "Apify","publishedDate": null,"imageUrl": "https://apify.com/og-image.png","logoUrl": "https://apify.com/favicon.ico","lang": "en"}
You can download the dataset in various formats such as JSON, HTML, CSV, or Excel.
Data fields
| Field | Description |
|---|---|
url | Final URL after redirects |
pageTitle | Page title |
pageDescription | Meta description |
author | Content author |
publisher | Site publisher / brand name |
publishedDate | Publish or modified date |
imageUrl | Featured / Open Graph image |
logoUrl | Site logo |
lang | Page language code |
error | Error message if fetch failed (only present on failure) |
Pricing / Cost estimation
This Actor uses the Pay-per-event model. Each URL processed counts as one event. Typical cost is fractions of a cent per URL.
Apify offers a free tier with $5/month of platform credits — enough for thousands of metadata extractions.
Tips
- For pages behind paywalls or heavy JS rendering, consider pairing with a browser-based Actor.
- Use
maxItemsduring testing to avoid unnecessary costs. - Schedule the Actor via Apify Console to track metadata changes over time.
FAQ, Disclaimers, and Support
Is it legal? This Actor fetches publicly accessible web pages using standard HTTP requests, the same way a browser does. It is your responsibility to comply with each website's Terms of Service, robots.txt, and applicable laws (GDPR, CCPA, etc.). The Actor does not bypass authentication or access non-public data.
Known limitations: Pages requiring JavaScript execution or login are not supported. For JS-heavy sites, the metadata extraction may be incomplete.
Issues or feature requests? Use the Issues tab on the Actor page.
Legal Disclaimer: This Actor is an unofficial integration of metascraper. Not affiliated with original project.
🔗 Related Actors by ntriqpro
Extend this actor with the ntriqpro intelligence network:
- video-intelligence-analyzer — Video/image AI analyzer (transcripts, scenes, objects)
- vehicle-damage-assessment — Vehicle damage AI assessment
- table-chart-extractor — Table and chart data extraction
⭐ Love it? Leave a Review
Your rating helps professionals discover this actor. Rate it here.
