Pricing

$2.50/month + usage

Try for free

Go to Apify Store

URL to Metadata

Try for free

Developed by

njoylab

A powerful Apify actor that extracts essential website information, including title, description, images, and social media links. Perfect for quick data gathering and insights from any URL.

5.0 (2)

Pricing

$2.50/month + usage

Last modified

3 months ago

Automation

Lead generation

Website URL to Metadata

This project is a web scraping tool designed to extract metadata from websites. It uses libraries like Axios for HTTP requests, Cheerio for HTML parsing, and Apify SDK for actor management. The scraper can fetch metadata such as titles, descriptions, social media links, and more from a webpage.

Features

Fetches metadata from a given URL.
Supports custom user-agent strings.
Respects robots.txt rules unless explicitly ignored.
Extracts social media links and contact information.
Extracts external links.

Usage

Prepare the input:

It should include at least the url field.

{
  "url": "https://example.com",
  "language": "en-US",
  "ignoreRobots": false,
  "ignoreExternalLinks": false,
  "ignoreInteralLinks": false
}

Output:

{
  "title": "Example Domain",
  "description": "This domain is for use in illustrative examples in documents.",
  "keywords": "example, domain, illustrative, examples, documents",
  "image": "https://example.com/image.png",
  "facebook": "https://facebook.com/example",
  "x": "https://twitter.com/example",
  "linkedin": "https://linkedin.com/company/example",
  "instagram": "https://instagram.com/example",
  "youtube": "https://youtube.com/example",
 "trustpilot": "https://trustpilot.com/review/example.com",
  "canonical": "https://example.com",
  "url_fetched": "https://example.com",
  "url": "https://example.com",
  "mail": "contact@example.com",
  "robotsAllow": true,
  "linksExternal": ["https://example.com/external1", "https://example.com/external2"],
  "linksInternal": ["https://example.com/about", "https://example.com/contacts"]
}

Configuration

User-Agent: The scraper uses a random user-agent string for each request to mimic a real browser.
Language: You can specify the Accept-Language header in the input payload.
Robots.txt: By default, the scraper respects robots.txt rules. Set ignoreRobots to true in the input payload to bypass this.
External Links: By default, the scraper extracts external links. Set ignoreExternalLinks to true in the input payload to bypass this.
Internal Links: By default, the scraper extracts internal links. Set ignoreInternalLinks to true in the input payload to bypass this.

On this page

Website URL to Metadata

Share Actor:

🔗✨ Link Extractor Pro: URL to HTML List Downloader

dainty_screw/link-extractor-pro-url-to-html-list-downloader

Maximize productivity with HTML URL List Downloader. Quickly extract, manage, and organize URLs from HTML pages. Ideal for SEO professionals and digital marketers. Streamline your workflow today!

codemaster devops

135

My Actor

david15999/my-actor

HTML scraper

David Emanuel Moreira

Metadata Extractor

jancurn/extract-metadata

A small efficient actor that loads a web page, parses its HTML using Cheerio library and extracts the following meta-data from the <HEAD> tag, such as page title, description, author etc.

Jan Čurn

1.3K

Get Metadata

maged120/get-metadata

The actor extracts comprehensive metadata including image previews, titles, descriptions, author, time of publish, fav icon, and a lot more

Maged

5.0

HTML Scraper pro

scrapingxpert/html-scraper-pro

The HTML Scraper Pro is a powerful tool designed to extract the HTML source code and metadata from websites. It uses advanced web scraping techniques to retrieve the full HTML content of web pages,page title and HTTP status code.This tool is ideal for data extraction, website analysis, and archiving

scrapingxpert

115

Extract Website With URL

mrahil/extract-website-with-url

The Extract Website with URL API allows users to extract structured data from any webpage by providing a URL. It retrieves HTML, metadata, tables, and images, returning data in JSON format. Ideal for web scraping, SEO analysis, and content extraction. Use it for e-commerce data, news scraping

Mohammed Rahil

104

Metadata Scraper

louisdeconinck/metadata-scraper

Automatically scrape metadata such as title, description, heading and article from websites. It will crawl the start URLs and then scrape the metadata from the detail pages automatically navigating through the pagination.

Louis Deconinck

5.0

Download HTML from URLs

mtrunkat/url-list-download-html

This actor takes a list of URLs and downloads HTML of each page.

Marek Trunkát

8.7K

Html To Markdown Converter 📄

powerful_bachelor/html-to-markdown-converter

📄✨ HTML to Markdown Converter transforms web pages into clean, portable Markdown. Simply input a URL to extract content while preserving structure, formatting, and media elements.🔄 Perfect for content repurposing, documentation, and creating readable, platform-independent text from any webpage! 🚀

Powerful Bachelor

Metadata Scraper

autofacts/metadata-scraper

A powerful web scraper that extracts various types of structured metadata from web pages, including JSON-LD, Microdata, Open Graph, Twitter Cards, and more. Perfect for SEO analysis, content aggregation, and research purposes.