# Favicon & Brand Asset Extractor (`santamaria-automations/favicon-extractor`) Actor

Extract favicons, Apple touch icons, PWA manifest icons, logos, and brand colors from any website. Perfect for link preview generation, brand research, directory building, and ML training datasets. Export data, run via API, schedule and monitor runs, or integrate with other tools.

- **URL**: https://apify.com/santamaria-automations/favicon-extractor.md
- **Developed by:** [Alessandro Santamaria](https://apify.com/santamaria-automations) (community)
- **Categories:** Developer tools, Lead generation
- **Stats:** 2 total users, 1 monthly users, 100.0% runs succeeded, NaN bookmarks
- **User rating**: No ratings yet

## Pricing

from $2.00 / 1,000 website extracteds

This Actor is paid per event. You are not charged for the Apify platform usage, but only a fixed price for specific events.

Learn more: https://docs.apify.com/platform/actors/running/actors-in-store#pay-per-event

## What's an Apify Actor?

Actors are a software tools running on the Apify platform, for all kinds of web data extraction and automation use cases.
In Batch mode, an Actor accepts a well-defined JSON input, performs an action which can take anything from a few seconds to a few hours,
and optionally produces a well-defined JSON output, datasets with results, or files in key-value store.
In Standby mode, an Actor provides a web server which can be used as a website, API, or an MCP server.
Actors are written with capital "A".

## How to integrate an Actor?

If asked about integration, you help developers integrate Actors into their projects.
You adapt to their stack and deliver integrations that are safe, well-documented, and production-ready.
The best way to integrate Actors is as follows.

In JavaScript/TypeScript projects, use official [JavaScript/TypeScript client](https://docs.apify.com/api/client/js.md):

```bash
npm install apify-client
```

In Python projects, use official [Python client library](https://docs.apify.com/api/client/python.md):

```bash
pip install apify-client
```

In shell scripts, use [Apify CLI](https://docs.apify.com/cli/docs.md):

````bash
# MacOS / Linux
curl -fsSL https://apify.com/install-cli.sh | bash
# Windows
irm https://apify.com/install-cli.ps1 | iex
```bash

In AI frameworks, you might use the [Apify MCP server](https://docs.apify.com/platform/integrations/mcp.md).

If your project is in a different language, use the [REST API](https://docs.apify.com/api/v2.md).

For usage examples, see the [API](#api) section below.

For more details, see Apify documentation as [Markdown index](https://docs.apify.com/llms.txt) and [Markdown full-text](https://docs.apify.com/llms-full.txt).


# README

## Favicon & Brand Asset Extractor

Extract favicons, app icons, and brand colors from any website — perfect for link previews, directories, and ML datasets.

One API call gives you every icon variant a website exposes: the classic `/favicon.ico`, HTML `<link rel="icon">` entries at every resolution, Apple touch icons for iOS home screens, PWA manifest icons for Android, Microsoft tile images, plus Open Graph and Twitter Card preview images — and the brand's theme color. Build rich link preview cards, bookmark thumbnails, or directory listings in minutes.

### Features

- **All favicon variants** — Standard favicons, shortcut icons, SVG mask icons, `/favicon.ico` probe
- **Apple Touch Icons** — All sizes used by iOS/macOS home screens
- **PWA Web App Manifest** — Parses `manifest.json` for Android icons, app name, theme color, background color
- **Microsoft tiles** — `msapplication-TileColor` and `TileImage`
- **Social previews** — Open Graph image and Twitter Card image
- **Logo detection** — Schema.org `logo` property + `<img class="logo">` heuristics
- **Theme color** — `<meta name="theme-color">` for brand color sampling
- **Optional base64 download** — Fetch the primary icon bytes when you want to store them offline
- **Relative URLs resolved** — All returned URLs are absolute and ready to use
- **Fast & cheap** — minimal compute usage keeps your bill low

### Example output

```json
{
  "website": "https://www.apple.com",
  "final_url": "https://www.apple.com/",
  "page_title": "Apple",
  "favicon_url": "https://www.apple.com/favicon.ico",
  "favicons": [
    { "url": "https://www.apple.com/favicon.ico", "size": "", "type": "image/x-icon" }
  ],
  "default_favicon_ico_exists": true,
  "apple_touch_icon": "https://www.apple.com/apple-touch-icon.png",
  "apple_touch_icons": [
    { "url": "https://www.apple.com/apple-touch-icon.png", "size": "" }
  ],
  "manifest_url": "",
  "manifest_name": "",
  "manifest_icons": [],
  "theme_color": "#000000",
  "og_image": "https://www.apple.com/ac/structured-data/images/open_graph_logo.png",
  "twitter_image": "",
  "logo_url": "https://www.apple.com/ac/globalnav/7/en_US/images/be15095f-5a20-57d0-ad14-cf4c638e223a/globalnav_apple_image__b5er5ngrzxqq_large.svg",
  "scraped_at": "2026-04-07T10:30:00Z"
}
````

### Use cases

- **Link preview cards** — Build Slack, WhatsApp, or Discord-style link preview tiles with the correct hi-res icon for every URL
- **Bookmark managers** — Enrich saved links with crisp favicons instead of broken `/favicon.ico` fallbacks
- **Directory sites** — Display clean logos for listed businesses without manual upload
- **Brand research** — Analyze visual identity across hundreds of competitors at scale
- **ML training data** — Build logo and icon datasets for computer vision models
- **Competitor monitoring** — Detect brand refreshes by diffing icons and theme colors over time
- **Link aggregators** — Add visual identity to URL lists, RSS readers, and curated feeds

### Input

| Field | Type | Default | Description |
|---|---|---|---|
| `websites` | `string[]` | — | URLs to extract brand assets from |
| `downloadImages` | `boolean` | `false` | Fetch and base64-encode the primary icon |
| `checkManifest` | `boolean` | `true` | Parse the Web App Manifest when present |
| `timeoutSeconds` | `integer` | `15` | Per-request fetch timeout |
| `maxConcurrency` | `integer` | `5` | How many websites to process in parallel |
| `proxyConfiguration` | `object` | — | Optional Apify Proxy configuration |

### Pricing

- **Start:** $0.001 per run
- **Per website:** $0.002

**1,000 websites ≈ $2.00** — fast, bulk, and predictable.

### Issues & Feedback

Found a bug or want a new field? [Open an issue](https://console.apify.com/actors/ACTOR_ID_PLACEHOLDER/issues) — feedback is read and acted on quickly.

### Related Actors

- [SEO Metadata Extractor](https://apify.com/santamaria-automations/seo-metadata-extractor) — Title, meta, Open Graph, Schema.org, SEO score
- [Website Tech Stack Detector](https://apify.com/santamaria-automations/website-tech-detector) — CMS, framework, analytics, CDN
- [Domain WHOIS & DNS](https://apify.com/santamaria-automations/domain-whois-dns) — Registrar, creation date, DNS records
- [Image Metadata Extractor](https://apify.com/santamaria-automations/image-metadata-extractor) — EXIF, dimensions, format

# Actor input Schema

## `websites` (type: `array`):

List of websites to extract brand assets from. Include the scheme (https://) for best results. One URL per line.

## `downloadImages` (type: `boolean`):

Download the primary favicon and include it as base64 in the output. Careful: bloats dataset size.

## `checkManifest` (type: `boolean`):

Fetch and parse the PWA manifest.json when present to extract app name, colors, and manifest icons.

## `timeoutSeconds` (type: `integer`):

How long to wait for a website to respond before giving up.

## `maxConcurrency` (type: `integer`):

How many websites to analyze in parallel.

## `proxyConfiguration` (type: `object`):

Select a proxy to use when fetching pages. Use Apify Proxy to avoid rate limits on large runs.

## Actor input object example

```json
{
  "websites": [
    "https://www.apple.com",
    "https://github.com",
    "https://stripe.com",
    "https://www.wikipedia.org"
  ],
  "downloadImages": false,
  "checkManifest": true,
  "timeoutSeconds": 15,
  "maxConcurrency": 5,
  "proxyConfiguration": {
    "useApifyProxy": false
  }
}
```

# Actor output Schema

## `defaultDataset` (type: `string`):

Per-website favicon and brand asset extraction results.

# API

You can run this Actor programmatically using our API. Below are code examples in JavaScript, Python, and CLI, as well as the OpenAPI specification and MCP server setup.

## JavaScript example

```javascript
import { ApifyClient } from 'apify-client';

// Initialize the ApifyClient with your Apify API token
// Replace the '<YOUR_API_TOKEN>' with your token
const client = new ApifyClient({
    token: '<YOUR_API_TOKEN>',
});

// Prepare Actor input
const input = {
    "websites": [
        "https://www.apple.com",
        "https://github.com",
        "https://stripe.com",
        "https://www.wikipedia.org"
    ],
    "proxyConfiguration": {
        "useApifyProxy": false
    }
};

// Run the Actor and wait for it to finish
const run = await client.actor("santamaria-automations/favicon-extractor").call(input);

// Fetch and print Actor results from the run's dataset (if any)
console.log('Results from dataset');
console.log(`💾 Check your data here: https://console.apify.com/storage/datasets/${run.defaultDatasetId}`);
const { items } = await client.dataset(run.defaultDatasetId).listItems();
items.forEach((item) => {
    console.dir(item);
});

// 📚 Want to learn more 📖? Go to → https://docs.apify.com/api/client/js/docs

```

## Python example

```python
from apify_client import ApifyClient

# Initialize the ApifyClient with your Apify API token
# Replace '<YOUR_API_TOKEN>' with your token.
client = ApifyClient("<YOUR_API_TOKEN>")

# Prepare the Actor input
run_input = {
    "websites": [
        "https://www.apple.com",
        "https://github.com",
        "https://stripe.com",
        "https://www.wikipedia.org",
    ],
    "proxyConfiguration": { "useApifyProxy": False },
}

# Run the Actor and wait for it to finish
run = client.actor("santamaria-automations/favicon-extractor").call(run_input=run_input)

# Fetch and print Actor results from the run's dataset (if there are any)
print("💾 Check your data here: https://console.apify.com/storage/datasets/" + run["defaultDatasetId"])
for item in client.dataset(run["defaultDatasetId"]).iterate_items():
    print(item)

# 📚 Want to learn more 📖? Go to → https://docs.apify.com/api/client/python/docs/quick-start

```

## CLI example

```bash
echo '{
  "websites": [
    "https://www.apple.com",
    "https://github.com",
    "https://stripe.com",
    "https://www.wikipedia.org"
  ],
  "proxyConfiguration": {
    "useApifyProxy": false
  }
}' |
apify call santamaria-automations/favicon-extractor --silent --output-dataset

```

## MCP server setup

```json
{
    "mcpServers": {
        "apify": {
            "command": "npx",
            "args": [
                "mcp-remote",
                "https://mcp.apify.com/?tools=santamaria-automations/favicon-extractor",
                "--header",
                "Authorization: Bearer <YOUR_API_TOKEN>"
            ]
        }
    }
}

```

## OpenAPI specification

```json
{
    "openapi": "3.0.1",
    "info": {
        "title": "Favicon & Brand Asset Extractor",
        "description": "Extract favicons, Apple touch icons, PWA manifest icons, logos, and brand colors from any website. Perfect for link preview generation, brand research, directory building, and ML training datasets. Export data, run via API, schedule and monitor runs, or integrate with other tools.",
        "version": "1.0",
        "x-build-id": "0D2mDGKyIdhaeRYTo"
    },
    "servers": [
        {
            "url": "https://api.apify.com/v2"
        }
    ],
    "paths": {
        "/acts/santamaria-automations~favicon-extractor/run-sync-get-dataset-items": {
            "post": {
                "operationId": "run-sync-get-dataset-items-santamaria-automations-favicon-extractor",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor, waits for its completion, and returns Actor's dataset items in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK"
                    }
                }
            }
        },
        "/acts/santamaria-automations~favicon-extractor/runs": {
            "post": {
                "operationId": "runs-sync-santamaria-automations-favicon-extractor",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor and returns information about the initiated run in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK",
                        "content": {
                            "application/json": {
                                "schema": {
                                    "$ref": "#/components/schemas/runsResponseSchema"
                                }
                            }
                        }
                    }
                }
            }
        },
        "/acts/santamaria-automations~favicon-extractor/run-sync": {
            "post": {
                "operationId": "run-sync-santamaria-automations-favicon-extractor",
                "x-openai-isConsequential": false,
                "summary": "Executes an Actor, waits for completion, and returns the OUTPUT from Key-value store in response.",
                "tags": [
                    "Run Actor"
                ],
                "requestBody": {
                    "required": true,
                    "content": {
                        "application/json": {
                            "schema": {
                                "$ref": "#/components/schemas/inputSchema"
                            }
                        }
                    }
                },
                "parameters": [
                    {
                        "name": "token",
                        "in": "query",
                        "required": true,
                        "schema": {
                            "type": "string"
                        },
                        "description": "Enter your Apify token here"
                    }
                ],
                "responses": {
                    "200": {
                        "description": "OK"
                    }
                }
            }
        }
    },
    "components": {
        "schemas": {
            "inputSchema": {
                "type": "object",
                "required": [
                    "websites"
                ],
                "properties": {
                    "websites": {
                        "title": "Websites",
                        "type": "array",
                        "description": "List of websites to extract brand assets from. Include the scheme (https://) for best results. One URL per line.",
                        "items": {
                            "type": "string"
                        }
                    },
                    "downloadImages": {
                        "title": "Download primary icon",
                        "type": "boolean",
                        "description": "Download the primary favicon and include it as base64 in the output. Careful: bloats dataset size.",
                        "default": false
                    },
                    "checkManifest": {
                        "title": "Parse Web App Manifest",
                        "type": "boolean",
                        "description": "Fetch and parse the PWA manifest.json when present to extract app name, colors, and manifest icons.",
                        "default": true
                    },
                    "timeoutSeconds": {
                        "title": "Fetch timeout (seconds)",
                        "minimum": 3,
                        "maximum": 60,
                        "type": "integer",
                        "description": "How long to wait for a website to respond before giving up.",
                        "default": 15
                    },
                    "maxConcurrency": {
                        "title": "Max concurrency",
                        "minimum": 1,
                        "maximum": 20,
                        "type": "integer",
                        "description": "How many websites to analyze in parallel.",
                        "default": 5
                    },
                    "proxyConfiguration": {
                        "title": "Proxy configuration",
                        "type": "object",
                        "description": "Select a proxy to use when fetching pages. Use Apify Proxy to avoid rate limits on large runs.",
                        "default": {
                            "useApifyProxy": false
                        }
                    }
                }
            },
            "runsResponseSchema": {
                "type": "object",
                "properties": {
                    "data": {
                        "type": "object",
                        "properties": {
                            "id": {
                                "type": "string"
                            },
                            "actId": {
                                "type": "string"
                            },
                            "userId": {
                                "type": "string"
                            },
                            "startedAt": {
                                "type": "string",
                                "format": "date-time",
                                "example": "2025-01-08T00:00:00.000Z"
                            },
                            "finishedAt": {
                                "type": "string",
                                "format": "date-time",
                                "example": "2025-01-08T00:00:00.000Z"
                            },
                            "status": {
                                "type": "string",
                                "example": "READY"
                            },
                            "meta": {
                                "type": "object",
                                "properties": {
                                    "origin": {
                                        "type": "string",
                                        "example": "API"
                                    },
                                    "userAgent": {
                                        "type": "string"
                                    }
                                }
                            },
                            "stats": {
                                "type": "object",
                                "properties": {
                                    "inputBodyLen": {
                                        "type": "integer",
                                        "example": 2000
                                    },
                                    "rebootCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "restartCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "resurrectCount": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "computeUnits": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            },
                            "options": {
                                "type": "object",
                                "properties": {
                                    "build": {
                                        "type": "string",
                                        "example": "latest"
                                    },
                                    "timeoutSecs": {
                                        "type": "integer",
                                        "example": 300
                                    },
                                    "memoryMbytes": {
                                        "type": "integer",
                                        "example": 1024
                                    },
                                    "diskMbytes": {
                                        "type": "integer",
                                        "example": 2048
                                    }
                                }
                            },
                            "buildId": {
                                "type": "string"
                            },
                            "defaultKeyValueStoreId": {
                                "type": "string"
                            },
                            "defaultDatasetId": {
                                "type": "string"
                            },
                            "defaultRequestQueueId": {
                                "type": "string"
                            },
                            "buildNumber": {
                                "type": "string",
                                "example": "1.0.0"
                            },
                            "containerUrl": {
                                "type": "string"
                            },
                            "usage": {
                                "type": "object",
                                "properties": {
                                    "ACTOR_COMPUTE_UNITS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_WRITES": {
                                        "type": "integer",
                                        "example": 1
                                    },
                                    "KEY_VALUE_STORE_LISTS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_INTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_EXTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_RESIDENTIAL_TRANSFER_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_SERPS": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            },
                            "usageTotalUsd": {
                                "type": "number",
                                "example": 0.00005
                            },
                            "usageUsd": {
                                "type": "object",
                                "properties": {
                                    "ACTOR_COMPUTE_UNITS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATASET_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "KEY_VALUE_STORE_WRITES": {
                                        "type": "number",
                                        "example": 0.00005
                                    },
                                    "KEY_VALUE_STORE_LISTS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_READS": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "REQUEST_QUEUE_WRITES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_INTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "DATA_TRANSFER_EXTERNAL_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_RESIDENTIAL_TRANSFER_GBYTES": {
                                        "type": "integer",
                                        "example": 0
                                    },
                                    "PROXY_SERPS": {
                                        "type": "integer",
                                        "example": 0
                                    }
                                }
                            }
                        }
                    }
                }
            }
        }
    }
}
```
