Fetch Branding
Pricing
$15.00 / 1,000 results
Fetch Branding
Apify Actor for extracting branding information from websites including logos, colors, metadata, and social links.
Pricing
$15.00 / 1,000 results
Rating
0.0
(0)
Developer
Jotunweb
Actor stats
0
Bookmarked
87
Total users
16
Monthly active users
36 days
Issues response
15 days ago
Last modified
Categories
Share
Website Branding Extractor
An Apify Actor that extracts comprehensive branding information from websites including logos, colors, metadata, and social media links.
Features
- Logo Extraction: Favicon, apple-touch-icon, Open Graph images, and common logo selectors
- Color Detection: CSS custom properties and inline style colors
- Metadata Extraction: Meta tags, Open Graph, and Twitter Card data
- Social Media Links: Detects links to major social platforms
- Error Handling: Comprehensive error reporting for failed extractions
- Configurable Output: Filter results by success status and limit processing
How it works
- Input Processing: Accepts URLs via
startUrlsarray - HTTP Fetching: Uses Axios with timeout, user-agent spoofing, and redirect handling
- HTML Parsing: Parses content with Cheerio for data extraction
- Multi-faceted Extraction: Simultaneously extracts logos, colors, metadata, and social links
- Result Filtering: Applies configurable filters based on success status and limits
- Data Storage: Stores results in Apify Dataset with multiple view options
Input Configuration
The Actor accepts the following input parameters:
URLs
- Start URLs: Array of websites to extract branding from
Extraction Options
- Extract Logos: Enable/disable logo detection (default: true)
- Extract Colors: Enable/disable color extraction (default: true)
- Extract Metadata: Enable/disable metadata extraction (default: true)
- Extract Social Links: Enable/disable social media link detection (default: true)
Request Configuration
- Timeout: Request timeout in milliseconds (default: 30000)
- Max Redirects: Maximum redirects to follow (default: 5)
- User Agent: Custom user agent string
Output Options
- Include Failed URLs: Include failed extractions in output (default: true)
- Max Results: Maximum number of URLs to process (default: 1000)
- Only Successful: Filter to only successful extractions (default: false)
Proxy Configuration
- Proxy Configuration: Optional Apify proxy settings
Output Data
Each extracted result contains:
{"url": "https://example.com","success": true,"title": "Example Site","description": "An example website description","logo": [{"url": "https://example.com/favicon.ico","type": "favicon","sizes": "32x32"}],"colors": {"primary": "#007bff","secondary": "#6c757d"},"metadata": {"keywords": "example, website","ogTitle": "Example Site","twitterCard": "summary"},"socialLinks": ["https://twitter.com/example","https://facebook.com/example"],"timestamp": "2024-01-01T12:00:00.000Z"}
For failed extractions, the result includes an error field with details about the failure.
Dataset Views
The Actor provides multiple views of the extracted data:
- Overview: Basic results without error details
- Detailed: Complete data including errors and metadata
- Logo Assets: Individual logo entries for analysis
- Failed Extractions: Only URLs that failed processing
Getting Started
Local Development
# Install dependenciesnpm install# Run locallyapify run# Build for productionnpm run build
Deploy to Apify
-
Connect Git Repository:
- Go to Actor creation page
- Click Link Git Repository
-
Push from Local Machine:
# Login to Apifyapify login# Deploy Actorapify push
Supported Platforms
Logo Sources
- Favicon links
- Apple touch icons
- Open Graph images
- Common logo selectors (alt/src/class containing "logo")
Social Media Platforms
- Twitter/X
- YouTube
- TikTok
- Snapchat
- GitHub
- GitLab
- BitBucket
Metadata Standards
- Basic HTML meta tags
- Open Graph protocol
- Twitter Cards
- Viewport and charset information
Error Handling
The Actor provides detailed error messages for common scenarios:
- Network Issues: Domain not found, connection refused
- HTTP Errors: 403 (forbidden), 404 (not found), 429 (rate limited), 5xx (server errors)
- Timeouts: Request timeout handling
- Invalid URLs: URL format validation
Technical Details
- Framework: Built with Apify SDK and TypeScript
- HTTP Client: Axios with comprehensive error handling
- HTML Parser: Cheerio for server-side DOM manipulation
- Rate Limiting: 1-second delay between requests for multiple URLs
- URL Resolution: Automatic conversion of relative URLs to absolute URLs