WebSight API — AI-Native Web Intelligence
Pricing
Pay per usage
WebSight API — AI-Native Web Intelligence
One API call for 7 types of web intelligence from any URL: clean markdown, tech stack, SEO audit, contacts, structured data, AI score, domain intel. Token-optimized for AI agents. Cached.
Pricing
Pay per usage
Rating
0.0
(0)
Developer
George Kioko
Actor stats
0
Bookmarked
2
Total users
1
Monthly active users
8 hours ago
Last modified
Categories
Share
WebSight API
One API call. Seven intelligence modules. Any URL.
Extract clean content, tech stack, SEO scores, contacts, structured data, AI detection, and domain intel — all optimized for AI agents and LLM applications.
The Problem
Building an AI agent that needs web intelligence? Today you need:
| Task | Separate API | Cost |
|---|---|---|
| Content extraction | Diffbot / Jina | $0.01+ |
| Tech stack detection | BuiltWith / Wappalyzer | $0.02+ |
| SEO analysis | Moz / Ahrefs API | $0.05+ |
| Contact extraction | Hunter.io | $0.01+ |
| Structured data | Custom parser | Dev time |
| AI content detection | GPTZero / Originality | $0.01+ |
| Domain intel | WHOIS API | $0.01+ |
| Total per page | 7 APIs, 7 integrations | $0.12+ |
WebSight does ALL of this in one call for $0.005.
How It Works
flowchart LRA["URL Input"] --> B["Fetch & Parse"]B --> C1["Clean Content"]B --> C2["Tech Stack"]B --> C3["SEO Audit"]B --> C4["Contacts"]B --> C5["Structured Data"]B --> C6["AI Score"]B --> C7["Domain Intel"]C1 --> D["Combined JSON"]C2 --> DC3 --> DC4 --> DC5 --> DC6 --> DC7 --> DD --> E["1hr Cache"]E --> F["Response"]style A fill:#3B82F6,color:#fff,stroke:nonestyle B fill:#8B5CF6,color:#fff,stroke:nonestyle C1 fill:#10B981,color:#fff,stroke:nonestyle C2 fill:#10B981,color:#fff,stroke:nonestyle C3 fill:#10B981,color:#fff,stroke:nonestyle C4 fill:#10B981,color:#fff,stroke:nonestyle C5 fill:#10B981,color:#fff,stroke:nonestyle C6 fill:#10B981,color:#fff,stroke:nonestyle C7 fill:#10B981,color:#fff,stroke:nonestyle D fill:#F59E0B,color:#fff,stroke:nonestyle E fill:#EF4444,color:#fff,stroke:nonestyle F fill:#3B82F6,color:#fff,stroke:none
All 7 modules execute in parallel. Average response time under 2 seconds.
Architecture
graph TBsubgraph "Client Layer"AI["AI Agents<br/>Claude, GPT, Gemini"]MCP["MCP Servers<br/>Cursor, Windsurf"]APP["Applications<br/>JS, Python, cURL"]endsubgraph "WebSight API"LB["HTTP Server<br/>Standby Mode"]CACHE["In-Memory Cache<br/>1hr TTL, LRU"]SSRF["SSRF Protection<br/>Private IP Block"]subgraph "Parallel Module Engine"M1["Content Extractor<br/>Markdown / Text / HTML"]M2["Tech Detector<br/>40+ Technologies"]M3["SEO Analyzer<br/>Score 0-100"]M4["Contact Finder<br/>Emails, Phones, Social"]M5["Structured Parser<br/>OG, Twitter, JSON-LD"]M6["AI Scorer<br/>5-Factor Heuristic"]M7["Domain Intel<br/>DNS, SPF, DMARC, WHOIS"]endendsubgraph "Infrastructure"APIFY["Apify Platform"]PPE["Pay-Per-Event<br/>$0.005/page"]endAI --> LBMCP --> LBAPP --> LBLB --> CACHELB --> SSRFSSRF --> M1 & M2 & M3 & M4 & M5 & M6 & M7LB --> APIFYAPIFY --> PPEstyle AI fill:#8B5CF6,color:#fff,stroke:nonestyle MCP fill:#8B5CF6,color:#fff,stroke:nonestyle APP fill:#8B5CF6,color:#fff,stroke:nonestyle LB fill:#3B82F6,color:#fff,stroke:nonestyle CACHE fill:#F59E0B,color:#fff,stroke:nonestyle SSRF fill:#EF4444,color:#fff,stroke:nonestyle M1 fill:#10B981,color:#fff,stroke:nonestyle M2 fill:#10B981,color:#fff,stroke:nonestyle M3 fill:#10B981,color:#fff,stroke:nonestyle M4 fill:#10B981,color:#fff,stroke:nonestyle M5 fill:#10B981,color:#fff,stroke:nonestyle M6 fill:#10B981,color:#fff,stroke:nonestyle M7 fill:#10B981,color:#fff,stroke:nonestyle APIFY fill:#00B3E6,color:#fff,stroke:nonestyle PPE fill:#00B3E6,color:#fff,stroke:none
Why WebSight
vs. Using 7 Separate APIs
| Feature | 7 Separate APIs | WebSight API |
|---|---|---|
| API integrations to maintain | 7 | 1 |
| Authentication tokens | 7 | 1 |
| Cost per page | ~$0.12 | $0.005 |
| Response calls needed | 7 sequential | 1 parallel |
| Latency | 5-10s total | <2s |
| Caching | Build your own | Built-in (free) |
| Bulk support | Manual orchestration | Native (20 URLs) |
vs. Raw HTML for AI Agents
| Metric | Raw HTML | WebSight Markdown |
|---|---|---|
| Tokens (typical page) | ~15,000 | ~2,000 |
| Token reduction | 0% | ~87% |
| Noise (ads, nav, scripts) | Included | Removed |
| Format for LLMs | Unparseable | Clean Markdown |
| Structured metadata | Must parse | Pre-extracted JSON |
| Cost at GPT-4 rates ($10/1M tokens) | $0.15/page | $0.02/page |
Token Savings Example
Raw HTML of stripe.com homepage:<html><head><meta charset="utf-8"><meta name="viewport"...<script>window.__NEXT_DATA__={...15KB of JSON...}</script><nav class="TopNav">...<div class="cookie-banner">...========================~18,400 tokens ($0.184 at GPT-4)WebSight markdown output:# Stripe: Financial Infrastructure for the InternetMillions of companies use Stripe to accept payments...## Products- Payments: Accept online and in-person payments- Billing: Build and scale subscriptions...========================~1,800 tokens ($0.018 at GPT-4)Token savings: 90% | Cost savings: $0.166 per page
Quick Start
cURL
# Full analysis — all 7 modulescurl "https://george-the-developer--websight-api.apify.actor/analyze?url=https://stripe.com"# Selective modules — only what you needcurl "https://george-the-developer--websight-api.apify.actor/analyze?url=https://stripe.com&modules=tech,seo,contacts"# Bulk analysis — up to 20 URLscurl -X POST "https://george-the-developer--websight-api.apify.actor/analyze/bulk" \-H "Content-Type: application/json" \-d '{"urls": ["https://stripe.com", "https://shopify.com", "https://vercel.com"]}'
JavaScript / Node.js
// Using the Apify Standby endpointconst BASE = 'https://george-the-developer--websight-api.apify.actor';// Single URL — full analysisconst response = await fetch(`${BASE}/analyze?url=https://stripe.com`);const data = await response.json();console.log(data.modules.content.text); // Clean markdownconsole.log(data.modules.tech.detected); // [{name: "React", category: "framework"}, ...]console.log(data.modules.seo.score); // 78console.log(data.modules.contacts.emails); // ["sales@stripe.com"]console.log(data.modules.aiContent.verdict); // "likely_human"// Selective modules — save processing timeconst seoOnly = await fetch(`${BASE}/analyze?url=https://stripe.com&modules=seo,tech`);// Bulk analysisconst bulk = await fetch(`${BASE}/analyze/bulk`, {method: 'POST',headers: { 'Content-Type': 'application/json' },body: JSON.stringify({urls: ['https://stripe.com', 'https://shopify.com'],modules: ['tech', 'seo', 'contacts']})});const results = await bulk.json();console.log(`Analyzed ${results.total} pages, charged for ${results.charged}`);
Python
import requestsBASE = "https://george-the-developer--websight-api.apify.actor"# Single URLresp = requests.get(f"{BASE}/analyze", params={"url": "https://stripe.com"})data = resp.json()# Access modulesprint(data["modules"]["content"]["text"][:200]) # First 200 chars of markdownprint(data["modules"]["tech"]["detected"]) # Tech stackprint(data["modules"]["seo"]["score"]) # SEO score 0-100print(data["modules"]["domain"]["dns"]["a"]) # IP addresses# Bulk analysisbulk = requests.post(f"{BASE}/analyze/bulk", json={"urls": ["https://stripe.com", "https://shopify.com", "https://vercel.com"],"modules": ["tech", "contacts"]})for result in bulk.json()["results"]:print(f"{result['url']}: {len(result['modules']['tech']['detected'])} technologies found")
AI Agent Integration (LangChain / Claude Tools)
// Define as a tool for your AI agentconst websightTool = {name: 'analyze_website',description: 'Extract comprehensive intelligence from any URL: content, tech stack, SEO, contacts, AI detection, domain info',parameters: {url: { type: 'string', description: 'URL to analyze' },modules: { type: 'string', description: 'Comma-separated: content,tech,seo,contacts,structured,ai_score,domain' }},execute: async ({ url, modules }) => {const params = new URLSearchParams({ url });if (modules) params.set('modules', modules);const res = await fetch(`https://george-the-developer--websight-api.apify.actor/analyze?${params}`);return res.json();}};// Your agent can now call:// "Analyze stripe.com and tell me what tech stack they use"// "Check if this blog post was written by AI"// "Find contact information for this company"
MCP Server (Coming Soon)
WebSight API is designed to work as an MCP tool for AI coding assistants like Cursor, Windsurf, and Claude Desktop. Native MCP server configuration will be published soon.
{"mcpServers": {"websight": {"command": "npx","args": ["@anthropic/websight-mcp"],"env": {"WEBSIGHT_API_URL": "https://george-the-developer--websight-api.apify.actor"}}}}
API Reference
Endpoints
| Method | Path | Description |
|---|---|---|
GET | /analyze?url=<url> | Full analysis (all modules) |
GET | /analyze?url=<url>&modules=tech,seo | Selective module analysis |
POST | /analyze | JSON body: { url, modules?, contentFormat?, maxContentLength? } |
POST | /analyze/bulk | JSON body: { urls: [...], modules? } (max 20 URLs) |
GET | / | Health check + API documentation |
Query Parameters
| Parameter | Type | Default | Description |
|---|---|---|---|
url | string | required | URL to analyze |
modules | string | all | Comma-separated module list |
format | string | markdown | Content format: markdown, text, html |
maxLength | integer | 10000 | Max content characters (500-100,000) |
Available Modules
| Module | Key | What It Returns |
|---|---|---|
| Clean Content | content | Token-optimized markdown/text/HTML. Strips ads, nav, footers, cookie banners, scripts. 80-90% token reduction. |
| Tech Stack | tech | Array of detected technologies with categories. 40+ techs: CMS (WordPress, Shopify, Wix), Frameworks (React, Next.js, Vue, Angular, Svelte), Analytics (GA, GTM, Hotjar, Segment), Infrastructure (Cloudflare, Vercel, AWS), Payments (Stripe, PayPal), Security (reCAPTCHA, hCaptcha). |
| SEO Audit | seo | Score 0-100, title analysis (length + optimal range), meta description audit, heading structure (H1/H2), canonical URL, robots meta, viewport, favicon, image alt coverage, internal/external link counts, word count. |
| Contacts | contacts | Emails (filtered, deduplicated), phone numbers, social profiles: LinkedIn, Twitter/X, Facebook, Instagram, GitHub, YouTube. |
| Structured Data | structured | OpenGraph tags (title, image, description, type), Twitter Card data, JSON-LD schemas (parsed and normalized). |
| AI Content Score | ai_score | Score 0-100 with verdict (ai_generated, likely_ai, mixed, likely_human, human_written). 5-factor analysis: sentence uniformity, AI phrase density, vocabulary diversity (TTR), readability (Flesch-Kincaid), paragraph uniformity. |
| Domain Intel | domain | DNS records (A, AAAA, MX, NS), SPF validation, DMARC check, WHOIS data (creation date, registrar, domain age in days). |
Input Schema
When running as a standard Apify Actor (non-API mode):
{"url": "https://stripe.com","urls": ["https://stripe.com", "https://shopify.com"],"modules": ["content", "tech", "seo", "contacts", "structured", "ai_score", "domain"],"contentFormat": "markdown","maxContentLength": 10000}
Output Example
Full response from GET /analyze?url=https://stripe.com:
{"url": "https://stripe.com","analyzedAt": "2026-03-22T14:30:00.000Z","fetchInfo": {"statusCode": 200,"responseTimeMs": 847,"contentLength": 198432},"modules": {"content": {"format": "markdown","text": "# Stripe: Financial Infrastructure for the Internet\n\nMillions of companies of all sizes use Stripe online and in person to accept payments, send payouts, automate financial processes, and ultimately grow revenue.\n\n## Products\n\n- Payments: Accept online and in-person payments\n- Billing: Build and scale your recurring business model\n- Connect: Set up multi-party payments for platforms\n- Terminal: Build in-person payment experiences\n...","charCount": 4823},"tech": {"detected": [{ "name": "React", "category": "framework" },{ "name": "Next.js", "category": "framework" },{ "name": "Cloudflare", "category": "infra" },{ "name": "Google Analytics", "category": "analytics" },{ "name": "Google Tag Manager", "category": "analytics" },{ "name": "Segment", "category": "analytics" },{ "name": "Stripe", "category": "payment" }]},"seo": {"score": 82,"title": {"text": "Stripe | Financial Infrastructure for the Internet","length": 51,"optimal": true},"metaDescription": {"text": "Stripe powers online and in-person payment processing and financial solutions for businesses of all sizes.","length": 106,"optimal": true},"headings": {"h1": ["Financial Infrastructure for the Internet"],"h2": ["Products", "Use cases", "Integrations & custom solutions", "Get started"]},"canonical": "https://stripe.com","robots": "","viewport": true,"favicon": true,"images": { "total": 34, "withAlt": 30, "withoutAlt": 4 },"links": { "internal": 87, "external": 12 },"wordCount": 2341},"contacts": {"emails": ["sales@stripe.com", "support@stripe.com"],"phones": [],"socialProfiles": {"twitter": "https://twitter.com/stripe","linkedin": "https://www.linkedin.com/company/stripe","github": "https://github.com/stripe","youtube": "https://www.youtube.com/stripe"}},"structured": {"openGraph": {"title": "Stripe | Financial Infrastructure for the Internet","description": "Stripe powers online and in-person payment processing...","image": "https://images.ctfassets.net/fzn2n1nzq965/og-image.png","type": "website","url": "https://stripe.com"},"twitterCard": {"card": "summary_large_image","site": "@stripe","title": "Stripe | Financial Infrastructure for the Internet"},"jsonLd": [{"@context": "https://schema.org","@type": "Organization","name": "Stripe","url": "https://stripe.com"}]},"aiContent": {"score": 28,"verdict": "likely_human","factors": [{ "name": "sentence_uniformity", "score": 30, "weight": 15 },{ "name": "ai_phrase_density", "score": 10, "weight": 20 },{ "name": "vocabulary_diversity", "score": 50, "weight": 10 },{ "name": "readability", "score": 50, "weight": 8 },{ "name": "paragraph_uniformity", "score": 20, "weight": 8 }]},"domain": {"hostname": "stripe.com","dns": {"a": ["3.18.12.63", "3.18.1.20"],"aaaa": [],"mx": [{ "priority": 1, "exchange": "aspmx.l.google.com" },{ "priority": 5, "exchange": "alt1.aspmx.l.google.com" }],"ns": ["ns-423.awsdns-52.com", "ns-1408.awsdns-48.org"],"hasSPF": true,"hasDMARC": true},"age": {"created": "2009-09-16T04:00:00Z","registrar": "SafeNames Ltd.","daysOld": 6032}}},"processingTimeMs": 1843}
Use Cases
mindmaproot((WebSight API))AI AgentsResearch assistantsContent summarizersCompetitive analysis botsLead qualification agentsLead GenerationContact extraction at scaleTech stack targetingCompany enrichmentSEO ToolsSite auditingCompetitor benchmarkingContent quality scoringCompetitive IntelligenceTech stack monitoringDomain age verificationSocial presence mappingContent AnalysisAI content detectionReadability scoringStructured data validationSecurity & TrustDomain reputation checksSPF/DMARC verificationSSL & DNS auditing
Pricing
$0.005 per page analyzed (half a cent).
Cached responses within 1 hour are free and return in <500ms.
| Volume | Cost | Effective Price |
|---|---|---|
| 100 pages | $0.50 | $0.005/page |
| 1,000 pages | $5.00 | $0.005/page |
| 10,000 pages | $50.00 | $0.005/page |
| 100,000 pages | $500.00 | $0.005/page |
Cost comparison for analyzing 1,000 pages:
| Solution | Monthly Cost |
|---|---|
| BuiltWith + Hunter + Moz + GPTZero + ... | $120+ |
| WebSight API (all 7 modules) | $5.00 |
| Savings | 96% |
Free tier available on Apify. Start with $5 of free platform credits.
Features at a Glance
| Feature | Details |
|---|---|
| Modules | 7 intelligence modules, all in one call |
| Content formats | Markdown (default), plain text, HTML |
| Tech detection | 40+ technologies across 7 categories |
| SEO scoring | 0-100 weighted score, 12 ranking factors |
| AI detection | 5-factor heuristic analysis, 5 confidence levels |
| DNS analysis | A, AAAA, MX, NS, SPF, DMARC records |
| WHOIS | Domain age, registrar, creation date |
| Caching | 1-hour TTL, LRU eviction, cached = free |
| Bulk | Up to 20 URLs per request |
| SSRF protection | Private IP blocking built-in |
| CORS | Enabled for all origins |
| Timeout | 15s per URL fetch, 300s actor timeout |
| Memory | 2GB default allocation |
FAQ
Links
- Apify Store: george.the.developer/websight-api
- API Endpoint:
https://george-the-developer--websight-api.apify.actor - Author: george.the.developer on Apify
Built by george.the.developer