Technology Profiling Engine avatar
Technology Profiling Engine

Pricing

from $5.00 / 1,000 results

Go to Apify Store
Technology Profiling Engine

Technology Profiling Engine

Identify technologies used by any website - React, WordPress, Shopify, Google Analytics & 7,000+ more. Deep 8-tier inspection with confidence scores. Detect 7,000+ technologies on any website. Like Wappalyzer but 10x cheaper: $5/1,000 URLs.

Pricing

from $5.00 / 1,000 results

Rating

0.0

(0)

Developer

Benoit Eveillard

Benoit Eveillard

Maintained by Community

Actor stats

0

Bookmarked

1

Total users

0

Monthly active users

3 days ago

Last modified

Share

Identify technologies used by any website - React, WordPress, Shopify, Google Analytics & 7,000+ more. Deep 8-tier inspection with confidence scores. Like Wappalyzer but 10x cheaper: $5/1,000 URLs.

Why Use This Actor?

  • 7,000+ Technology Signatures - The most comprehensive detection database
  • 8-Tier Deep Inspection - Goes beyond HTML: headers, DOM, JavaScript, network, storage, PWA, WebSocket, cookies
  • Confidence Scoring - Know how certain each detection is
  • 10x Cheaper - Just $5 per 1,000 URLs (vs $295+/month for BuiltWith)
  • Batch Processing - Scan hundreds of URLs in one run
  • Bot Protection Bypass - Stealth browser handles Cloudflare and other protections

What It Detects

CategoryExamples
FrameworksReact, Vue, Angular, Next.js, Nuxt.js, Svelte, Astro
CMS & BuildersWordPress, Webflow, Shopify, Wix, Squarespace, Drupal
AnalyticsGoogle Analytics, Plausible, PostHog, Mixpanel, Amplitude
MarketingHubSpot, Intercom, Google Tag Manager, Mailchimp, Segment
UI FrameworksTailwind CSS, Bootstrap, Material-UI, Chakra UI
CDNCloudflare, Fastly, Akamai, AWS CloudFront, Vercel
PaymentStripe, PayPal, Square, Braintree
And more...100+ categories, 7,000+ technologies

Use Cases

  • Competitive Analysis - See what tech your competitors use
  • Lead Generation - Find companies using specific technologies
  • Sales Intelligence - Qualify leads by their tech sophistication
  • Market Research - Analyze technology trends across industries
  • Security Audits - Identify outdated or vulnerable technologies

Input

{
"urls": ["https://example.com", "https://another-site.com"],
"timeout": 60,
"proxyType": "datacenter"
}

Parameters

ParameterTypeDefaultDescription
urls (required)array/string-URLs to analyze
timeoutinteger30Page timeout in seconds (10-120)
proxyTypestringdatacenterdatacenter or residential (for protected sites)
concurrencyinteger5Concurrent pages (1-50)
minConfidenceinteger50Minimum confidence score (0-100)
includeEvidencebooleanfalseInclude detection evidence in output

URL Formats

All these formats work:

// JSON array
["https://example.com", "https://example.org"]
// Line-separated
"https://example.com\nhttps://example.org"
// Comma-separated
"https://example.com, https://example.org"
// Single URL
"https://example.com"

Output

{
"url": "https://www.example.com",
"scan_timestamp": 1704067200,
"status": "success",
"technologies": [
{
"name": "React",
"category": "JavaScript frameworks",
"categories": ["JavaScript frameworks"],
"groups": [{"id": 9, "name": "Web development"}],
"confidence_score": 85,
"confidence_tier": "Medium",
"version": "18.2.0"
},
{
"name": "Cloudflare",
"category": "CDN",
"categories": ["CDN"],
"groups": [{"id": 7, "name": "Servers"}],
"confidence_score": 60,
"confidence_tier": "Medium"
}
],
"metadata": {
"duration_ms": 8500
}
}

Output Fields

FieldDescription
urlScanned URL
statussuccess, failed, or timeout
technologies[].nameTechnology name
technologies[].categoryPrimary category
technologies[].confidence_scoreDetection confidence (0-100)
technologies[].confidence_tierHigh (90+), Medium (50-89), Low (<50)
technologies[].versionVersion if detected
metadata.duration_msScan duration

How It Works

8-Tier Deep Inspection

Unlike simple HTML scrapers, this Actor inspects 8 different layers:

TierWhat It ChecksExample Detections
1HTTP HeadersCloudflare, nginx, security headers
2DOM/HTMLMeta tags, script sources, CSS classes
3JavaScriptGlobal variables (React, Vue, jQuery)
4NetworkXHR/Fetch to analytics, CDNs
5StoragelocalStorage keys (Redux, auth)
6PWAService workers, manifest.json
7WebSocketReal-time connections (Socket.io)
8CookiesCookie names (tracking, session)

This catches technologies that surface-level scrapers miss.

API Usage

Python

from apify_client import ApifyClient
client = ApifyClient("YOUR_API_TOKEN")
run = client.actor("justa/technology-profiling-engine").call(
run_input={"urls": ["https://example.com"]}
)
for item in client.dataset(run["defaultDatasetId"]).iterate_items():
print(f"{item['url']}: {len(item['technologies'])} technologies found")
for tech in item['technologies']:
print(f" - {tech['name']} ({tech['confidence_score']}%)")

JavaScript

import { ApifyClient } from 'apify-client';
const client = new ApifyClient({ token: 'YOUR_API_TOKEN' });
const run = await client.actor('justa/technology-profiling-engine').call({
urls: ['https://example.com']
});
const { items } = await client.dataset(run.defaultDatasetId).listItems();
items.forEach(item => {
console.log(`${item.url}: ${item.technologies.length} technologies`);
});

cURL

curl -X POST "https://api.apify.com/v2/acts/justa~technology-profiling-engine/runs?token=YOUR_API_TOKEN" \
-H "Content-Type: application/json" \
-d '{"urls": ["https://example.com"]}'

Tips

For Protected Sites (Cloudflare, etc.)

Use residential proxy and increased timeout:

{
"urls": ["https://protected-site.com"],
"proxyType": "residential",
"timeout": 60
}

For Batch Processing

Scan hundreds of URLs efficiently:

{
"urls": ["url1", "url2", "...hundreds more..."],
"concurrency": 10,
"timeout": 30
}

Pricing

  • $0.005 per URL (~$5 per 1,000 URLs)
  • Plus Apify platform costs (compute time)
  • No subscriptions - pay only for what you use

Support

Questions or issues?

  • Open an issue on GitHub
  • Contact via Apify

Built with Playwright and Camoufox.