Technology Profiling Engine
Pricing
from $5.00 / 1,000 results
Technology Profiling Engine
Identify technologies used by any website - React, WordPress, Shopify, Google Analytics & 7,000+ more. Deep 8-tier inspection with confidence scores. Detect 7,000+ technologies on any website. Like Wappalyzer but 10x cheaper: $5/1,000 URLs.
Pricing
from $5.00 / 1,000 results
Rating
0.0
(0)
Developer

Benoit Eveillard
Actor stats
0
Bookmarked
1
Total users
0
Monthly active users
3 days ago
Last modified
Share
Identify technologies used by any website - React, WordPress, Shopify, Google Analytics & 7,000+ more. Deep 8-tier inspection with confidence scores. Like Wappalyzer but 10x cheaper: $5/1,000 URLs.
Why Use This Actor?
- 7,000+ Technology Signatures - The most comprehensive detection database
- 8-Tier Deep Inspection - Goes beyond HTML: headers, DOM, JavaScript, network, storage, PWA, WebSocket, cookies
- Confidence Scoring - Know how certain each detection is
- 10x Cheaper - Just $5 per 1,000 URLs (vs $295+/month for BuiltWith)
- Batch Processing - Scan hundreds of URLs in one run
- Bot Protection Bypass - Stealth browser handles Cloudflare and other protections
What It Detects
| Category | Examples |
|---|---|
| Frameworks | React, Vue, Angular, Next.js, Nuxt.js, Svelte, Astro |
| CMS & Builders | WordPress, Webflow, Shopify, Wix, Squarespace, Drupal |
| Analytics | Google Analytics, Plausible, PostHog, Mixpanel, Amplitude |
| Marketing | HubSpot, Intercom, Google Tag Manager, Mailchimp, Segment |
| UI Frameworks | Tailwind CSS, Bootstrap, Material-UI, Chakra UI |
| CDN | Cloudflare, Fastly, Akamai, AWS CloudFront, Vercel |
| Payment | Stripe, PayPal, Square, Braintree |
| And more... | 100+ categories, 7,000+ technologies |
Use Cases
- Competitive Analysis - See what tech your competitors use
- Lead Generation - Find companies using specific technologies
- Sales Intelligence - Qualify leads by their tech sophistication
- Market Research - Analyze technology trends across industries
- Security Audits - Identify outdated or vulnerable technologies
Input
{"urls": ["https://example.com", "https://another-site.com"],"timeout": 60,"proxyType": "datacenter"}
Parameters
| Parameter | Type | Default | Description |
|---|---|---|---|
| urls (required) | array/string | - | URLs to analyze |
| timeout | integer | 30 | Page timeout in seconds (10-120) |
| proxyType | string | datacenter | datacenter or residential (for protected sites) |
| concurrency | integer | 5 | Concurrent pages (1-50) |
| minConfidence | integer | 50 | Minimum confidence score (0-100) |
| includeEvidence | boolean | false | Include detection evidence in output |
URL Formats
All these formats work:
// JSON array["https://example.com", "https://example.org"]// Line-separated"https://example.com\nhttps://example.org"// Comma-separated"https://example.com, https://example.org"// Single URL"https://example.com"
Output
{"url": "https://www.example.com","scan_timestamp": 1704067200,"status": "success","technologies": [{"name": "React","category": "JavaScript frameworks","categories": ["JavaScript frameworks"],"groups": [{"id": 9, "name": "Web development"}],"confidence_score": 85,"confidence_tier": "Medium","version": "18.2.0"},{"name": "Cloudflare","category": "CDN","categories": ["CDN"],"groups": [{"id": 7, "name": "Servers"}],"confidence_score": 60,"confidence_tier": "Medium"}],"metadata": {"duration_ms": 8500}}
Output Fields
| Field | Description |
|---|---|
url | Scanned URL |
status | success, failed, or timeout |
technologies[].name | Technology name |
technologies[].category | Primary category |
technologies[].confidence_score | Detection confidence (0-100) |
technologies[].confidence_tier | High (90+), Medium (50-89), Low (<50) |
technologies[].version | Version if detected |
metadata.duration_ms | Scan duration |
How It Works
8-Tier Deep Inspection
Unlike simple HTML scrapers, this Actor inspects 8 different layers:
| Tier | What It Checks | Example Detections |
|---|---|---|
| 1 | HTTP Headers | Cloudflare, nginx, security headers |
| 2 | DOM/HTML | Meta tags, script sources, CSS classes |
| 3 | JavaScript | Global variables (React, Vue, jQuery) |
| 4 | Network | XHR/Fetch to analytics, CDNs |
| 5 | Storage | localStorage keys (Redux, auth) |
| 6 | PWA | Service workers, manifest.json |
| 7 | WebSocket | Real-time connections (Socket.io) |
| 8 | Cookies | Cookie names (tracking, session) |
This catches technologies that surface-level scrapers miss.
API Usage
Python
from apify_client import ApifyClientclient = ApifyClient("YOUR_API_TOKEN")run = client.actor("justa/technology-profiling-engine").call(run_input={"urls": ["https://example.com"]})for item in client.dataset(run["defaultDatasetId"]).iterate_items():print(f"{item['url']}: {len(item['technologies'])} technologies found")for tech in item['technologies']:print(f" - {tech['name']} ({tech['confidence_score']}%)")
JavaScript
import { ApifyClient } from 'apify-client';const client = new ApifyClient({ token: 'YOUR_API_TOKEN' });const run = await client.actor('justa/technology-profiling-engine').call({urls: ['https://example.com']});const { items } = await client.dataset(run.defaultDatasetId).listItems();items.forEach(item => {console.log(`${item.url}: ${item.technologies.length} technologies`);});
cURL
curl -X POST "https://api.apify.com/v2/acts/justa~technology-profiling-engine/runs?token=YOUR_API_TOKEN" \-H "Content-Type: application/json" \-d '{"urls": ["https://example.com"]}'
Tips
For Protected Sites (Cloudflare, etc.)
Use residential proxy and increased timeout:
{"urls": ["https://protected-site.com"],"proxyType": "residential","timeout": 60}
For Batch Processing
Scan hundreds of URLs efficiently:
{"urls": ["url1", "url2", "...hundreds more..."],"concurrency": 10,"timeout": 30}
Pricing
- $0.005 per URL (~$5 per 1,000 URLs)
- Plus Apify platform costs (compute time)
- No subscriptions - pay only for what you use
Support
Questions or issues?
- Open an issue on GitHub
- Contact via Apify
Built with Playwright and Camoufox.