Website Metadata Bulk Extractor avatar

Website Metadata Bulk Extractor

Pricing

$1.00 / 1,000 results

Go to Apify Store
Website Metadata Bulk Extractor

Website Metadata Bulk Extractor

Extract SEO metadata, Open Graph tags, Twitter cards, and detect 50+ technologies from any website URL in bulk. Fast concurrent processing with retry logic.

Pricing

$1.00 / 1,000 results

Rating

0.0

(0)

Developer

BotFlowTech

BotFlowTech

Maintained by Community

Actor stats

3

Bookmarked

31

Total users

7

Monthly active users

14 days ago

Last modified

Share

Extract metadata, detect technology stacks, and analyze any website at scale — CMS, frameworks, analytics, CDN, hosting, and 50+ technologies.

What does Website Metadata Bulk Extractor do?

Website Metadata Bulk Extractor is a powerful Apify actor that analyzes websites to detect their full technology stack. It identifies content management systems, JavaScript frameworks, analytics tools, CDN providers, hosting platforms, e-commerce solutions, and marketing/CRM tools. Process hundreds or thousands of URLs in a single run.

Perfect for: SEO audits, competitive research, sales prospecting, market research, lead generation, and technology landscape analysis.

Key Features

  • 50+ technology detections — CMS, frameworks, analytics, CDN, hosting, e-commerce, and marketing tools
  • Bulk processing — analyze thousands of websites in a single run
  • CSS framework detection — Bootstrap, Tailwind CSS, Bulma, Foundation
  • Server & infrastructure info — server headers, powered-by, HTTPS status
  • Mobile responsiveness check — viewport meta tag detection
  • Language detection — HTML lang attribute extraction
  • Script analysis — counts and lists external scripts
  • PPE monetization — pay-per-event pricing support for Apify marketplace
  • Error resilience — automatic retries with graceful failure handling

Technologies Detected

Content Management Systems (CMS)

WordPress, Shopify, Wix, Squarespace, Webflow, Drupal, Joomla, Magento, PrestaShop, BigCommerce

JavaScript Frameworks

React, Vue.js, Angular, Next.js, Nuxt.js, Svelte, jQuery

Analytics & Tracking

Google Analytics, Google Tag Manager, Facebook Pixel, Hotjar, Mixpanel, Segment

CDN & Infrastructure

Cloudflare, Fastly, Akamai, Amazon CloudFront, jsDelivr, unpkg

Hosting Providers

Vercel, Netlify, AWS, GitHub Pages, Heroku

E-commerce

WooCommerce, Stripe, PayPal

Marketing & CRM

HubSpot, Mailchimp, Intercom, Drift, Zendesk

Additional

CSS Frameworks (Bootstrap, Tailwind, Bulma, Foundation), server software, mobile responsiveness, HTTPS status, language detection

Input

{
"urls": [
"https://www.shopify.com",
"https://www.apple.com",
"https://www.stripe.com"
],
"maxRequestsPerCrawl": 1000
}

Input Parameters

ParameterTypeRequiredDefaultDescription
urlsarrayYesList of website URLs to analyze
maxRequestsPerCrawlintegerNo1000Maximum number of websites to process

Output

Each URL produces a result object in the dataset:

{
"url": "https://www.shopify.com",
"cms": ["Shopify"],
"frameworks": ["React", "Next.js"],
"analytics": ["Google Analytics", "Google Tag Manager"],
"cdn": ["Cloudflare", "Amazon CloudFront"],
"hosting": [],
"ecommerce": ["Stripe"],
"marketing": ["HubSpot"],
"cssFrameworks": ["Tailwind CSS"],
"server": "cloudflare",
"poweredBy": null,
"generator": null,
"language": "en",
"isMobileResponsive": true,
"isHttps": true,
"scriptCount": 24,
"topScripts": ["https://cdn.shopify.com/..."],
"detectedCount": 7,
"scrapedAt": "2026-03-16T12:00:00.000Z"
}

If an error occurs for a URL:

{
"url": "https://example-down.com",
"error": "Failed to fetch website",
"cms": [],
"frameworks": [],
"analytics": [],
"detectedCount": 0,
"scrapedAt": "2026-03-16T12:00:00.000Z"
}

Use Cases

  • SEO Audits — Analyze technology stacks across multiple client websites
  • Sales Prospecting — Find companies using specific CMS or e-commerce platforms
  • Competitive Research — Discover which technologies competitors are using
  • Market Research — Analyze technology adoption trends across industries
  • Lead Generation — Build targeted prospect lists filtered by tech stack
  • Data Enrichment — Add technology metadata to existing URL or company datasets
  • Migration Planning — Identify websites on legacy platforms for upgrade pitches
  • Content Marketing — Research technology landscape for industry reports

Pricing

This actor supports Pay-Per-Event (PPE) pricing on the Apify marketplace. You are charged per successfully analyzed website. The actor automatically stops processing when your budget limit is reached, so you never overspend.

Free tier usage is also supported — the actor works the same way without PPE, storing all results in the dataset.

FAQ

Q: How many URLs can I process at once? A: The default limit is 1,000 URLs per run, but you can increase maxRequestsPerCrawl for larger batches.

Q: What if a website blocks the request? A: The actor uses realistic browser headers and automatic retries. Most websites respond successfully.

Q: Does it work with Single Page Applications (SPAs)? A: Yes! The actor detects SPA frameworks (React, Vue, Angular, Next.js) through HTML markers, script references, and meta tags present in the initial response.

Q: How accurate is the detection? A: Detection uses multiple signature patterns per technology to minimize false positives. Accuracy is typically 90%+ for major platforms.

Q: How is PPE pricing calculated? A: Each website analyzed (success or error) counts as one metadata-extracted event. You set your budget limit, and the actor stops when it's reached.

Keywords

website metadata extractor, bulk SEO metadata, og tags scraper, meta tags extractor, website title description extractor, technology stack detector, CMS detector, framework detection, analytics detection, website technology analyzer, tech stack finder, bulk website analysis, competitive intelligence tool