Website Metadata Bulk Extractor
Pricing
$1.00 / 1,000 results
Website Metadata Bulk Extractor
Extract SEO metadata, Open Graph tags, Twitter cards, and detect 50+ technologies from any website URL in bulk. Fast concurrent processing with retry logic.
Pricing
$1.00 / 1,000 results
Rating
0.0
(0)
Developer
BotFlowTech
Actor stats
3
Bookmarked
31
Total users
7
Monthly active users
14 days ago
Last modified
Categories
Share
Extract metadata, detect technology stacks, and analyze any website at scale — CMS, frameworks, analytics, CDN, hosting, and 50+ technologies.
What does Website Metadata Bulk Extractor do?
Website Metadata Bulk Extractor is a powerful Apify actor that analyzes websites to detect their full technology stack. It identifies content management systems, JavaScript frameworks, analytics tools, CDN providers, hosting platforms, e-commerce solutions, and marketing/CRM tools. Process hundreds or thousands of URLs in a single run.
Perfect for: SEO audits, competitive research, sales prospecting, market research, lead generation, and technology landscape analysis.
Key Features
- 50+ technology detections — CMS, frameworks, analytics, CDN, hosting, e-commerce, and marketing tools
- Bulk processing — analyze thousands of websites in a single run
- CSS framework detection — Bootstrap, Tailwind CSS, Bulma, Foundation
- Server & infrastructure info — server headers, powered-by, HTTPS status
- Mobile responsiveness check — viewport meta tag detection
- Language detection — HTML lang attribute extraction
- Script analysis — counts and lists external scripts
- PPE monetization — pay-per-event pricing support for Apify marketplace
- Error resilience — automatic retries with graceful failure handling
Technologies Detected
Content Management Systems (CMS)
WordPress, Shopify, Wix, Squarespace, Webflow, Drupal, Joomla, Magento, PrestaShop, BigCommerce
JavaScript Frameworks
React, Vue.js, Angular, Next.js, Nuxt.js, Svelte, jQuery
Analytics & Tracking
Google Analytics, Google Tag Manager, Facebook Pixel, Hotjar, Mixpanel, Segment
CDN & Infrastructure
Cloudflare, Fastly, Akamai, Amazon CloudFront, jsDelivr, unpkg
Hosting Providers
Vercel, Netlify, AWS, GitHub Pages, Heroku
E-commerce
WooCommerce, Stripe, PayPal
Marketing & CRM
HubSpot, Mailchimp, Intercom, Drift, Zendesk
Additional
CSS Frameworks (Bootstrap, Tailwind, Bulma, Foundation), server software, mobile responsiveness, HTTPS status, language detection
Input
{"urls": ["https://www.shopify.com","https://www.apple.com","https://www.stripe.com"],"maxRequestsPerCrawl": 1000}
Input Parameters
| Parameter | Type | Required | Default | Description |
|---|---|---|---|---|
urls | array | Yes | — | List of website URLs to analyze |
maxRequestsPerCrawl | integer | No | 1000 | Maximum number of websites to process |
Output
Each URL produces a result object in the dataset:
{"url": "https://www.shopify.com","cms": ["Shopify"],"frameworks": ["React", "Next.js"],"analytics": ["Google Analytics", "Google Tag Manager"],"cdn": ["Cloudflare", "Amazon CloudFront"],"hosting": [],"ecommerce": ["Stripe"],"marketing": ["HubSpot"],"cssFrameworks": ["Tailwind CSS"],"server": "cloudflare","poweredBy": null,"generator": null,"language": "en","isMobileResponsive": true,"isHttps": true,"scriptCount": 24,"topScripts": ["https://cdn.shopify.com/..."],"detectedCount": 7,"scrapedAt": "2026-03-16T12:00:00.000Z"}
If an error occurs for a URL:
{"url": "https://example-down.com","error": "Failed to fetch website","cms": [],"frameworks": [],"analytics": [],"detectedCount": 0,"scrapedAt": "2026-03-16T12:00:00.000Z"}
Use Cases
- SEO Audits — Analyze technology stacks across multiple client websites
- Sales Prospecting — Find companies using specific CMS or e-commerce platforms
- Competitive Research — Discover which technologies competitors are using
- Market Research — Analyze technology adoption trends across industries
- Lead Generation — Build targeted prospect lists filtered by tech stack
- Data Enrichment — Add technology metadata to existing URL or company datasets
- Migration Planning — Identify websites on legacy platforms for upgrade pitches
- Content Marketing — Research technology landscape for industry reports
Pricing
This actor supports Pay-Per-Event (PPE) pricing on the Apify marketplace. You are charged per successfully analyzed website. The actor automatically stops processing when your budget limit is reached, so you never overspend.
Free tier usage is also supported — the actor works the same way without PPE, storing all results in the dataset.
FAQ
Q: How many URLs can I process at once?
A: The default limit is 1,000 URLs per run, but you can increase maxRequestsPerCrawl for larger batches.
Q: What if a website blocks the request? A: The actor uses realistic browser headers and automatic retries. Most websites respond successfully.
Q: Does it work with Single Page Applications (SPAs)? A: Yes! The actor detects SPA frameworks (React, Vue, Angular, Next.js) through HTML markers, script references, and meta tags present in the initial response.
Q: How accurate is the detection? A: Detection uses multiple signature patterns per technology to minimize false positives. Accuracy is typically 90%+ for major platforms.
Q: How is PPE pricing calculated?
A: Each website analyzed (success or error) counts as one metadata-extracted event. You set your budget limit, and the actor stops when it's reached.
Keywords
website metadata extractor, bulk SEO metadata, og tags scraper, meta tags extractor, website title description extractor, technology stack detector, CMS detector, framework detection, analytics detection, website technology analyzer, tech stack finder, bulk website analysis, competitive intelligence tool