Sitemap Analyzer & Structure Auditor
Pricing
from $4.00 / 1,000 results
Sitemap Analyzer & Structure Auditor
Parse and analyze sitemap.xml files. Discover URL structure, update frequencies, dead links, and site architecture at scale.
Pricing
from $4.00 / 1,000 results
Rating
0.0
(0)
Developer

太郎 山田
Actor stats
0
Bookmarked
2
Total users
1
Monthly active users
8 days ago
Last modified
Categories
Share
Parse, analyze, and audit sitemap.xml files at scale. Discover URL structure, content update patterns, dead links, and site architecture — essential for technical SEO audits.
What does this actor do?
Fetches and parses sitemap.xml files (both standard urlsets and sitemap indexes), then provides deep structural analysis. Auto-discovers /sitemap.xml if you provide just a domain URL.
Key Features
- 🗺️ Sitemap index support — Handles nested sitemap structures
- 📊 Structure analysis — Top directories, depth distribution, file extensions
- 📅 Update pattern detection — lastmod distribution, changefreq analysis
- 🔗 Dead link checker — Optional HEAD request sampling to find broken URLs
- 🏗️ Architecture insights — Understand site structure from sitemap alone
- 📋 Bulk processing — Analyze multiple sitemaps per run
Use Cases
| Who | Why |
|---|---|
| SEO agencies | Technical SEO audits — sitemap completeness and structure |
| Content strategists | Identify content update patterns and stale pages |
| Web developers | Verify sitemap structure before search engine submission |
| Competitive analysts | Map competitor site architecture from public sitemaps |
Input Example
{"sitemapUrls": ["https://apify.com/sitemap.xml"],"maxUrls": 5000,"checkStatus": false,"concurrency": 3}
Output Example
{"sitemapUrl": "https://apify.com/sitemap.xml","status": "ok","analysis": {"type": "urlset","totalUrls": 1247,"structure": {"topDirectories": [{ "path": "/store", "count": 890, "percentage": 71 },{ "path": "/blog", "count": 156, "percentage": 13 }],"depthDistribution": { "1": 45, "2": 890, "3": 312 }},"updateFrequency": {"lastModRange": { "oldest": "2023-01-15", "newest": "2026-02-20" },"urlsWithLastmod": 1100}}}
Cost
Zero external costs. Fetches public XML files — no API keys, no proxies. Analyzing a 5,000-URL sitemap takes ~10 seconds.
Commercial Ops
Set up .env first:
$cp -n .env.example .env
Cloud Task/Schedule setup (idempotent):
$npm run apify:cloud:setup
Daily reliability checks:
npm run canary:checknpm run contract:test:live
OpenClaw cron commands:
openclaw-cron-commands.md