Sitemap Analyzer & Structure Auditor avatar

Sitemap Analyzer & Structure Auditor

Pricing

from $4.00 / 1,000 results

Go to Apify Store
Sitemap Analyzer & Structure Auditor

Sitemap Analyzer & Structure Auditor

Parse and analyze sitemap.xml files. Discover URL structure, update frequencies, dead links, and site architecture at scale.

Pricing

from $4.00 / 1,000 results

Rating

0.0

(0)

Developer

太郎 山田

太郎 山田

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

8 days ago

Last modified

Share

Parse, analyze, and audit sitemap.xml files at scale. Discover URL structure, content update patterns, dead links, and site architecture — essential for technical SEO audits.

What does this actor do?

Fetches and parses sitemap.xml files (both standard urlsets and sitemap indexes), then provides deep structural analysis. Auto-discovers /sitemap.xml if you provide just a domain URL.

Key Features

  • 🗺️ Sitemap index support — Handles nested sitemap structures
  • 📊 Structure analysis — Top directories, depth distribution, file extensions
  • 📅 Update pattern detection — lastmod distribution, changefreq analysis
  • 🔗 Dead link checker — Optional HEAD request sampling to find broken URLs
  • 🏗️ Architecture insights — Understand site structure from sitemap alone
  • 📋 Bulk processing — Analyze multiple sitemaps per run

Use Cases

WhoWhy
SEO agenciesTechnical SEO audits — sitemap completeness and structure
Content strategistsIdentify content update patterns and stale pages
Web developersVerify sitemap structure before search engine submission
Competitive analystsMap competitor site architecture from public sitemaps

Input Example

{
"sitemapUrls": ["https://apify.com/sitemap.xml"],
"maxUrls": 5000,
"checkStatus": false,
"concurrency": 3
}

Output Example

{
"sitemapUrl": "https://apify.com/sitemap.xml",
"status": "ok",
"analysis": {
"type": "urlset",
"totalUrls": 1247,
"structure": {
"topDirectories": [
{ "path": "/store", "count": 890, "percentage": 71 },
{ "path": "/blog", "count": 156, "percentage": 13 }
],
"depthDistribution": { "1": 45, "2": 890, "3": 312 }
},
"updateFrequency": {
"lastModRange": { "oldest": "2023-01-15", "newest": "2026-02-20" },
"urlsWithLastmod": 1100
}
}
}

Cost

Zero external costs. Fetches public XML files — no API keys, no proxies. Analyzing a 5,000-URL sitemap takes ~10 seconds.

Commercial Ops

Set up .env first:

$cp -n .env.example .env

Cloud Task/Schedule setup (idempotent):

$npm run apify:cloud:setup

Daily reliability checks:

npm run canary:check
npm run contract:test:live

OpenClaw cron commands:

  • openclaw-cron-commands.md