Pricing

from $5.00 / 1,000 results

Sitemap Generator - Creates sitemap.xml for any domain

Generate a clean, standards-compliant sitemap.xml for a website. This actor crawls a single website, discovers all indexable pages, and produces: ✅ A ready-to-submit sitemap.xml (Google-compliant) ✅ A structured JSON dataset of discovered URLs (for auditing, reporting, and billing)

Pricing

from $5.00 / 1,000 results

Rating

0.0

(0)

Developer

Chris Xavier

Actor stats

Bookmarked

Total users

Monthly active users

3 months ago

Last modified

🗺️ Sitemap Generator (Apify Actor)

Generate a clean, standards-compliant sitemap.xml for a website — automatically, reliably, and without manual cleanup.

This actor crawls a single website, discovers all indexable pages, and produces:

✅ A ready-to-submit sitemap.xml (Google-compliant)
✅ A structured JSON dataset of discovered URLs (for auditing, reporting, and billing)

Built for SEO professionals, agencies, and site owners who want accuracy, transparency, and results they can trust.

✅ What This Actor Does

Crawls one website per run (no mixed domains, no confusion)
Discovers internal pages by following links
Excludes junk/system URLs automatically (e.g. Cloudflare, admin endpoints)
Respects robots.txt (optional)
Removes duplicate URLs and URL fragments
Optionally strips query strings to prevent sitemap bloat
Extracts real <lastmod> dates when available:
- From HTTP Last-Modified headers
- From blog/article meta tags when headers are missing
Outputs a fully valid sitemap.xml

📦 Outputs (Where to Find Your Files)

Run → Storage → Key-value store → sitemap.xml

This file is:

Ready to upload to Google Search Console
Ready to host at /sitemap.xml
Standards-compliant (no reconstruction required)

🟢 JSON Results (Dataset)

Every discovered page is also saved to the Dataset.

Each row includes:

url – discovered page URL
depth – crawl depth from the homepage
lastmod – modification date (when available)
lastmodSource – "header", "meta", or null

This dataset is useful for:

Auditing and QA
URL counts and reporting
Monetization and billing logic
Previewing results before download

🔒 Important Design Decisions (On Purpose)

One Website per Run

This actor enforces a single start URL.

Why?

A sitemap must not mix domains
One site = one sitemap = one clean result
Prevents invalid or rejected sitemaps
Enables clear pricing per site

Honest `<lastmod>` Values

The actor does not fake modification dates.

Uses real server headers when available
Falls back to article metadata for blog posts
Omits <lastmod> when no trustworthy source exists

This avoids misleading search engines and protects SEO integrity.

⚙️ Inputs

Required

Start URL
The root URL of the website (example: https://example.com)

Optional

Max crawl depth
Max number of pages
Concurrency
Headless browser (for JavaScript-heavy sites)
Strip query strings
Respect robots.txt
Advanced include/exclude URL patterns (regex)

Most users can run the actor with just a Start URL.

🧠 Who This Is For

SEO professionals
Agencies managing multiple client sites
Developers who need clean sitemaps programmatically
Site owners preparing for Google Search Console
AI-first websites optimizing crawlability

💡 Why Use This Actor Instead of Online Sitemap Tools?

No URL limits
No fake results
No mixed domains
No guessing which pages were included
Full transparency (XML + JSON)
Automation-ready and API-friendly

🔐 PPE (Paid / Private / Enterprise)

This actor is designed for PPE use:

Consistent, auditable outputs
Dataset always populated (even if XML is downloaded)
Clear value per run
Suitable for client-facing and internal workflows

Run it. Download sitemap.xml. Submit. Done.

🟢 `sitemap.xml` (Primary Output)

Your sitemap is written as a real XML file.

Location in Apify UI:

Sitemap Generator - Crawl Website & Create XML Sitemap

scrappy_garden/sitemap-generator

Generate an XML sitemap for any website. Crawls internal pages from start URLs (with depth + page limits), deduplicates URLs, and stores a ready-to-submit sitemap.xml plus a structured dataset and summary for SEO audits.

Bikram Adhikari

Sitemap URL Extractor

onescales/sitemap-url-extractor

Provide a website link to a sitemap.xml and the app will extract and list all URLs in the sitemap as well as additional data in the sitemap (i.e. https://onescales.com/sitemap.xml).

One Scales

420

5.0

Sitemap Scraper

pvillalva/sitemap-scraper

The Sitemap Scraper extracts and outputs all URLs from a given sitemap.

Percival Villalva

241

Sitemap Generator

himalyancoder/Sitemap-generator

Sameer Pun

Sitemap API

vivid_astronaut/sitemap

Fabio Suizu

Sitemap Analyzer API | sitemap.xml SEO Audit

taroyamada/sitemap-analyzer

Analyze sitemap.xml files for structure, freshness, broken URLs, and crawl-ready SEO insights at scale.

太郎山田

Find Sitemap from url

eesti/find-sitemap-from-url

A powerful [Apify Actor] that finds sitemap URLs for any website. This Actor helps you discover XML sitemaps by checking common locations, robots.txt files, and analyzing HTML content for sitemap links.

ando

206

1.0

XML Sitemap URL Extractor

andok/sitemap-extractor

Recursively crawl and extract every single URL from a website’s sitemap.xml. Automate your SEO audits and scraping queues.

Andok

Xml Sitemap Validator

zerobreak/xml-sitemap-validator

XML sitemap validator that crawls every URL in your sitemap and flags broken links, redirect chains, and structural errors — so SEO teams can audit sitemap health in seconds.

ZeroBreak

Sitemap Generator

datawinder/sitemap-generator

Automatically crawl a website and generate an SEO-ready sitemap in XML, HTML, or TXT format. Supports crawl depth limits, URL include/exclude patterns, and optional merging with an existing sitemap.xml. Ideal for SEO audits, site migrations, and automation.