Under maintenance

Pricing

$2.00 / 1,000 results

Try for free

Go to Apify Store

Sitemap Generator

Under maintenance

Try for free

A powerful Apify actor that generates XML sitemaps for websites. Perfect for SEO optimization and website indexing.

Pricing

$2.00 / 1,000 results

Rating

0.0

(0)

Developer

Salman Bareesh

Actor stats

Bookmarked

Total users

Monthly active users

23 days ago

Last modified

Sitemap Generator Actor

A powerful Apify actor that generates XML sitemaps for websites. Perfect for SEO optimization and website indexing.

Features

Automatic URL Discovery: Use the built-in web crawler to automatically discover all URLs on your website
Manual URL Input: Provide a list of URLs directly for sitemap generation
Standard Compliance: Generates XML sitemaps compliant with sitemaps.org protocol
Configurable Metadata: Set change frequency and priority for each URL
Flexible Output: Output as JSON or save as sitemap.xml file
Domain Validation: Automatically filters URLs to stay within the specified domain

How It Works

The actor can operate in two modes:

Mode 1: Web Crawler (Automatic URL Discovery)

When useWebCrawler is enabled, the actor uses Apify's Web Crawler to discover all links on your website up to a specified depth. This is ideal for discovering all pages automatically.

Mode 2: Direct URL Input

Provide a list of URLs directly, and the actor will include them in the sitemap. This is useful when you already have a known list of URLs.

Input Configuration

Required Parameters

base_url (string): The root URL of your website (e.g., https://example.com)

Optional Parameters

urls (array): Array of specific URLs to include in the sitemap
- Default: []
- Example: ["https://example.com/page1", "https://example.com/page2"]
useWebCrawler (boolean): Enable automatic URL discovery
- Default: false
- When true, the actor crawls your website to find all links
maxCrawlDepth (integer): Maximum depth for the web crawler
- Default: 2
- Valid range: 0-10
- Only applies when useWebCrawler is true
maxPages (integer): Maximum number of pages to include in the sitemap
- Default: 1000
- Valid range: 1-50000
changeFrequency (string): Default change frequency for all URLs
- Default: weekly
- Valid options: always, hourly, daily, weekly, monthly, yearly, never
priority (number): Default priority for all URLs
- Default: 0.8
- Valid range: 0.0-1.0
- Indicates importance relative to other URLs on your site
saveToStorage (boolean): Save the sitemap as a file to Apify storage
- Default: false
- When true, output includes /tmp/sitemap.xml

Example Inputs

Example 1: Simple Web Crawl

{
  "base_url": "https://example.com",
  "useWebCrawler": true,
  "maxCrawlDepth": 2,
  "maxPages": 500
}

Example 2: Direct URL List

{
  "base_url": "https://example.com",
  "urls": [
    "https://example.com/",
    "https://example.com/about",
    "https://example.com/services",
    "https://example.com/contact"
  ],
  "changeFrequency": "monthly",
  "priority": 0.9,
  "saveToStorage": true
}

Example 3: Custom Configuration

{
  "base_url": "https://blog.example.com",
  "useWebCrawler": true,
  "maxCrawlDepth": 3,
  "maxPages": 2000,
  "changeFrequency": "daily",
  "priority": 0.7,
  "saveToStorage": true
}

Output

The actor outputs a dataset item with the following structure:

{
  "sitemap": "<?xml version=\"1.0\" encoding=\"UTF-8\"?>\n<urlset xmlns=\"http://www.sitemaps.org/schemas/sitemap/0.9\">...</urlset>",
  "total_urls": 150,
  "base_url": "https://example.com",
  "generated_at": "2024-01-15T10:30:45.123456"
}

Sample Sitemap XML

<?xml version="1.0" encoding="UTF-8"?>
<urlset xmlns="http://www.sitemaps.org/schemas/sitemap/0.9">
  <url>
    <loc>https://example.com/</loc>
    <lastmod>2024-01-15</lastmod>
    <changefreq>weekly</changefreq>
    <priority>0.8</priority>
  </url>
  <url>
    <loc>https://example.com/about</loc>
    <lastmod>2024-01-15</lastmod>
    <changefreq>weekly</changefreq>
    <priority>0.8</priority>
  </url>
</urlset>

Use Cases

SEO Optimization: Submit sitemaps to Google Search Console and Bing Webmaster Tools
Website Indexing: Help search engines discover all pages on your website
Site Structure Analysis: Understand your website's URL structure
Link Validation: Identify all crawlable pages before migration
Multi-language Sites: Generate sitemaps for international websites

Technical Details

Language: Python 3.11+
Runtime: Apify Actor
Key Dependencies:
- apify: Apify SDK for actor development
- apify-client: Client for Apify API
- beautifulsoup4: HTML parsing for link extraction
- requests: HTTP requests
- lxml: XML processing

Error Handling

The actor handles various error scenarios:

Invalid URLs are automatically filtered
Cross-domain URLs are excluded
Malformed URLs are skipped with logging
Missing base_url parameter raises a clear error message

Performance Considerations

Large Sites: For sites with 10,000+ pages, consider increasing maxPages and using pagination
Crawl Depth: Each depth level increases crawl time exponentially (use 2-3 for most sites)
API Limits: Apify actor runs are subject to platform resource limits

Troubleshooting

No URLs Found

Verify the base_url is correct and accessible
Check that useWebCrawler is enabled if expecting automatic discovery
Ensure the website doesn't block crawlers with robots.txt

Too Few URLs

Increase maxCrawlDepth to discover deeper pages
Verify pages are linked and not isolated
Check for JavaScript-rendered content (may need different crawler)

Sitemap File Not Created

Ensure saveToStorage is set to true
Check actor logs for file write errors
Verify sufficient storage quota available

License

This actor is provided under the MIT License. Feel free to modify and distribute as needed.

Support

For issues, questions, or feature requests, please contact the development team or open an issue on the repository.

Sitemap Generator

igview-owner/sitemap-generator

Automatically crawl any website and generate XML, HTML, and text sitemaps for SEO optimization. Perfect for submitting to Google Search Console, Bing Webmaster Tools, and improving search engine indexing. no manual work required. Free sitemap generator tool for WordPress, Blogger, and all website.

Sachin Kumar Yadav

Find Sitemap from url

eesti/find-sitemap-from-url

A powerful [Apify Actor] that finds sitemap URLs for any website. This Actor helps you discover XML sitemaps by checking common locations, robots.txt files, and analyzing HTML content for sitemap links.

ando

200

1.0

Sitemap Detector

coder_zoro/sitemap-detector

Find sitemap URLs fast with our free Sitemap Finder tool. Instantly detect sitemaps from any website for SEO audits, indexing checks, and crawl planning. Improve visibility, site structure insights, and search engine performance in just seconds

Zoro

163

5.0

Sitemap Generator

himalyancoder/Sitemap-generator

Sameer Pun

5.0

Sitemap Generator - Crawl Website & Create XML Sitemap

scrappy_garden/sitemap-generator

Generate an XML sitemap for any website. Crawls internal pages from start URLs (with depth + page limits), deduplicates URLs, and stores a ready-to-submit sitemap.xml plus a structured dataset and summary for SEO audits.

Bikram Adhikari

Sitemap Generator - Creates sitemap.xml for any domain

wisteria_banjo/sitemap-generator---creates-sitemap-xml-for-any-domain

Generate a clean, standards-compliant sitemap.xml for a website. This actor crawls a single website, discovers all indexable pages, and produces: ✅ A ready-to-submit sitemap.xml (Google-compliant) ✅ A structured JSON dataset of discovered URLs (for auditing, reporting, and billing)

Chris Xavier

Sitemap Extractor

cerebral_aluminum/sitemap-extractor

Extract all URLs from website sitemaps. Pages, images, PDFs. Handles sitemap indexes and WordPress.

Benny

Fast Sitemap Generator

eunit/sitemap-generator

Boost SEO with this automatic Sitemap Generator. Crawl any site to create XML, HTML, & TXT sitemaps. Supports custom depth, regex filters, & robots.txt. Compatible with Google Search Console.

Emmanuel Uchenna

5.0

Sitemap Generator

datawinder/sitemap-generator

Automatically crawl a website and generate an SEO-ready sitemap in XML, HTML, or TXT format. Supports crawl depth limits, URL include/exclude patterns, and optional merging with an existing sitemap.xml. Ideal for SEO audits, site migrations, and automation.

Datawinder

Sitemap URL Extractor

onescales/sitemap-url-extractor

Provide a website link to a sitemap.xml and the app will extract and list all URLs in the sitemap as well as additional data in the sitemap (i.e. https://onescales.com/sitemap.xml).

One Scales

393

5.0

Sitemap Generator

Sitemap Generator Actor

Features

How It Works

Mode 1: Web Crawler (Automatic URL Discovery)

Mode 2: Direct URL Input

Input Configuration

Required Parameters

Optional Parameters

Example Inputs

Example 1: Simple Web Crawl

Example 2: Direct URL List

Example 3: Custom Configuration

Output

Sample Sitemap XML

Use Cases

Technical Details

Error Handling

Performance Considerations

Troubleshooting

No URLs Found

Too Few URLs

Sitemap File Not Created

License

Support

You might also like

Sitemap Generator

Find Sitemap from url

Sitemap Detector

Sitemap Generator

Sitemap Generator - Crawl Website & Create XML Sitemap

Sitemap Generator - Creates sitemap.xml for any domain

Sitemap Extractor

Fast Sitemap Generator

Sitemap Generator

Sitemap URL Extractor