Sitemap Generator avatar

Sitemap Generator

Pricing

from $0.01 / 1,000 results

Go to Apify Store
Sitemap Generator

Sitemap Generator

Generate XML sitemaps by crawling any website. Discover all pages, images, & videos with configurable crawl depth, URL filters, & multiple output formats. Full Site Crawling ,Image Sitemap, Video Sitemap, Multiple Output Formats, URL Filtering, Configurable Depth, Last Modified, Webhook Integration

Pricing

from $0.01 / 1,000 results

Rating

0.0

(0)

Developer

The Howlers

The Howlers

Maintained by Community

Actor stats

0

Bookmarked

12

Total users

1

Monthly active users

7 days ago

Last modified

Share

Generate XML sitemaps by crawling any website. Discover all pages, images, and videos with configurable crawl depth, URL filters, and output formats. Built by John Rippy (https://www.linkedin.com/in/johnrippy/ | https://johnrippy.link/).


Quick Start

Test with Demo Mode (free, no API key needed)

{
"demoMode": true,
"startUrl": "https://example.com",
"urlPatterns": "https://example.com"
}

Run with real data

{
"demoMode": false,
"startUrl": "https://example.com",
"maxPages": 1000,
"maxDepth": 10,
"includeImages": true,
"includeVideos": false,
"includeLastmod": true,
"respectRobotsTxt": true,
"urlPatterns": [],
"excludePatterns": [
"\\.(jpg|jpeg|png|gif|svg|webp|pdf|zip|exe)$",
"/cdn-cgi/",
"\\?",
"#"
],
"outputFormat": "xml",
"changefreq": "weekly",
"priority": 0.5
}

Input Parameters

ParameterTypeDefaultRequiredDescription
startUrlstring-NoThe website URL to crawl and generate a sitemap for
maxPagesinteger1000NoMaximum number of pages to crawl (0 = unlimited)
maxDepthinteger10NoMaximum link depth from start URL (0 = only start page)
includeImagesbooleantrueNoInclude image URLs in the sitemap (creates image sitemap extension)
includeVideosbooleanfalseNoInclude video URLs in the sitemap (creates video sitemap extension)
includeLastmodbooleantrueNoTry to detect last modified dates from HTTP headers or meta tags
respectRobotsTxtbooleantrueNoFollow robots.txt rules when crawling
urlPatternsarray[]NoOnly include URLs matching these patterns (regex). Leave empty to include all.
excludePatternsarray`["\.(jpgjpegpng
outputFormatstring"xml"NoFormat of the generated sitemap
changefreqstring"weekly"NoDefault change frequency for all URLs
prioritynumber0.5NoDefault priority for all URLs (0.0 to 1.0)
webhookUrlstring-NoURL to receive a POST request when sitemap generation is complete
demoModebooleantrueNoRun with sample data instead of actual crawling (for testing)

Pricing

This actor uses pay-per-event billing:

EventDescriptionPrice
URL DiscoveredEach URL discovered during crawl$0.01

Demo mode is free -- no charges for sample data.


Troubleshooting

"API error 429" or "Rate limit"

Too many requests. Wait a minute and try again, or reduce the number of items per run.

No results or empty dataset

Check the run log for error messages. Common causes:

  • Invalid input format (check the examples above)
  • The target data doesn't exist or is too small to track

How do I test without an API key?

Enable Demo Mode in the input. This returns realistic sample data so you can verify the output format works for your workflow.


Built by John Rippy | Actor Arsenal