XML Sitemap Checker
Pricing
$3.99/month + usage
XML Sitemap Checker
Verify if your website has a properly configured XML sitemap. Checks robots.txt and common paths, validates accessibility, XML structure, content type, and URL count — ensuring search engines can easily crawl and index your site.
Pricing
$3.99/month + usage
Rating
0.0
(0)
Developer

Luffy
Actor stats
0
Bookmarked
2
Total users
1
Monthly active users
7 days ago
Last modified
Categories
Share
Verify if your website has a properly configured XML sitemap, ensuring search engines can easily crawl and index your site's pages. This actor checks both robots.txt and common sitemap paths, then validates each discovered sitemap for accessibility, XML validity, correct content type, and URL count.
Features
- Discovers sitemaps from
robots.txtand common fallback paths (/sitemap.xml,/sitemap_index.xml, etc.) - Validates each sitemap: accessibility, valid XML structure, correct content type header
- Counts the number of URLs/entries in each sitemap
- Reports whether each sitemap is listed in
robots.txt - Outputs a flat list — one row per sitemap for easy filtering and export
- Supports single or bulk URL checking
- Fast and lightweight (async with aiohttp)
- Built-in proxy support
Input
The actor accepts the following JSON input:
{"url": "example.com","timeout": 30}
| Parameter | Type | Required | Description |
|---|---|---|---|
url | string | No* | A single website URL to check |
urls | array | No* | Multiple website URLs (one per line) |
timeout | integer | No | Request timeout in seconds (default: 30) |
*At least one of url or urls must be provided.
The input will be automatically normalized, so partial domains like example.com or www.example.com are acceptable.
Output
Each discovered sitemap produces one row in the output dataset:
| Field | Type | Description |
|---|---|---|
source_website | string | The normalized website URL that was checked |
sitemap_url | string | The discovered sitemap URL |
is_accessible | boolean | Whether the sitemap returned HTTP 200 |
http_status | number | The HTTP status code returned |
is_valid_xml | boolean | Whether the content is valid XML sitemap format |
content_type | string | The Content-Type header returned by the server |
is_xml_content_type | boolean | Whether the Content-Type contains "xml" |
url_count | number | Number of <url> or <sitemap> entries found |
found_in_robots_txt | boolean | Whether this sitemap was listed in robots.txt |
Example Output
{"source_website": "https://example.com","sitemap_url": "https://example.com/sitemap.xml","is_accessible": true,"http_status": 200,"is_valid_xml": true,"content_type": "application/xml","is_xml_content_type": true,"url_count": 142,"found_in_robots_txt": true}
If no sitemaps are found for a website, a single row is returned with sitemap_url set to "None found" and all check fields set to false/0.