Sitemap URL Extractor - List All URLs in a Sitemap
Pricing
$0.50 / 1,000 url extracteds
Sitemap URL Extractor - List All URLs in a Sitemap
Extract every URL from any XML sitemap, with lastmod, changefreq and priority. Resolves sitemap indexes recursively. Pass a sitemap.xml or just a site root to auto-discover its sitemaps. Pure HTTP, no browser — fast and cheap.
Pricing
$0.50 / 1,000 url extracteds
Rating
0.0
(0)
Developer
Walid
Maintained by CommunityActor stats
0
Bookmarked
2
Total users
1
Monthly active users
2 days ago
Last modified
Categories
Share
Sitemap URL Extractor — List All URLs in a Sitemap
⭐ Bookmark this actor so pulling every URL from a sitemap is one click away on your next crawl or audit.
Extract every URL from any XML sitemap — along with its lastmod, changefreq and priority. The actor resolves sitemap indexes recursively, so one nested sitemap_index.xml expands into all its child sitemaps automatically. Pass a direct sitemap.xml URL, or just a site root and it auto-discovers sitemaps from robots.txt.
Pure HTTP + XML parsing — no browser, fast and cheap.
What it does
- Reads a sitemap.xml, sitemap index, or site root (auto-discovery)
- Walks nested sitemap indexes recursively (depth-limited, dedup-safe)
- Returns one record per URL with loc, lastmod, changefreq, priority
- Tracks which source sitemap each URL came from
Input
| Field | Type | Description |
|---|---|---|
url | string | A sitemap.xml / sitemap index URL, or a site root to auto-discover sitemaps |
maxUrls | integer | Max URLs to extract (1–50000, default 10000) |
Example input
{ "url": "https://www.theverge.com/sitemaps", "maxUrls": 5000 }
You can also pass a site root like https://apify.com — the actor finds the sitemaps listed in robots.txt (and common locations) and extracts from them.
Output
One record per URL:
{"loc": "https://www.example.com/blog/post-1","lastmod": "2026-05-20","changefreq": "weekly","priority": "0.8","source_sitemap": "https://www.example.com/sitemap-posts.xml"}
Use cases
- SEO crawls — get the canonical URL list for a site before auditing it.
- Content inventory — export every page with its last-modified date.
- Migration QA — diff old vs new sitemaps to catch dropped URLs.
- Feed a scraper — pipe the URL list into another actor for full-page extraction.
Pricing
Pay-per-event: $0.0005 per URL extracted ($0.50 per 1,000 URLs). Up to 50,000 URLs per run.
Next step: make those URLs AI-discoverable
Have the full URL list? Turn it into a standards-compliant llms.txt so ChatGPT, Claude and Perplexity understand your site — the AI-era complement to your sitemap.
Built by dltik. Pure HTTP — no browser, fast and cheap.