Sitemap URL Extractor - List All URLs in a Sitemap avatar

Sitemap URL Extractor - List All URLs in a Sitemap

Pricing

$0.50 / 1,000 url extracteds

Go to Apify Store
Sitemap URL Extractor - List All URLs in a Sitemap

Sitemap URL Extractor - List All URLs in a Sitemap

Extract every URL from any XML sitemap, with lastmod, changefreq and priority. Resolves sitemap indexes recursively. Pass a sitemap.xml or just a site root to auto-discover its sitemaps. Pure HTTP, no browser — fast and cheap.

Pricing

$0.50 / 1,000 url extracteds

Rating

0.0

(0)

Developer

Walid

Walid

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

2 days ago

Last modified

Share

Sitemap URL Extractor — List All URLs in a Sitemap

Bookmark this actor so pulling every URL from a sitemap is one click away on your next crawl or audit.

Extract every URL from any XML sitemap — along with its lastmod, changefreq and priority. The actor resolves sitemap indexes recursively, so one nested sitemap_index.xml expands into all its child sitemaps automatically. Pass a direct sitemap.xml URL, or just a site root and it auto-discovers sitemaps from robots.txt.

Pure HTTP + XML parsing — no browser, fast and cheap.

What it does

  • Reads a sitemap.xml, sitemap index, or site root (auto-discovery)
  • Walks nested sitemap indexes recursively (depth-limited, dedup-safe)
  • Returns one record per URL with loc, lastmod, changefreq, priority
  • Tracks which source sitemap each URL came from

Input

FieldTypeDescription
urlstringA sitemap.xml / sitemap index URL, or a site root to auto-discover sitemaps
maxUrlsintegerMax URLs to extract (1–50000, default 10000)

Example input

{ "url": "https://www.theverge.com/sitemaps", "maxUrls": 5000 }

You can also pass a site root like https://apify.com — the actor finds the sitemaps listed in robots.txt (and common locations) and extracts from them.

Output

One record per URL:

{
"loc": "https://www.example.com/blog/post-1",
"lastmod": "2026-05-20",
"changefreq": "weekly",
"priority": "0.8",
"source_sitemap": "https://www.example.com/sitemap-posts.xml"
}

Use cases

  • SEO crawls — get the canonical URL list for a site before auditing it.
  • Content inventory — export every page with its last-modified date.
  • Migration QA — diff old vs new sitemaps to catch dropped URLs.
  • Feed a scraper — pipe the URL list into another actor for full-page extraction.

Pricing

Pay-per-event: $0.0005 per URL extracted ($0.50 per 1,000 URLs). Up to 50,000 URLs per run.

Next step: make those URLs AI-discoverable

Have the full URL list? Turn it into a standards-compliant llms.txt so ChatGPT, Claude and Perplexity understand your site — the AI-era complement to your sitemap.


Built by dltik. Pure HTTP — no browser, fast and cheap.