Pricing

Pay per usage

Sitemap & URL Extractor — Get Every URL of a Website

Get every URL of a website: parses sitemap.xml and sitemap-indexes (discovered via robots.txt or the default location), with a same-site crawl fallback when there's no sitemap. Returns each URL + lastmod. No API key.

Pricing

Pay per usage

Rating

0.0

(0)

Developer

Daniel Brenner

Actor stats

Bookmarked

Total users

Monthly active users

18 days ago

Last modified

What you get (per URL)

url — the page URL (absolute, deduped)
lastmod — last-modified date from the sitemap, when present (honest-null otherwise)
source — "sitemap" or "crawl" (how the URL was found)
discoveredAt

How to use it

{ "startUrls": ["https://example.com"], "maxResults": 5000 }

Pass a site URL (the sitemap is found automatically) or a direct sitemap URL. It handles sitemap-indexes (sites that split their sitemap into many files) by following each child sitemap, and if there's no sitemap at all it falls back to a polite, same-site crawl. It respects robots.txt, identifies itself, and fetches one request at a time.

Pair it: discover → extract → audit

This is the discover step of a clean "feed-your-AI" toolkit by dataquarry:

Discover — this actor: every URL of a site.
Extract — dataquarry/website-to-markdown: turn those URLs into clean, LLM-ready Markdown.
Audit — dataquarry/website-seo-metadata-checker: SEO & metadata for each page.

Also see the dataquarry OSM place-data scrapers and free guides at openplacedata.com.

Clean & honest

Reads only public sitemap.xml/robots.txt and (in fallback) public pages; respects robots.txt; sends a descriptive User-Agent; no logins, no PII. Missing values are null, never guessed.

FAQ

Do I need an API key? No — give it a URL and run it. It's free.

What if the site has no sitemap? It crawls the site's own links (same-domain, bounded) so you still get a URL list.

Does it handle huge sitemap-indexes? Yes — it follows child sitemaps up to the maxSitemaps and maxResults caps you set.

⭐ Found this useful? A quick rating on this actor's Store page helps others discover it — and if something is off or you wish it did more, open an issue on the actor. I read every one.

Sitemap Extractor: Website → All URLs (sitemap.xml parser)

boxbox10/sitemap-extractor

Give it a website. Get every URL from its sitemap — loc, lastmod, changefreq, priority — as one clean record per URL. Auto-discovers sitemap.xml, robots.txt Sitemap: directives, and nested sitemap indexes. Perfect for SEO audits, crawl seeding, and URL discovery.

Marvin Eguilos

Sitemap URL Extractor - XML Sitemap Scraper

benthepythondev/sitemap-url-extractor

Extract URLs from XML sitemaps and sitemap indexes. Get URL, lastmod, changefreq, priority and source sitemap.

Ben

Sitemap URL Extractor

automationagents/web-sitemap

Extract every URL from a website via sitemap.xml, robots.txt, or crawl discovery. Feed clean URL lists straight into your scrapers.

Alex Jordan

Sitemap URL Extractor — robots.txt + sitemap.xml Crawl

v0iddo/sitemap-url-extractor

Discover every URL a site exposes via its public sitemap chain. Reads robots.txt, follows Sitemap declarations, recursively descends sitemap-index files, extracts URLs with lastmod, changefreq, priority.

vøiddo

Sitemap URL Extractor

blazing_stake/sitemap-url-extractor

Extract every URL from any website's sitemap, including nested sitemap indexes (recursive). Auto-discovers sitemaps from robots.txt. Returns URLs with lastmod, changefreq, priority.

Mehmet Kut

Sitemap URL Extractor - List All URLs in a Sitemap

dltik/sitemap-url-extractor

Extract every URL from any XML sitemap, with lastmod, changefreq and priority. Resolves sitemap indexes recursively. Pass a sitemap.xml or just a site root to auto-discover its sitemaps. Pure HTTP, no browser — fast and cheap.

Walid

Sitemap URL Extractor

seemuapps/sitemap-extractor

Extract every URL from a website's sitemap.xml. Recursively walks nested sitemap indexes and returns loc, lastmod, changefreq, and priority for each page.

Andrew

Sitemap URL Extractor — Get All URLs From Any Website

q_services/sitemap-url-extractor

Extract every URL from any website's XML sitemaps. Handles sitemap indexes, robots.txt discovery, URL filtering. Great for SEO audits.

Q Services

Sitemap URL Extractor

aetheragent/sitemap-extractor

Extract all URLs from any website's sitemap.xml. Parse robots.txt to find sitemap locations, follow sitemap indexes, and return a complete list of all discovered URLs. Essential for SEO audits, web crawlers, and site migration planning.

Grant Mitchell

Sitemap URL Extractor - Get Every URL from sitemap.xml

eliai/sitemap-url-extractor

Extract every URL from any sitemap.xml, auto-following nested sitemap index files. Input: startUrls (sitemap URL). Output: JSON records with loc, lastmod, changefreq, priority, sourceSitemap. Cheap pay-per-result: $0.02 per sitemap parsed.