Pricing

Pay per usage

Go to Apify Store

Sitemap Url Extractor

Try for free

Pricing

Pay per usage

Rating

0.0

(0)

Developer

Donny Nguyen

Actor stats

Bookmarked

Total users

Monthly active users

29 minutes ago

Last modified

What does Sitemap URL Extractor do?

Sitemap URL Extractor is an Apify actor that parses XML sitemaps and extracts all listed URLs along with their metadata such as last modification date, priority, and change frequency. It handles standard sitemaps, sitemap indexes, and nested sitemaps automatically. Feed it any sitemap URL like https://crawlee.dev/sitemap.xml and get a clean, structured list of every page the site exposes.

Why use Sitemap URL Extractor?

Bulk URL discovery -- Instantly extract thousands of URLs from any XML sitemap without manual parsing.
Complete metadata -- Get lastmod, priority, and changefreq values alongside every URL for SEO analysis.
Sitemap index support -- Automatically follows sitemap indexes to extract URLs from all child sitemaps.
API integration -- Retrieve results programmatically via the Apify API for use in crawling pipelines and SEO tools.
Proxy support -- Leverages Apify Proxy to access sitemaps behind geo-restrictions or rate limits.

How to use Sitemap URL Extractor

Visit the Apify Store and find Sitemap URL Extractor.
Click Try for free to open the actor configuration page.
Enter one or more XML sitemap URLs in the Sitemap URLs field.
Optionally set a Max URLs limit to cap the number of extracted URLs.
Click Start, then download the dataset as JSON, CSV, or Excel.

Input configuration

Field	Type	Description	Default
`sitemapUrls`	Array of strings	XML sitemap URLs to parse	`["https://crawlee.dev/sitemap.xml"]`
`maxUrls`	Integer	Maximum number of URLs to extract	`10000`

Output data

Each record in the output dataset represents a single URL found in the sitemap. Metadata fields are included when available in the source XML.

{
  "url": "https://crawlee.dev/docs/introduction",
  "lastmod": "2025-11-15",
  "priority": "0.8",
  "changefreq": "weekly",
  "sitemapSource": "https://crawlee.dev/sitemap.xml"
}

Cost of usage

Sitemap URL Extractor uses pay-per-event (PPE) pricing at the Utility tier:

Tier	Cost per 1,000 events	Free events per month
Utility	$0.30	~16,600

Parsing a single sitemap with 500 URLs costs approximately 1 event. Even large-scale extraction of tens of thousands of URLs stays well within the generous free tier allowance, making this one of the most cost-effective actors available.

Tips and advanced usage

Feed into a crawler -- Use extracted URLs as the start URL list for another Apify actor or a Crawlee-based scraper.
Schedule weekly extractions -- Set up scheduled runs to track when new pages appear on a competitor's site.
Filter by priority -- After extraction, filter the dataset by priority to focus on the most important pages.
Monitor sitemap freshness -- Compare lastmod dates across runs to detect stale or outdated content.
Combine with SEO tools -- Pair the output with a keyword rank checker or backlink analyzer for a comprehensive SEO audit.

Built with Crawlee and Apify SDK. See more scrapers by consummate_mandala on Apify Store.

Sitemap URL Extractor

onescales/sitemap-url-extractor

Provide a website link to a sitemap.xml and the app will extract and list all URLs in the sitemap as well as additional data in the sitemap (i.e. https://onescales.com/sitemap.xml).

One Scales

379

5.0

(2)

Sitemap to URL Crawler

logiover/sitemap-to-url-crawler

nstantly extract all public URLs from any website's sitemap.xml recursively. Handles nested sitemap indexes automatically. The fastest & cheapest way to build URL lists for RAG pipelines, LLM training, and SEO audits. Zero-config & blazing fast.

Logiover

Extract Sitemap Parser Url — URLs, Structure & Metadata

tropical_quince/sitemap-parser-url-extractor

Extract sitemap parser url data at scale with this powerful Apify actor. Extracts urls, structure & metadata with automatic pagination and proxy rotation. Perfect for market research, competitive intelligence, and data-driven decision making.

Donny Nguyen

Internal Links Scraper

mysteriousshadow/internal-links-scraper

When given a sitemap of a website, this scraper will go through every page listed on the sitemap and find all the internal links. Useful for SEO, finding orphaned pages, and visualizing internal linking structure.

Mysterious Shadow

104

Open Graph Scraper

consummate_mandala/open-graph-scraper

Donny Nguyen

Shopify Scraper

pocesar/shopify-scraper

Automate monitoring prices on the most popular solution for building online stores and selling products online. Crawl arbitrary Shopify-powered online stores and extract a list of all products in a structured form, including product title, price, description, etc.

Paulo Cesar

2.1K

1.0

(1)

Sitemap Scraper

pvillalva/sitemap-scraper

The Sitemap Scraper extracts and outputs all URLs from a given sitemap.

Percival Villalva

214

5.0

(1)

Google Keyword Suggestions by URL Scraper

powerai/google-keywords-suggest-by-url-scraper

Scrape Google keyword suggestions based on a specific URL using our API wrapper service

PowerAI

5.0

(1)

Faire Product Details Scraper

e-commerce/faire-product-details-scraper

Use this scraper to collect data from the Faire marketplace. Extract detailed product information, including prices, descriptions, images, and in-stock availability. Download the data in multiple structured formats for easy analysis and integration.

E Commerce