Pricing

from $0.20 / 1,000 results

Try for free

Go to Apify Store

URLs List - Extract ALL website urls

Try for free

Automatically discovers and extracts ALL URLs from any website. Perfect for SEO analysis, content inventory, and bulk URL extraction from multiple websites. Get complete URL lists with metadata including last modified dates and priority levels.

Pricing

from $0.20 / 1,000 results

Rating

5.0

(2)

Developer

Lofomachines

Actor stats

Bookmarked

Total users

Monthly active users

3 months ago

Last modified

✨ Key Features

🔍 Automatic Discovery: Intelligently finds all available URLs from any website structure.
💨 Fast & Efficient: Optimized for speed to handle large sites (50k+ URLs).
📦 Bulk Processing: Accepts multiple domain roots to process simultaneously.
🏷️ Rich Metadata: Extracts last modified dates, priority levels, and update frequency (where available).
🗜️ Smart Handling: Works with standard sitemaps, recursive crawling, and standard web formats.
🛡️ Resilient: Automatic retries on temporary errors and infinite loop prevention.
🎯 Result Limiting: Control the maximum number of URLs extracted with maxResults or enable returnAll for complete extraction.
🔎 Keyword Filtering: Filter URLs by keywords - only URLs containing all specified keywords will be returned.

🎯 Use Cases

Use Case	Description
SEO Audit	Extract all URLs to analyze site architecture and identify orphan pages.
Content Inventory	Create a comprehensive list of all existing pages for migration planning.
Monitoring	Track `lastmod` dates to identify which content has been updated recently.
Data Pipelines	Feed the output URLs into other scrapers (e.g., Scrape HTML, Google Sheets export).
Targeted Extraction	Use keyword filtering to extract only specific sections (e.g., all blog posts, product pages).
Sampling	Use `maxResults` to extract a sample of URLs for quick analysis without processing entire sites.

💰 Cost of Usage

This scraper is designed to be lightweight. It parses URL structures without rendering full page JavaScript (unless necessary), keeping costs low.

Small Sites (< 1,000 URLs): Cents per run.
Medium Sites (10,000 URLs): Typically < $1.00.
Large Sites: Efficiency scales well, but usage depends on the complexity of the target site's architecture.

Tip: Always use Apify Proxy (enabled by default) to ensure consistent access and avoid blocking.

📥 Input Configuration

The Actor expects a JSON input defining the websites to scan.

Example Input

{
  "startUrls": [
    { "url": "https://apify.com" },
    { "url": "https://crawlee.dev" }
  ],
  "proxyConfiguration": {
    "useApifyProxy": true
  },
  "returnAll": true,
  "maxResults": 1000,
  "keywords": ["blog", "article"]
}

Input Parameters

Parameter	Type	Required	Default	Description
startUrls	Array	✅ Yes	`[{ url: "https://apify.com" }]`	List of website URLs to extract pages from.
proxyConfiguration	Object	❌ No	`{ useApifyProxy: false }`	Proxy settings for reliable access.
returnAll	Boolean	❌ No	`true`	If `true`, extracts all available URLs regardless of `maxResults`. If `false`, applies the `maxResults` limit.
maxResults	Integer	❌ No	`1000`	Maximum number of URLs to extract. Ignored if `returnAll` is `true` or set to `0`.
keywords	Array	❌ No	`[]`	Filter URLs to only include those containing ALL specified keywords. Case-insensitive matching. Example: `["blog"]` returns only URLs containing "blog" (e.g., `https://example.com/blog/article`).

Sitemap to URL Crawler

logiover/sitemap-to-url-crawler

nstantly extract all public URLs from any website's sitemap.xml recursively. Handles nested sitemap indexes automatically. The fastest & cheapest way to build URL lists for RAG pipelines, LLM training, and SEO audits. Zero-config & blazing fast.

Logiover

Extract Website With URL

mrahil/extract-website-with-url

The Extract Website with URL API allows users to extract structured data from any webpage by providing a URL. It retrieves HTML, metadata, tables, and images, returning data in JSON format. Ideal for web scraping, SEO analysis, and content extraction. Use it for e-commerce data, news scraping

Mohammed Rahil

207

Fast Sitemap Generator

eunit/sitemap-generator

Boost SEO with this automatic Sitemap Generator. Crawl any site to create XML, HTML, & TXT sitemaps. Supports custom depth, regex filters, & robots.txt. Compatible with Google Search Console.

Emmanuel Uchenna

5.0

Fast URL Content Crawler

6sigmag/fast-url-content-crawler

A high-performance web scraper that rapidly extracts and analyzes content from multiple URLs simultaneously. Perfect for competitive research, content aggregation, and website structure analysis.

David Deng

274

5.0

Deep URL Content Crawler

6sigmag/deep-url-content-crawler

Scrape Failed Killer! A high-performance web scraper that rapidly extracts and analyzes content from multiple URLs simultaneously. Perfect for competitive research, content aggregation, and website structure analysis.

David Deng

Find Sitemap from url

eesti/find-sitemap-from-url

A powerful [Apify Actor] that finds sitemap URLs for any website. This Actor helps you discover XML sitemaps by checking common locations, robots.txt files, and analyzing HTML content for sitemap links.

ando

200

1.0

Sitemap URL Extractor

onescales/sitemap-url-extractor

Provide a website link to a sitemap.xml and the app will extract and list all URLs in the sitemap as well as additional data in the sitemap (i.e. https://onescales.com/sitemap.xml).

One Scales

395

5.0

Sitemap Detector

coder_zoro/sitemap-detector

Find sitemap URLs fast with our free Sitemap Finder tool. Instantly detect sitemaps from any website for SEO audits, indexing checks, and crawl planning. Improve visibility, site structure insights, and search engine performance in just seconds

Zoro

164

5.0

Get URLs from link

boring_code/get-urls-from-link

Extracts URLs from a sitemap or webpage with intuitive path matching. Use comma-separated patterns to include or exclude URL paths with smart matching: '/tags/' for exact paths, '/product' for paths starting with, or simple text for substring matches.