Sitemap Keyword Extractor avatar
Sitemap Keyword Extractor

Pricing

from $3.00 / 1,000 results

Go to Apify Store
Sitemap Keyword Extractor

Sitemap Keyword Extractor

๐Ÿ›’ Extract comprehensive product data from Amazon product pages with structured metadata, pricing, reviews, and availability information. Fast, reliable, and production-ready.

Pricing

from $3.00 / 1,000 results

Rating

0.0

(0)

Developer

SimplifySME Toolbox

SimplifySME Toolbox

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

6 days ago

Last modified

Share

๐Ÿ—บ๏ธ Extract all pages from XML sitemaps and detect keywords from URL structures for SEO analysis and content planning. Fast, efficient, and production-ready.


๐Ÿ“บ What It Extracts

  • Sitemap Data: All URLs, last modification dates, change frequencies, priorities
  • Keyword Detection: Automatically extracts keywords from URL paths
  • Page Metadata: Complete sitemap information for each page
  • Statistics: Total page count and keyword analysis

๐Ÿš€ Key Features

FeatureDescription
๐Ÿ—บ๏ธ Sitemap ParsingSupports standard XML sitemap format
๐Ÿ” Keyword DetectionAutomatically extracts keywords from URL paths
๐Ÿ“Š Structured OutputClean JSON format with page and keyword data
โšก Fast ProcessingEfficient parsing of large sitemaps
๐Ÿ”„ Error HandlingGracefully handles malformed sitemaps
๐Ÿ“ˆ SEO InsightsProvides keyword counts and patterns

๐Ÿ“ฅ Input

Required

  • sitemapUrl (string): The URL of the sitemap.xml file
    • Example: "https://example.com/sitemap.xml"
    • Supports standard XML sitemap format

๐Ÿ“ค Output

Returns structured sitemap data:

{
"sitemapUrl": "https://example.com/sitemap.xml",
"totalPages": 150,
"pages": [
{
"url": "https://example.com/products/widget",
"lastmod": "2024-01-15",
"changefreq": "weekly",
"priority": "0.8",
"detectedKeywords": ["products", "widget"],
"keywordCount": 2
}
],
"_metadata": {
"runId": "abc123",
"processedAt": "2024-01-15T12:00:00.000Z",
"processingTimeMs": 2500
}
}

๐Ÿ’ก Use Cases

  • โœ… SEO Audits - Analyze site structure and keyword distribution
  • โœ… Content Planning - Identify content gaps and opportunities
  • โœ… Competitor Analysis - Study competitor site structures
  • โœ… Site Mapping - Generate comprehensive site maps
  • โœ… Keyword Research - Extract keywords from URL patterns
  • โœ… Content Strategy - Plan content based on existing structure

โš™๏ธ Technical Details

  • Parser: Uses Cheerio for efficient XML parsing
  • Keyword Detection: Extracts meaningful keywords from URL path segments
  • Error Handling: Validates sitemap format and handles errors gracefully
  • Performance: Optimized for processing large sitemaps efficiently

๐Ÿ“ Example Usage

Basic Extraction

{
"sitemapUrl": "https://example.com/sitemap.xml"
}

Large Sitemaps

{
"sitemapUrl": "https://large-site.com/sitemap.xml"
}

โš ๏ธ Important Notes

  • Supports standard XML sitemap format
  • Automatically filters out non-meaningful URL segments for keyword detection
  • Handles sitemaps with thousands of URLs efficiently
  • Keywords are extracted from URL path segments (e.g., /products/widget โ†’ ["products", "widget"])