Sitemap Keyword Extractor avatar
Sitemap Keyword Extractor
Under maintenance

Pricing

Pay per usage

Go to Apify Store
Sitemap Keyword Extractor

Sitemap Keyword Extractor

Under maintenance

πŸ›’ Extract comprehensive product data from Amazon product pages with structured metadata, pricing, reviews, and availability information. Fast, reliable, and production-ready.

Pricing

Pay per usage

Rating

0.0

(0)

Developer

SimplifySME Toolbox

SimplifySME Toolbox

Maintained by Community

Actor stats

0

Bookmarked

1

Total users

0

Monthly active users

4 days ago

Last modified

Share

πŸ—ΊοΈ Extract all pages from XML sitemaps and detect keywords from URL structures for SEO analysis and content planning. Fast, efficient, and production-ready.


πŸ“Ί What It Extracts

  • Sitemap Data: All URLs, last modification dates, change frequencies, priorities
  • Keyword Detection: Automatically extracts keywords from URL paths
  • Page Metadata: Complete sitemap information for each page
  • Statistics: Total page count and keyword analysis

πŸš€ Key Features

FeatureDescription
πŸ—ΊοΈ Sitemap ParsingSupports standard XML sitemap format
πŸ” Keyword DetectionAutomatically extracts keywords from URL paths
πŸ“Š Structured OutputClean JSON format with page and keyword data
⚑ Fast ProcessingEfficient parsing of large sitemaps
πŸ”„ Error HandlingGracefully handles malformed sitemaps
πŸ“ˆ SEO InsightsProvides keyword counts and patterns

πŸ“₯ Input

Required

  • sitemapUrl (string): The URL of the sitemap.xml file
    • Example: "https://example.com/sitemap.xml"
    • Supports standard XML sitemap format

πŸ“€ Output

Returns structured sitemap data:

{
"sitemapUrl": "https://example.com/sitemap.xml",
"totalPages": 150,
"pages": [
{
"url": "https://example.com/products/widget",
"lastmod": "2024-01-15",
"changefreq": "weekly",
"priority": "0.8",
"detectedKeywords": ["products", "widget"],
"keywordCount": 2
}
],
"_metadata": {
"runId": "abc123",
"processedAt": "2024-01-15T12:00:00.000Z",
"processingTimeMs": 2500
}
}

πŸ’‘ Use Cases

  • βœ… SEO Audits - Analyze site structure and keyword distribution
  • βœ… Content Planning - Identify content gaps and opportunities
  • βœ… Competitor Analysis - Study competitor site structures
  • βœ… Site Mapping - Generate comprehensive site maps
  • βœ… Keyword Research - Extract keywords from URL patterns
  • βœ… Content Strategy - Plan content based on existing structure

βš™οΈ Technical Details

  • Parser: Uses Cheerio for efficient XML parsing
  • Keyword Detection: Extracts meaningful keywords from URL path segments
  • Error Handling: Validates sitemap format and handles errors gracefully
  • Performance: Optimized for processing large sitemaps efficiently

πŸ“ Example Usage

Basic Extraction

{
"sitemapUrl": "https://example.com/sitemap.xml"
}

Large Sitemaps

{
"sitemapUrl": "https://large-site.com/sitemap.xml"
}

⚠️ Important Notes

  • Supports standard XML sitemap format
  • Automatically filters out non-meaningful URL segments for keyword detection
  • Handles sitemaps with thousands of URLs efficiently
  • Keywords are extracted from URL path segments (e.g., /products/widget β†’ ["products", "widget"])