SEO Sitemap & Broken Link Auditor avatar

SEO Sitemap & Broken Link Auditor

Under maintenance

Pricing

Pay per usage

Go to Apify Store
SEO Sitemap & Broken Link Auditor

SEO Sitemap & Broken Link Auditor

Under maintenance

Comprehensive SEO audit: crawl sitemaps, find broken links, check redirects, analyze meta tags, validate schema markup, and detect indexability issues.

Pricing

Pay per usage

Rating

0.0

(0)

Developer

Prince Raj

Prince Raj

Maintained by Community

Actor stats

0

Bookmarked

1

Total users

0

Monthly active users

5 days ago

Last modified

Categories

Share

SEO Sitemap & Indexability Auditor

Comprehensive SEO audit tool that crawls websites to detect broken links, indexability issues, meta tag problems, and structured data gaps. Built for SEO agencies, website owners, and SaaS founders.

What It Extracts

  • Broken links: Internal and external broken links with status codes
  • Meta audit: Title tags, meta descriptions, heading structure (H1/H2)
  • Indexability: Canonical tags, robots directives, noindex/nofollow flags
  • Schema markup: JSON-LD presence, schema types detected
  • Social previews: OpenGraph tags, Twitter Card tags
  • Sitemap analysis: Automatic sitemap discovery and URL extraction

Why It's Better

Combines sitemap discovery, broken link detection, and full meta/schema audit in one run. Automatically discovers sitemaps from robots.txt and crawls up to 500 pages. Generates a priority fixes report.

Input

FieldTypeDefaultDescription
startUrlsstring[]requiredURLs or sitemap URLs to audit
maxPagesinteger50Maximum pages to crawl (1-500)
checkExternalLinksbooleanfalseCheck external links for broken status
includeMetaAuditbooleantrueAudit titles, descriptions, headings
includeSchemaAuditbooleantrueCheck JSON-LD presence
includeSocialPreviewAuditbooleantrueCheck OG/Twitter tags

Output Example

{
"url": "https://example.com/pricing",
"statusCode": 200,
"title": "Pricing Plans",
"description": "Choose the right plan for your team",
"canonical": "https://example.com/pricing",
"robots": null,
"hasNoindex": false,
"h1Count": 1,
"h2Count": 4,
"internalLinks": 15,
"externalLinks": 3,
"hasSchema": true,
"schemaTypes": ["Product", "Offer"],
"hasOgTags": true,
"hasTwitterCards": true,
"issues": [],
"brokenLinks": [],
"crawledAt": "2025-01-15T10:30:00Z"
}

Use Cases

  • Client SEO audits: Generate comprehensive audit reports for clients
  • Pre-launch checks: Verify SEO setup before site launch
  • Broken link monitoring: Regular checks for broken links
  • Schema validation: Ensure structured data is present on key pages
  • Migration audits: Verify SEO elements after site migration

PPE Pricing

EventDescriptionSuggested Price
url-checkedOne page crawled and analyzed$0.002
site-auditedComplete site audit report generated$0.01

Limitations

  • Only crawls pages reachable from provided URLs
  • External link checking is optional (adds time)
  • JavaScript-rendered content may not be fully captured
  • Rate limited to respect target sites

Legal/Ethical Use

This actor crawls publicly accessible pages. It respects robots.txt directives. Users are responsible for compliance with applicable laws and website terms of service.

Local Run

cd actors/seo-sitemap-indexability-auditor
apify run --input-file .actor/sample_input.json

Deploy

cd actors/seo-sitemap-indexability-auditor
apify push

FAQ

Q: How does sitemap discovery work? A: It checks robots.txt for Sitemap directives, then tries common sitemap locations (/sitemap.xml).

Q: Can it crawl JavaScript-rendered pages? A: It uses CheerioCrawler which handles most server-rendered pages. For heavy JS sites, consider using a browser-based crawler.

Q: What's the maximum number of pages? A: Up to 500 pages per run. Set maxPages to control crawl depth and cost.

Tags

SEO audit, broken link checker, sitemap analyzer, indexability audit, meta tag checker, schema validator, OpenGraph audit, technical SEO, website health check