Broken Link Finder avatar
Broken Link Finder

Pricing

Pay per usage

Go to Apify Store
Broken Link Finder

Broken Link Finder

Maintain your website's health with the Broken Link Finder. Scanning massive sites can be time-consuming, so this actor is designed to efficiently check small parts at a time. Quickly identify and fix dead links without the long wait. Perfect for incremental maintenance!

Pricing

Pay per usage

Rating

0.0

(0)

Developer

Shahid Irfan

Shahid Irfan

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

3 days ago

Last modified

Share

Find and fix broken links on any website. This tool crawls your pages, checks every link, and reports which ones are broken — helping you maintain SEO rankings and provide a better user experience.

Broken Link Finder automatically scans your website to detect:

  • 404 errors — Pages that no longer exist
  • Server errors (5xx) — Links to pages with server problems
  • Timeout issues — Links that take too long to respond
  • Invalid anchors — Fragment links (#section) that don't exist on the page
  • External broken links — Dead links pointing to other websites

Broken links hurt your website in multiple ways:

  1. SEO Impact — Search engines penalize sites with broken links
  2. User Experience — Visitors leave when they hit dead ends
  3. Lost Revenue — Broken product or checkout links cost sales
  4. Credibility — Broken links make your site look unmaintained

Features

  • Deep crawling — Check links inside article pages, not just the homepage
  • Configurable depth — Control how deep the crawler goes (1-10 levels)
  • Fast parallel checking — Scan up to 50 pages simultaneously
  • External link checking — Verify links to other websites work
  • Smart content detection — Focuses on main content, skips navigation menus
  • Email notifications — Get reports sent directly to your inbox
  • Detailed reports — HTML and JSON reports with all findings

How to use

  1. Click Try for free to open the actor
  2. Enter your Website URL (e.g., https://example.com/blog)
  3. Set Crawl depth to control how deep to check (default: 3)
  4. Set Max pages to limit the crawl size (default: 100)
  5. Click Start and wait for results
  6. Download the report or view online

Input options

FieldDescriptionDefault
Website URLThe starting URL to crawlRequired
Max pagesMaximum pages to crawl100
Crawl depthHow many levels deep to check links3
Max concurrencyPages to check in parallel10
Check external linksAlso verify links to other sitesYes
Save only broken linksOnly save broken links to datasetYes
Crawl subdomainsInclude subdomains in the crawlNo
Notification emailsEmail addresses for reportsNone

Input example

Check a blog for broken links, going 3 levels deep:

{
"baseUrl": "https://example.com/blog",
"maxPages": 500,
"maxCrawlDepth": 3,
"maxConcurrency": 10,
"checkExternalLinks": true,
"saveOnlyBrokenLinks": true
}

Check an entire e-commerce site including subdomains:

{
"baseUrl": "https://shop.example.com",
"maxPages": 2000,
"maxCrawlDepth": 4,
"crawlSubdomains": true,
"notificationEmails": ["webmaster@example.com"]
}

Output

Results are saved in two formats:

Dataset (structured data)

Each broken link is saved as a record:

{
"sourceUrl": "https://example.com/blog/old-post",
"sourceTitle": "My Old Blog Post",
"targetUrl": "https://example.com/deleted-page",
"linkText": "Click here",
"linkType": "internal",
"httpStatus": 404,
"status": "Not Found",
"isBroken": true,
"severity": "high",
"issueType": "404_not_found",
"checkedAt": "2024-01-15T10:30:00Z"
}

Key-Value Store

  • OUTPUT — JSON summary with statistics and all broken links
  • OUTPUT.html — Visual HTML report for easy viewing

Understanding the results

StatusHTTP CodeSeverityMeaning
OK200NoneLink works correctly
Redirect301/302LowLink redirects (usually fine)
Not Found404HighPage doesn't exist
Forbidden403MediumAccess denied
Server Error500+HighServer problem
TimeoutHighPage didn't respond

How crawl depth works

The Crawl depth setting controls how deep the crawler goes:

DepthWhat gets checked
1Only links on the starting page
2Starting page + one level of linked pages
3Two levels deep (recommended for most sites)
4+Deeper crawling for large content sites

Example with depth 3:

  1. Crawls category page /blog/tutorials
  2. Finds 20 article links, crawls each article
  3. Checks all links inside each article (images, downloads, related posts)

Use cases

Blog and content sites

Find broken links in old articles that reference deleted pages or outdated external resources.

E-commerce stores

Detect broken product links, missing images, and dead checkout paths before customers do.

Documentation sites

Ensure all internal links between docs work and external API references are valid.

Site migrations

Verify all old URLs properly redirect after moving to a new domain or platform.

Regular SEO audits

Schedule weekly or monthly checks to catch broken links before search engines do.

Cost estimation

Costs depend on pages crawled and resources used:

Site SizePagesEstimated Cost
Small100~$0.10-0.25
Medium1,000~$1.00-2.50
Large10,000~$10.00-25.00

Tips for best results

  1. Start small — Test with 50-100 pages first to verify settings
  2. Use appropriate depth — Depth 3 works for most sites
  3. Lower concurrency — Reduce to 5 if you get rate-limited
  4. Schedule regular checks — Use Apify schedules for weekly monitoring
  5. Check external links — Many broken links point to other sites

Integrations

Export results to:

  • Google Sheets
  • Slack notifications
  • Email reports
  • Webhooks for custom integrations
  • Any tool via Apify API

FAQ

How long does a crawl take?

A 100-page site typically completes in 2-5 minutes. Larger sites take proportionally longer.

Will this slow down my website?

The crawler includes rate limiting and respects server responses. Reduce concurrency if needed.

Can I check competitor websites?

Yes, but respect their terms of service and use reasonable crawl limits.

What's the difference between internal and external links?

Internal links point to pages on your site. External links point to other websites.

How do I fix broken links?

Update the link to the correct URL, set up a redirect, or remove the link entirely.

Support