Broken Link Checker โ€” Full Website Crawler avatar

Broken Link Checker โ€” Full Website Crawler

Pricing

from $3.50 / 1,000 results

Go to Apify Store
Broken Link Checker โ€” Full Website Crawler

Broken Link Checker โ€” Full Website Crawler

Crawl an entire website and check every internal & external link for broken (404/5xx) responses. Get the source page, anchor text and status code for each link. Schedule it to monitor any site for dead links and SEO errors.

Pricing

from $3.50 / 1,000 results

Rating

0.0

(0)

Developer

Logiover

Logiover

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

18 hours ago

Last modified

Share

Broken Link Checker โ€” Crawl a Whole Website for Dead Links ๐Ÿ”—

Find every broken link on a website automatically. This broken link checker crawls an entire site, follows internal links, and checks every internal and external link for 404, 410 and 5xx errors. For each link you get the source page it was found on, the anchor text, the link type and the exact HTTP status code.

Give it one URL โ€” it discovers and audits thousands of pages and links for you. No login, no proxy setup, no headless browser. Fast, cheap and schedulable.

Looking for a dead link checker, 404 finder, broken link finder or a website link validator? This actor does all of it, site-wide.


โœจ Key features

  • ๐Ÿ•ท๏ธ Full-site crawl โ€” start from one URL and automatically follow internal links across the whole domain.
  • ๐Ÿ”— Checks every link โ€” internal and external (outbound) links, not just a flat list.
  • ๐Ÿšฆ Exact HTTP status โ€” 200, 301, 404, 410, 500, plus network errors (timeouts, unreachable hosts, DNS failures).
  • ๐Ÿ“„ Source page + anchor text โ€” know exactly where each broken link lives so you can fix it fast.
  • โšก Fast & cheap โ€” pure HTTP (HEAD with GET fallback), no browser. Each unique URL is checked only once and cached.
  • ๐Ÿ—“๏ธ Schedulable monitoring โ€” run it daily/weekly to catch dead links the moment they appear.
  • ๐ŸŽ›๏ธ Flexible output โ€” export only broken links or a full audit of every link on the site.

๐Ÿ’ก Use cases

  • SEO audits โ€” broken internal links waste crawl budget and hurt rankings. Find and fix them across the whole site.
  • Website migrations & redesigns โ€” catch every 404 before and after a launch or domain move.
  • Content & docs maintenance โ€” keep large blogs, documentation and knowledge bases link-clean.
  • Affiliate & outbound link monitoring โ€” make sure the external links you earn from still resolve.
  • Ongoing link monitoring โ€” schedule the actor and get alerted when something breaks.

๐Ÿ“ฆ What you get

One row per link checked:

FieldDescription
sourceUrlThe page the link was found on
linkUrlThe link target (absolute URL)
anchorTextThe clickable text of the link
linkTypeinternal or external
statusCodeHTTP status code (e.g. 200, 404, 500)
oktrue if the link works, false if it's broken
errorNetwork error (timeout, unreachable, โ€ฆ) when there is no status code
checkedAtISO 8601 timestamp of the check

Example output

{
"sourceUrl": "https://example.com/blog/post-1",
"linkUrl": "https://example.com/old-page",
"anchorText": "see our old guide",
"linkType": "internal",
"statusCode": 404,
"ok": false,
"error": null,
"checkedAt": "2026-05-25T14:08:46.306Z"
}

Turn on Only output broken links for a clean list of just the dead links, or leave it off to export a complete link audit of the entire website.

  1. Click Try for free / Start.
  2. Paste one or more website URLs into Start URLs.
  3. (Optional) Set Max pages to crawl โ€” use 0 to crawl the whole site.
  4. (Optional) Toggle Check external links and Only output broken links.
  5. Click Save & Start.
  6. Download your results as JSON, CSV, Excel, HTML or via API โ€” or schedule the run to repeat.

โš™๏ธ Input

OptionDescriptionDefault
startUrlsWebsites to crawlโ€“ (required)
maxPagesToCrawlMax pages to crawl per run (0 = crawl the whole site)1000
checkExternalLinksAlso check links pointing to other domainstrue
onlyBrokenSave only broken links (4xx / 5xx / unreachable)false
maxConcurrencyNumber of parallel requests10

Example input

{
"startUrls": [{ "url": "https://example.com" }],
"maxPagesToCrawl": 2000,
"checkExternalLinks": true,
"onlyBroken": false,
"maxConcurrency": 10
}

๐Ÿ” How it works

The crawler starts from your Start URLs and follows internal <a> links within the same domain (skipping asset files like images, PDFs and scripts). On every page it extracts all links and verifies each one with a lightweight HEAD request, automatically falling back to GET for servers that don't support HEAD. Every unique URL is checked only once and the result is reused across pages, so even very large sites are audited efficiently.

๐Ÿงฐ Tips & best practices

  • Set maxPagesToCrawl to 0 for full-site coverage; keep it lower for quick spot-checks.
  • Lower maxConcurrency if the target site starts rate-limiting or returning 429.
  • Use Only output broken links for fix-lists, and the full export for complete link inventories.
  • Schedule the actor (daily/weekly) so broken links never sit unnoticed.

โ“ FAQ

What counts as a broken link? Any link returning a 4xx or 5xx status code, or one that can't be reached (timeout, DNS failure, connection refused). These have ok: false.

Does it crawl other websites too? It checks external links (their status) but only crawls deeper within your own domain, so you stay in control of scope.

Will it find every page on my site? It follows internal links up to maxPagesToCrawl. Increase that limit (or set it to 0) for complete coverage.

Does it use a real browser? No โ€” it's pure HTTP, which makes it fast and inexpensive. This works for the vast majority of websites.

Can I check several sites at once? Yes โ€” add multiple Start URLs.

What output formats are supported? JSON, CSV, Excel, HTML table and a full REST API, like every Apify actor.

  • Bulk URL Status Checker โ€” check a flat list of URLs for status codes & redirects.
  • Website Link Graph & Outbound Links Crawler โ€” map internal & outbound links site-wide.
  • Website SEO Audit Crawler โ€” full on-page SEO audit for every page.
  • Sitemap to URL Crawler โ€” extract all URLs from any sitemap.xml.

Changelog

  • 2026-05-25 โ€” Maintenance & reliability pass: pulled the latest source and rebuilt the Actor on the current base image; build verified.

Last reviewed: 2026-05-25.