Broken Link Checker โ Full Website Crawler
Pricing
from $3.50 / 1,000 results
Broken Link Checker โ Full Website Crawler
Crawl an entire website and check every internal & external link for broken (404/5xx) responses. Get the source page, anchor text and status code for each link. Schedule it to monitor any site for dead links and SEO errors.
Pricing
from $3.50 / 1,000 results
Rating
0.0
(0)
Developer
Logiover
Maintained by CommunityActor stats
0
Bookmarked
2
Total users
1
Monthly active users
18 hours ago
Last modified
Categories
Share
Broken Link Checker โ Crawl a Whole Website for Dead Links ๐
Find every broken link on a website automatically. This broken link checker crawls an entire site, follows internal links, and checks every internal and external link for 404, 410 and 5xx errors. For each link you get the source page it was found on, the anchor text, the link type and the exact HTTP status code.
Give it one URL โ it discovers and audits thousands of pages and links for you. No login, no proxy setup, no headless browser. Fast, cheap and schedulable.
Looking for a dead link checker, 404 finder, broken link finder or a website link validator? This actor does all of it, site-wide.
โจ Key features
- ๐ท๏ธ Full-site crawl โ start from one URL and automatically follow internal links across the whole domain.
- ๐ Checks every link โ internal and external (outbound) links, not just a flat list.
- ๐ฆ Exact HTTP status โ
200,301,404,410,500, plus network errors (timeouts, unreachable hosts, DNS failures). - ๐ Source page + anchor text โ know exactly where each broken link lives so you can fix it fast.
- โก Fast & cheap โ pure HTTP (HEAD with GET fallback), no browser. Each unique URL is checked only once and cached.
- ๐๏ธ Schedulable monitoring โ run it daily/weekly to catch dead links the moment they appear.
- ๐๏ธ Flexible output โ export only broken links or a full audit of every link on the site.
๐ก Use cases
- SEO audits โ broken internal links waste crawl budget and hurt rankings. Find and fix them across the whole site.
- Website migrations & redesigns โ catch every
404before and after a launch or domain move. - Content & docs maintenance โ keep large blogs, documentation and knowledge bases link-clean.
- Affiliate & outbound link monitoring โ make sure the external links you earn from still resolve.
- Ongoing link monitoring โ schedule the actor and get alerted when something breaks.
๐ฆ What you get
One row per link checked:
| Field | Description |
|---|---|
sourceUrl | The page the link was found on |
linkUrl | The link target (absolute URL) |
anchorText | The clickable text of the link |
linkType | internal or external |
statusCode | HTTP status code (e.g. 200, 404, 500) |
ok | true if the link works, false if it's broken |
error | Network error (timeout, unreachable, โฆ) when there is no status code |
checkedAt | ISO 8601 timestamp of the check |
Example output
{"sourceUrl": "https://example.com/blog/post-1","linkUrl": "https://example.com/old-page","anchorText": "see our old guide","linkType": "internal","statusCode": 404,"ok": false,"error": null,"checkedAt": "2026-05-25T14:08:46.306Z"}
Turn on Only output broken links for a clean list of just the dead links, or leave it off to export a complete link audit of the entire website.
๐ How to use the Broken Link Checker
- Click Try for free / Start.
- Paste one or more website URLs into Start URLs.
- (Optional) Set Max pages to crawl โ use
0to crawl the whole site. - (Optional) Toggle Check external links and Only output broken links.
- Click Save & Start.
- Download your results as JSON, CSV, Excel, HTML or via API โ or schedule the run to repeat.
โ๏ธ Input
| Option | Description | Default |
|---|---|---|
startUrls | Websites to crawl | โ (required) |
maxPagesToCrawl | Max pages to crawl per run (0 = crawl the whole site) | 1000 |
checkExternalLinks | Also check links pointing to other domains | true |
onlyBroken | Save only broken links (4xx / 5xx / unreachable) | false |
maxConcurrency | Number of parallel requests | 10 |
Example input
{"startUrls": [{ "url": "https://example.com" }],"maxPagesToCrawl": 2000,"checkExternalLinks": true,"onlyBroken": false,"maxConcurrency": 10}
๐ How it works
The crawler starts from your Start URLs and follows internal <a> links within the same domain (skipping asset files like images, PDFs and scripts). On every page it extracts all links and verifies each one with a lightweight HEAD request, automatically falling back to GET for servers that don't support HEAD. Every unique URL is checked only once and the result is reused across pages, so even very large sites are audited efficiently.
๐งฐ Tips & best practices
- Set
maxPagesToCrawlto0for full-site coverage; keep it lower for quick spot-checks. - Lower
maxConcurrencyif the target site starts rate-limiting or returning429. - Use Only output broken links for fix-lists, and the full export for complete link inventories.
- Schedule the actor (daily/weekly) so broken links never sit unnoticed.
โ FAQ
What counts as a broken link? Any link returning a 4xx or 5xx status code, or one that can't be reached (timeout, DNS failure, connection refused). These have ok: false.
Does it crawl other websites too? It checks external links (their status) but only crawls deeper within your own domain, so you stay in control of scope.
Will it find every page on my site? It follows internal links up to maxPagesToCrawl. Increase that limit (or set it to 0) for complete coverage.
Does it use a real browser? No โ it's pure HTTP, which makes it fast and inexpensive. This works for the vast majority of websites.
Can I check several sites at once? Yes โ add multiple Start URLs.
What output formats are supported? JSON, CSV, Excel, HTML table and a full REST API, like every Apify actor.
๐ Related actors by the same author
- Bulk URL Status Checker โ check a flat list of URLs for status codes & redirects.
- Website Link Graph & Outbound Links Crawler โ map internal & outbound links site-wide.
- Website SEO Audit Crawler โ full on-page SEO audit for every page.
- Sitemap to URL Crawler โ extract all URLs from any sitemap.xml.
Changelog
- 2026-05-25 โ Maintenance & reliability pass: pulled the latest source and rebuilt the Actor on the current base image; build verified.
Last reviewed: 2026-05-25.