Website Monitoring: Broken Links & Resources Checker avatar

Website Monitoring: Broken Links & Resources Checker

Pricing

from $5.00 / 1,000 deep link checks

Go to Apify Store
Website Monitoring: Broken Links & Resources Checker

Website Monitoring: Broken Links & Resources Checker

Monitor your website by detecting broken resources (images, scripts, styles) and broken deep links (404s, timeouts).

Pricing

from $5.00 / 1,000 deep link checks

Rating

0.0

(0)

Developer

Budi Arsana

Budi Arsana

Maintained by Community

Actor stats

1

Bookmarked

2

Total users

1

Monthly active users

13 hours ago

Last modified

Share

Monitor your website health by automatically detecting broken resources (images, scripts, styles) and broken deep links (404s, timeouts, etc.). Ensure your users have a seamless experience by identifying and fixing these issues before they notice them.

What does Website Monitoring do?

This Actor crawls a list of provided URLs and checks every resource loaded by the page and every hyperlink found on the page. It identifies:

  • Broken Resources: Images, JavaScript files, CSS stylesheets, and other assets that fail to load (4xx/5xx status codes).
  • Broken Deep Links: Hyperlinks on the page that lead to non-existent pages or result in server errors.

Why use Website Monitoring?

  • SEO & UX: Broken links and missing images negatively impact search engine rankings and user trust.
  • Automated QA: Run this Actor periodically to ensure new deployments haven't broken existing pages.
  • Save Time: Manually checking every link and asset on a large site is impossible; this Actor does it in minutes.

How to use Website Monitoring

  1. Enter URLs: Provide a list of URLs you want to monitor in the input section.
  2. Set Limits: Optionally set the maximum number of requests to limit the crawl depth.
  3. Run the Actor: Click "Start" and wait for the results.
  4. Review Results: Check the "Dataset" tab to see a detailed table of every broken item found, including the source page, the broken URL, and the HTTP status code.

Input

  • urls: (Required) List of URLs to start monitoring.
  • maxRequestsPerCrawl: (Optional) Maximum number of pages the crawler will visit. Default is 100.

Example input:

{
"urls": [
"https://apify.com",
"https://crawlee.dev"
],
"maxRequestsPerCrawl": 50
}

Output

The Actor stores its results in a dataset. Each item represents a broken resource or link.

Example output:

{
"sourceUrl": "https://apify.com",
"issueType": "broken_resource",
"targetUrl": "https://apify.com/non-existent-image.png",
"status": 404
}

Data table

FieldDescription
sourceUrlThe URL of the page where the issue was found.
issueTypebroken_resource or broken_deeplink.
targetUrlThe URL of the broken resource or link.
statusThe HTTP status code (e.g., 404). -1 indicates a network failure.
messageDetailed error message for network failures (e.g., "Connection reset").

Pricing / Cost estimation

How much does it cost to monitor your site? This Actor uses the PlaywrightCrawler which requires a browser. It typically consumes about 0.05 - 0.1 Compute Units per 100 pages checked, depending on the number of resources and links on those pages.

FAQ, disclaimers, and support

  • Disclaimer: This tool is intended for monitoring your own websites. Please respect the robots.txt and Terms of Service of any site you do not own.
  • Support: If you encounter any issues or have feature requests, please open an issue in the "Issues" tab.