Checks the provided website using Playwright. This is a low level runner, most likely you want to use the high level master actor - https://apify.com/lukaskrivka/website-checker
Will save HTML for Cheerio and HTML + screenshot for Puppeteer/Playwright
Link Selector
linkSelectorstringOptional
A CSS selector saying which links on the page (<a> elements with href attribute) shall be followed and added to the request queue. This setting only applies if Use request queue is enabled. To filter the links added to the queue, use the Pseudo-URLs setting.
If Link selector is empty, the page links are ignored.
Additional check to make sure that only link related to the same domain are enqueued.
Pseudo-URLs
pseudoUrlsarrayOptional
Specifies what kind of URLs found by Link selector should be added to the request queue. A pseudo-URL is a URL with regular expressions enclosed in [] brackets, e.g. http://www.example.com/[.*]. This setting only applies if the Use request queue option is enabled.
If Pseudo-URLs are omitted, the actor enqueues all links matched by the Link selector.
Will access each URL multiple times. Useful to test the same URL or bypass blocking of the first page.
Max number of pages checked per domain
maxNumberOfPagesCheckedPerDomainintegerOptional
The maximum number of pages that the checker will load. The checker will stop when this limit is reached. It's always a good idea to set this limit in order to prevent excess platform usage for misconfigured scrapers. Note that the actual number of pages loaded might be slightly higher than this value.
If set to 0, there is no limit.
Default value of this property is 100
Maximum concurrent pages checked per domain
maxConcurrentPagesCheckedPerDomainintegerOptional
Specifies the maximum number of pages that can be processed by the checker in parallel for one domain. The checker automatically increases and decreases concurrency based on available system resources. This option enables you to set an upper limit, for example to reduce the load on a target website.
Default value of this property is 50
Maximum number of concurrent domains checked
maxConcurrentDomainsCheckedintegerOptional
Specifies the maximum number of domains that should be checked at a time. This setting is relevant when passing in more than one URL to check.