Pricing

Pay per usage

Website Checker Runner Playwright

Checks the provided website using Playwright. This is a low level runner, most likely you want to use the high level master actor - https://apify.com/lukaskrivka/website-checker

Pricing

Pay per usage

Rating

0.0

(0)

Developer

Lukáš Křivka

Actor stats

Bookmarked

171

Total users

Monthly active users

8 months ago

Last modified

Categories

Developer tools

Open source

URLs to check

urlsToCheck

Required

A static list of URLs to check for captchas. To be able to add new URLs on the fly, enable the Use request queue option.

For details, see Start URLs in README.

Type:array

Proxy Configuration

proxyConfiguration

Optional

Specifies proxy servers that will be used by the scraper in order to hide its origin.

For details, see Proxy configuration in README.

Type:object

Default:

{}

Enabled

saveSnapshot

Optional

Will save HTML for Cheerio and HTML + screenshot for Puppeteer/Playwright

Type:boolean

Link Selector

linkSelector

Optional

A CSS selector saying which links on the page (<a> elements with href attribute) shall be followed and added to the request queue. This setting only applies if Use request queue is enabled. To filter the links added to the queue, use the Pseudo-URLs setting.

If Link selector is empty, the page links are ignored.

For details, see Link selector in README.

Type:string

Min. length:1

Allow only links from the same domain

allowOnlyLinksFromSameDomain

Optional

Additional check to make sure that only link related to the same domain are enqueued.

Type:boolean

Pseudo-URLs

pseudoUrls

Optional

Specifies what kind of URLs found by Link selector should be added to the request queue. A pseudo-URL is a URL with regular expressions enclosed in [] brackets, e.g. http://www.example.com/[.*]. This setting only applies if the Use request queue option is enabled.

If Pseudo-URLs are omitted, the actor enqueues all links matched by the Link selector.

For details, see Pseudo-URLs in README.

Type:array

Default:

[]

Repeat checks on provided URLs

repeatChecksOnProvidedUrls

Optional

Will access each URL multiple times. Useful to test the same URL or bypass blocking of the first page.

Type:integer

Max number of pages checked per domain

maxNumberOfPagesCheckedPerDomain

Optional

The maximum number of pages that the checker will load. The checker will stop when this limit is reached. It's always a good idea to set this limit in order to prevent excess platform usage for misconfigured scrapers. Note that the actual number of pages loaded might be slightly higher than this value.

If set to 0, there is no limit.

Type:integer

Default:100

Maximum concurrent pages checked per domain

maxConcurrentPagesCheckedPerDomain

Optional

Specifies the maximum number of pages that can be processed by the checker in parallel for one domain. The checker automatically increases and decreases concurrency based on available system resources. This option enables you to set an upper limit, for example to reduce the load on a target website.

Type:integer

Minimum:1

Default:50

Maximum number of concurrent domains checked

maxConcurrentDomainsChecked

Optional

Specifies the maximum number of domains that should be checked at a time. This setting is relevant when passing in more than one URL to check.

Type:integer

Minimum:1

Maximum:10

Default:5

Retire browser instance after request count

retireBrowserInstanceAfterRequestCount

Optional

How often will the browser itself rotate. Pick a higher number for smaller consumption, pick a lower number to rotate (test) more proxies.

Type:integer

Minimum:1

Default:10

Chrome

playwright.chrome

Optional

Use Chrome when checking

Type:boolean

Default:true

Firefox

playwright.firefox

Optional

Use Firefox when checking

Type:boolean

Safari (Webkit)

playwright.webkit

Optional

Use Safari when checking

Type:boolean

Use Chrome instead of Chromium

playwright.useChrome

Optional

Only works for Playwright type! Be careful that Chrome is not guaranteed to work with Playwright.

Type:boolean

Headfull browser (XVFB)

playwright.headfull

Optional

If the browser should be headfull or not

Type:boolean

Wait for

playwright.waitFor

Optional

Only works for playwright type. Will wait on each page. You can provide number in ms or a selector.

Type:string

Playwright MCP Server

jiri.spilka/playwright-mcp-server

A Model Context Protocol (MCP) server that provides browser automation capabilities using Playwright

Jiří Spilka

212

Playwright Scraper

apify/playwright-scraper

Crawls websites with the headless Chromium, Chrome, or Firefox browser and Playwright library using a provided server-side Node.js code. Supports both recursive crawling and a list of URLs. Supports login to a website.

Apify

6.7K

3.9

Website Checker Runner Cheerio

lukaskrivka/website-checker-cheerio

Checks the provided website using cheerio. This is a low level runner, most likely you want to use the high level master actor - https://apify.com/lukaskrivka/website-checker

Lukáš Křivka

322

Website Checker Runner Puppeteer

lukaskrivka/website-checker-puppeteer

Checks the provided website using Puppeteer. This is a low level runner, most likely you want to use the high level master actor - https://apify.com/lukaskrivka/website-checker

Lukáš Křivka

245

Advanced Website Domain Name Validator

saswave/advanced-website-domain-name-validator

Advanced domain scraper. Determine if a domain is still valid or has moved. We test multiple scenario before flagging the domain as invalid. Extract technologies stack, social account, emails

SASWAVE

Brave Search MCP Server

agentify/brave-search-mcp-server

The Brave Search MCP Server powers query processing and data handling for Brave Search, enabling fast and private search results.

agentify

267

Voip Call API

vivid_astronaut/voip-call

Fabio Suizu

Moz DA PA Spam Checker

callapi/moz-da-pa-spam-checker

Moz DA PA Spam Checker with Ref Domains, Backlinks count & Ranking Keywords count.

CallAPI

106

5.0

Event Lead Extractor — Speakers & Attendees

ryanclinton/event-lead-extractor

Turn any conference, trade show, or event page into a qualified lead list — complete with emails, phone numbers, social profiles, and lead scores. Paste in event URLs from Eventbrite, Lu.ma, Sched, Bizzabo, or any custom conference website and the actor does the rest.