Actor picture

Website-checker-starter

vaclavrut/website-checker-starter

Works with lukaskrivka/website-checker. The idea is that this actor manages more URLs on the input, will start website-checker with 10 runs at a time and store all data to one datasets.

No credit card required

Author's avatarVaclav Rut
  • Modified
  • Users4
  • Runs22
Actor picture

Website-checker-starter

Start URLs

startUrls

Required

array

Enter array of strings.

Type of Crawler

type

Optional

string

Which type of SDK Crawler will be used

Options:

"cheerio", "puppeteer"

Proxy configuration

proxyConfiguration

Optional

object

Specifies proxy servers that will be used by the scraper in order to hide its origin. For details, see Proxy configuration in README.

Max pages per run

maxPagesPerCrawl

Optional

integer

The maximum number of pages that the scraper will load. The scraper will stop when this limit is reached. It's always a good idea to set this limit in order to prevent excess platform usage for misconfigured scrapers. Note that the actual number of pages loaded might be slightly higher than this value. If set to 0, there is no limit.

Max concurrency

maxConcurrency

Optional

integer

Specified the maximum number of pages that can be processed by the scraper in parallel. The scraper automatically increases and decreases concurrency based on available system resources. This option enables you to set an upper limit, for example to reduce the load on a target website.

Save Snapshots

saveSnapshots

Optional

boolean

Will save HTML for Cheerio and HTML + screenshot for Puppeteer