Website Changes Detector avatar
Website Changes Detector

Pricing

Pay per usage

Go to Store
Website Changes Detector

Website Changes Detector

Developed by

Tri⟁angle

Tri⟁angle

Maintained by Apify

This actor uses Apify’s Website Content Crawler to track website changes by comparing new and previous crawl results, highlighting only relevant updates to save time and resources.

0.0 (0)

Pricing

Pay per usage

0

Total users

3

Monthly users

3

Runs succeeded

83%

Last modified

3 days ago

Website Content Crawler Input

wccInputobjectOptional

All input properties for the Website Content Crawler should be here. See docs here: https://apify.com/apify/website-content-crawler#input

Memory

wccMemoryEnumOptional

Amount of memory (RAM) allocated for the Website Content Crawler run in megabytes.

Value options:

"32768": string"16384": string"8192": string"4096": string"2048": string"1024": string"512": string

Default value of this property is "4096"

Timeout

wccTimeoutintegerOptional

Timeout for the Website Content Crawler Actor run in seconds. Zero value means there is no timeout, and the Actor runs until completion, or maybe infinitely. Default is 360,000 seconds (100 hours).

Default value of this property is 360000

WCC Dataset Prefix

websiteContentDatasetNamePrefixstringOptional

Prefix for naming the Website Content Crawler datasets (e.g., 'projectX-prod'). If empty, default prefix ('wcc-changes-detector') will be used.

Include URLs From Previous Website Content Crawler Run

includePreviousRunUrlsbooleanOptional

If checked, all URLs successfully crawled in the most recent (previous) dataset (with the same prefix) will be added to the start URLs for the new WCC crawl, alongside any URLs provided in the 'Website Content Crawler Input'. This effectively re-crawls pages found last time.

Default value of this property is false

Filter by Keywords

filterKeywordsarrayOptional

Only include pages whose text content contains at least one of these keywords. Leave empty to disable keyword filtering.

Default value of this property is []

Skip Crawling

skipCrawlbooleanOptional

If checked, do not run a new WCC crawl. Compare the two most recent existing datasets with the specified prefix.

Default value of this property is false

Max WCC Datasets to Keep

websiteContentDatasetMaxCountintegerRequired

Maximum number of historical WCC datasets with this prefix to retain. Older ones will be deleted. Minimum is 2. Maximum is 1000.

Default value of this property is 5

Change Item Types to Return (NEW, UPDATED, REMOVED, SAME)

returnChangeTypesarrayOptional

Select the types of changes to return. If none are selected, 'NEW' and 'UPDATED' will be returned.

Default value of this property is ["NEW","UPDATED"]