
Duplications Checker
Pricing
Pay per usage
Go to Apify Store

Duplications Checker
Check your dataset for duplications. Accept only the highest quality data!
0.0 (0)
Pricing
Pay per usage
12
233
6
Last modified
4 years ago
Pricing
Pay per usage
Check your dataset for duplications. Accept only the highest quality data!
0.0 (0)
Pricing
Pay per usage
12
233
6
Last modified
4 years ago
datasetId
stringOptional
Id of dataset where the data are located. If you need to use other input types like Key value store or raw JSON, look at Other data sources
checkOnlyCleanItems
booleanOptional
Only clean dataset items will be loaded and use for duplications checking if datasetId
option is provided.
Default value of this property is false
fields
arrayRequired
List of fields in each item that will be checked for duplicates. Each given field must not be nested and it should contain only simple value (string or number). You can prepare your data with preCheckFunction.
Default value of this property is []
preCheckFunction
stringOptional
You can specify which fields should display in the debug OUTPUT to identify bad items. By default it shows all fields which may make it unnecessary big.
minDuplications
integerOptional
Minimum occurences to be included in the report. Defaults to 2
Default value of this property is 2
showIndexes
booleanOptional
Indexes of the duplicate items will be shown in the OUTPUT report. Set to false if you don't need them.
Default value of this property is true
showItems
booleanOptional
Duplicate items will be pushed to a dataset. Set to false if you don't need them.
Default value of this property is true
showMissing
booleanOptional
Items where the values for the field
is missing or is null
or ''
will be included in the report.
Default value of this property is true
offset
integerOptional
From which item the checking will start. Use with limit to check specific items.
batchSize
integerOptional
You can change number of loaded and processed items in each batch. This is only needed if you have really huge items.
Default value of this property is 1000