Duplications Checker
No credit card required
Duplications Checker
No credit card required
Check your dataset for duplications. Accept only the highest quality data!
Dataset ID
datasetId
stringOptional
Id of dataset where the data are located. If you need to use other input types like Key value store or raw JSON, look at Other data sources
Check only clean dataset items
checkOnlyCleanItems
booleanOptional
Only clean dataset items will be loaded and use for duplications checking if datasetId
option is provided.
Default value of this property is false
Fields
fields
arrayRequired
List of fields in each item that will be checked for duplicates. Each given field must not be nested and it should contain only simple value (string or number). You can prepare your data with preCheckFunction.
Default value of this property is []
Pre-check function
preCheckFunction
stringOptional
You can specify which fields should display in the debug OUTPUT to identify bad items. By default it shows all fields which may make it unnecessary big.
Minimum duplications
minDuplications
integerOptional
Minimum occurences to be included in the report. Defaults to 2
Default value of this property is 2
Show indexes
showIndexes
booleanOptional
Indexes of the duplicate items will be shown in the OUTPUT report. Set to false if you don't need them.
Default value of this property is true
Show items
showItems
booleanOptional
Duplicate items will be pushed to a dataset. Set to false if you don't need them.
Default value of this property is true
Show missing fields
showMissing
booleanOptional
Items where the values for the field
is missing or is null
or ''
will be included in the report.
Default value of this property is true
Offset
offset
integerOptional
From which item the checking will start. Use with limit to check specific items.
Batch Size
batchSize
integerOptional
You can change number of loaded and processed items in each batch. This is only needed if you have really huge items.
Default value of this property is 1000
- 3 monthly users
- 11 stars
- 100.0% runs succeeded
- Created in Aug 2019
- Modified over 3 years ago