Syntax: checkboxes to choose fields
These are the fields which will be checked for duplicate prevention (if Prevent Duplicates is enabled). The concatenation of these fields is hashed for each incoming document, and if the hash is the same as an existing document, the incoming document will be discarded as a duplicate.
By default, all fields are included, so any differences in the content of two documents will cause them to not be seen as duplicates.
If this profile has parametric fields, a Parametric Fields
checkbox will
also be offered, so those fields can be included in what is considered a
duplicate.
Note: Changing Duplicate Check Fields after a walk has completed (i.e. before a later Refresh type walk) may cause new documents to not be removed as duplicates as expected, since the pre-existing documents' hashes are now for a different set of fields. This will not cause errors or corruption; it just might leave some newly-duplicate documents in the database.