Determines whether hash values are generated for n on-email items in an Outlook PST file during the ED Loader import.
Creates a record in the database but does not copy the native file.
Creates a record for the duplicate in the database and copies the native file into the case folder. This setting determines the action to take once a duplicate is located. If record is considered a duplicate then (Action).Deduplicates documents against records with identical custodian values. Deduplicates documents against the entire incoming collection and against existing records in the LAW case. During the import process, deduplication can be performed at one of two levels: This option identifies the scope for deduplication. The hash values are obtained through metadata fields (e-mail) or by hashing the entire file (e-docs). A hash value can be thought of as the DNA of a file. The working digest is the method of hashing that will be conducted to determine duplicates. Enables duplicate checking for the current session. Click the Settings tab and then click Deduplication. On the File menu click Import and then click Electronic Discovery. The scope of the project will determine whether or not deduplication will be performed and which methods will be used.ġ. You can set the encryption key in the deduplication settings. In the case of electronic documents, the file is hashed. An exact copy of a file will yield the same hash value. In essence, the file is subjected to an encryption process that yields a unique value. A hash is a numerical representation of a file whose value is based on the file contents or other attributes. For example, in electronic discovery sets containing e-mail archives for an organization, it is not uncommon for multiple e-mail accounts to contain the exact same widely distributed e-mail or file attachment.ĬloudNine™ LAW identifies duplicate files by comparing hashes of files. Deduplication is necessary in many situations involving electronic documents because multiple identical documents are a typical feature of large record sets. Deduplication is a necessary step in managing the volume of data that must be analyzed.Ī duplicate file is an exact copy of another file. Deduplication is the process of identifying duplicate files during the discovery process and removing them from further processing and analysis.