Calendar duplicate detector

Determines whether hash values are generated for n on-email items in an Outlook PST file during the ED Loader import.

Enable hashing of non-email Outlook items.

Also, before running the internal deduplication, it is recommended that the Deduplication Status Reset command is executed to clear the values assigned by the Inter-Case Deduplication utility to prevent the mixture of internal and external duplicates. At this point, the current case should be removed from the external database.

Proceeding with the ED Loader deduplication after the case has been deduplicated with the Inter-Case Deduplication utility will result in the external deduplication database being placed in Rebuild/Flush mode.

Doing so will present a mixture of internal and external duplicates and could cause problems when purging, filtering, or reviewing duplicate records.

Use of the ED Loader deduplication on imported records after the case has already been deduplicated against other cases using the Inter-Case Deduplication utility is not recommended.

If the current case has already been deduplicated via the Inter-Case Deduplication utility, a warning will appear (see below) when starting the ED Loader import if deduplication is enabled.

This setting was not available in versions prior to 5.5.07. The desired state of this setting should be determined prior to the first import into new cases and should not be changed.

While enabling the Include attachment hashes in e-mail metadata hash setting is recommended, it is not advisable to change this setting during the course of a case as it will alter the e-mail hashing schema, as noted in the interface.

Note the following warnings prior to running a deduplication session: When disabled, the Attach field is incorporated in with the metadata hash which only contains the file names of attached files. When enabled, the ED Loader will include the hashes of attached files in the parent e-mail's metadata hash.

Include attachment hashes in e-mail metadata hash.

Does not create a record, no text is extracted, and the native file is not copied to the case folder.

Creates a record in the database but does not copy the native file.

Creates a record for the duplicate in the database and copies the native file into the case folder. This setting determines the action to take once a duplicate is located. If record is considered a duplicate then (Action).Deduplicates documents against records with identical custodian values. Deduplicates documents against the entire incoming collection and against existing records in the LAW case. During the import process, deduplication can be performed at one of two levels: This option identifies the scope for deduplication. The hash values are obtained through metadata fields (e-mail) or by hashing the entire file (e-docs). A hash value can be thought of as the DNA of a file. The working digest is the method of hashing that will be conducted to determine duplicates. Enables duplicate checking for the current session. Click the Settings tab and then click Deduplication. On the File menu click Import and then click Electronic Discovery. The scope of the project will determine whether or not deduplication will be performed and which methods will be used.ġ. You can set the encryption key in the deduplication settings. In the case of electronic documents, the file is hashed. An exact copy of a file will yield the same hash value. In essence, the file is subjected to an encryption process that yields a unique value. A hash is a numerical representation of a file whose value is based on the file contents or other attributes. For example, in electronic discovery sets containing e-mail archives for an organization, it is not uncommon for multiple e-mail accounts to contain the exact same widely distributed e-mail or file attachment.ĬloudNine™ LAW identifies duplicate files by comparing hashes of files. Deduplication is necessary in many situations involving electronic documents because multiple identical documents are a typical feature of large record sets. Deduplication is a necessary step in managing the volume of data that must be analyzed.Ī duplicate file is an exact copy of another file. Deduplication is the process of identifying duplicate files during the discovery process and removing them from further processing and analysis.