CUSTOM DEDUPLICATION
Best for large-scale document reviews and/or messy data imports.
Identifying Duplicates:
Nextpoint would run an internal report identifying all duplicate MD5Hash and MessageID values across the entire database. The MD5Hash is a unique thumbprint for all electronic documents, whereas MessageID is a unique thumbprint for all emails. Duplicates are brought into the database when the exact same document is attached to varying emails or the documents have slight variances in their metadata.
Isolating Duplicates:
Once we identify the duplicative documents, we will isolate the MD5Hash dupes into a separate folder labeled, “Duplicates,” with one copy of said duplicates isolated into another “Masters” folder. These “master” documents will be the only copy that will need to be reviewed for relevance and privilege. Once these documents are complete, we will automatically apply the coding to their respective duplicates, so you only have to code the document once. This will alleviate your review time by a significant amount.