What Is Culling?
When someone uses the term "culling" I often have the image of a miner with a pick. That's exactly what this is. You are looking for the important stuff. To do that, you must chip away the stuff you don't need to get to it. Chances are that over 75% of the data you collect will never be produced. Hundreds of hours are spent every year reviewing data.
Culling helps you in the front end to narrow down to data that you truly need.
There are generally three types of culling:
- DeNISTing - Yes, this is a big time techie term but this is the method of removing all of the junk data such as systems files or other file formats which aren't generated by the user.
- Deduplication - Here's another tech term but this method identifies and separates out duplicate documents and emails. This is either done globally (the entire data collection) or by custodians. You don't want to review 10 copies of the same exact document in your reviews. Each document has its own DNA and there are ways to carve out the duplicates based upon the DNA.
- Search terms - Once your are familiar with the case, you can create search terms to include (or exclude) data.