Today Index Engines released its Catalyst Unstructured Data Profiling Engine. You can find the press release here.
Basically the Catalyst Data Profiling Engine processes all forms of unstructured storage, email and document types, creating a searchable index of what exists, where it is located, who owns it, when it was last accessed and what key terms are in it. Through this process, unknown – dark – or lost data is found and decisions can be made on its disposition.
But what this really does is provide a knowledge of what data exists and gives different departments a chance to have a balanced discussion about their data. Before data profiling it was nearly impossible to understand what exists, where and for how long.
Data profiling allows conversation to take place between IT and legal. These conversations allow disposition to be decided. Aged data that has no business value and not been accessed in more than a decade is easily classified and purged. Sensitive email such as PSTs that are hidden on the network can be easily uncovered and monitored in order to determine the best course of action. PII can be searched for and encrypted before a breach happens. Systems can be audited for compliance.
Legal can now view and profile data and collaborate with IT to determine the next step. Even when the next eDiscovery event occurs, legal can just ask IT where is “John Doe’s” email and IT can provide an quick answer and preserve the data on legal hold.
As legal and IT begin to collaborate and discuss polices and information governance strategies they will find that much of the data that they are spending significant money to store and maintain is of no value.