Essential Guide

Powering a more efficient enterprise search engine

A comprehensive collection of articles, videos and more, hand-picked by our editors

Content analytics vs. predictive coding and beyond

Records management software needs to mature. But is the way forward through predictive coding methods, content analytics, search-based applications or some combination?

This article can also be found in the Premium Editorial Download: Business Information: New technologies put fresh spin on records management:

Some say records management tools are outmoded and unfit for today's information explosion, but technologies are emerging that may give them a much-needed boost. Predictive coding and content analytics address the ever-increasing volume of data that all organizations wrestle with.

"We do not go to work looking forward to tagging objects for purposes of compliance or records management," said Jason R. Baron, an attorney in the information governance and e-discovery practice at Drinker, Biddle and Reath LLP in Washington, D.C., and the former director of litigation at the U.S. National Archives and Records Administration.

Predictive coding software tags and categorizes documents, which reduces the time and cost of manually sorting through millions of files. It creates a sample cluster of documents and employees review the documents for accuracy. Subsequent rounds of coding may be required to "teach" the software further.

But legal challengers worry that the process -- a brute-force chunking method of sorts -- separates wheat from chaff at the risk of introducing errors: Relevant documents may be missed, or irrelevant ones included. But predictive coding is far superior to the alternative, which is unmanageable given the growing volume of information. Having legal professionals devote hours to tagging documents is a "terrible state of affairs," Baron said.

For more on predictive coding

Key questions on predictive coding technology

Are predictive coding technologies the answer to information overload?

Predictive coding doesn't replace human review

Still others argue that the real goal is more sophisticated technologies, such as content analytics.

Sandra Serkes, founder and president of Valora Technologies Inc., a content analytics services provider in Bedford, Mass., said content analytics can address real trends and conduct business intelligence on all enterprise information, regardless of format.

"We look at everything there is to know about a file," Serkes said about Valora's PowerHouse content analytics software. "What language is it in, where did it come from, how does it match typical versions of the same genre, are there unique attributes?"

The future of records management may require content analytics, predictive coding and intelligent search. But it may take time, since single-purpose uses like e-discovery and compliance already address enterprise pain points and the return on investment on information management has been harder to establish. Then there are human obstacles.

"Technically, this is not that difficult," Serkes said. "It is the emotional side of information sharing that becomes problematic. Records managers have built empires for themselves because they are the only ones who know where information is."

Pro+

Features

Enjoy the benefits of Pro+ membership, learn more and join.

Essential Guide

Powering a more efficient enterprise search engine

0 comments

Oldest 

Forgot Password?

No problem! Submit your e-mail address below. We'll send you an email containing your password.

Your password has been sent to:

-ADS BY GOOGLE

SearchBusinessAnalytics

SearchDataManagement

SearchManufacturingERP

SearchOracle

SearchSAP

SearchSQLServer

Close