Predictive Coding Helps Companies Reduce Discovery Costs
|
|
|
- Jody Holland
- 10 years ago
- Views:
Transcription
1 Predictive Coding Helps Companies Reduce Discovery Costs Recent Court Decisions Open Door to Wider Use by Businesses to Cut Costs in Document Discovery By John Tredennick As companies struggle to manage exploding volumes of electronic content, legal and IS departments have had to find new ways to comply with data discovery obligations. Increasingly, global litigation and regulatory investigations require the production of a range of company documents and data. Courts have stepped up requirements to preserve and share these files, imposing multi-million dollar sanctions for noncompliance. This has spawned an enormous industry built around electronic discovery. Companies spend billions on collecting, organizing and reviewing their data. Fully 73 percent of that spend is on reviewing documents to find those relevant to the matter and protect those that might be subject to the attorney-client privilege, RAND recently found. Attorneys have long felt it necessary to manually review each document being considered for production. When data volumes were still relatively small, this expense was a rounding error on much bigger legal budgets. However, as document volumes increased into the millions, attorney review costs mushroomed to become the largest part of the discovery spend. New Approaches to Reducing Review Costs Faced with increasing volumes, corporate counsel began looking for new ways to cut review burdens. They began using search techniques designed to better target relevant documents and exclude junk. Then they employed clustering to help group documents with similar content. Some sent review work offshore. All the while, volumes kept increasing, outpacing counsel s ability to keep up. Recently, a more-promising approach to reduce document volumes has emerged. Called variously predictive coding, computer-assisted review and technology-assisted review, the technology has been proven to streamline the [email protected]
2 review process dramatically, reducing both the quantity of data involved and the time required to sift through it. As promising as this technology is, however, lawyers and their clients have been slow to adopt it. Their reluctance is often attributed to risk management or, in lawyer talk, what is called defensibility. Lawyers fear that, if called on in court to defend their data discovery, predictive coding would be too untested a technology to hold up. Seemingly overnight, that mindset changed. On Feb. 24, 2012, a federal magistrate-judge in New York City, Andrew J. Peck, issued the first judicial opinion anywhere to endorse the use of predictive coding in electronic discovery. It was the shot heard round the litigation world, prompting headlines declaring the ruling a breakthrough and the judge a trailblazer. As Judge Peck himself had observed just two months earlier in a legal trade magazine, While anecdotally it appears that some lawyers are using predictive coding technology, it also appears that many lawyers (and their clients) are waiting for a judicial decision approving of computer-assisted review. Without doubt, Judge Peck ushered in the next generation of e-discovery search and review. In the months since his decision, the business world has shown a heightened interest in predictive coding. Other judges have picked up the mantle and considered its use in their cases. Industry vendors have scrambled to add predictive coding to their product rosters. As important as was the ruling s outcome, so too was its reasoning. For business leaders, IT professionals and in-house lawyers, the opinion provides a primer on the reasons for using predictive coding and the processes underlying it. How Big Data is Changing Litigation To understand the importance of predictive coding, it helps to step back and consider how big data is changing the nature of litigation. Before a trial, court rules allow litigants to discover each other s evidence. Litigants must identify their potentially relevant documents and hand them over to their opponents. They can withhold documents protected by attorney-client privilege. As for all the documents that are neither relevant nor privileged, they can be ignored. For decades, document review was done manually, with attorneys eyeballing each document and deciding whether it should be produced. But as paper turned 2
3 into data and data grew into big data, manual review was no longer feasible. Today, major corporate lawsuits routinely involve gigabytes of data and sometimes even terabytes. To help sift through it all, attorneys turned to technology. Their objective, as Judge Peck observed, was to identify as many relevant documents as possible while reviewing as few non-relevant documents as possible. Attorneys most commonly use keyword searching. But keyword searches are imprecise. The way lawyers choose keywords, Judge Peck wrote, is the equivalent of the child s game of Go Fish. Further, keywords are often overinclusive, he noted, finding relevant documents but also large numbers of irrelevant ones. Judge Peck cited a 1985 study by scholars David Blair and M.E. Maron. Experienced searchers were instructed to use keywords to retrieve at least 75 percent of relevant documents from a collection of 40,000. Although the searchers believed they had succeeded, their actual recall was just 20 percent. By comparison, predictive coding is more effective and less expensive, Judge Peck said. In fact, studies have shown predictive coding to be at least as accurate as human review, if not more accurate, he noted. Computer-assisted review appears to be better than the available alternatives, and thus should be used in appropriate cases, Judge Peck concluded. While computer-assisted review is not perfect, he added, court rules do not require perfection. The overarching goal is to secure the just, speedy, and inexpensive determination of lawsuits. How Predictive Coding Works Unlike manual review, which is mostly done by junior staff, computer-assisted coding involves senior members of the litigation team who review and code a seed set of documents. Software uses that seed set to code other documents. The process is iterative, with the lawyers providing feedback on the computer s coding and the computer then coding additional documents. The case decided by Judge Peck, Da Silva Moore v. Publicis Groupe, illustrates the process. To review a collection of some 3 million s, the parties proposed a predictive-coding protocol. Judge Peck approved the protocol and included it with his written ruling. While there is no single right way to do computer-assisted review, the protocol provides insight into how the process can work. 3
4 To begin, the attorneys would identify a small number of s to serve as seed sets representative of the categories to be reviewed and coded. The seed sets one per issue would then be used to begin training the software. The software uses each seed set to identify and prioritize all similar documents within the collection. The lawyers would then review at least 500 of the computer-selected documents to further calibrate the system. Once the system is trained, it would proceed to identify relevant documents. Documents it identifies would be reviewed manually before they are produced to the opposing side. As a final quality-control step, the accuracy of the process would be verified using both judgmental and statistical sampling. Training the Software and Checking Results Predictive coding is an iterative process. After the software identifies the first set of potentially relevant documents, the parties review the results. The software then analyzes the new tagging and finds a second set for testing, then a third and a fourth. Here, the protocol suggested that the process be repeated seven times. The key was to watch the change in the number of relevant documents predicted by the system after each round of testing. Once that number dropped below a delta of 5 percent, the parties could stop. That would indicate the system was stable, with further review unlikely to bear much fruit. At that point, the process would move from computer-assisted to humanpowered review. The producing party would review all of the potentially responsive documents and produce accordingly. As a final check, the parties would focus on the documents the system said to ignore. Of those, they would check a random sample (2,399 again) to see how many were, in fact, responsive. What this Means to Business Leaders Predictive coding techniques can reduce document populations dramatically, often by more than 50 percent. This translates into substantial savings on review costs, easily shaving millions off total annual corporate e-discovery costs. It is not ideal for every case it particularly shines in the larger ones but it should be considered by every corporation facing litigation and regulatory inquiries. 4
5 The process outlined in Judge Peck s ruling is not the bible of predictive coding. There are a variety of viable approaches. However, law is a profession built on precedent. First opinions carry significant weight. For business leaders, the significance of the case is its seal of approval for predictive coding technology. That should lead to its wider acceptance and use. With data volumes increasing dramatically, predictive coding is the best weapon businesses have to cut the cost and time of review. In their battle to conquer big data, that is a significant victory. John Tredennick is the founder and CEO of Catalyst Repository Systems, an international provider of multi-lingual document repositories and technology for electronic discovery and complex litigation. Formerly a nationally known trial lawyer, he was editor-in-chief of the best-selling book, Winning with Computers: Trial Practice in the Twenty-First Century. Recently, he was named to the Fastcase 50 as one of the legal profession s smartest, most courageous innovators. -END- 5
E-discovery Taking Predictive Coding Out of the Black Box
E-discovery Taking Predictive Coding Out of the Black Box Joseph H. Looby Senior Managing Director FTI TECHNOLOGY IN CASES OF COMMERCIAL LITIGATION, the process of discovery can place a huge burden on
The United States Law Week
The United States Law Week Source: U.S. Law Week: News Archive > 2012 > 04/24/2012 > BNA Insights > Under Fire: A Closer Look at Technology- Assisted Document Review E-DISCOVERY Under Fire: A Closer Look
REDUCING COSTS WITH ADVANCED REVIEW STRATEGIES - PRIORITIZATION FOR 100% REVIEW. Bill Tolson Sr. Product Marketing Manager Recommind Inc.
REDUCING COSTS WITH ADVANCED REVIEW STRATEGIES - Bill Tolson Sr. Product Marketing Manager Recommind Inc. Introduction... 3 Traditional Linear Review... 3 Advanced Review Strategies: A Typical Predictive
Predictive Coding Defensibility and the Transparent Predictive Coding Workflow
Predictive Coding Defensibility and the Transparent Predictive Coding Workflow Who should read this paper Predictive coding is one of the most promising technologies to reduce the high cost of review by
Predictive Coding Defensibility and the Transparent Predictive Coding Workflow
WHITE PAPER: PREDICTIVE CODING DEFENSIBILITY........................................ Predictive Coding Defensibility and the Transparent Predictive Coding Workflow Who should read this paper Predictive
How It Works and Why It Matters for E-Discovery
Continuous Active Learning for Technology Assisted Review How It Works and Why It Matters for E-Discovery John Tredennick, Esq. Founder and CEO, Catalyst Repository Systems Peer-Reviewed Study Compares
UNDERSTANDING E DISCOVERY A PRACTICAL GUIDE. 99 Park Avenue, 16 th Floor New York, New York 10016 www.devoredemarco.com
UNDERSTANDING E DISCOVERY A PRACTICAL GUIDE 1 What is ESI? Information that exists in a medium that can only be read through the use of computers Examples E-mail Word Documents Databases Spreadsheets Multimedia
Predictive Coding Defensibility
Predictive Coding Defensibility Who should read this paper The Veritas ediscovery Platform facilitates a quality control workflow that incorporates statistically sound sampling practices developed in conjunction
Technology Assisted Review of Documents
Ashish Prasad, Esq. Noah Miller, Esq. Joshua C. Garbarino, Esq. October 27, 2014 Table of Contents Introduction... 3 What is TAR?... 3 TAR Workflows and Roles... 3 Predictive Coding Workflows... 4 Conclusion...
PREDICTIVE CODING: SILVER BULLET OR PANDORA S BOX?
Vol. 46 No. 3 February 6, 2013 PREDICTIVE CODING: SILVER BULLET OR PANDORA S BOX? The high costs of e-discovery have led to the development of computerized review technology by which the user may search
How Good is Your Predictive Coding Poker Face?
How Good is Your Predictive Coding Poker Face? SESSION ID: LAW-W03 Moderator: Panelists: Matthew Nelson ediscovery Counsel Symantec Corporation Hon. Andrew J. Peck US Magistrate Judge Southern District
case 3:12-md-02391-RLM-CAN document 396 filed 04/18/13 page 1 of 7 UNITED STATES DISTRICT COURT NORTHERN DISTRICT OF INDIANA SOUTH BEND DIVISION
case 3:12-md-02391-RLM-CAN document 396 filed 04/18/13 page 1 of 7 UNITED STATES DISTRICT COURT NORTHERN DISTRICT OF INDIANA SOUTH BEND DIVISION IN RE: BIOMET M2a MAGNUM HIP IMPLANT PRODUCTS LIABILITY
Technology- Assisted Review 2.0
LITIGATION AND PRACTICE SUPPORT Technology- Assisted Review 2.0 by Ignatius Grande of Hughes Hubbard & Reed LLP and Andrew Paredes of Epiq Systems Legal teams and their outside counsel must deal with an
SAMPLING: MAKING ELECTRONIC DISCOVERY MORE COST EFFECTIVE
SAMPLING: MAKING ELECTRONIC DISCOVERY MORE COST EFFECTIVE Milton Luoma Metropolitan State University 700 East Seventh Street St. Paul, Minnesota 55337 651 793-1246 (fax) 651 793-1481 [email protected]
The case for statistical sampling in e-discovery
Forensic The case for statistical sampling in e-discovery January 2012 kpmg.com 2 The case for statistical sampling in e-discovery The sheer volume and unrelenting production deadlines of today s electronic
E-DISCOVERY: BURDENSOME, EXPENSIVE, AND FRAUGHT WITH RISK
E-DISCOVERY: BURDENSOME, EXPENSIVE, AND FRAUGHT WITH RISK If your company is involved in civil litigation, the Federal Rules of Civil Procedure regarding preservation and production of electronic documents
ACADEMIC AFFAIRS COUNCIL ******************************************************************************
ACADEMIC AFFAIRS COUNCIL AGENDA ITEM: 8.D DATE: March 15, 2007 ****************************************************************************** SUBJECT: Electronic Records Discovery Electronic records management
MANAGING BIG DATA IN LITIGATION
David Han 2015 MANAGING BIG DATA IN LITIGATION DAVID HAN Associate, Morgan Lewis & Bockius, edata Practice Group MANAGING BIG DATA Data volumes always increasing New data sources Mobile Internet of Things
Quality Control for predictive coding in ediscovery. kpmg.com
Quality Control for predictive coding in ediscovery kpmg.com Advances in technology are changing the way organizations perform ediscovery. Most notably, predictive coding, or technology assisted review,
www.pwc.nl Review & AI Lessons learned while using Artificial Intelligence April 2013
www.pwc.nl Review & AI Lessons learned while using Artificial Intelligence Why are non-users staying away from PC? source: edj Group s Q1 2013 Predictive Coding Survey, February 2013, N = 66 Slide 2 Introduction
DSi Pilot Program: Comparing Catalyst Insight Predict with Linear Review
case study DSi Pilot Program: Comparing Catalyst Insight Predict with Linear Review www.dsicovery.com 877-797-4771 414 Union St., Suite 1210 Nashville, TN 37219 (615) 255-5343 Catalyst Insight Predict
Recent Developments in the Law & Technology Relating to Predictive Coding
Recent Developments in the Law & Technology Relating to Predictive Coding Presented by Paul Neale CEO Presented by Gene Klimov VP & Managing Director Presented by Gerard Britton Managing Director 2012
The Truth About Predictive Coding: Getting Beyond The Hype
www.encase.com/ceic The Truth About Predictive Coding: Getting Beyond The Hype David R. Cohen Reed Smith LLP Records & E-Discovery Practice Group Leader David leads a group of more than 100 lawyers in
Navigating Information Governance and ediscovery
Navigating Information Governance and ediscovery Implementing Processes & Technology to Reduce Downstream ediscovery Cost and Risk Shannon Smith General Counsel, Globanet March 11 12, 2013 Agenda 1 Overview
Navigating E-Discovery, And The
Navigating E-Discovery, And The l f C S Role of ACEDS 1 IDF Conference December 2012 Overview Introduction to US E-Discovery Important E-Discovery Trends Role of ACEDS Mission of ACEDS in Japan 2 E-Discovery
Case 1:11-cv-01279-ALC-AJP Document 96 Filed 02/24/12 Page 1 of 49
Case 1:11-cv-01279-ALC-AJP Document 96 Filed 02/24/12 Page 1 of 49 UNITED STATES DISTRICT COURT SOUTHERN DISTRICT OF NEW YORK - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
DOCSVAULT WhitePaper. Concise Guide to E-discovery. Contents
WhitePaper Concise Guide to E-discovery Contents i. Overview ii. Importance of e-discovery iii. How to prepare for e-discovery? iv. Key processes & issues v. The next step vi. Conclusion Overview E-discovery
3 "C" Words You Need to Know: Custody - Control - Cloud
3 "C" Words You Need to Know: Custody - Control - Cloud James Christiansen Chief Information Security Officer Evantix, Inc. Bradley Schaufenbuel Director of Information Security Midland States Bank Session
How to Manage Costs and Expectations for Successful E-Discovery: Best Practices
How to Manage Costs and Expectations for Successful E-Discovery: Best Practices Mukesh Advani, Esq., Advisory Board Member, UBIC North America, Inc. UBIC North America, Inc. 3 Lagoon Dr., Ste. 180, Redwood
EnCase ediscovery. Automatically search, identify, collect, preserve, and process electronically stored information across the network.
TM GUIDANCE SOFTWARE EnCASE ediscovery EnCase ediscovery Automatically search, identify, collect, preserve, and process electronically stored information across the network. GUIDANCE SOFTWARE EnCASE ediscovery
Take an Enterprise Approach to E-Discovery. Streamline Discovery and Control Review Cost Using a Central, Secure E-Discovery Cloud Platform
Take an Enterprise Approach to E-Discovery Streamline Discovery and Control Review Cost Using a Central, Secure E-Discovery Cloud Platform A Smarter Approach Catalyst s e-discovery cloud platform provides
Ethics in Technology and ediscovery Stuff You Know, But Aren t Thinking About
Ethics in Technology and ediscovery Stuff You Know, But Aren t Thinking About Kelly H Twigger, Esq. Oil and Gas Symposium Arkansas Law Review October 16-17, 2014 Overview In the last two decades, business
Natalie Price. Natalie Price. April 15, 2008
Natalie Price 195 East 600 North #14 Provo, Utah 84606 801-368-5418 [email protected] April 15, 2008 J. Michael Pemberton, Ph.D., CRM, FAI Information Management Journal 1345 Circle Park Dr. Knoxville,
E-Discovery: A Common Sense Approach. In order to know how to handle and address ESI issues, the preliminary and
Jay E. Heidrick Polsinelli [email protected] (913) 234-7506 E-Discovery: A Common Sense Approach In order to know how to handle and address ESI issues, the preliminary and obvious question must
Discussion of Electronic Discovery at Rule 26(f) Conferences: A Guide for Practitioners
Discussion of Electronic Discovery at Rule 26(f) Conferences: A Guide for Practitioners INTRODUCTION Virtually all modern discovery involves electronically stored information (ESI). The production and
The Case for Technology Assisted Review and Statistical Sampling in Discovery
The Case for Technology Assisted Review and Statistical Sampling in Discovery Position Paper for DESI VI Workshop, June 8, 2015, ICAIL Conference, San Diego, CA Christopher H Paskach The Claro Group, LLC
Managed Services: Maximizing Transparency and Minimizing Expense and Risk in ediscovery and Information Governance
Managed Services: Maximizing Transparency and Minimizing Expense and Risk in ediscovery and Information Governance January 18, 2013 Andrew Bayer, Director of Business Development Adam Wells, VP, Business
Mastering Predictive Coding: The Ultimate Guide
Mastering Predictive Coding: The Ultimate Guide Key considerations and best practices to help you increase ediscovery efficiencies and save money with predictive coding 4.5 Validating the Results and Producing
A Practitioner s Guide to Statistical Sampling in E-Discovery. October 16, 2012
A Practitioner s Guide to Statistical Sampling in E-Discovery October 16, 2012 1 Meet the Panelists Maura R. Grossman, Counsel at Wachtell, Lipton, Rosen & Katz Gordon V. Cormack, Professor at the David
ABA SECTION OF LITIGATION 2012 SECTION ANNUAL CONFERENCE APRIL 18-20, 2012: PREDICTIVE CODING
ABA SECTION OF LITIGATION 2012 SECTION ANNUAL CONFERENCE APRIL 18-20, 2012: PREDICTIVE CODING Predictive Coding SUBMITTED IN SUPPORT OF THE PANEL DISCUSSION INTRODUCTION Technology has created a problem.
Predictive Coding, TAR, CAR NOT Just for Litigation
Predictive Coding, TAR, CAR NOT Just for Litigation February 26, 2015 Olivia Gerroll VP Professional Services, D4 Agenda Drivers The Evolution of Discovery Technology Definitions & Benefits How Predictive
Minimizing ediscovery risks. What organizations need to know in today s litigious and digital world.
What organizations need to know in today s litigious and digital world. The main objective for a corporation s law department is to mitigate risk throughout the company, while keeping costs under control.
PICTERA. What Is Intell1gent One? Created by the clients, for the clients SOLUTIONS
PICTERA SOLUTIONS An What Is Intell1gent One? Created by the clients, for the clients This white paper discusses: Understanding How Intell1gent One Saves Time and Money Using Intell1gent One to Save Money
Case 1:04-cv-01639-RJL Document 1092-20 Filed 08/16/13 Page 1 of 6. EXHIBIT 1 s
Case 1:04-cv-01639-RJL Document 1092-20 Filed 08/16/13 Page 1 of 6 EXHIBIT 1 s Case 1:04-cv-01639-RJL Document 1092-20 Filed 08/16/13 Page 2 of 6 UNITED STATES DISTRICT COURT DISTRICT OF COLUMBIA In re
BEYOND THE HYPE: Understanding the Real Implications of the Amended Federal Rules of Civil Procedure. A Clearwell Systems White Paper
BEYOND THE HYPE: UNDERSTANDING THE REAL IMPLICATIONS OF THE AMENDED FRCP PA G E : 1 BEYOND THE HYPE: Understanding the Real Implications of the Amended Federal Rules of Civil Procedure A Clearwell Systems
In my article Search, Forward: Will manual document review and keyword searches
UNITED STATES DISTRICT COURT SOUTHERN DISTRICT OF NEW YORK - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - x MONIQUE DA SILVA MOORE, et al., Plaintiffs, -against- PUBLICIS GROUPE
Introduction to Predictive Coding
Introduction to Predictive Coding Herbert L. Roitblat, Ph.D. CTO, Chief Scientist, OrcaTec Predictive coding uses computers and machine learning to reduce the number of documents in large document sets
Predictive Coding in Multi-Language E-Discovery
Comprehending the Challenges of Technology Assisted Document Review Predictive Coding in Multi-Language E-Discovery UBIC North America, Inc. 3 Lagoon Dr., Ste. 180, Redwood City, CA 94065 877-321-8242
Meeting E-Discovery Challenges with Confidence
Meeting E-Discovery Challenges with Confidence Meeting today s e-discovery and information governance challenges while setting the foundation for tomorrow s requirements is the goal of every legal team.
the e-discovery how-to Guide page : 1 The E-Discovery Practical Recommendations for Streamlining Corporate E-Discovery A Clearwell White Paper
the e-discovery how-to Guide page : 1 The E-Discovery How-To Guide: Practical Recommendations for Streamlining Corporate E-Discovery A Clearwell White Paper the e-discovery how-to Guide page : 2 Table
ANALYSIS OF ORIGINAL BILL
Franchise Tax Board ANALYSIS OF ORIGINAL BILL Author: Evans Analyst: Deborah Barrett Bill Number: AB 5 See Legislative Related Bills: History Telephone: 845-4301 Introduced Date: December 1, 2008 Attorney:
A Brief Overview of ediscovery in California
What is ediscovery? Electronic discovery ( ediscovery ) is discovery of electronic information in litigation. ediscovery in California is governed generally by the Civil Discovery Act. In 2009, the California
E-Discovery in Michigan. Presented by Angela Boufford
E-Discovery in Michigan ESI Presented by Angela Boufford DISCLAIMER: This is by no means a comprehensive examination of E-Discovery issues. You will not be an E-Discovery expert after this presentation.
grouped into five different subject areas relating to: 1) planning for discovery and initial disclosures; 2)
ESI: Federal Court An introduction to the new federal rules governing discovery of electronically stored information In September 2005, the Judicial Conference of the United States unanimously approved
Solving Key Management Problems in Lotus Notes/Domino Environments
Solving Key Management Problems in Lotus Notes/Domino Environments An Osterman Research White Paper sponsored by Published April 2007 sponsored by Osterman Research, Inc. P.O. Box 1058 Black Diamond, Washington
Three Methods for ediscovery Document Prioritization:
Three Methods for ediscovery Document Prioritization: Comparing and Contrasting Keyword Search with Concept Based and Support Vector Based "Technology Assisted Review-Predictive Coding" Platforms Tom Groom,
Information Governance Challenges and Solutions
Challenges and Solutions In this modern information age, organizations struggle with two things: the problem of too much electronic data and how to govern the data. Each year, the speed of information
E-DISCOVERY & PRESERVATION OF ELECTRONIC EVIDENCE. Ana Maria Martinez April 14, 2011
E-DISCOVERY & PRESERVATION OF ELECTRONIC EVIDENCE Ana Maria Martinez April 14, 2011 This presentation does not present the views of the U.S. Department of Justice. This presentation is not legal advice.
