Predictive Coding, TAR, CAR NOT Just for Litigation
|
|
|
- Geoffrey Marshall
- 10 years ago
- Views:
Transcription
1 Predictive Coding, TAR, CAR NOT Just for Litigation February 26, 2015 Olivia Gerroll VP Professional Services, D4
2 Agenda Drivers The Evolution of Discovery Technology Definitions & Benefits How Predictive Coding Works The Ripple Effect Data and Information Governance Implications Use Cases & Considerations Selection & Technologies How Do Lawyers and Court View PC? Resources Q&A
3 Drivers Big Data - Growing FAST HOW MUCH DATA IS A PETABYTE
4 Drivers - The Technology is Not Unique As you select, the application zeroes in on what you like Song 1 Song 2 Song 3
5 Evolution of Discovery Technology Stand Alone Review Apps Document Imaging OCR Client/Server Review Tools Computer Forensics Automated Litigation Support Tape Restoration Web Based Review ASP or SaaS Auto Coding Thread & Near Dupe Detection Conceptual Search Visualization Systems Clustering & Categorization Legal Hold Management All in One Litigation Support Platforms Social Media and Cloud Collection Managed Services Predictive Coding & Assisted Review Natural Language Applications Artificial Intelligence?
6 TAR and CAR Technology-Assisted Review (TAR), or Computer- Assisted Review (CAR), is the use of advanced information retrieval technology that helps make the identification and review process more efficient TAR/CAR uses components of existing technologies to organize and sort documents by priority or relevance What differentiates TAR/CAR from other technologies are concept-based search engines and application of quantitative analysis.
7 Predictive Coding Defined Predictive Coding is one type of TAR/CAR Combines the efficiencies of concept search and statistics with the knowledge of human beings Uses an active machine learning approach or sometimes a support vector machine to distinguish relevant from non-relevant documents, based on decision made by a subject matter expert Uses established statistical principles to measure status and accuracy The technology can be used for applying Information Governance (IG) within a firm to both structured and unstructured data. One key component of predictive coding that differs from searching analytics is the methodology for training the technology that is used to automatically classify records and improve the accuracy and self-learning of predictive coding technology.
8 Benefits In the normal course of business documents are not organized by relevance With a predictive coding approach a Subject Matter Expert trains the software by coding individual documents responsive or not responsive, as the system samples the population Software calculates relevance scores for each document based on relevance
9 How It Works Matter expert is assigned to train the engine. The software initially selects a random sample of documents. The expert identifies relevant documents in the sample. The software analyzes the expert s input and creates a profile for relevant and irrelevant documents. The software generates new samples, each time learning more from the expert s input. The process repeats until the software determines it has sufficient information to scores all of the documents. The scores are then used to make informed decisions about the data management.
10 Predictive Coding Workflow - Discovery
11 Data Environment
12 The Ripple Effect Early use of predictive coding can be used to confidently impact settlement before heads-down legal review. -Kroll Predictive coding is a natural way to assess and detect risk patterns, and stop them from developing further. Predictive coding can be utilized to enforce and create record retention policies.
13 Data and Information Governance Key problems for organizations Find information they need, when needed and in a cheap and efficient manner Have to have the information Must keep it till needed Find valuable information Destroy worthless or unessential information What is valuable? What is worthless?
14 Implications? Chucking Daisies Ten Rules For Taking Control of Your Organization s Digital Debris Kahn & Datskovsky ARMA International (2013) Ch 1: Stop Keeping Everything Forever Ch 2: Clean Up the Past to Gain Business Efficiency But how? Since people are storing yet even more, predictive coding can help separate the debris: from what is required to be kept. Backup tape reduction. Early Case Assessment. Big Data mining. Compliance investigations.
15 Considerations All document-related information governance and RIM initiatives rest on and depend upon consistent, comprehensive document classification. Without consistent, comprehensive classification, an organization can't determine what to keep, how long to keep it, who should have access to it, and where to store it. Replace manual classification decision making processes with technology Use predictive technology to create classification schemas for identifying and categorizing data currently in unstructured systems Predictive technology can identify areas of conflict in existing classifications and ensure consistency and uniformity going forward
16 Considerations Use your skilled experts for creating the appropriate data sets (or seed sets). The data sets should represent content from all information repositories. Product must be able to meet your end-user, IT and legal compliance requirements. Oversight and a comprehensive remediation plan, agreed upon by all stakeholders. Deployment should include a process to audit the application s decisions. Ideally - leverage internal ediscovery resources to help guide the deployment. Litigation technology experts have been working with this technology for years and can provide valuable insight into its usability and functionalities. The hybrid approach You do not have to choose between upstream or downstream data movement. Predictive coding is not a panacea, so any project needs to start with the establishment of an IG framework. See Slide18 Item 3 for resource content location
17 Information Management IG Data Governance Shared drives Local hard drives SharePoint DM Systems Extranets, intranets RIM Data Control & Management official record Retention and disposition Onboarding client file intake Off-boarding client file transfer Identify vital and/or historical records Legal Hold/preservation Security, conflict and risk remediation
18 How can Predictive Coding be Applied? Seed Set Context Human Interaction Validate and Automate Identify Existing Information Data that has been classified in accordance with the organizations RIM policies Leverage Context Use existing resources such as the DM or financial system to provide context to the process Manual Verification Records staff interact with the technology to validate findings and ensure validity of predictive coding assessments Validate & Fully Automate After the manual verification has been validated and/or corrections made the system can be let loose
19 Application - Information Governance and Records Retention: How to Start Three Key Steps Executive sponsorship that supports Information Governance Form a steering committee of key stakeholders across multiple departments IT Legal Records Management Compliance Security & Privacy, etc. Define global policies Committee must focus on the business processes, laws & regulations, departmental requirements needed to define the global policies needed to govern information within the organization.
20 Selection Factors Ensure that your environment is ready to implement the technology. Factor in the learning curve necessary to fully understand and effectively use the technology. Skilled resources: The tools are best used by people skilled in big data information analysis understanding the analysis and patterns and how to interpret the results. Ensure that the technology and environment are correctly secured especially when dealing with the cloud and internet access. Understand the technology: Dig under the hood How good are the algorithms inside the software at doing what we tell it to do in finding information?
21 Some Technologies Information Management: Equivio Recommind Autonomy Symantec IBM EMC CommVault Discovery Nuix Relativity IPRO Autonomy Recommind FTI Catalyst
22 How Does the Judiciary View PC? Da Silva Moore v. Publicis Groupe Court okayed parties agreement to use; 3.3M s) Kleen Products v. Packaging Corp. of America Plaintiffs abandoned arguments in favor of PC and went Boolean Global Aerospace Inc. v. Landow Aviation, L.P. Court approved defendant use of PC over objections (2M s) Actos (Pioglitazone) Products Liability Litigation Court affirmatively approved using PC for review and production EORHB, Inc., et al v. HOA Holdings, LLC Court orders parties to use PC and share an ediscovery vendor
23 Defensible Predictive Coding Using Da Silva is a Map: Senior attorneys must be involved Cooperate in devising approach Have a written protocol Share the Seed Set (maybe!) Refine repeatedly for accuracy Be transparent Bottom Line for Defensibility: Sampling, transparency, documentation
24 Resources 1. D4 Knowledge Center 2. The Grossman-Cormack Glossary of Technology-Assisted Review 3. Chucking Daisies Ten Rules For Taking Control of Your Organization s Digital Debris ARMA Publication 4. Predictive Coding for Information Governance 5. The Electronic Discovery Reference Model The Sedona Conference Thesedonaconference.com
25 Questions? On behalf of everyone at D4 thank you ARMA Iowa for this opportunity to present. Olivia Gerroll VP, Professional Services Group o m
Recent Developments in the Law & Technology Relating to Predictive Coding
Recent Developments in the Law & Technology Relating to Predictive Coding Presented by Paul Neale CEO Presented by Gene Klimov VP & Managing Director Presented by Gerard Britton Managing Director 2012
MANAGING BIG DATA IN LITIGATION
David Han 2015 MANAGING BIG DATA IN LITIGATION DAVID HAN Associate, Morgan Lewis & Bockius, edata Practice Group MANAGING BIG DATA Data volumes always increasing New data sources Mobile Internet of Things
Predictive Coding Defensibility and the Transparent Predictive Coding Workflow
WHITE PAPER: PREDICTIVE CODING DEFENSIBILITY........................................ Predictive Coding Defensibility and the Transparent Predictive Coding Workflow Who should read this paper Predictive
IBM Unstructured Data Identification & Management An on ramp to reducing information costs and risk
Amir Jaibaji - Product Management Program Director IBM Information Lifecycle Governance IBM Unstructured Data Identification & Management An on ramp to reducing information costs and risk Enterprise big
Document Review Costs
Predictive Coding Gain Earlier Insight and Reduce Document Review Costs Tom Groom Vice President, Discovery Engineering [email protected] 303.840.3601 D4 LLC Litigation support service provider since
www.pwc.nl Review & AI Lessons learned while using Artificial Intelligence April 2013
www.pwc.nl Review & AI Lessons learned while using Artificial Intelligence Why are non-users staying away from PC? source: edj Group s Q1 2013 Predictive Coding Survey, February 2013, N = 66 Slide 2 Introduction
Predictive Coding Defensibility and the Transparent Predictive Coding Workflow
Predictive Coding Defensibility and the Transparent Predictive Coding Workflow Who should read this paper Predictive coding is one of the most promising technologies to reduce the high cost of review by
Architecting Our Future
2011 CIO ROUNDTABLE RETREAT March 6 8, 2011 The Arizona Biltmore Phoenix, Arizona LITIGATIONHOLD AND EDISCOVERYINDUSTRY BRIEFS PRESENTED BY OLIVIA GERROLL Agenda What are law firm IT departments facing
Technology- Assisted Review 2.0
LITIGATION AND PRACTICE SUPPORT Technology- Assisted Review 2.0 by Ignatius Grande of Hughes Hubbard & Reed LLP and Andrew Paredes of Epiq Systems Legal teams and their outside counsel must deal with an
Auto-Classification for Document Archiving and Records Declaration
Auto-Classification for Document Archiving and Records Declaration Josemina Magdalen, Architect, IBM November 15, 2013 Agenda IBM / ECM/ Content Classification for Document Archiving and Records Management
Fundamentals of Information Governance:
Fundamentals of Information Governance: More than just records management PETER KURILECZ CRM CA IGP Hard as I try, I simply cannot make myself understand how Information Governance isn t just a different
Miguel Ortiz, Sr. Systems Engineer. Globanet
Miguel Ortiz, Sr. Systems Engineer Globanet Agenda Who is Globanet? Archiving Processes and Standards How Does Data Archiving Help Data Management? Data Archiving to Meet Downstream ediscovery Needs Timely
Quality Control for predictive coding in ediscovery. kpmg.com
Quality Control for predictive coding in ediscovery kpmg.com Advances in technology are changing the way organizations perform ediscovery. Most notably, predictive coding, or technology assisted review,
Guide to Information Governance: A Holistic Approach
E-PAPER DECEMBER 2014 Guide to Information Governance: A Holistic Approach A comprehensive strategy allows agencies to create more reliable processes for ediscovery, increase stakeholder collaboration,
Nuix bolsters its e-discovery team and continues its push to information governance
Nuix bolsters its e-discovery team and continues its push to information governance Analyst: David Horrigan 5 Sep, 2013 Over the past 12-18 months, many e-discovery vendors and thought leaders have jumped
Proactive Data Management for ediscovery
Proactive Data Management for ediscovery Simon Taylor Snr. Director Information Management CommVault Systems Inc. Why ediscovery sucks for IT The US Federal Rules of Civil Procedure Rule 34(a), (b) Definition
PREDICTIVE CODING: SILVER BULLET OR PANDORA S BOX?
Vol. 46 No. 3 February 6, 2013 PREDICTIVE CODING: SILVER BULLET OR PANDORA S BOX? The high costs of e-discovery have led to the development of computerized review technology by which the user may search
How Good is Your Predictive Coding Poker Face?
How Good is Your Predictive Coding Poker Face? SESSION ID: LAW-W03 Moderator: Panelists: Matthew Nelson ediscovery Counsel Symantec Corporation Hon. Andrew J. Peck US Magistrate Judge Southern District
Technology Assisted Review of Documents
Ashish Prasad, Esq. Noah Miller, Esq. Joshua C. Garbarino, Esq. October 27, 2014 Table of Contents Introduction... 3 What is TAR?... 3 TAR Workflows and Roles... 3 Predictive Coding Workflows... 4 Conclusion...
ZEROING IN DATA TARGETING IN EDISCOVERY TO REDUCE VOLUMES AND COSTS
ZEROING IN DATA TARGETING IN EDISCOVERY TO REDUCE VOLUMES AND COSTS WELCOME Thank you for joining Numerous diverse attendees Today s topic and presenters This is an interactive presentation You will receive
WHITE PAPER Practical Information Governance: Balancing Cost, Risk, and Productivity
WHITE PAPER Practical Information Governance: Balancing Cost, Risk, and Productivity Sponsored by: EMC Corporation Laura DuBois August 2010 Vivian Tero EXECUTIVE SUMMARY Global Headquarters: 5 Speen Street
Data Sheet: Archiving Symantec Enterprise Vault Discovery Accelerator Accelerate e-discovery and simplify review
Accelerate e-discovery and simplify review Overview provides IT/Legal liaisons, investigators, lawyers, paralegals and HR professionals the ability to search, preserve and review information across the
Navigating Information Governance and ediscovery
Navigating Information Governance and ediscovery Implementing Processes & Technology to Reduce Downstream ediscovery Cost and Risk Shannon Smith General Counsel, Globanet March 11 12, 2013 Agenda 1 Overview
Autonomy Consolidated Archive
Autonomy Consolidated Archive Dennis Wild Director SME, Information Governance and Archiving POWER PROTECT PROMOTE Meaning-Based Governance Files IM Audio Email Social Video SharePoint Archiving = Gain
Intelligent Information Management: Archive & ediscovery
Intelligent Information Management: Archive & ediscovery Byron Chang Senior Systems Engineer / Symantec Hong Kong Agenda 1 Today s Information Management Challenges 2 Why Information Management? 3 The
Three Methods for ediscovery Document Prioritization:
Three Methods for ediscovery Document Prioritization: Comparing and Contrasting Keyword Search with Concept Based and Support Vector Based "Technology Assisted Review-Predictive Coding" Platforms Tom Groom,
Integrated email archiving: streamlining compliance and discovery through content and business process management
Make better decisions, faster March 2008 Integrated email archiving: streamlining compliance and discovery through content and business process management 2 Table of Contents Executive summary.........
The Truth About Predictive Coding: Getting Beyond The Hype
www.encase.com/ceic The Truth About Predictive Coding: Getting Beyond The Hype David R. Cohen Reed Smith LLP Records & E-Discovery Practice Group Leader David leads a group of more than 100 lawyers in
Breaking Down the Silos: A 21st Century Approach to Information Governance. May 2015
Breaking Down the Silos: A 21st Century Approach to Information Governance May 2015 Introduction With the spotlight on data breaches and privacy, organizations are increasing their focus on information
What We ll Cover. Defensible Disposal of Records and Information Litigation Holds Information Governance the future of records management programs
What We ll Cover Foundations of Records and Information Management Creating a Defensible Retention Schedule Paper v. Electronic Records Organization and Retrieval of Records and Information Records Management
THE PREDICTIVE CODING CASES A CASE LAW REVIEW
THE PREDICTIVE CODING CASES A CASE LAW REVIEW WELCOME Thank you for joining Numerous diverse attendees Please feel free to submit questions Slides, recording and survey coming tomorrow SPEAKERS Matthew
Reduce Cost, Time, and Risk ediscovery and Records Management in SharePoint
Reduce Cost, Time, and Risk ediscovery and Records Management in SharePoint David Tappan SharePoint Consultant C/D/H [email protected] Twitter @cdhtweetstech Don Miller Vice President of Sales Concept Searching
Managed Services: Maximizing Transparency and Minimizing Expense and Risk in ediscovery and Information Governance
Managed Services: Maximizing Transparency and Minimizing Expense and Risk in ediscovery and Information Governance January 18, 2013 Andrew Bayer, Director of Business Development Adam Wells, VP, Business
Nuix continues rapid growth, expands e-discovery into information governance
Nuix continues rapid growth, expands e-discovery into information governance Analyst: David Horrigan 8 Mar, 2012 Australian e-discovery vendor Nuix has embarked on a busy 2012, releasing three new components
PICTERA. What Is Intell1gent One? Created by the clients, for the clients SOLUTIONS
PICTERA SOLUTIONS An What Is Intell1gent One? Created by the clients, for the clients This white paper discusses: Understanding How Intell1gent One Saves Time and Money Using Intell1gent One to Save Money
Information Archiving
Information Archiving Drinking from the firehose. Raymond Lambie Product Marketing Manager, HP Autonomy AP/J Archive or Backup What is the difference? Ctrl-X or Ctrl-C An archive is a primary copy of inactive
Considering Third Generation ediscovery? Two Approaches for Evaluating ediscovery Offerings
Considering Third Generation ediscovery? Two Approaches for Evaluating ediscovery Offerings Developed by Orange Legal Technologies, Providers of the OneO Discovery Platform. Considering Third Generation
ediscovery Solutions
The Radicati Group, Inc. www.radicati.com ediscovery Solutions A Radicati Group, Inc. Webconference The Radicati Group, Inc. Copyright November 2010, Reproduction Prohibited 9:30 am, PT November 4, 2010
SMART ARCHIVING. The need for a strategy around archiving. Peter Van Camp
SMART ARCHIVING The need for a strategy around archiving Peter Van Camp I.R.I.S. mission I.R.I.S. mission : Increase our customers productivity and knowledge through helping them better manage their documents,
PRESENTATION TOPICS 2/27/2014. Why Update Policies? 21st Century Best Practices for Information Governance & Policies. Why update policies??
21st Century Best Practices for Information Governance & Policies Presented by: John Isaza, CEO- Information Governance Solutions, LLC Partner - Rimon PC ARMA NOVA Chapter Friday, February 28, 2014 12:30
Traditionally, the gold standard for identifying potentially
istockphoto.com/alexandercreative Predictive Coding: It s Here to Stay Predictive coding programs are poised to become a standard practice in e-discovery in the near future. As more courts weigh in on
Brochure. ECM without borders. HP Enterprise Content Management (ECM)
Brochure ECM without borders HP Enterprise Content Management (ECM) HP Enterprise Content Management (ECM) Without question, the volume, variety, and velocity of data across your enterprise create new
Data Sheet: Archiving Symantec Enterprise Vault Store, Manage, and Discover Critical Business Information
Store, Manage, and Discover Critical Business Information Managing millions of mailboxes for thousands of customers worldwide, Enterprise Vault, the industry leader in email and content archiving, enables
Viewpoint ediscovery Services
Xerox Legal Services Viewpoint ediscovery Platform Technical Brief Viewpoint ediscovery Services Viewpoint by Xerox delivers a flexible approach to ediscovery designed to help you manage your litigation,
W H I T E P A P E R E X E C U T I V E S U M M AR Y S I T U AT I O N O V E R V I E W. Sponsored by: EMC Corporation. Laura DuBois May 2010
W H I T E P A P E R E n a b l i n g S h a r e P o i n t O p e r a t i o n a l E f f i c i e n c y a n d I n f o r m a t i o n G o v e r n a n c e w i t h E M C S o u r c e O n e Sponsored by: EMC Corporation
CA Records Manager. Benefits. CA Advantage. Overview
PRODUCT BRIEF: CA RECORDS MANAGER CA RECORDS MANAGER HELPS YOU CONTROL AND MANAGE PHYSICAL, ELECTRONIC AND EMAIL RECORDS ACROSS THE ENTERPRISE FOR PROACTIVE COMPLIANCE WITH REGULATORY, LEGISLATIVE AND
Functions & Importance of a Strategic Business Plan
Functions & Importance of a Strategic Business Plan Komal A Gulich, CRM, IGP Manager, Enterprise Records Management FirstEnergy Service Co April 15, 2014 Agenda Brief recap of Workshop Look at Function
Meeting E-Discovery Challenges with Confidence
Meeting E-Discovery Challenges with Confidence Meeting today s e-discovery and information governance challenges while setting the foundation for tomorrow s requirements is the goal of every legal team.
The Case for Technology Assisted Review and Statistical Sampling in Discovery
The Case for Technology Assisted Review and Statistical Sampling in Discovery Position Paper for DESI VI Workshop, June 8, 2015, ICAIL Conference, San Diego, CA Christopher H Paskach The Claro Group, LLC
ILM: Tiered Services & The Need For Classification
ILM: Tiered Services & The Need For Classification Edgar StPierre, EMC 2 SNW San Diego April 2007 SNIA Legal Notice The material contained in this tutorial is copyrighted by the SNIA. Member companies
CA Message Manager. Benefits. Overview. CA Advantage
PRODUCT BRIEF: CA MESSAGE MANAGER CA Message Manager THE PROACTIVE MANAGEMENT OF EMAIL AND INSTANT MESSAGES IS INTEGRAL TO THE OVERALL STRATEGY OF INFORMATION GOVERNANCE. THERE ARE MANY COMPLEX CHALLENGES
Electronically Stored Information in Litigation
Electronically Stored Information in Litigation Volume 69, November 2013 By Timothy J. Chorvat and Laura E. Pelanek* I. Introduction Recent developments in the use of electronically stored information
Successful Implementation of Enterprise-Wide Information Governance
Successful Implementation of Enterprise-Wide Information Governance ARMA Austin Monthly Meeting November 13, 2014 TAD C. HOWINGTON, CRM, FAI Manager, E- Records and Information Governance Kinder- Morgan
Technology Assisted Review: The Disclosure of Training Sets and Related Transparency Issues Whitney Street, Esq. 1
Technology Assisted Review: The Disclosure of Training Sets and Related Transparency Issues Whitney Street, Esq. 1 The potential cost savings and increase in accuracy afforded by technology assisted review
IBM Unstructured Data Identification and Management
IBM Unstructured Data Identification and Management Discover, recognize, and act on unstructured data in-place Highlights Identify data in place that is relevant for legal collections or regulatory retention.
Metrics-Based Information Governance
Metrics-Based Information Governance Five Ways to Measure Program Effectiveness Sponsored by: Abstract Measuring the effectiveness and business impact of information governance has always been a difficult
Lowering E-Discovery Costs Through Enterprise Records and Retention Management. An Oracle White Paper March 2007
Lowering E-Discovery Costs Through Enterprise Records and Retention Management An Oracle White Paper March 2007 Lowering E-Discovery Costs Through Enterprise Records and Retention Management Exponential
Predictive Coding Helps Companies Reduce Discovery Costs
Predictive Coding Helps Companies Reduce Discovery Costs Recent Court Decisions Open Door to Wider Use by Businesses to Cut Costs in Document Discovery By John Tredennick As companies struggle to manage
IBM ediscovery Identification and Collection
IBM ediscovery Identification and Collection Turning unstructured data into relevant data for intelligent ediscovery Highlights Analyze data in-place with detailed data explorers to gain insight into data
Litigation Solutions. insightful interactive culling. distributed ediscovery processing. powering digital review
Litigation Solutions insightful interactive culling distributed ediscovery processing powering digital review TECHNOLOGY ASSISTED REVIEW Eclipse combines advanced analytic technology with machine learning
This Symposium brought to you by www.ttcus.com
This Symposium brought to you by www.ttcus.com Linkedin/Group: Technology Training Corporation @Techtrain Technology Training Corporation www.ttcus.com Big Data Analytics as a Service (BDAaaS) Big Data
The Future of Records Management. Senior Director, Loss Prevention Project Manager/Developer
The Future of Records Management Ann Ostrander Jimmy Lam Senior Director, Loss Prevention Project Manager/Developer Kirkland & Ellis LLP Loeb & Loeb LLP Agenda What is driving the change? People Technology
From Chaos to Clarity.
LITIGATION READINESS 3 PRESERVATION & COLLECTION 3 PROCESSING 3 DATA ANALYTICS 3 DOCUMENT REVIEW 3 PRODUCTION 3 POST PRODUCTION From Chaos to Clarity. The AlixPartners Difference Experienced. AlixPartners
Information governance is old news at Nuix
Information governance is old news at Nuix Analyst: David Horrigan 18 Jul, 2014 Sydney-based software developer Nuix was one of the early tech proponents of information governance (IG), after being known
DOCUMENT RETENTION STRATEGIES FOR HEALTHCARE ORGANIZATIONS
Overview. DOCUMENT RETENTION STRATEGIES FOR HEALTHCARE ORGANIZATIONS A comprehensive and consistently applied document retention policy is necessary to reduce the risk of being charged with spoliation
