Using Artificial Intelligence to Manage Big Data for Litigation



Similar documents
Data Mining for Customer Service Support. Senioritis Seminar Presentation Megan Boice Jay Carter Nick Linke KC Tobin

Machine Learning: Overview

Machine Learning. Chapter 18, 21. Some material adopted from notes by Chuck Dyer

Network Machine Learning Research Group. Intended status: Informational October 19, 2015 Expires: April 21, 2016

A Systemic Artificial Intelligence (AI) Approach to Difficult Text Analytics Tasks

Reduce Cost and Risk during Discovery E-DISCOVERY GLOSSARY

Machine Learning using MapReduce

Software Development Training Camp 1 (0-3) Prerequisite : Program development skill enhancement camp, at least 48 person-hours.

Data Isn't Everything

An Introduction to Data Mining

MA2823: Foundations of Machine Learning

International Journal of Computer Science Trends and Technology (IJCST) Volume 2 Issue 3, May-Jun 2014

Using LSI for Implementing Document Management Systems Turning unstructured data from a liability to an asset.

DATA MINING TECHNIQUES AND APPLICATIONS

A STUDY ON DATA MINING INVESTIGATING ITS METHODS, APPROACHES AND APPLICATIONS

MS1b Statistical Data Mining

A Review of Data Mining Techniques

Viewpoint ediscovery Services

Machine Learning with MATLAB David Willingham Application Engineer

Introduction to Pattern Recognition

An Overview of Knowledge Discovery Database and Data mining Techniques

Introduction to Predictive Coding

Is a Data Scientist the New Quant? Stuart Kozola MathWorks

Three Methods for ediscovery Document Prioritization:

Machine Learning Introduction

Learning is a very general term denoting the way in which agents:

Data Mining Applications in Higher Education

CS 2750 Machine Learning. Lecture 1. Machine Learning. CS 2750 Machine Learning.

Machine Learning and Data Mining. Fundamentals, robotics, recognition

How To Use Neural Networks In Data Mining

Data Sheet: Archiving Symantec Enterprise Vault Discovery Accelerator Accelerate e-discovery and simplify review

The Business Case for ECA

Machine Learning for Data Science (CS4786) Lecture 1

Statistics for BIG data

Principles of Data Mining by Hand&Mannila&Smyth

IBM ediscovery Identification and Collection

Master s Program in Information Systems

Using Data Mining for Mobile Communication Clustering and Characterization

Gerard Mc Nulty Systems Optimisation Ltd BA.,B.A.I.,C.Eng.,F.I.E.I

Predictive Coding, TAR, CAR NOT Just for Litigation

From Chaos to Clarity.

Bagged Ensemble Classifiers for Sentiment Classification of Movie Reviews

The Data Mining Process

Predicting the Risk of Heart Attacks using Neural Network and Decision Tree

BIG DATA IN THE CLOUD : CHALLENGES AND OPPORTUNITIES MARY- JANE SULE & PROF. MAOZHEN LI BRUNEL UNIVERSITY, LONDON

Welcome. Data Mining: Updates in Technologies. Xindong Wu. Colorado School of Mines Golden, Colorado 80401, USA

Course 395: Machine Learning

MANAGING QUEUE STABILITY USING ART2 IN ACTIVE QUEUE MANAGEMENT FOR CONGESTION CONTROL

Understanding How Service Providers Charge for ediscovery Services

ZEROING IN DATA TARGETING IN EDISCOVERY TO REDUCE VOLUMES AND COSTS

Introduction. A. Bellaachia Page: 1

Chapter 12 Discovering New Knowledge Data Mining

Introduction to Machine Learning and Data Mining. Prof. Dr. Igor Trajkovski

Defending Networks with Incomplete Information: A Machine Learning Approach. Alexandre

Maschinelles Lernen mit MATLAB

Information Management course

A Study Of Bagging And Boosting Approaches To Develop Meta-Classifier

Statistical Validation and Data Analytics in ediscovery. Jesse Kornblum

Machine Learning. CUNY Graduate Center, Spring Professor Liang Huang.

Data Mining Techniques

A HYBRID RULE BASED FUZZY-NEURAL EXPERT SYSTEM FOR PASSIVE NETWORK MONITORING

EFFICIENT DATA PRE-PROCESSING FOR DATA MINING

Review & AI Lessons learned while using Artificial Intelligence April 2013

DMDSS: Data Mining Based Decision Support System to Integrate Data Mining and Decision Support

An Introduction to Data Mining. Big Data World. Related Fields and Disciplines. What is Data Mining? 2/12/2015

PICTERA. What Is Intell1gent One? Created by the clients, for the clients SOLUTIONS

INTRODUCTION TO MACHINE LEARNING 3RD EDITION

Machine Learning for natural language processing

In this presentation, you will be introduced to data mining and the relationship with meaningful use.

NEURAL NETWORKS IN DATA MINING

Index Contents Page No. Introduction . Data Mining & Knowledge Discovery

Healthcare Measurement Analysis Using Data mining Techniques

Graduate Co-op Students Information Manual. Department of Computer Science. Faculty of Science. University of Regina

Software Engineering of NLP-based Computer-assisted Coding Applications

CS Master Level Courses and Areas COURSE DESCRIPTIONS. CSCI 521 Real-Time Systems. CSCI 522 High Performance Computing

E- Discovery in Criminal Law

LVQ Plug-In Algorithm for SQL Server

Data Mining Part 5. Prediction

Page 1 of 5. (Modules, Subjects) SENG DSYS PSYS KMS ADB INS IAT

Machine Learning and Statistics: What s the Connection?

Mining. Practical. Data. Monte F. Hancock, Jr. Chief Scientist, Celestech, Inc. CRC Press. Taylor & Francis Group

Introduction to Data Mining and Machine Learning Techniques. Iza Moise, Evangelos Pournaras, Dirk Helbing

A Content based Spam Filtering Using Optical Back Propagation Technique

Social Media Mining. Data Mining Essentials

Classification algorithm in Data mining: An Overview

Doctor of Philosophy in Computer Science

Data Mining. Concepts, Models, Methods, and Algorithms. 2nd Edition

Introduction to Data Mining

Spam Detection Using Customized SimHash Function

Machine Learning for Cyber Security Intelligence

Data Mining Solutions for the Business Environment

IBM Unstructured Data Identification and Management

Proactive Data Management for ediscovery

Transcription:

FEBRUARY 3 5, 2015 / THE HILTON NEW YORK Using Artificial Intelligence to Manage Big Data for Litigation Understanding Artificial Intelligence to Make better decisions Improve the process Allay the fear And Keep Screen-Punching to a Minimum

Introduction Stuart Miles "Maybe the only significant difference between a really smart [AI] simulation and a human being was the noise they made when you punched them." Terry Pratchett, The Long Earth

Introduction Define your terms definitions matter Confirm that everyone has the same definition other parties and fact finders as well when making agreements If your counsel or expert cannot explain a concept or its application, start to question the counsel or expertise if you can't teach an idea, how well do you really know it?

Definitions Conventional Artificial Intelligence Deductive AI Formal and statistical analysis of human behavior Computational Intelligence Inductive AI Development or interactive learning using empirical data

Definitions "Big Data" Massive amounts of an organization's diverse data, ranging from email messages, to file servers, specialized databases, social media, online business transactions (e.g., Amazon), etc. Unstructured data (e.g., emails) Structured data (e.g., specialized databases, archives)

Important Big Data Considerations Anonymization Ownership Permissions Privacy Records Retention Security Tokenization or R n (Redaction)

State of ediscovery Technology Know the current state of ediscovery technologies General, non-ai applications of technology that may be applicable singly or in concert with AI technologies De-NISTing De-duplication (Hash) Parametric Boolean Key Word searches Date Ranges Custodial Filtering

State of ediscovery AI Predictive Coding, Assisted Review, and New Technologies have changed the game Unsupervised Learning Algorithms use what the data can provide without attorney input: Clustering Near-Duplicate Detection Concept Search Other types of input Linguistic Analysis

State of ediscovery AI Supervised Learning Algorithms use attorney review in combination with the back-end math: Active Learning Language Modeling Logistic Regression LSA & Probabilistic LSA Naïve Bayesian Classifiers Nearest Neighbor Relevance Feedback Support Vector Machines

State of ediscovery AI Active Learning: Learning algorithm that reduces human effort by selecting the most informative data for training Language Modeling: Seeks out ideas in context in large data collections rather than using keywords Latent Semantic Analysis: An extraction, identification and categorization of a large set of documents by using statistical analysis to identify meaning based on the contexts in which words appear

State of ediscovery AI Support Vector Machines: Machine learning technique for classifying images, text, and other data into groups Uses human-classified data ("training") to categorize unknown data based on its resemblance to training data Relevance Feedback: In an iterative process, human user identifies search results as "relevant" and the identified "relevant" information is used as the basis for the next search

New Data Sources Understand how big data represents new sources and associated challenges for litigation preservation, collection, and use Automated collection Database complexity "Known" and "unknown" unknowns Third-party data mining and related services

Alternative Uses of AI Don't limit considerations to ediscovery and consider other types of AI Use of non-tar and Learning Algorithms for nonproduction activities: Client data analysis Early case assessment Opposing and third-party productions Prior party productions and representations

Alternative Uses of AI A* ("A-star") Pathfinding algorithm for application to underpinning contentions Considerations of mapping point-by-point uses of "found" information to most directly support points for a fact finder

Alternative Uses of AI Neural Networks Modeled on the brain; weighs connection strength Applies to handwriting analysis and better OCR Applications for speech recognition in matters involving financial institutions or caches of like data Some even theorize that they might be applied to basic legal analysis D. Hunter, Looking for Law in all the Wrong Places: Legal Theory and Legal Neural Networks, in: A. Soeteman (eds.), Legal knowledge based systems JURIX 94: The Foundation for Legal Knowledge Systems, Lelystad: Koninklijke Vermande, 1994, pp. 55-64,

Alternative Uses of AI Genetic Algorithms A population of candidate solutions to an optimization problem evolves toward better solutions Applications for settlement considerations Pattern litigation considerations Some even theorize that they might be applied to basic legal analysis A.S. Pannu, Using Genetic Algorithms to Inductively Reason with Cases in the Legal Domain, Intelligent Systems Program, University of Pittsburgh, in: Proceedings of the Fifth International Conference on Artificial Intelligence and Law (ICAIL-95)

Miscellaneous AI is not replacing Subject Matter Experts Despite reliance on AI and associated technologies in many other areas Despite scholarship in this direction Future Considerations within AI Quantum computing Outputs from AI as evidence Pattern recognition that indicates a "truth" that doesn't exist

Glossary Anonymization A* ("A-star") Big Data Computational Intelligence Conventional Artificial Intelligence De-NISTing Deductive AI Genetic Algorithms Hash Inductive AI Language Modeling Latent Semantic Analysis ("LSA") Logistic Regression Naïve Bayesian Classifiers Neural Networks Parametric Boolean Permissions Relevance Feedback Structured data Support Vector Machines Tokenization Unstructured data Unsupervised Learning Algorithms