OPTIMIZATION AND OPERATIONS RESEARCH Vol. IV - Decision Trees and Influence Diagrams - Prakash P. Shenoy

Similar documents
SEQUENTIAL VALUATION NETWORKS: A NEW GRAPHICAL TECHNIQUE FOR ASYMMETRIC DECISION PROBLEMS

ANALYZING SYSTEM MAINTAINABILITY USING ENTERPRISE ARCHITECTURE MODELS

I I I I I I I I I I I I I I I I I I I

COMPUTING DURATION, SLACK TIME, AND CRITICALITY UNCERTAINTIES IN PATH-INDEPENDENT PROJECT NETWORKS

INTEGRATED GLOBAL MODELS OF SUSTAINABLE DEVELOPMENT - Vol. II - Global Modeling and Reasoning Support Tools - Sree N. Sreenath

An Influence Diagram Model for Detecting Credit Card Fraud

Another Look at Sensitivity of Bayesian Networks to Imprecise Probabilities

CONTROL SYSTEMS, ROBOTICS, AND AUTOMATION - Vol. I - Basic Elements of Control Systems - Ganti Prasada Rao

In Proceedings of the Eleventh Conference on Biocybernetics and Biomedical Engineering, pages , Warsaw, Poland, December 2-4, 1999

The Certainty-Factor Model

Decision Trees and Networks

Probabilistic Evidential Reasoning with Symbolic Argumentation for Space Situation Awareness

Real Time Decision Support System for Portfolio Management

A Belief-Function Based Decision Support System

DETERMINING THE CONDITIONAL PROBABILITIES IN BAYESIAN NETWORKS

DECISION TREE INDUCTION FOR FINANCIAL FRAUD DETECTION USING ENSEMBLE LEARNING TECHNIQUES

Degree Reduction of Interval SB Curves

Chapter 14 Managing Operational Risks with Bayesian Networks

ARTIFICIAL INTELLIGENCE: DEFINITION, TRENDS, TECHNIQUES, AND CASES

CONTROL SYSTEMS, ROBOTICS AND AUTOMATION Vol. XVI - Fault Accomodation Using Model Predictive Methods - Jovan D. Bošković and Raman K.

Effective Analysis and Predictive Model of Stroke Disease using Classification Methods

Patent Big Data Analysis by R Data Language for Technology Management

Compression algorithm for Bayesian network modeling of binary systems

3. The Junction Tree Algorithms

THIS paper presents a decision analysis approach for

AnalysisofData MiningClassificationwithDecisiontreeTechnique

Chapter 7.30 Retrieving Medical Records Using Bayesian Networks

University of Portsmouth PORTSMOUTH Hants UNITED KINGDOM PO1 2UP

2010 Master of Science Computer Science Department, University of Massachusetts Amherst

A Special Session on. Handling Uncertainties in Big Data by Fuzzy Systems

Decision-Theoretic Troubleshooting: A Framework for Repair and Experiment

EFFICIENCY OF DECISION TREES IN PREDICTING STUDENT S ACADEMIC PERFORMANCE

Simulation: An Effective Marketing Tool

Decision trees DECISION NODES. Kluwer Academic Publishers / X_220. Stuart Eriksen 1, Candice Huynh 1, and L.

Course Outline Department of Computing Science Faculty of Science. COMP Applied Artificial Intelligence (3,1,0) Fall 2015

Locating and Decoding EAN-13 Barcodes from Images Captured by Digital Cameras

THE WHE TO PLAY. Teacher s Guide Getting Started. Shereen Khan & Fayad Ali Trinidad and Tobago

SYSM 6304: Risk and Decision Analysis Lecture 5: Methods of Risk Analysis

Doctor of Philosophy in Computer Science

Fault Localization in a Software Project using Back- Tracking Principles of Matrix Dependency

KNOWLEDGE-BASED IN MEDICAL DECISION SUPPORT SYSTEM BASED ON SUBJECTIVE INTELLIGENCE

A STUDY ON DATA MINING INVESTIGATING ITS METHODS, APPROACHES AND APPLICATIONS

POLYNOMIAL FUNCTIONS

Unit 2.1. Data Analysis 1 - V Data Analysis 1. Dr Gordon Russell, Napier University

INFORMATION SECURITY RISK ASSESSMENT UNDER UNCERTAINTY USING DYNAMIC BAYESIAN NETWORKS

REVIEW OF HEART DISEASE PREDICTION SYSTEM USING DATA MINING AND HYBRID INTELLIGENT TECHNIQUES

International Journal of Emerging Technology & Research

Psychological Pricing in Mergers & Acquisitions using Game Theory

Include Requirement (R)

Incorporating Evidence in Bayesian networks with the Select Operator

Predicting the Risk of Heart Attacks using Neural Network and Decision Tree

Multi-ultrasonic sensor fusion for autonomous mobile robots

Healthcare Data Mining: Prediction Inpatient Length of Stay

Graduate Co-op Students Information Manual. Department of Computer Science. Faculty of Science. University of Regina

Performance Analysis of Naive Bayes and J48 Classification Algorithm for Data Classification

NATURAL LANGUAGE QUERY PROCESSING USING PROBABILISTIC CONTEXT FREE GRAMMAR

Advances in Health Care Predictive Analytics

THE DEVELOPMENT OF AN EXPERT CAR FAILURE DIAGNOSIS SYSTEM WITH BAYESIAN APPROACH

Optimal Replacement of Underground Distribution Cables

RISK MANAGEMENT HEALTH CARE

COMPLEXITY RISING: FROM HUMAN BEINGS TO HUMAN CIVILIZATION, A COMPLEXITY PROFILE. Y. Bar-Yam New England Complex Systems Institute, Cambridge, MA, USA

Game Theory 1. Introduction

Software Development Cost and Time Forecasting Using a High Performance Artificial Neural Network Model

Prediction of Heart Disease Using Naïve Bayes Algorithm

Information processing for new generation of clinical decision support systems

How To Create A Data Transformation And Data Visualization Tool In Java (Xslt) (Programming) (Data Visualization) (Business Process) (Code) (Powerpoint) (Scripting) (Xsv) (Mapper) (

A SURVEY ON GENETIC ALGORITHM FOR INTRUSION DETECTION SYSTEM

DATA MINING AND REPORTING IN HEALTHCARE

EMPIRICAL STUDY ON SELECTION OF TEAM MEMBERS FOR SOFTWARE PROJECTS DATA MINING APPROACH

Tracking Groups of Pedestrians in Video Sequences

Switching Cost, Competition, and Pricing in the Property/Casualty Insurance Market for Large Commercial Accounts

Modeling and Design of Intelligent Agent System

What is the state of the world?

bdepartment of Computer Science, Stanford University, Stanford, California

SAMPLE CHAPTERS UNESCO EOLSS PID CONTROL. Araki M. Kyoto University, Japan

EFFICIENT DATA PRE-PROCESSING FOR DATA MINING

HISTORICAL DEVELOPMENTS AND THEORETICAL APPROACHES IN SOCIOLOGY Vol. I - Social Network Analysis - Wouter de Nooy

A Bayesian Network Model for Diagnosis of Liver Disorders Agnieszka Onisko, M.S., 1,2 Marek J. Druzdzel, Ph.D., 1 and Hanna Wasyluk, M.D.,Ph.D.

Game Theory Perspectives on Client Vendor relationships in offshore software outsourcing

Linguistic Preference Modeling: Foundation Models and New Trends. Extended Abstract

Third School on Belief Functions and Their Applications. Final report

Course: Decision Analysis Models and Applications (PhD in Engineering and Management, PhD in Engineering and Public Policy)

Use Decision Trees to Make Important Project Decisions 1

Course Syllabus For Operations Management. Management Information Systems

Different Conceptions in Software Project Risk Assessment

Data Mining for Manufacturing: Preventive Maintenance, Failure Prediction, Quality Control

Bayesian Nash Equilibrium

An ACO Approach to Solve a Variant of TSP

Randomly Encryption Using Genetic Algorithm

A Probabilistic Causal Model for Diagnosis of Liver Disorders

ANANDHI S. BHARADWAJ

INTERNATIONAL ECONOMICS, FINANCE AND TRADE Vol.I - Economics of Scale and Imperfect Competition - Bharati Basu

SYSTEMS ANALYSIS AND MODELING OF INTEGRATED WORLD SYSTEMS - Vol. I - Simulation Software - Yu. I. Brodsky

SAMPLE CHAPTERS UNESCO EOLSS DIGITAL INSTRUMENTS. García J. and García D.F. University of Oviedo, Spain

Impelling Heart Attack Prediction System using Data Mining and Artificial Neural Network

1 Representation of Games. Kerschbamer: Commitment and Information in Games

UNIVERSITY OF MUMBAI

A Contribution to Expert Decision-based Virtual Product Development

IDENTIFYING BANK FRAUDS USING CRISP-DM AND DECISION TREES

Transcription:

DECISION TREES AND INFLUENCE DIAGRAMS Prakash P. Shenoy University of Kansas, Lawrence, USA Keywords: Decision trees, influence diagrams, decision making under uncertainty Contents 1. Introduction 2. A Medical Diagnosis Problem 3. Decision Trees 3.1. Decision Tree Representation 3.2. Decision Tree Solution 3.3. Strengths and Weaknesses of the Decision Tree Representation Technique 3.4. Strengths and Weaknesses of the Decision Tree Solution Technique 4. Influence Diagrams 4.1. Influence Diagram Representation 4.2. The Arc-Reversal Technique for Solving Influence Diagrams 4.3. Strengths and Weaknesses of the Influence Diagram Representation Technique 4.4. Strengths and Weaknesses of the Arc-Reversal Solution Technique 5. Summary and Conclusions Glossary Bibliography Biographical Sketch Summary This chapter describes decision trees and influence diagrams. We start with a small decision problem called Medical Diagnosis. Next we describe the decision tree representation and solution technique and illustrate it using the Medical Diagnosis problem. Then we state some strengths and weaknesses of the decision tree representation and solution technique. Next we describe the influence diagram representation and solution technique and illustrate it using the Medical Diagnosis problem. Then we describe some strengths and weaknesses of the influence diagram representation technique. Finally we conclude with a summary. 1. Introduction The main goal of this chapter is to describe decision trees and influence diagrams, both of which are formal mathematical techniques for representing and solving one-person decision problems under uncertainty. Decision trees have their genesis in the pioneering work of von Neumann and Morgenstern (1944) on extensive form games. Decision trees graphically depict all possible scenarios. The decision tree representation allows computation of an optimal strategy by the backward recursion method of dynamic programming. Howard Raiffa (1968) calls the dynamic programming method for solving decision trees averaging out and folding back.

Influence diagram is another method for representing and solving decision problems. Influence diagrams were initially proposed by Ron Howard and Jim Matheson (1981) as a method only for representing decision problems. The motivation behind the formulation of influence diagrams was to find a method for representing decision problems without any preprocessing. Subsequently, Scott Olmsted (1983) and Ross Shachter (1986) devised methods for solving influence diagrams directly, without first having to convert influence diagrams to decision trees. In the last decade, influence diagrams have become popular for representing and solving decision problems. 2. A Medical Diagnosis Problem In this section, we will state a simple symmetric decision problem that involves Bayesian revision of probabilities (a decision problem is said to be asymmetric if there exists a decision tree representation such that the number of scenarios in the representation is less than the product of the cardinalities of the state spaces of the decision and chance variables in the representation, and a decision problem is said to be symmetric if it is not asymmetric). This will enable us to show the strengths and weaknesses of the various methods for such problems. A physician is trying to decide on a policy for treating patients suspected of suffering from a disease D. D causes a pathological state P that in turn causes symptom S to be exhibited. The physician first observes whether or not a patient is exhibiting symptom S. Based on this observation, he/she either treats the patient (for D and P) or not. The physician s utility function depends on his/her decision to treat or not, the presence or absence of disease D, and the presence or absence of pathological state P. The prior probability of disease D is 10%. For patients known to suffer from D, 80% suffer from pathological state P. On the other hand, for patients known not to suffer from D, 15% suffer from P. For patients known to suffer from P, 70% exhibit symptom S. And for patients known not to suffer from P, 20% exhibit symptom S. We assume D and S are probabilistically conditionally independent given P. Table 1 shows the physician s utility function. Physician s States Utilities Has pathological state (p) No pathological state (~p) (υ) Has disease (d) No disease (~d) Has disease (d) No disease (~d) Treat (t) 10 6 8 4 Acts Not treat (~t) 0 2 1 10 Table 1: The physician s utility function for all act state pairs 3. Decision Trees In this section, we describe a decision tree representation and solution of the Medical Diagnosis problem. Also, we describe the strengths and weaknesses of the decision tree representation and solution techniques.

3.1. Decision Tree Representation Figure 1 shows the preprocessing of probabilities that has to be done before we can complete a decision tree representation of the Medical Diagnosis problem. In the probability tree on the left, we compute the joint probability distribution by multiplying the conditionals. For example, Pr( d, p, s) = Pr( d) Pr( pd ) Pr( s p) = (0.10)(0.80)(0.70) = 0.0560. (1) In the probability tree on the right, we compute the desired conditionals by additions and divisions. For example, Pr( s) = Pr( s, pd, ) + Pr( s, p,~ d) + Pr( s,~ pd, ) + Pr( s,~ p,~ d) = 0.0560 + 0.0945 + 0.0040 + 0.1530 = 0.3075, Pr( ps ) = Pr( s, p) Pr( s, p, d) + Pr( s, p,~ d) 0.0560 + 0.0945 = = = 0.4894, Pr( s) Pr( s) 0.3075 and Pr( s, p, d) Pr( s, p, d) 0.0560 Pr( ds, p) = = = = 0.3721. Pr( s, p) Pr( s, p, d) + Pr( s, p,~ d) 0.0560 + 0.0945 (2) (3) Figure 1: The preprocessing of probabilities in the Medical Diagnosis problem

Figure 2 shows a complete decision tree representation of the Medical Diagnosis problem. Each path from the root node to a leaf node represents a scenario. This tree has 16 scenarios. A decision problem is said to be asymmetric if there exists a decision tree representation such that the number of scenarios in the decision tree representation is less than the product of the cardinalities of the states spaces of the chance and decision variables in the problem. The Medical Diagnosis problem is symmetric since the number of scenarios is16= ΘS ΘT ΘP Θ D = 2222 ( Θ S denotes the set of all possible values of S and Θ S denotes its cardinality). Figure 2: A decision tree representation of the Medical Diagnosis problem

Figure 3: A decision tree solution of the Medical Diagnosis problem using coalescence - - - TO ACCESS ALL THE 19 PAGES OF THIS CHAPTER, Visit: http://www.eolss.net/eolss-sampleallchapter.aspx

Bibliography Bertele U. and Brioschi F. (1972). Nonserial Dynamic Programming. 235 pp. New York: Academic Press. [This is the basic reference for dynamic programming, an optimization technique based on divide and conquer philosophy.] Bielza C. and Shenoy P.P. (1999). A comparison of graphical techniques for asymmetric decision problems. Management Science. 45 (11), 1552 1569. [This paper compares four different graphical techniques for representation and solution of asymmetric decision problems.] Howard R. A. and Matheson J.E. (1981). Influence diagrams. The Principles and Applications of Decision Analysis. (ed. R.A. Howard and J.E. Matheson. 1984). 2, 719 762. Menlo Park, CA: Strategic Decisions Group, [This is the seminal paper that introduced the influence diagrams representation technique.] Olmsted S.M. (1983). On Representing And Solving Decision Problems. Ph.D. dissertation, 112 pp., Department of Engineering Economic Systems, Stanford University. [This Ph.D. dissertation is the first one to suggest a technique for solving influence diagrams directly, without first converting an influence diagram to a decision tree.] Pearl J. (1986). Fusion, propagation and structuring in belief networks. Artificial Intelligence. 29, 241 288. [This paper describes Bayes nets and a d-separation property of Bayes nets that can be used for detecting conditional independence properties of the joint probability distribution encoded by it.] Raiffa H. (1968). Decision Analysis: Introductory Lectures on Choices Under Uncertainty, 312 pp. Reading, MA: Addison-Wesley. [This is the basic reference for decision trees.] Shachter R.D. (1986). Evaluating influence diagrams. Operations Research. 34, 871 882. [This paper is widely cited for describing Olmsted s influence diagram solution technique on a sound mathematical footing.] Shenoy P.P. (1992). Valuation-based systems for Bayesian decision analysis. Operations Research. 40 (3), 463 484. [This paper describes a valuation-based systems approach to representing and solving decision problems.] Shenoy P.P. (1994). A comparison of graphical techniques for decision analysis. European Journal of Operational Research. 78 (1), 1 21. [This paper compares decision trees, influence diagrams and valuation networks.] Tatman J.A. and Shachter R.D. (1990). Dynamic programming and influence diagrams. IEEE Transactions on Systems, Man, and Cybernetics. 20 (2), 365 379. [This paper describes solution of influence diagram when the joint utility function factors into several smaller functions.] von Neumann J. and Morgenstern O. (1944). Theory of Games and Economic Behavior. 1st edition. 648 pp. New York: John Wiley & Sons. [This path-breaking treatise on game theory is the foundation for decision trees representation technique.] Biographical Sketch Prakash P. Shenoy is the Ronald G. Harper Distinguished Professor of Artificial Intelligence in Business, University of Kansas at Lawrence. He received a B.Tech.. in Mechanical Engineering from the Indian Institute of Technology, Bombay, India, in 1973, and an M.S. and a Ph.D. in Operations Research from Cornell University in 1975 and 1977 respectively. His research interests are in the areas of artificial intelligence and decision sciences. He has published many articles on management of uncertainty in expert systems, decision analysis, and the mathematical theory of games. His articles have appeared in journals such as Operations Research, Management Science, Artificial Intelligence, and International Journal of Approximate Reasoning. He has received several research grants from the Database and Expert Systems (DES), and Decision, Risk and Management Science (DRMS) programs of the National Science Foundation, the Research Opportunities in Auditing program of the Peat Marwick Main Foundation, the Higher Education Academic Development Donations program of Apple Computer, Inc., and the Information Sciences Department of Hughes Research Laboratories. He served as the North-American editor of International Journal of Uncertainty, Fuzziness and Knowledge-Based Systems, and as an

associate editor of Operations Research, Management Science, and International Journal of Approximate Reasoning, and as an ad-hoc referee for over 30 journals and conferences in Artificial Intelligence and Management Science/Operations Research. He served as Program Co-Chair of the Thirteenth Conference on Uncertainty in Artificial Intelligence held at Brown University, Providence, 1997, and as Conference Chair of the Fourteenth Conference on Uncertainty in Artificial Intelligence held at University of Wisconsin-Madison in 1998.