Decision making in the presence of uncertainty II



Similar documents
Decision & Risk Analysis Lecture 6. Risk and Utility

Decision Making under Uncertainty

Informatics 2D Reasoning and Agents Semester 2,

Choice Under Uncertainty

= = 106.

Choice under Uncertainty

Risk and Uncertainty. Vani K Borooah University of Ulster

Introduction to Game Theory IIIii. Payoffs: Probability and Expected Utility

Economics of Insurance

Chapter 14 Risk Analysis

Choice Under Uncertainty Insurance Diversification & Risk Sharing AIG. Uncertainty

Insurance. Michael Peters. December 27, 2013

Lecture 13: Risk Aversion and Expected Utility

Economics 1011a: Intermediate Microeconomics

STA 371G: Statistics and Modeling

Lecture 11 Uncertainty

Is a monetary incentive a feasible solution to some of the UK s most pressing health concerns?

Valuation of the Surrender Option Embedded in Equity-Linked Life Insurance. Brennan Schwartz (1976,1979) Brennan Schwartz

a. Again, this question will be answered on the basis of subjective judgment. Here are some possible types :

Absolute Value Equations and Inequalities

1 Introduction to Option Pricing

1 Uncertainty and Preferences

Lecture 15. Ranking Payoff Distributions: Stochastic Dominance. First-Order Stochastic Dominance: higher distribution

7: The CRR Market Model

Profit Maximization. 2. product homogeneity

Simple Decision Making

Chapter 5 Uncertainty and Consumer Behavior

Prospect Theory Ayelet Gneezy & Nicholas Epley

CS 2750 Machine Learning. Lecture 1. Machine Learning. CS 2750 Machine Learning.

6.254 : Game Theory with Engineering Applications Lecture 2: Strategic Form Games

DECISION MAKING UNDER UNCERTAINTY:

Chapter 2 Time value of money

ECON 459 Game Theory. Lecture Notes Auctions. Luca Anderlini Spring 2015

One Period Binomial Model

Lecture 10 - Risk and Insurance

Intermediate Microeconomics (22014)

We never talked directly about the next two questions, but THINK about them they are related to everything we ve talked about during the past week:

Combining decision analysis and portfolio management to improve project selection in the exploration and production firm

Health Economics. University of Linz & Demand and supply of health insurance. Gerald J. Pruckner. Lecture Notes, Summer Term 2010

Lecture notes for Choice Under Uncertainty

Unit 4 DECISION ANALYSIS. Lesson 37. Decision Theory and Decision Trees. Learning objectives:

Chapter 4 DECISION ANALYSIS

Economics 1011a: Intermediate Microeconomics

Options Markets: Introduction

: Corporate Finance. Financial Decision Making

3 Introduction to Assessing Risk

Investment Appraisal INTRODUCTION

CAN BONUS-MALUS ALLIEVIATE INSURANCE FRAUD?

Math 120 Final Exam Practice Problems, Form: A

Minimax Strategies. Minimax Strategies. Zero Sum Games. Why Zero Sum Games? An Example. An Example

Next Tuesday: Amit Gandhi guest lecture on empirical work on auctions Next Wednesday: first problem set due

Economics 2020a / HBS 4010 / HKS API-111 Fall 2011 Practice Problems for Lectures 1 to 11

Holding Period Return. Return, Risk, and Risk Aversion. Percentage Return or Dollar Return? An Example. Percentage Return or Dollar Return? 10% or 10?

2. Information Economics

Regret and Rejoicing Effects on Mixed Insurance *

6.825 Exercise Solutions, Decision Theory

Chapter 11 Monte Carlo Simulation

An Overview of Asset Pricing Models

Mathematical finance and linear programming (optimization)

Tools for Project Evaluation. Nathaniel Osgood 1.040/1.401J 2/11/2004

LINEAR INEQUALITIES. less than, < 2x + 5 x 3 less than or equal to, greater than, > 3x 2 x 6 greater than or equal to,

Econ 132 C. Health Insurance: U.S., Risk Pooling, Risk Aversion, Moral Hazard, Rand Study 7

Answer: The relationship cannot be determined.

MATH 60 NOTEBOOK CERTIFICATIONS

Principal Agent Model. Setup of the basic agency model. Agency theory and Accounting. 2 nd session: Principal Agent Problem

MASSACHUSETTS INSTITUTE OF TECHNOLOGY 6.436J/15.085J Fall 2008 Lecture 5 9/17/2008 RANDOM VARIABLES

Lecture 3: Put Options and Distribution-Free Results

LECTURES ON REAL OPTIONS: PART I BASIC CONCEPTS

Follow links for Class Use and other Permissions. For more information send to:

Lecture 4: Derivatives

I. Derivatives, call and put options, bounds on option prices, combined strategies

Chapter 25: Exchange in Insurance Markets

Solving Rational Equations

Table 1: Field Experiment Dependent Variable Probability of Donation (0 to 100)

2 SYSTEM DESCRIPTION TECHNIQUES

Understanding Options: Calls and Puts

Advanced Microeconomics

1 Lecture: Integration of rational functions by decomposition

3.1 Solving Systems Using Tables and Graphs

5.1 Radical Notation and Rational Exponents

Decision trees DECISION NODES. Kluwer Academic Publishers / X_220. Stuart Eriksen 1, Candice Huynh 1, and L.

Lecture 11. Sergei Fedotov Introduction to Financial Mathematics. Sergei Fedotov (University of Manchester) / 7

is identically equal to x 2 +3x +2

Problem Set 1 Solutions

Chapter 4 Online Appendix: The Mathematics of Utility Functions

Consumer Theory. The consumer s problem

Version Spaces.

Price Elasticity of Supply; Consumer Preferences

Utility. M. Utku Ünver Micro Theory. M. Utku Ünver Micro Theory Utility 1 / 15

CHAPTER 6 RISK AND RISK AVERSION

Formal decision analysis in the face of uncertainty frequently occurs. Decision Making Under Uncertainty CHAPTER 10

Chapter 5 Risk and Return ANSWERS TO SELECTED END-OF-CHAPTER QUESTIONS

Problem Set 9 Solutions

CHAPTER 20: OPTIONS MARKETS: INTRODUCTION

Econ 101: Principles of Microeconomics

COMMERCE MENTORSHIP PROGRAM COMM295: MANAGERIAL ECONOMICS FINAL EXAM REVIEW SOLUTION KEY

Competitive Screening in Credit Markets with Adverse Selection: A Cover Version of Bester s Model

Second Degree Price Discrimination - Examples 1

UC Berkeley Haas School of Business Economic Analysis for Business Decisions (EWMBA 201A)

Transcription:

CS 274 Knowledge representation Lecture 23 Decision making in the presence of uncertainty II Milos Hauskrecht milos@cs.pitt.edu 5329 Sennott Square Information-gathering actions Many actions and their outcomes irreversibly change the world Information-gathering (exploratory) actions: make an inquiry about the world Key benefit: reduction in the uncertainty Example: medicine Assume a patient is admitted to the hospital with some set of initial complaints We are uncertain about the underlying problem and consider a surgery, or a medication to treat them But there are often lab tests or observations that can help us to determine more closely the disease the patient suffers from Goal of lab tests: Reduce the uncertainty of outcomes of treatments so that better treatment option can be chosen

Decision-making with exploratory actions In decision trees: Exploratory actions can be represented and reasoned about the same way as other actions. How do we capture the effect of exploratory actions in the decision tree model? Information obtained through exploratory actions may affect the probabilities of later outcomes Recall that the probabilities on later outcomes can be conditioned on past observed outcomes and past actions Sequence of past actions and outcomes is remembered within the decision tree branch Oil wildcatter problem. An oil wildcatter has to make a decision of whether to drill or not to drill on a specific site Chance of hitting an oil deposit: Oil: 4% Oil = T ) =.4 No-oil: 6% Oil = F ) =.6 Cost of drilling: 7K Payoffs: Oil: 22K No-oil: K.4 22-7=15.6-7 1.

Oil wildcatter problem. An oil wildcatter has to make a decision of whether to drill or not to drill on a specific site Chance of hitting an oil deposit: Oil: 4% Oil = T ) =.4 No-oil: 6% Oil = F ) =.6 Cost of drilling: 7K Payoffs: Oil: 22K No-oil: K.4 22-7=15.6-7 1. Oil cost: 1K Oil wildcatter problem Assume that in addition to the drill/no-drill choices we have an option to run the seismic resonance test Seismic resonance test results: Closed pattern (more likely when the hole holds the oil) Diffuse pattern (more likely when empty) Oil Seismic resonance test) Seismic resonance test pattern closed diffuse True False.8.3.2.7

Decision tree Oil wildcatter problem. (closed).5.64.36 1..16.5.84 (diffuse) 1..4 22-7=15.6-7 1. 22-7-1=14-7-1=-8-1=-1 22-7-1=14-7-1=-8-1=-1 Alternative model Oil wildcatter problem. (closed).5.64.36 1..16.5.84 (diffuse) 1. No.4.6 1. 22-7-1=14-7-1=-8-1=-1 22-7-1=14-7-1=-8-1=-1 22-7=15-7

Oil wildcatter problem. Decision tree probabilities.64 22-7-1=14.5 (closed).36 1. -7-1=-8-1=-1.5 No Oil = closed) P ( Oil = T = closed ) = P ( = closed Oil = T ) Oil = closed ) = T ) Oil = F = closed ) = = closed T Oil = F ) Oil = closed ) = F ) P ( = closed) = = closed Oil = F) Oil = F) + = closed Oil = T ) Oil = T) Oil wildcatter problem. Decision tree probabilities.64 22-7-1=14.5 (closed).36 1. -7-1=-8-1=-1 No.5 (diffuse) ) P ( = closed) = = closed Oil = F) Oil = F) + = closed Oil = T ) Oil = T) P ( = diff ) = = diff Oil = F) Oil = F) + = diff Oil = T ) Oil = T )

Decision tree 25.4 No Oil wildcatter problem. 6.8 (closed).5 6.8.64.36-1 1. -44.8.16.5-1 (diffuse) -1.84 1..4.6 1. 22-7-1=14-7-1=-8-1=-1 22-7-1=14-7-1=-8-1=-1 22-7=15-7 Decision tree Oil wildcatter problem. 6.8.36 (closed) -7-1=-8-1.5 1. 25.4-1=-1-44.8.16 22-7-1=14.5-1.84 (diffuse) -7-1=-8-1 1. -1=-1 No.4 The presence of the test and its result affected 22-7=15 our decision:.6-7 if test =closed then drill if test=diffuse then do not drill 1. 6.8.64 22-7-1=14

Value of information When the test makes sense? Only when its result makes the decision maker to change his mind, that is he decides not to drill. Value of information: Measure of the goodness of the information from the test Difference between the expected value with and without the test information Oil wildcatter example: Expected value without the test = Expected value with the test =25.4 Value of information for the seismic test = 7.4 Utilities

Selection based on expected values Until now: The optimal action choice was the option that maximized the expected monetary value. But is the expected monetary value always the quantity we want to optimize? Stock 1 Stock 2 Bank Home 12 14 11 1.6.4.4.6 1. 1. (up) 11 (down) 9 (up) 14 (down) 8 11 1 Selection based on expected values Is the expected monetary value always the quantity we want to optimize? Answer: Yes, but only if we are risk-neutral. But what if we do not like the risk (we are risk-averse)? In that case we may want to get the premium for undertaking the risk (of loosing the money) Example: we may prefer to get $11 for sure against $12 in expectation but with the risk of loosing the money Problem: How to model decisions and account for the risk? Solution: use utility function, and utility theory

Utility function Utility function (denoted U) Quantifies how we value outcomes, i.e., it reflects our preferences Can be also applied to value outcomes other than money and gains (e.g. utility of a patient being healthy, or ill) Decision making: uses expected utilities (denoted EU) EU ( X ) = X = x) U ( X = x) x Ω X U ( X = x) the utility of outcome x Important!!! Under some conditions on preferences we can always design the utility function that fits our preferences Utility theory Defines axioms on preferences that involve uncertainty and ways to manipulate them. Uncertainty is modeled through lotteries Lottery: [ p : A; (1 p) : C ] Outcome A with probability p Outcome C with probability (1-p) The following six constraints are known as the axioms of utility theory. The axioms are the most obvious semantic constraints on preferences with lotteries. Notation: f ~ - preferable - indifferent (equally preferable)

Axioms of the utility theory Orderability: Given any two states, the a rational agent prefers one of them, else the two as equally preferable. ( A f B) ( B f A) ( A ~ B) Transitivity: Given any three states, if an agent prefers A to B and prefers B to C, agent must prefer A to C. ( A f B) ( B f C ) ( A f C ) Continuity: If some state B is between A and C in preference, then there is a p for which the rational agent will be indifferent between state B and the lottery in which A comes with probability p, C with probability (1-p). ( A f B f C ) p [ p : A; (1 p) : C ] ~ B Axioms of the utility theory Substitutability: If an agent is indifferent between two lotteries, A and B, then there is a more complex lottery in which A can be substituted with B. ( A ~ B) [ p : A; (1 p) : C ] ~ [ p : B; (1 p) : C ] Monotonicity: If an agent prefers A to B, then the agent must prefer the lottery in which A occurs with a higher probability ( A f B) ( p > q [ p : A; (1 p) : B] f [ q : A; (1 q) : B]) Decomposability: Compound lotteries can be reduced to simpler lotteries using the laws of probability. [ p : [ p : A; (1 p) : [ q : B; (1 q) : C ]] A; (1 p) q : B; (1 p)(1 q) : C ]

Utility theory If the agent obeys the axioms of the utility theory, then 1. there exists a real valued function U such that: U ( A) > U ( B) A f B U ( A) = U ( B) A ~ B 2. The utility of the lottery is the expected utility, that is the sum of utilities of outcomes weighted by their probability U [ p : A; (1 p) : B] = pu ( A) + (1 p) U ( B) 3. Rational agent makes the decisions in the presence of uncertainty by maximizing its expected utility Utility functions We can design a utility function that fits our preferences if they satisfy the axioms of utility theory. But how to design the utility function for monetary values so that they incorporate the risk? What is the relation between utility function and monetary values? Assume we loose or gain $1. Typically this difference is more significant for lower values (around $1-1) than for higher values (~ $1,,) What is the relation between utilities and monetary value for a typical person?

Utility functions What is the relation between utilities and monetary value for a typical person? Concave function that flattens at higher monetary values utility 1, Monetary value Utility functions Expected utility of a sure outcome of 75 is 75 utility EU(sure 75) U(x) 5 75 1 Monetary value

Utility functions Assume a lottery L [.5: 5,.5:1] Expected value of the lottery = 75 Expected utility of the lottery EU(L) is different: EU(L) =.5U(5) +.5*U(1) U(x) utility EU(lottery L) EU line for lotteries with outcomes 5 and 1 Lottery L: [.5: 5,.5:1] 5 75 1 Monetary value Utility functions Expected utility of the lottery EU(lottery L) < EU(sure 75) utility EU(sure 75) EU(lottery L) U(x) Lottery L: [.5: 5,.5:1] 5 75 1 Monetary value Risk aversion a bonus is required for undertaking the risk