Extensive-Form Games. Branislav Bošanský AE4M36MAS, Fall 2015

Similar documents
Théorie de la décision et théorie des jeux Stefano Moretti

Game Theory 1. Introduction

Backward Induction and Subgame Perfection

6.207/14.15: Networks Lecture 15: Repeated Games and Cooperation

Sequential lmove Games. Using Backward Induction (Rollback) to Find Equilibrium

Microeconomic Theory Jamison / Kohlberg / Avery Problem Set 4 Solutions Spring (a) LEFT CENTER RIGHT TOP 8, 5 0, 0 6, 3 BOTTOM 0, 0 7, 6 6, 3

Economics 159: Introduction to Game Theory. Summer 2015: Session A. Online Course

Game Theory and Algorithms Lecture 10: Extensive Games: Critiques and Extensions

Equilibrium computation: Part 1

1 Representation of Games. Kerschbamer: Commitment and Information in Games

University of Oslo Department of Economics

Bayesian Nash Equilibrium

Perfect Bayesian Equilibrium

Week 7 - Game Theory and Industrial Organisation

Industry profit in an oligopoly (sum of all firms profits) < monopoly profit.

21. Unverifiable Investment, Hold Up, Options and Ownership

I. Noncooperative Oligopoly

Game Theory in Wireless Networks: A Tutorial

Competition and Regulation. Lecture 2: Background on imperfect competition

Individual security and network design

UCLA. Department of Economics Ph. D. Preliminary Exam Micro-Economic Theory

Modeling Internet Security Investments Tackling Topological Information Uncertainty

AN INTRODUCTION TO GAME THEORY

Best-response play in partially observable card games

6.254 : Game Theory with Engineering Applications Lecture 1: Introduction

How to Win Texas Hold em Poker

Infinitely Repeated Games with Discounting Ù

Psychology and Economics (Lecture 17)

Simon Fraser University Spring Econ 302 D200 Final Exam Solution Instructor: Songzi Du Tuesday April 21, 2015, 12 3 PM

Compact Representations and Approximations for Compuation in Games

Project and Production Management Prof. Arun Kanda Department of Mechanical Engineering Indian Institute of Technology, Delhi

The vertical differentiation model in the insurance market: costs structure and equilibria analysis

ECO 199 B GAMES OF STRATEGY Spring Term 2004 PROBLEM SET 4 B DRAFT ANSWER KEY

Universidad Carlos III de Madrid Game Theory Problem Set - Dynamic Games

Evolutionary dynamics and backward induction

Decision Trees and Networks

Climate-Change Treaties: A Game-Theoretic Approach Roy Radner Stern School, New York University

Working Paper Series

Imperfect monitoring in communication networks

Investing without credible inter-period regulations:

6.207/14.15: Networks Lectures 19-21: Incomplete Information: Bayesian Nash Equilibria, Auctions and Introduction to Social Learning

Department of Economics The Ohio State University Economics 817: Game Theory

Internet Advertising and the Generalized Second Price Auction:

6.3 Conditional Probability and Independence

0.0.2 Pareto Efficiency (Sec. 4, Ch. 1 of text)

Mistakes Allow Evolutionary Stability in the Repeated Prisoner's Dilemma Game. I. Introduction

Hacking-proofness and Stability in a Model of Information Security Networks

ECON 312: Oligopolisitic Competition 1. Industrial Organization Oligopolistic Competition

Techniques for Supporting Prediction of Security Breaches in. Critical Cloud Infrastructures Using Bayesian Network and. Markov Decision Process

Bad Reputation. March 26, 2002

Secure or Insure? Analyzing Network Security Games with Externalities

Dynamic Bayesian Spectrum Bargaining with Non-Myopic Users

3.2 Roulette and Markov Chains

When is Reputation Bad? 1

6.254 : Game Theory with Engineering Applications Lecture 2: Strategic Form Games

GAMES FOR BUSINESS AND ECONOMICS

Part 2: Community Detection

Game Theory. An introduction to the concepts of dominant strategies, Nash equilibrium and strategic commitment

Introduction Solvability Rules Computer Solution Implementation. Connect Four. March 9, Connect Four

Computational Learning Theory Spring Semester, 2003/4. Lecture 1: March 2

Oligopoly: How do firms behave when there are only a few competitors? These firms produce all or most of their industry s output.

Cournot s model of oligopoly

Games Manipulators Play

Development of dynamically evolving and self-adaptive software. 1. Background

IE 680 Special Topics in Production Systems: Networks, Routing and Logistics*

Game theory and AI: a unified approach to poker games

A game-theoretic analysis of preventing spam. over Internet Telephony via audio. CAPTCHA-based authentication

A Game Theoretical Framework for Adversarial Learning

SECTION 10-2 Mathematical Induction

Game Theory: Supermodular Games 1

Comparing Decision Support Approaches for Cyber Security Investment

Decision trees DECISION NODES. Kluwer Academic Publishers / X_220. Stuart Eriksen 1, Candice Huynh 1, and L.

Economics 431 Fall nd midterm Answer Key

Advanced Game Theory

Chapter 4 DECISION ANALYSIS

Labor Economics, Lecture 3: Education, Selection, and Signaling

Computing the Electricity Market Equilibrium: Uses of market equilibrium models

Bounding Rationality by Discounting Time

Rafael Witten Yuze Huang Haithem Turki. Playing Strong Poker. 1. Why Poker?

Ch. 13.3: More about Probability

Oligopoly: Cournot/Bertrand/Stackelberg

Measuring the Performance of an Agent

Christmas Gift Exchange Games

Switching Cost, Competition, and Pricing in the Property/Casualty Insurance Market for Large Commercial Accounts

On Nash-solvability of chess-like games

6.231 Dynamic Programming and Stochastic Control Fall 2008

The Plaintiff s Attorney in the Liability Insurance Claims Settlement Process: A Game Theoretic Approach

Performance Analysis of Naive Bayes and J48 Classification Algorithm for Data Classification

ECON Game Theory Exam 1 - Answer Key. 4) All exams must be turned in by 1:45 pm. No extensions will be granted.

CeDEx Discussion Paper Series. Discussion Paper No J. Arin, V. Feltkamp and M. Montero July 2012

A Graph-Theoretic Network Security Game

SE351a: Software Project & Process Management

Goal Problems in Gambling and Game Theory. Bill Sudderth. School of Statistics University of Minnesota

Oligopoly. Oligopoly is a market structure in which the number of sellers is small.

Agent Applications in Network Security Monitoring

CS MidTerm Exam 4/1/2004 Name: KEY. Page Max Score Total 139

The Basics of Game Theory

Game Mining: How to Make Money from those about to Play a Game

Parallel Algorithm for Dense Matrix Multiplication

Network Security A Decision and Game-Theoretic Approach

Transcription:

Extensive-Form Games Branislav Bošanský AE4M36MAS, Fall 2015

Finite Sequential Games Different representations Normal-form Games visually represented as matrix games Extensive-form Games finite sequential games that evolve in time the concept of time is implicitly integrated into the model visually represented as finite trees general enough to represent different types of uncertainty

Finite Sequential Games

Finite Sequential Games

Extensive-Form Games Perfect-Information Games Perfect-Information Games with Chance Imperfect-Information Games Solving Zero-Sum Games Solving General-Sum Games Approximate Solutions to Large Zero-Sum Games

Game Theory: Extensive-Form Games States Players 1 2 Actions Utility

Game Theory: Extensive-Form Games

Game Theory: Extensive-Form Games Definition: A (finite) perfect-information game in the extensive form is defined as a tuple (N, A, H, Z, χ, ρ, σ, u), where: Players N = {1, 2, } Actions A Choice nodes and label for these nodes Choice nodes: H Action function: χ H 2 A Player function: ρ H N Terminal nodes Z Successor function σ H A H Z Utility function u = u 1,, u n ; u i Z R

EFGs: Actions and Strategies Pure strategy: an assignment of an action for each state S i = h H,ρ h =i χ(h) Action is uniquely identified by the state, in which it is taken. A 1 = {2 0, 1 1, 0 2} A 2 = {no 2 0, yes 2 0, no 1 1, yes 1 1, no 0 2, yes 0 2 } S 1 = {2 0, 1 1, 0 2} S 2 = { no 2 0, no 1 1, no 0 2, no 2 0, no 1 1, yes 0 2, }

Pure Strategies in EFGs

Induced Normal Form

Nash Equilibria in EFGs

Nash Equilibria in EFGs

Nash Equilibria in EFGs - threats

Solution Concepts in EFGs Sub-game perfect (SPE) sub-game of Grooted at some node h is the restriction of G to the descendants of h ( h ) Strategy profile is a sub-game perfect NE, if it is a NE for every sub-game of G Every SPE is NE Refinements of NE further assumptions on NE strategy profiles Sub-game perfect equilibria Sequential equilibria Quasi-perfect equilibria E. van Damme. Stability and Perfection of Nash Equilibria (1991)

Computing SPE: Backward Induction works for general-sum n-players games with perfect information computes pure strategy subgame-perfect equilibrium every perfect-information EFG has a pure (subgame-perfect) Nash equilibrium

Nash Equilibrium in EFGs SPE?

Nash Equilibrium in EFGs SPE?

Extensive-Form Games Perfect-Information Games Perfect-Information Games with Chance Imperfect-Information Games Solving Zero-Sum Games Solving General-Sum Games Approximate Solutions to Large Zero-Sum Games

EFGs with Chance Nodes c 0.7 0.3

Extensive-Form Games with Chance Definition: A (finite) perfect-information game in the extensive form is defined as a tuple (N, A, H, Z, χ, ρ, γ, σ, u), where: Players N = 1, 2, {c} Actions A Choice nodes and label for these nodes Choice nodes: H Action function: χ H 2 A Player function: ρ H N Stochastic transitions: γ χ(h) h H, ρ h = c} [0,1], a χ h γ a = 1 ( h H, ρ h = c) Terminal nodes Z Successor function σ H A H Z Utility function u = u 1,, u n ; u i Z R

Extensive-Form Games Perfect-Information Games Perfect-Information Games with Chance Imperfect-Information Games Solving Zero-Sum Games Solving General-Sum Games Approximate Solutions to Large Zero-Sum Games

Imperfect Information EFGs States Players 1 2 Information Set Actions Utility

Imperfect Information EFGs Definition: A (finite) imperfect-information game in the extensive form is defined as a tuple (N, A, H, Z, χ, ρ, σ, u, I), where: tuple N, A, H, Z, χ, ρ, σ, u is a perfect-information extensive-form game I = I 1,, I n, where I i = (I i,1,, I i,k ) is a set of equivalence classes on (i.e., a partition of) choice nodes of a player i with the property that χ h = χ(h ) and ρ h = ρ(h ), whenever h I i,j and h I i,j for some j we use χ(i i,j ) instead of χ(h) for some h I i,j

II EFGs: Actions and Strategies Pure strategy: an assignment of an action for each information set Action is uniquely identified by the information set, in which it is taken. A 1 = {2 0, 1 1, 0 2} A 2 = {no, yes} S 1 = {2 0, 1 1, 0 2} S 2 = {no, yes}

Strategies in EFGs Existence of a pure NE is no longer guaranteed for imperfect-information EFGs Mixed strategies Probabilistic distribution over pure strategies Behavioral Strategies Probabilistic distribution over actions to play for each information set There is a broad class of imperfect-information games in which the expressive power of mixed and behavioral strategies coincides. This is the class of games of perfect recall. Intuitively speaking, in these games no player forgets any information she previously knew.

Perfect Recall in EFGs Player i has perfect recall in an imperfect-information game G if for any two nodes h, h that are in the same information set for player i,for any path h 0, a 0, h 1, a 1, h 2,..., h n, a n, h from the root of the game tree to h (where the h j are decision nodes and the a j are actions) and for any path h 0, a 0, h 1, a 1, h 2,..., h m, a m, h from the root to h it must be the case that: 1. n = m 2. for all 0 j n, h j and h j are in the same equivalence class for player i 3. for all 0 j n, if ρ(h j ) = i (i.e., h j is a decision node of player i), then a j = a j.

Perfect vs. Imperfect Recall Remembering all information induces very large strategies Easier to solve Strategies can be compactly represented Smaller trees, unnecessary information can be forgotten Much harder to solve Equilibrium in behavior strategies might not exist

NE in Imperfect Recall Games Wichardt. Existence of Nash equilibria in finite extensive form games with imperfect recall: A counterexample (2008)