Extensive-Form Games with Imperfect Information

Similar documents
Perfect Bayesian Equilibrium

Games of Incomplete Information

Bayesian Nash Equilibrium

Competition and Regulation. Lecture 2: Background on imperfect competition

6.207/14.15: Networks Lecture 15: Repeated Games and Cooperation

Simon Fraser University Spring Econ 302 D200 Final Exam Solution Instructor: Songzi Du Tuesday April 21, 2015, 12 3 PM

Game Theory and Algorithms Lecture 10: Extensive Games: Critiques and Extensions

Games of Incomplete Information

Cheap Talk : Multiple Senders and Multiple Receivers

6.254 : Game Theory with Engineering Applications Lecture 2: Strategic Form Games

6.207/14.15: Networks Lectures 19-21: Incomplete Information: Bayesian Nash Equilibria, Auctions and Introduction to Social Learning

ECON 459 Game Theory. Lecture Notes Auctions. Luca Anderlini Spring 2015

AN INTRODUCTION TO GAME THEORY

Théorie de la décision et théorie des jeux Stefano Moretti

Midterm March (a) Consumer i s budget constraint is. c i b i c i H 12 (1 + r)b i c i L 12 (1 + r)b i ;

UCLA. Department of Economics Ph. D. Preliminary Exam Micro-Economic Theory

Backward Induction and Subgame Perfection

How to Win Texas Hold em Poker

Economics 159: Introduction to Game Theory. Summer 2015: Session A. Online Course

Lecture V: Mixed Strategies

2. Information Economics

How To Play A Static Bayesian Game

Infinitely Repeated Games with Discounting Ù

Oligopoly and Strategic Pricing

Universidad Carlos III de Madrid Game Theory Problem Set - Dynamic Games

Price competition with homogenous products: The Bertrand duopoly model [Simultaneous move price setting duopoly]

Sequential lmove Games. Using Backward Induction (Rollback) to Find Equilibrium

The Plaintiff s Attorney in the Liability Insurance Claims Settlement Process: A Game Theoretic Approach

Equilibrium computation: Part 1

Bad Reputation. March 26, 2002

Capital Structure. Itay Goldstein. Wharton School, University of Pennsylvania

When is Reputation Bad? 1

0.0.2 Pareto Efficiency (Sec. 4, Ch. 1 of text)

From the probabilities that the company uses to move drivers from state to state the next year, we get the following transition matrix:

EconS Advanced Microeconomics II Handout on Cheap Talk

1 Representation of Games. Kerschbamer: Commitment and Information in Games

Game Theory in Wireless Networks: A Tutorial

University of Oslo Department of Economics

Game theory and AI: a unified approach to poker games

Modeling Internet Security Investments Tackling Topological Information Uncertainty

ECON Game Theory Exam 1 - Answer Key. 4) All exams must be turned in by 1:45 pm. No extensions will be granted.

Lecture Note 1 Set and Probability Theory. MIT Spring 2006 Herman Bennett

Economics of Insurance

Labor Economics, Lecture 3: Education, Selection, and Signaling

6.254 : Game Theory with Engineering Applications Lecture 1: Introduction

Computational Learning Theory Spring Semester, 2003/4. Lecture 1: March 2

How to Solve Strategic Games? Dominant Strategies

Fairness with an Honest Minority and a Rational Majority

National Responses to Transnational Terrorism: Intelligence and Counterterrorism Provision

FINAL EXAM, Econ 171, March, 2015, with answers

ALMOST COMMON PRIORS 1. INTRODUCTION

Imperfect monitoring in communication networks

14.74 Lecture 12 Inside the household: Is the household an efficient unit?

Tiers, Preference Similarity, and the Limits on Stable Partners

Econ 101: Principles of Microeconomics

Computational Game Theory and Clustering

Game Mining: How to Make Money from those about to Play a Game

Internet Advertising and the Generalized Second Price Auction:

Why the government should do cost-effectiveness analysis in a health insurance market

On Reputation in Game Theory Application on Online Settings

Oligopoly markets: The price or quantity decisions by one rm has to directly in uence pro ts by other rms if rms are competing for customers.

Game Theory: Supermodular Games 1

Week 7 - Game Theory and Industrial Organisation

Co-ordination, Spillovers, and Cheap Talk 1

Repeated Games and Reputations

Christmas Gift Exchange Games

Oligopoly: Cournot/Bertrand/Stackelberg

The Basics of Game Theory

Chapter 3. Cartesian Products and Relations. 3.1 Cartesian Products

Not Only What But also When: A Theory of Dynamic Voluntary Disclosure

Dynamics and Equilibria

1 Nonzero sum games and Nash equilibria

Ch. 13.2: Mathematical Expectation

Microeconomic Theory Jamison / Kohlberg / Avery Problem Set 4 Solutions Spring (a) LEFT CENTER RIGHT TOP 8, 5 0, 0 6, 3 BOTTOM 0, 0 7, 6 6, 3

WORKED EXAMPLES 1 TOTAL PROBABILITY AND BAYES THEOREM

10 Evolutionarily Stable Strategies

The Prison S Dilemma and Its Connections

Spectrum Trading with Insurance in Cognitive Radio Networks

Credible Termination Threats, Reputation and Private Monitoring.

Knowledge and Equilibrium

Towards a compliance audit of SLAs for data replication in Cloud storage

Unraveling versus Unraveling: A Memo on Competitive Equilibriums and Trade in Insurance Markets

Individual security and network design

Dynamic Bayesian Spectrum Bargaining with Non-Myopic Users

The vertical differentiation model in the insurance market: costs structure and equilibria analysis

Valuation Uncertainty and Imperfect Introspection in Second-Price Auctions

CPC/CPA Hybrid Bidding in a Second Price Auction

Game Theory and Nash Equilibrium

i/io as compared to 4/10 for players 2 and 3. A "correct" way to play Certainly many interesting games are not solvable by this definition.

Rent-Seeking and Corruption

5 Market Games For Teaching Economics

A Game Theoretical Framework for Adversarial Learning

Games Manipulators Play

On the Efficiency of Competitive Stock Markets Where Traders Have Diverse Information

A HYBRID GENETIC ALGORITHM FOR THE MAXIMUM LIKELIHOOD ESTIMATION OF MODELS WITH MULTIPLE EQUILIBRIA: A FIRST REPORT

Probabilistic Strategies: Solutions

When is Reputation Bad? The Harvard community has made this article openly available. Please share how this access benefits you. Your story matters.

Common Knowledge: Formalizing the Social Applications

GAME THEORY. Thomas S. Ferguson

Course: Model, Learning, and Inference: Lecture 5

Transcription:

Extensive-Form Games with Imperfect Information Yiling Chen September 12, 2012

Perfect Information vs. Imperfect Information Perfect Information All players know the game structure. Each player, when making any decision, is perfectly informed of all the events that have previously occurred. Imperfect Information All players know the game structure. Each player, when making any decision, may not be perfectly informed about some (or all) of the events that have already occurred.

Roadmap Define Imperfect-Information Extensive-Form Game Introduce Sequential Equilibrium rather a lot of bodies are buried in this definition. (Kreps, 1990)

Example 1 1 (2, 2) L M R 2 a r a r (3,1) (0,0) (0,2) (1,1)

Example 2 L 1 R L R 0 0 1

Def. of Imperfect-Information Extensive-Form Games An imperfect-information extensive-form game is a tuple (N, H, P, I, u) (N, H, P, u) is a perfect-information extensive-form game I = {I1, I 2,..., I n } is the set of information partitions of all players I i = {I i,1,...i i,ki } is the information partition of player i I i,j is an information set of player i Action set A(h) = A(h ) if h and h are in I i,j, denote as A(I i,j ) P(I i,j ) be the player who plays at information set I i,j. I 1 = {{Φ}, {(L, A), (L, B)}}, I 2 = {{L}} 1 L 2 A B 1 a b a b (0,0) (1,2) (1,2) (0,0) R (2,1)

Pure Strategies: Example 3 S = {S 1, S 2 } S 1 = {(L, a), (L, b), (R, a), (R, b)} S 2 = {A, B} 1 L 2 A B 1 a b a b R (2,1) (0,0) (1,2) (1,2) (0,0)

Normal-Form Representation: Example 1 An imperfect-information extensive-form game A normal-form game 1 a r (2, 2) L M R 2 a r a r (3,1) (0,0) (0,2) (1,1) L M R 2,2 2,2 3,1 0,0 0,2 1,1 The Nash Equilibrium (both pure and mixed) concept remains the same for imperfect-information extensive-form games.

Normal-Form Representation: Example 1 An imperfect-information extensive-form game A normal-form game 1 a r (2, 2) L M R 2 a r a r (3,1) (0,0) (0,2) (1,1) L M R 2,2 2,2 3,1 0,0 0,2 1,1 The Nash Equilibrium (both pure and mixed) concept remains the same for imperfect-information extensive-form games.

Normal-Form Games A normal-form game An imperfect-information extensive-form game C D 1 C D 5, 5 0, 8 8, 0 1, 1 C 2 D C D C D Prisoner s Dilemma (5,5) (0,8) (8,0) (1,1)

Nash Equilibrium: Example 1 1 a r (2, 2) L M R 2 a r a r (3,1) (0,0) (0,2) (1,1) L M R 2,2 2,2 3,1 0,0 0,2 1,1 Suppose we want to generalize the idea of subgame perfect equilibrium. Consider the equilibrium (L, r). Is it subgame perfect?

Nash Equilibrium: Example 1 1 a r (2, 2) L M R 2 a r a r (3,1) (0,2) (0,2) (1,1) L M R 2,2 2,2 3,1 0,2 0,2 1,1 Suppose we want to generalize the idea of subgame perfect equilibrium. Consider the equilibrium (L, r). Is it subgame perfect?

Beliefs A belief µ is a function that assigns to every information set a probability measure on the set of histories in the information set. An assessment in an extensive-form game is a strategy-belief pair (s, µ). The assessment (s, µ) is sequentially rational if for every player i and every information set I i,j I i we have for any s i s i. E[u i (s i, s i I i,j )] E[u i (s i, s i I i,j )] 1 a r (2, 2) L M R 2 a r a r (3,1) (0,2) (0,2) (1,1) L M R 2,2 2,2 3,1 0,2 0,2 1,1

Restrictions to beliefs? In lieu of the Nash equilibrium concept, we require that beliefs are derived from equilibrium strategies according to Bays rule (as if players know each others strategies). But, what about beliefs for information sets that are off the equilibrium path? 1 a r (2, 2) L M R 2 a r a r (3,1) (0,2) (0,2) (1,1) L M R 2,2 2,2 3,1 0,2 0,2 1,1

Restrictions to Beliefs? We want beliefs for information sets that are off the equilibrium path to be reasonable. But what is reasonable? Consider the NE (L, r) again. Player 2 s information set will not be reached at the equilibrium, because player 1 will play L with probability 1. But assume that player 1 plays a completely mixed strategy, playing L, M, and R with probabilities 1 ɛ, 3ɛ 4, and ɛ 4. Then, the belief on player 2 s information set is well defined. Now, if ɛ 0, it s still well defined. 1 a r (2, 2) L M R 2 a r a r (3,1) (0,2) (0,2) (1,1) L M R 2,2 2,2 3,1 0,2 0,2 1,1

Consistent Assessment An assessment (s, µ) is consistent if there is a sequence ((s n, µ n )) n=1 of assessments that converges to (s, µ) and has the properties that each strategy profile s n is completely mixed and that each belief system µ n is derived from s n using Bayes rule.

Sequential Equilibrium An assessment (s, µ) is a sequential equilibrium of a finite extensive-form game with perfect recall if it is sequentially rational and consistent. Thm: Every finite extensive-form game with perfect recall has a sequential equilibrium. A sequential equilibrium is a Nash equilibrium. With perfect information, a subgame perfect equilibrium is a sequential equilibrium.

Bayesian Games Yiling Chen September 12, 2012

So far Up to this point, we have assumed that players know all relevant information about each other. Such games are known as games with complete information.

Games with Incomplete Information Bayesian Games = Games with Incomplete Information Incomplete Information: Players have private information about something relevant to his decision making. Incomplete information introduces uncertainty about the game being played. Imperfect Information: Players do not perfectly observe the actions of other players or forget their own actions. We will see that Bayesian games can be represented as extensive-form games with imperfect information.

Example 4: A Modified Prisoner s Dilemma Game With probability λ, player 2 has the normal preferences as before (type I), while with probability (1 λ), player 2 hates to rat on his accomplice and pays a psychic penalty equal to 6 years in prison for confessing (type II). λ 1 λ C D C D C 5, 5 0, 8 C 5, 5 0, 2 D 8, 0 1, 1 D 8, 0 1, -5 Type I Type II

Simultaneous-Move Bayesian Games A simultaneous-move Bayesian game is (N, A, Θ, F, u) N = {1,..., n} is the set of players A = {A1, A 2,..., A n } is the set of actions A i = {Cooperation, Defection}. Θ = {Θ1, Θ 2,..., Θ n } is the set of types. θ i Θ i is a realization of types for player i. Θ 2 = {I, II}. F : Θ [0, 1] is a joint probability distribution, according to which types of players are drawn p(θ 2 = type I) = λ u = {u1, u 2,..., u n } where u i : A Θ R is the utility function of player i Two assumptions All possible games have the same number of agents and the same action spaces for each agent Agents have common prior. The different beliefs of agents are posteriors.

Imperfect-Information Extensive-Form Representation of Bayesian Games Add a player Nature who has a unique strategy of randomizing in a commonly known way. Nature λ 1 1 λ C D C D 2 2 C D C D C D C D (5, 5) (0, 8) (8,0) (1, 1) (5, 5) (0, 2) (8, 0) (1, -5)

Strategies in Bayesian Games A pure strategy s i : Θ i A i of player i is a mapping from every type player i could have to the action he would play if he had that type. Denote the set of pure strategies of player i as S i. S 1 = {{C}, {D}} S 2 = {{C if type I, C if type II}, {C if type I, D if type II}, {D if type I, C if type II}, {D if type I, D if type II}} A mixed strategy σ i : S i [0, 1] of player i is a distribution over his pure strategies.

Best Response and Bayesian Nash Equilibrium We use pure strategies to illustrate the concepts. But they hold the same for mixed strategies. Player i s ex ante expected utility is E θ [u i (s(θ), θ)] = θ i Θ i p(θ i )E θ i [u i (s(θ), θ) θ i ] Player i s best responses to s i (θ i ) is BR i = arg max E θ [u i (s i (θ i ), s i (θ i ), θ)] s i (θ i ) S i = ( ) p(θ i ) arg max E θ i [u i (s i (θ i ), s i (θ i ), θ) θ i ] s i (θ i ) S i θ i Θ i A strategy profile s i (θ i ) is a Bayesian Nash Equilibrium iif i s i (θ i ) BR i.

Bayesian Nash Equilibrium: Example 4 Playing D is a dominant strategy for type I player 2; playing C is a dominant strategy for type II player 2. Player 1 s expected utility by playing C is λ 0 + (1 λ) 5 = 5 5λ. Player 1 s expected utility by playing D is λ 1 + (1 λ) 8 = 8 7λ > 5 5λ. (D, (D if type I, C if type II)) is a BNE of the game. C λ D C 1 λ D C 5, 5 0, 8 C 5, 5 0, 2 D 8, 0 1, 1 D 8, 0 1, -5 Type I Type II

Example 5: An Exchange Game Each of two players receives a ticket t on which there is a number in [0,1]. The number on a player s ticket is the size of a prize that he may receive. The two prizes are identically and independently distributed according to a uniform distribution. Each player is asked independently and simultaneously whether he wants to exchange his prize for the other player s prize. If both players agree then the prizes are exchanged; otherwise each player receives his own prize.

A Bayesian Nash Equilibrium for Example 5 Strategies of player 1 can be describe as Exchange if t 1 k Given player 1 plays such a strategy, what is the best response of player 2? If t 2 k, no exchange If t2 < k, exchange when t 2 k/2 Since players are symmetric, player 1 s best response is of the same form. Hence, at a Bayesian Nash equilibrium, both players are willing to exchange only when t i = 0.

Signaling (Sender-Receiver Games) There are two types of workers, bright and dull. Before entering the job market a worker can choose to get an education (i.e. go to college), or enjoy life (i.e. go to beach). The employer can observe the educational level of the worker but not his type. The employer can hire or reject the worker.

Example 6: Signaling Nature λ Bright Dull 1 λ Worker Worker C B C B Employer Employer H R H R H R H R (2, 2) (-1, 0) (4,-1) (1, 0) (2, 1) (-1, 0) (4, -2) (1, 0)

Bayesian Extensive Games with Observable Actions A Bayesian extensive game with observable actions is (N, H, P, Θ, F, u) (N, H, P) is the same as those in an extensive-form game with perfect information Θ = {Θ1, Θ 2,..., Θ n } is the set of types. θ i Θ i is a realization of types for player i. Θ 1 = {Bright, Dull}. F : Θ [0, 1] is a joint probability distribution, according to which types of players are drawn p(θ 1 = Bright) = λ u = {u1, u 2,..., u n } where u i : Z Θ R is the utility function of player i. Z H is the set of terminal histories.

Best Responses for Example 6 E.g. If the employer always plays H, then the best response for the worker is B. But how to define best responses for the employer? Beliefs on information sets Beliefs derived from strategies Nature λ Bright Dull 1 λ Worker Worker C B C B Employer Employer H R H R H R H R (2, 2) (-1, 0) (4,-1) (1, 0) (2, 1) (-1, 0) (4, -2) (1, 0)

A Bayesian Nash Equilibrium of Example 6 Nature λ Bright Dull 1 λ Worker Worker C B C B Employer Employer λ 1 λ H R H R H R H R (2, 2) (-1, 0) (4,-1) (1, 0) (2, 1) (-1, 0) (4, -2) (1, 0) p(bright Beach) = p(bright)σ(beach Bright) p(bright)σ(beach Bright)+p(Dull)σ(Beach Dull) = λ 1 λ 1+(1 λ) 1 = λ

Subgame Perfection The previous Bayesian Nash Equilibrium is not subgame perfect. When the information set College is reached, the employer should choose to hire no matter what belief he has. We need to require sequential rationality even for off-equilibrium-path information sets. Then, beliefs on off-equilibrium-path information sets matter.

Perfect Bayesian Equilibrium A strategy-belief pair, (σ, µ) is a perfect Bayesian equilibrium if (Beliefs) At every information set of player i, the player has beliefs about the node that he is located given that the information set is reached. (Sequential Rationality) At any information set of player i, the restriction of (σ, µ) to the continuation game must be a Bayesian Nash equilibrium. (On-the-path beliefs) The beliefs for any on-the-equilibrium-path information set must be derived from the strategy profile using Bayes Rule. (Off-the-path beliefs) The beliefs at any off-the-equilibrium-path information set must be determined from the strategy profile according to Bayes Rule whenever possible.

Perfect Bayesian Equilibrium Perfect Bayesian equilibrium is a similar concept to sequential equilibrium, both trying to achieve some sort of subgame perfection. Perfect Bayesian equilibrium is defined for all extensive-form games with imperfect information, not just for Bayesian extensive games with observable actions. Thm: For Bayesian extensive games with observable actions, every sequential equilibrium is a Perfect Bayesian equilibrium.

A Perfect Bayesian Equilibrium of Example 3 Nature λ Bright Dull 1 λ Worker Worker C B C B Employer Employer λ β 1 λ 1 β H R H R H R H R (2, 2) (-1, 0) (4,-1) (1, 0) (2, 1) (-1, 0) (4, -2) (1, 0) β [0, 1]

Summary of Equilibrium Concepts On-equ-path strategy σ on On-equ-path belief µ on Off-equ-path strategy σ off Off-equ-path belief µ off NE BR N/A N/A N/A BNE BR given µ on Consistent N/A N/A with σ on SPNE BR N/A BR N/A PBE BR given µ on Consistent with σ on BR given µ off Consistent with σ off SE BR given µ on Consistent with σ on BR given µ off Consistent with σ off