Backward Induction and Subgame Perfection



Similar documents
The Basics of Game Theory

Competition and Regulation. Lecture 2: Background on imperfect competition

Sequential lmove Games. Using Backward Induction (Rollback) to Find Equilibrium

21. Unverifiable Investment, Hold Up, Options and Ownership

Game Theory 1. Introduction

1 Representation of Games. Kerschbamer: Commitment and Information in Games

Week 7 - Game Theory and Industrial Organisation

I. Noncooperative Oligopoly

Managerial Economics & Business Strategy Chapter 9. Basic Oligopoly Models

Chapter 9 Basic Oligopoly Models

Perfect Bayesian Equilibrium

Oligopoly: How do firms behave when there are only a few competitors? These firms produce all or most of their industry s output.

ECON 312: Oligopolisitic Competition 1. Industrial Organization Oligopolistic Competition

Universidad Carlos III de Madrid Game Theory Problem Set - Dynamic Games

University of Oslo Department of Economics

ECON 459 Game Theory. Lecture Notes Auctions. Luca Anderlini Spring 2015

6.254 : Game Theory with Engineering Applications Lecture 2: Strategic Form Games

Oligopoly: Cournot/Bertrand/Stackelberg

Industry profit in an oligopoly (sum of all firms profits) < monopoly profit.

Game Theory and Algorithms Lecture 10: Extensive Games: Critiques and Extensions

Economics 159: Introduction to Game Theory. Summer 2015: Session A. Online Course

6.207/14.15: Networks Lecture 15: Repeated Games and Cooperation

Price competition with homogenous products: The Bertrand duopoly model [Simultaneous move price setting duopoly]

Oligopoly and Strategic Pricing

Bayesian Nash Equilibrium

Economics 203: Intermediate Microeconomics I Lab Exercise #11. Buy Building Lease F1 = 500 F1 = 750 Firm 2 F2 = 500 F2 = 400

Chapter 11. T he economy that we. The World of Oligopoly: Preliminaries to Successful Entry Production in a Nonnatural Monopoly Situation

AN INTRODUCTION TO GAME THEORY

Théorie de la décision et théorie des jeux Stefano Moretti

Cournot s model of oligopoly

Chapter 13: Strategic Decision Making in Oligopoly Markets

Microeconomic Theory Jamison / Kohlberg / Avery Problem Set 4 Solutions Spring (a) LEFT CENTER RIGHT TOP 8, 5 0, 0 6, 3 BOTTOM 0, 0 7, 6 6, 3

Game Theory in Wireless Networks: A Tutorial

A Game Theoretical Framework for Adversarial Learning

Market Structure: Duopoly and Oligopoly

GAMES FOR BUSINESS AND ECONOMICS

Simon Fraser University Spring Econ 302 D200 Final Exam Solution Instructor: Songzi Du Tuesday April 21, 2015, 12 3 PM

How to Solve Strategic Games? Dominant Strategies

Game Theory: Supermodular Games 1

Oligopoly. Oligopoly is a market structure in which the number of sellers is small.

Capital Structure. Itay Goldstein. Wharton School, University of Pennsylvania

ECON Game Theory Exam 1 - Answer Key. 4) All exams must be turned in by 1:45 pm. No extensions will be granted.

Chapter 7. Sealed-bid Auctions

Infinitely Repeated Games with Discounting Ù

Oligopoly. Chapter 25

chapter: Oligopoly Krugman/Wells Economics 2009 Worth Publishers 1 of 35

Created by Paul Hallett

When is Reputation Bad? 1

Other explanations of the merger paradox. Industrial Economics (EC5020), Spring 2010, Sotiris Georganas, February 22, 2010

Lecture 28 Economics 181 International Trade

12 Monopolistic Competition and Oligopoly

Homework 3, Solutions Managerial Economics: Eco 685

CHAPTER 6 MARKET STRUCTURE

Competition between Apple and Samsung in the smartphone market introduction into some key concepts in managerial economics

Individual security and network design

MobLab Game Catalog For Business Schools Fall, 2015

MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.

Summary of Doctoral Dissertation: Voluntary Participation Games in Public Good Mechanisms: Coalitional Deviations and Efficiency

Decision Making under Uncertainty

Table of Contents MICRO ECONOMICS

How To Understand The Theory Of Economic Theory

9 Repeated Games. Tomorrow, and tomorrow, and tomorrow, Creeps in this petty pace from day to day To the last syllable of recorded time Shakespeare

The Plaintiff s Attorney in the Liability Insurance Claims Settlement Process: A Game Theoretic Approach

Product Differentiation In homogeneous goods markets, price competition leads to perfectly competitive outcome, even with two firms Price competition

Why Plaintiffs Attorneys Use Contingent and Defense Attorneys Fixed Fee Contracts

Game Theory. CDAM Research Report LSE-CDAM October 8, What is game theory? 4. 2 Definitions of games 6.

Microeconomics. Lecture Outline. Claudia Vogel. Winter Term 2009/2010. Part III Market Structure and Competitive Strategy

Table 10.1: Elimination and equilibrium. 1. Is there a dominant strategy for either of the two agents?

Economics II: Micro Fall 2009 Exercise session 5. Market with a sole supplier is Monopolistic.

Computational Learning Theory Spring Semester, 2003/4. Lecture 1: March 2

6.254 : Game Theory with Engineering Applications Lecture 1: Introduction

Switching Cost, Competition, and Pricing in the Property/Casualty Insurance Market for Large Commercial Accounts

6.207/14.15: Networks Lectures 19-21: Incomplete Information: Bayesian Nash Equilibria, Auctions and Introduction to Social Learning

Mikroekonomia B by Mikolaj Czajkowski. MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.

One pile, two pile, three piles

Christmas Gift Exchange Games

Patent Litigation with Endogenous Disputes

Do not open this exam until told to do so.

Minimax Strategies. Minimax Strategies. Zero Sum Games. Why Zero Sum Games? An Example. An Example

Economics 431 Fall nd midterm Answer Key

When other firms see these potential profits they will enter the industry, causing a downward shift in the demand for a given firm s product.

Credible Termination Threats, Reputation and Private Monitoring.

Games of Incomplete Information

c. Given your answer in part (b), what do you anticipate will happen in this market in the long-run?

Introduction. Bargaining - whether over arms control, the terms of a peace settlement, exchange rate

Games and Strategic Behavior. Chapter 9. Learning Objectives

Financial Markets. Itay Goldstein. Wharton School, University of Pennsylvania

Lecture V: Mixed Strategies

13 MONOPOLISTIC COMPETITION AND OLIGOPOLY. Chapter. Key Concepts

Cuban Missile Crisis Lesson Plan. Central Historical Question: Why did the Russians pull their missiles out of Cuba?

Game Theory & Oligopoly Market. Decision making environments may be classified into four types:-

Transcription:

Backward Induction and Subgame Perfection In extensive-form games, we can have a Nash equilibrium profile of strategies where player 2 s strategy is a best response to player 1 s strategy, but where she will not want to carry out her plan at some nodes of the game tree. For example, consider the following game, given in both normal-form and extensive-form. player 2 L R player 1 A 1, 2 1, 2 B 0, 0 2, 1

1 A B 1,2 2 L R 0,0 2,1

This game has two Nash equilibria: (A,L) and (B,R). If we think of the players as selecting plans before the game starts, then the NE profile (A,L) makes some sense. If player 2 is absolutely convinced that player 1 will play A, then playing L is a best response. In fact, she would do everything in her power to convince player 1 that she will play L, so that he plays A. While player 2 might "threaten" to play L before the game starts, if the game proceeds and it is actually her turn to move, then L is not rational. The threat is not credible, and player 1 should realize this and play B. Thus, L might be a reasonable plan for player 2 if she thinks player 1 will play A, but it is not rational to carry out the plan after player 1 plays B. It is not sequentially rational forplayer2toplayl.

Definition: A player s strategy exhibits sequential rationality if it maximizes his or her expected payoff, conditional on every information set at which he or she has the move. That is, player i s strategy should specify an optimal action at each of player i s information sets, even those that player i does not believe will be reached in the game. If sequential rationality is common knowledge, then players will look ahead and realize that players will not take actions down the tree that are not rational at that point.

For games of perfect information (all singleton information sets), backward induction is the process of "looking aheadandworkingbackwards"tosolveagamebasedon common knowledge of sequential rationality: 1. Start at each node that is an immediate predecessor of a terminal node, find the optimal action for the player who moves at that node, and change that node into a terminal node with the payoffs from the optimal action. 2. Apply step 1 to smaller and smaller games until we canassignpayoffs to the initial node of the game.

1 A B 1,2 2 L R 0,0 2,1 Looking ahead to player 2 s decision node, her optimal choice is R, so we can convert her decision node into a terminal node with payoffs (2,1).

1 A B 1,2 2 L R 0,0 2,1 In the smaller game, player 1 can either choose A and reach the terminal node with payoffs (1,2), or B and reach the terminal node with payoffs (2,1). Player 1 s optimal choice is B. Backward induction leads to the strategy profile (B,R), with payoffs (2,1).

Here is a more complicated example, based on the Cuban Missile Crisis. During 1962, the Soviet Union installed nuclear missiles in Cuba. When the US found out, President Kennedy discussed the options (i) do nothing, (ii) air strike on the missiles, (iii) a naval blockade of Cuba. JFK decided on the naval blockade. Negotiations ensued, and Khrushchev threatened to escalate the situation; both sides believed that nuclear war was a possibility. Finally, the Soviet Union agreed to remove the missiles if the United States agreed not to invade Cuba. Privately, Kennedy agreed to remove some missiles based in Turkey.

First, Khrushchev must decide whether to place the missiles in Cuba or not. If the missiles are in place, JFK must decide on (i) nothing, (ii) air strike, or (iii) blockade. If JFK decides on air strike or blockade, Khrushchev must decide whether to acquiesce or escalate. Utility ranking of outcomes for Khrushchev: missiles allowed, status quo, acquiesce to blockade, acquiesce to air strike (lose resources), escalate after air strike (risk war but JFK is the initial aggressor), escalate after blockade (risk war and Khrushchev is the initial aggressor). Utility ranking for JFK: blockade and have Khrushchev acquiesce, air strike and have Khrushchev acquiesce (be tough but use resources), status quo, allow missiles, Khrushchev escalates after blockade, Khrushchev escalates after air strike.

Backward Induction (first payoff is Khrushchev s, second is JFK s): Khrushchev does not want to risk nuclear war and would optimally acquiesce to either a blockade or an air strike. Payoff to blockade is (3,5) and payoff toairstrikeis(2,4). JFK would optimally choose blockade, leading to payoff (3,5). Khrushchev s optimal choice is status quo (4,3), since he receives a higher payoff than placing the missiles (3,5).

Every finite game of perfect information can be solved using backward induction. If each player has a strict preference over his possible terminal node payoffs (no ties), then backward induction gives a unique sequentially rational strategy profile. Because no player has an incentive to deviate at any information set, the resulting strategy profile is a Nash equilibrium.

Related to backward induction is Zermelo s Theorem: In finite, two-player, "win-lose-draw" games of perfect information, then either one of the players has a strategy that guarantees a win, or both players have a strategy that guarantees a draw. Since Chess is essentially a finite game of perfect information, then either player 1 (playing the white pieces) can guarantee a win, player 2 (playing the black pieces) can guarantee a win, or each player can guarantee a draw.

The Game of Northeast: Draw a rectangular grid of dimension n k. Starting with player 1 on the full grid, the players take turns choosing a square remaining in the grid, removing that square and all the other squares to the northeast of that square. The player who is forced to choose the southwest square in the grid loses, and the other player wins. For grids of equal dimensions n n, we know a winning strategy for player 1. For grids of dimension n k, weknowthatplayer1has a winning strategy, even though we do not know what that winning strategy is.

Subgame Perfection In extensive-form games with imperfect information, backward induction can be problematic because a player s optimal action depends on which node she is at in her information set. Still, sequential rationality can be captured by the concept of subgame perfection. First we need to define what we mean by a subgame. Definition: Given an extensive-form game tree, a node x initiates a subgame if neither x nor any of its successors are in an information set containing nodes that are not successors of x. The tree defined by x and its successors is called a subgame.

Here is a game with 3 subgames.

Notice that: 1. Any game is a subgame of itself. Subgames other than the original game itself are called proper subgames. 2. For games of perfect information, every node other than a terminal node defines a subgame. 3. Any subgame is a game in its own right, satisfying all ofourrulesforgametrees. 4. A strategy for the original game also defines a strategy for each of its subgames, sometimes called a continuation strategy.

Definition: A strategy profile for an extensive-form game is a subgame perfect Nash equilibrium (SPNE) if it specifies a Nash equilibrium in each of its subgames. The idea behind SPNE is that even if a NE strategy profile dictates that certain subgames are not reached, we require that what the players would do conditional on reaching those subgames should constitute a NE. The "off-the-equilibrium-path" behavior can be important, because it affects the incentives of players to follow the equilibrium. Notice that every SPNE must also be a NE, because the full game is also a subgame. For finite games of perfect information, any backward induction solution is a SPNE and vice-versa. The advantage of SPNE is that it can be applied to games of imperfect information too.

TheChainStoreGame A chain store has branches in K cities, and in each city, k =1,...,K, there is a competitor. In period k, the competitor in city k decides whether to enter the market or stay out, and if firm k enters, the chain store must decide whether to fight or cooperate. This is a game of perfect information, with the payoffs in each city given in the figure (next slide). Firm k cares only about the actions taken in its city, but the chain store s payoff is thesumofthepayoffs it generates in each city.

city k Out In 5,1 Chain Store F C 0,0 2,2

Every path of the game in which the outcome in any period is either out or (in, C) is a Nash equilibrium outcome. There is a unique subgame perfect equilibrium, where each competitor chooses in and the chain store always chooses C. For K =1, subgame perfection eliminates the bad NE. For large K, isn t it more reasonable to think that the chain store will establish a reputation for being tough? Moreover,if we see the chain store fighting the first 10 competitors, is it reasonable for the next competitor to enter? That is, if we see that the assumption of common knowledge of sequential rationality is violated, does it make sense to continue to assume it?

The Centipede Game At each stage, a player can either stop the game, or continue the game, thereby sacrificing one dollar so that the other player can receive more than one dollar. There is a unique subgame perfect equilibrium, where each player stops the game after every history. There are several Nash equilibria, but all of them involve both players stopping the game at their first opportunity.

1 C 2 C 1 C 2 C 1 C S S S S S S 2 C 6,5 1,0 0,2 3,1 2,4 5,3 4,6 For a very long centipede, with payoffs in the hundreds, will player 1 stop immediately? Since player 1 starting with C isnotconsistentwithbackward induction, is it reasonable for player 2 to believe that player 1 will use backward induction in the future?

Stackelberg Competition With quantity competition, the timing of production matters. In the Stackelberg Game, firm 1 produces its output first. Then firm 2 observes q 1 before deciding its own output. Let us adapt the Cournot model covered earlier. Marginal production cost is equal to 100, and market inverse demand is given by p = 1000 q 1 q 2. (If the expression isnegative,wetakethepricetobezero.) Firm 1 s strategy set is S 1 = {q 1 : q 1 0}. Firm 2 s strategy set is more complicated, reflecting the extensive-form nature of the game. Firm 2 can let q 2 depend on q 1. S 2 = {functions q 2 (q 1 ):q 2 (q 1 ) 0 for each q 1 }.

A Nash equilibrium is a profile of strategies, (q 1,q 2 (q 1)). In any SPNE, the subgame following any q 1 must be a NE. In other words, q 2 (q 1) must be a best-response to q 1. q 2 (q 1)=BR 2 (q 1 ) = 450 q 1 2. Notice the distinction: Under simultaneous-move Cournot competition, firm 2 chooses a quantity, which in the NE is a best response to firm 1 s quantity q 1. Under Stackelberg competition, firm 2 chooses a function describing firm 2 s choice at each information set, and in the SPNE this function is BR 2 (q 1 ). Firm 2 must best respond to every possible output from firm 1, not just q 1.

Nowthatwehavefoundthestrategyoffirm 2 satisfying sequential rationality, we can work backwards and find.firm1 spayoff function is q 1 u 1 (q 1,q 2 )=[1000 q 1 q 2 ]q 1 100q 1. In a SPNE, firm 1 chooses a best response to firm 2 s strategy function, which we can substitute into the payoff function: u 1 (q 1,q 2 (q 1)) = [1000 q 1 (450 q 1 2 )]q 1 100q 1 = [550 q 1 2 ]q 1 100q 1 = [450 q 1 2 ]q 1. Taking the derivative with respect to q 1, setting it equal to zero, and solving for q 1 yields 450 q 1 = 0 q1 = 450.

The SPNE is (450, 450 q 1 2 ). Along the equilibrium path, we see that firm 1 chooses output of 450, and firm 2 chooses an output of 225. The corresponding price is 325. This is very different from the Cournot quantities, (300, 300), and price, 400. You can check that there is a first mover advantage. Firm 1 receives a higher payoff than firm 2, and a higher payoff than it would receive under simultaneous play. AlthoughthegamehasoneSPNE,therearemanyNE that violate sequential rationality, for example: q 1 = 64 q 2 (q 1 ) = 418 if q 1 =64 = 1000 otherwise.