Von Neumann and Newman poker with a flip of hand values
|
|
|
- Cori Welch
- 9 years ago
- Views:
Transcription
1 Von Neumann and Newman poker with a flip of hand values Nicla Bernasconi Julian Lorenz Reto Spöhel Institute of Theoretical Computer Science ETH Zürich, 8092 Zürich, Switzerland {nicla jlorenz rspoehel}@inf.ethz.ch Abstract. The von Neumann and Newman poker models are simplified two-person poker models in which hands are modeled by real values drawn uniformly at random from the unit interval. We analyze a simple extension of both models that introduces an element of uncertainty about the final strength of each player s own hand, as is present in real poker games. Whenever a showdown occurs, an unfair coin with fixed bias q is tossed, 0 q /2. With probability q, the higher hand value wins as usual, but with the remaining probability q, the lower hand wins. Both models favour the first player for q = 0 and are fair for q = /2. Our somewhat surprising result is that the first player s expected payoff increases with q as long as q is not too large. That is, the first player can exploit the additional uncertainty introduced by the coin toss and extract even more value from his opponent.. Introduction Simplified models of poker were first studied in a game theoretic framework by Borel [3] and von Neumann [2] in the 920s. Von Neumann s results were published later in the seminal book Theory of Games and Economic Behavior with Morgenstern [3]. He considered the following 0, ) twoplayer game: There is an initial pot of size P consisting of player s antes). Each player is dealt a hand, i.e., a real number x, respectively y, drawn independently and uniformly at random from the unit interval. After X has inspected x, he can make a bet of a chips or check. If he bets, Y can either call i.e., match X s bet of a chips) or fold i.e., concede the pot to X), where of course he can base this decision on his own hand y. If X checks or if his bet is called by Y, a showdown occurs, i.e., both hands are revealed and the player with the higher hand value wins the pot and all bets made. Note that only X is allowed to make a bet. Thus he has an advantage over Y, who can only react. In the above model, one can either prescribe the amount a that X bets if he opts to bet, or let X choose the size of his bet freely as a function of his hand x. The former case is the game originally studied by von Neumann. Here we assume a = P pot size bet) for simplicity. Von Neumann showed that in optimal play X has an expected payoff of 5/9 P. A key insight of his solution is that bluffing is a game-theoretic necessity: X will not only bet with his good hands, but also with very low-ranked hands. These bluff-bets aim at inducing his opponent to make more calls, even with only mediocre hands, which in turn enables X to extract more value from Y with his good hands. The case of variable bet size ax) was studied by Newman [9] in the 950s. Certainly, in this setup player X has an even greater advantage Newman proved an expected payoff of 4/7 P in optimal play. Several extensions and variants of these poker models have been studied in the literature. In particular, there are two intermediate models in which X can choose his bet more freely than The author was supported by UBS AG. The author was supported by Swiss National Science Foundation, grant
2 2 in von Neumann s model, but not completely unrestricted as in Newman s model: Karlin and Restrepo [8] see also [0]) analyzed the case where X may choose his bet a from a finite number of prescribed bet sizes a,..., a n, and Sakai [0] investigated the case where X is allowed to choose his bet freely in an interval [0, a max ]. Much research was also devoted to analyzing more complex bet structures, in which both X and Y are allowed to bet, raise, or re-raise [2, 5, 6, 7, 8]... The von Neumann and Newman model with flip. We consider the following extension of the above model: whenever a showdown occurs, an unfair coin with bias q is tossed, where 0 q /2 is a fixed parameter known to both players. With probability q, the higher hand value wins as usual, but with the remaining probability q, the lower hand wins. We shall refer to this as a flip of the hand values: initially strong hands become weak, and vice versa. This extension is motivated by the fact that in most real world poker variants, players are not only ignorant about the opponent s hand, but also about their own final hand value. In Texas Hold em for example, a player s final hand strength is a function of both his private hand cards and so-called community cards that are revealed over the course of the game; a weak starting hand thus may turn into a very strong hand at showdown if the right community cards appear. Obviously, setting q = 0 yields the original von Neumann and Newman models, and the expected payoffs of X in optimal play are as given above. In the other extreme q = /2, each player has probability /2 of having the better hand at showdown, regardless of the realizations of x and y. Thus, for q = /2 the game reduces to a fair coin toss and neither player will have an advantage: Y will call every bet, and regardless of X s strategy each player has expected payoff P/2 in both models. In light of these two extreme cases, one might be tempted to conclude that the uncertainty about player s own hands introduced by the flip is always bad for the bettor X, who is in control of the game recall that Y can only react). More specifically, one might think that player X s expected payoff as a function of q is decreasing on the entire interval [0, /2]. As we shall see, something altogether different is true..2. Our results. An equilibrium of a two-player game is a pair of possibly randomized) strategies such that no player can improve his expected payoff by unilaterally changing his strategy. In other words, each strategy is a best response to the other one. As it is well-known, in two-player constantsum games all equilibria have the same expected payoffs, and thus we may define the value of such a game as the expected payoff of the first player X in any equilibrium pair of strategies if one exists). Clearly, the poker games introduced above are constant-sum with sum P, and we shall present explicit equilibrium pairs of strategies for both games. Let V I q) and V II q) be the values of the von Neumann game with pot size bet) and the Newman game with flip probability q, respectively. Certainly, we have V II q) V I q) for all values of q, as in Newman s model player X is allowed more flexibility in his betting strategy than in von Neumann s. We first present our results for the von Neumann game. As mentioned above, we have V I 0) = 5/9 P P in the classical case without flip, and trivially V I /2) = P/2. Theorem. The value of von Neumann poker with pot size bet and flip probability q is V I q) = { 27q 2 32q+0 69q 2 0q+3) P, q [0, /3] 3 2q 4 P, q /3, /2]. ) We give a proof of Theorem in Section 2. Figure shows V I q) graphically. V I q) first increases as we increase q, and attains its unique maximum for q = /3 with a value of 7/2 P P. In [/3, /2], V I q) decreases linearly. In other words, as we increase the parameter q, player X
3 3 can exploit this additional uncertainty about the outcome of the game and extract even more value from his opponent at first! Only beyond q = /3, X gradually loses his advantage. This behaviour of the value function is related to von Neumann s observation about bluffing and can be explained as follows: As q increases, X has to make less and less bluff-bets, since gradually the flip q takes over and induces Y into calling enough of X s bets. At q = /3 something important happens as we shall show, at this point the optimal strategies change abruptly cf. Figure 2 in Section 2): For q > /3, Y calls every bet since his expected payoff is positive regardless of the hands x and y. Thus the need for X to bluff vanishes completely, and he can concentrate on extracting value from Y with his good hands. Due to the abrupt change of strategy, V I q) is continuous but not smooth at q = /3. The value /3 stems from our assumption that a = P ; in general, this change of strategies occurs at q satisfying q / q ) = a/p + a), i.e., at the point where the odds of a flip occurring equal Y s pot odds. 3/48 game values V I and V II 0.6 7/2 4/7 5/9 /2 0 /3 /2 flip probability q Figure. Value V I q) of von Neumann poker lower curve) and value V II q) of Newman poker upper curve) as function of the flip probability q. We set P = for simplicity. V II q) is discontinuous at q = /2. We now present our results for the Newman model. We find it quite remarkable that V II q) turns out to have the simple closed form below, as the calculations involved are far more complicated than in the von Neumann case. Theorem 2. The value of Newman poker with flip probability q is { 6 q 47 2q) P, q [0, /2) V II q) =. 2) P/2, q = /2 We give a proof of Theorem 2 in Section 3. As shown in Figure, V II q) increases from 4/7 P 0.57 P to 3/48 P P. Unlike in the von Neumann game, the value function increases in the entire interval [0, /2): As the outcome of the game becomes more and more uncertain, X s advantage gets larger and larger. We attribute the discontinuity at q = /2 to the fact that X s bet size a is unlimited. In fact, in the optimal strategies we present ax) diverges both at x = 0 and x = for all values of q, cf. Figure 3 in Section 3.) We believe that if a limit on the bet size is imposed as in [0], this discontinuity disappears and V II q) attains a maximum at some q 0 < /2 instead, similar to the behaviour of V I q). An interesting feature of the equilibrium pair of strategies we exhibit is that they are discontinuous at a satisfying q/ q) = a /P + a ) cf. Figure 3 in Section 3): Y will call all bets of at most
4 4 a since for such bets his expected payoff is nonnegative regardless of the hands x and y. Knowing this, X never bets an amount between 0 and a. Thus, again the equation q/ q) = a/p + a) determining the point where the odds of a flip equal Y s pot odds plays a vital role. 2. The von Neumann Model In this section we prove Theorem. We give an explicit equilibrium pair of deterministic strategies, and compute the value of the game from there. Recall that in the von Neumann poker model player X can decide whether to check or to bet a = P after inspecting his hand x. If X bets, Y either calls or folds depending on his hand y. Formally, a deterministic strategy for X is given by an arbitrary measurable) subset B 0, ) determining for which hand values x he will make a bet. Similarly, a deterministic strategy for Y is given by an arbitrary measurable) subset C 0, ) determining with which hands y he will call if X bets. To make the formulas more understandable we write a for the bet and P for the pot despite the assumption that a = P. Throughout, we neglect the case x = y, which occurs with probability The case /3 < q < /2. We show that for any fixed q /3, /2), an equilibrium pair of strategies is given by B = /2, ), 3) C = 0, ). 4) In words, X bets the better half of his hands, and Y calls every bet regardless of his hand. Claim 3. Y s calling strategy 4) is a best response to any betting strategy B of player X. Proof. Note that if Y folds, his payoff is 0, and if he calls with a hand of y, his expected payoff conditional on X having a hand x > y is while his expected payoff conditional on X having a hand x < y is P + a)q a q), 5) P + a) q) aq. 6) As for /2 > q > /3 = a/p + 2a) both 5) and 6) are positive, calling all bets of X indeed corresponds to Y s pointwise optimal behaviour, regardless of X s strategy. Claim 4. X s betting strategy 3) is a best response to Y s calling strategy 4). Proof. As Y always calls, X s expected payoff if he bets with a hand of x is given by x P + a) q) aq ) + x) P + a)q a q) ) and if he checks, by = P 2q)x + q ) + a 2q)2x ), 7) xp q) + x)p q = P 2q)x + q ). 8) Here x and x are the probabilities that Y s hand y is lower, resp. higher than x. Since 7) is larger than 8) for x > /2 and smaller for x < /2, 3) indeed is a pointwise best response to Y s calling strategy 4).
5 5 Let E x denote X s expected payoff when holding a hand of x, with X and Y playing strategies 3, 4). Combining 7, 8) and using that a = P, we obtain { P 2q)x + q), x 0, /2] E x = P 3 2q)x + 3q ), x /2, ). Integrating E x over 0, ) we obtain V I q) as stated in Theorem for the case /3 < q / The case 0 q /3. Before we state and verify an explicit equilibrium pair of strategies, we briefly outline how such strategies can be derived heuristically. We use the well-known principle of indifference, a more detailed description of which can be found e.g. in [4]. We assume that player X bets if and only if his hand x is in the interval 0, x 0 ) bluff bets) or in x, ) value bets) for some fixed numbers x 0, x 0, ). Similarly, we assume that Y calls a bet of X if and only if his hand y is in [y, ) for some fixed y, x 0 y x. Assume that Y s calling threshold y is fixed and known to X. If X bets with a hand x y, his payoff is given by y P + y ) P + a)q a q) ). Here the first term corresponds to the case where Y folds, and the second one to the case where Y calls. Clearly, X should bet if and only if this exceeds 8), which is his expected payoff if he checks. For continuity reasons, at his bluff threshold x 0 the two options should have the same expected payoff at x 0, X is indifferent between betting or checking ), which yields the indifference equation x 0 P q) + x 0 )P q! = y P + y ) P + a)q a q) ). 9) Similarly, X s indifference between checking and betting at x yields the equation x P q) + x )P q! = y P + x y ) P + a) q) aq ) 0) + x ) P + a)q a q) ), and from Y s indifference at y between calling and folding we obtain x 0 P + a) q) aq ) + x ) P + a)q a q) )! = 0. ) Solving the equation system 9, 0, ) observing that a = P yields x 0 = 3q)/D, 2) x = 2 3q)/D, 3) y = 2q)5 9q)/D, 4) where D = 3 9q 2 0q + 3). Note that x 0 < y < x for all values of q. Figure 2 illustrates these thresholds as q varies from 0 to /3. We prove that for any fixed q [0, /3], an equilibrium is given by the following pair of strategies. Player X bets if and only if his hand x is in and player Y calls a bet of X if and only if his hand y is in where x 0, x and y given by 2, 3, 4). B = 0, x 0 ) x, ), 5) C = [y, ), 6) The fact that the strategies 5, 6) are in equilibrium is proved by the two following claims. Claim 5. Any calling strategy C of player Y satisfying y > x = y C 7) y < x 0 = y / C 8)
6 6 x 0) = 7/9 y 0) = 5/9 /3 hand value x, y x 0 0) = /9 0 0 q* = /3 flip probability q /2 Figure 2. Equilibrium strategy pairs for players X and Y as function of parameter q. The hatched regions indicate with which hands x player X makes a bluff or value) bet, and the shaded region indicates with which hands y player Y calls X s bets. is a best response to X s betting strategy 5). Proof. Assume that Y has a hand of y and is faced with a bet. Clearly, if he folds, his payoff is 0. If he calls, his expected payoff is the weighted average of 5, 6), where the weights are the probabilities that x > y, resp. x < y, conditional on x B i.e., X making a bet). If Y s hand y is between x 0 and x, these conditional probabilities are proportional to x resp. x 0, and Y s expected payoff is some constant times the left hand side of ), which vanishes for x 0 and x as defined in 2, 3). Thus for hands between x 0 and x, calling and folding have the same expected payoff. For y < x 0, the conditional probabilities in question are proportional to x ) + x 0 y) and y respectively. Thus the smaller term 5) gets strictly larger weight, and Y s expected payoff when calling is negative. Analogously it follows that his expected payoff when calling with a hand y > x is positive. Together this shows that any strategy satisfying 7, 8) corresponds to pointwise optimal behaviour of Y, which proves the claim. Claim 6. X s betting strategy 5) is a best response to Y s calling strategy 6). Proof. The expected payoff for X if he checks with a hand of x is given by 8). On the other hand, if he bets with a hand x y, his expected payoff is y P + y ) P + a)q a q) ) = P 27q 3 24q 2 + 4q + )/D 9) independently of x, and if he bets with a hand with a hand x > y, y P + x y ) P + a) q) aq ) + x) P + a)q a q) ) = P 62q q 2 44q + 27) x/d) + 35q 3 92q 2 + 9q 4)/D ), 20) where we used that a = P and y as defined in 4). Straightforward calculation yields that 9) is larger than 8) for x < x 0 and that 20) is larger than 8) for x > x, reflecting equalities 9) and 0). Thus 5) is a best response to Y s calling strategy 6).
7 7 Let E x denote X s expected payoff when holding a hand of x, with X and Y playing strategies 5, 6). Combining 8, 9, 20), we obtain P 27q3 24q 2 +4q+ D, x 0, x 0 ] E x = P 2q)x + q), ) x x 0, x ). P 62q 3 +26q 2 44q+27 D x + 35q3 92q 2 +9q 4 D, x [x, ) Integrating E x over [0, ] we obtain V I q) as stated in Theorem for the case 0 q /3. 3. The Newman Model In this section we prove Theorem 2. As in the previous section, we give an equilibrium pair of deterministic strategies for players X and Y, and compute the value of the game from there. Throughout, we assume that 0 q < /2. As argued in the introduction, the case q = /2 is trivial. Recall that in the Newman poker model, X can choose the size of his bet a freely as a function of his hand x. In this formulation, if X wants to check, he does so by choosing bet size a = 0. Y sees the size of the bet, and can use this information together with his hand value y in deciding whether or not to call. Formally, a deterministic strategy for X is given by an arbitrary measurable) function ax) : 0, ) [0, ) determining his bet a as a function of his hand x. A deterministic strategy for Y is given by an arbitrary measurable) subset C 0, ) [0, ), which is interpreted as follows: Y calls a bet of a with a hand of y if and only if y, a) C. Before we state and verify an explicit equilibrium pair of strategies, we outline the heuristics that lead to these strategies. First of all, recall that the expected payoff of Y when calling a bet of a is given by 5, 6). Since for a at most a = q 2q P both terms are nonnegative, Y will call every bet of at most a. Similarly to Section 2., it follows that X should never bet an amount 0 < a < a with hands below /2, and should always bet at least a with hands above /2. In view of the findings in the von Neumann poker model about mixing value bets with one s best hands and bluff bets with one s worst hands, we assume that X plays as follows: He bets a > a if and only if his hand is either x 0 a) or x a) for some functions x 0, : [a, ) 0, ). We assume that x 0 a) is strictly decreasing with lim a x 0 a) = 0, and that x a) is strictly increasing with lim a x a) = cf. Figure 3 below). The function x 0 a) corresponds to X s bluff bets, and x a) to his value bets. For Y s calling strategy, we assume that he calls a bet of a > a if and only if his hand is greater or equal than y a) for some function y : [a, ) 0, ). We assume that x 0 a) y a) x a) for a [a, ). We will use the notations x 0 = x 0a ), x = x a ), y = y a ). Assume first that Y s strategy is fixed, and that X has a hand of x. Knowing Y s strategy y a), if X bets he will bet the amount which maximizes his expected payoff. For bluff bets this expected payoff is fa) = y a) P + y a)) P +a)q a q) ), independently of x. First-order condition yields fa) a = y a) P P + a)q a q) )) y a)) 2q) =! 0 2)
8 8 for a a, ). Similarly, for X s value bets the expected payoff is ga) = y a) P +x y a)) P + a) q) aq ) + x) P + a)q a q) ), and first-order condition yields ga) a = y a) P P + a) q) aq )) + x y a)) x) ) 2q) = 0, 22) from which we derive the indifference equation y a) P P + a) q) aq )) + 2x a) y a) ) 2q) =! 0 23) for a a, ) note that we replaced x by x a) after taking the derivative). Now suppose that X s strategy is fixed. If Y is faced with a bet of a > a, clearly his payoff is 0 if he folds. From a bet of a player Y can deduce that X either bluffs with the hand x 0 a) or bets for value with the hand x a). For a hand x 0 a) y x a), he will call if his expected payoff ha) is positive. To calculate ha), he needs to take into account the probability with which a bet of a is a bluff compared to a being a value bet. This ratio of bluff bets to value bets of a is given by the ratio of the slopes x 0 a) and x a), and with 5, 6) recall that we assumed x 0a) y a) x a)) we obtain the third equation ha) a for a a, ). = x 0a) P + a) q) aq ) + x a) P + a)q a q) )! = 0 24) Solving the system of differential equations 2, 23, 24) requires further boundary conditions. We require that at X s bluff threshold x 0 = x 0a ), checking and betting a should have the same expected payoff, i.e., x 0 P q) + x 0)P q! = y P + y ) P + a )q a q) ). 25) The last boundary condition is given by the assumption lim x 0a) =! 0. 26) a Solving the system of differential equations 2, 23, 24) under the boundary conditions 25, 26) yields x 0 a) = q 7 2q P 2 3q)P + 3 2q)a ) 3, 27) q)p + 2q)a x a) = 2q 7 2q q)4 q)p q)7 2q)aP + 2q)7 2q)a 2 q)p + 2q)a ) 2, 28) y a) = 2q q)p + 7 2q)a 7 2q q)p + 2q)a. 29) Moreover, straightforward calculation gives x 0 = x 0 a ) = q 7 2q, x = x a ) = 4 + q 7 2q, y = y a ) = + 4q 7 2q. Note that for all values of q we have x 0 < y < x. We shall prove that for 0 q < /2 an equilibrium is given by the following pair of strategies, where x 0 a), x a), y a), x 0, x, and a are as defined above. As a function of his hand x, player X bets an amount of a, x = x 0 a) 0, x 0 ) 0, x [x 0 ax) =, /2] a, x /2, x ]. 30) a, x = x a) x, )
9 9 Since both x 0 : [a, ) 0, x 0 ] and x : [a, ) [x, ) are bijective, ax) is indeed well-defined. Player Y, holding a hand of y, calls a bet of a if and only if y, a) is in { C = y, a) } a a y y a). 3) Figure 3 illustrates these strategies for q = 0 and q = /3. 5 q = 0 5 q = /3 bet size a bet size a a* = a* = 0 0 x * = /7 0 x * = 4/7 hand value x, y 0 0 x * y * /2 x * 0 hand value x, y Figure 3. Equilibrium strategy pair for q = 0 left) and q = /3 right). Player X s bet function ax) is given by the solid line, and Y s call region C is the shaded region. In order to verify that these strategies indeed form an equilibrium pair, we prove two stronger claims, showing that if one of the two strategies is fixed, there is in fact a large class of deterministic counterstrategies. Claim 7. Any calling strategy C of player Y satisfying is a best response to X s betting strategy 30). y > x a) = y, a) C 32) y < x 0 a) = y, a) / C 33) a a = y, a) C 34) Proof. We first show that the restrictions 32, 33, 34) reflect the pointwise optimal behaviour of Y when faced with a bet of a holding a hand of y. Recall that Y s expected payoff when calling is given by 5) or 6). If Y is faced with a bet of size a > a, there are exactly two hands X can possibly have: a value bet hand x a) and a bluff bet hand x 0 a), where x 0 a) < x a). Thus with any hand y < x 0 a), Y s expected payoff is given by 5) and is strictly less than P +a )q a q) = 0, since 5) is decreasing in a. Conversely, with any hand y > x a), Y s expected payoff is given by 6) and strictly larger than P +a ) q) a q = P > 0. As moreover for a a both 5) or 6) are nonnegative, his expected payoff will be nonnegative when calling such hands, regardless of his hand y. This proves that indeed the restrictions given by 32, 33, 34) correspond to Y s pointwise optimal behaviour. An alternative equilibrium pair is obtained by replacing x0a) with x 0 x 0a), as is done in [9]. Instead of betting arbitrarily large amounts a as x approaches 0, X s bet size then diverges as x x 0.
10 0 It remains to show that it is completely irrelevant what Y does in the remaining situations y, a). For a fixed hand y and any call strategy C satisfying 32, 33, 34), his expected payoff against X s strategy 30) conditional on no flip occurring is 30,34) = 32,33) = 0 a P + ax)) {y,ax)) C y>x} ax) {y,ax)) C y<x} dx + + a + + P + a) {y,a) C y>x0 a)} a ) {y,a) C y<x 0 a)} x 0 a) da /2 x 0 P {y>x} dx + x /2 P + a ) {y>x} a {y<x} dx P + a) {y,a) C y>x a)} a ) {y,a) C y<x a a)} x a) da P + a) {y,a) C} x 0a) da /2 x 0 = C 0 y) P {y>x} dx + x /2 P + a ) {y>x} a {y<x} dx a P + a) {y>x a)} a {y,a) C} {y>x a)}) ) x a) da a P + a) x 0 a) + a x a) ) {y,a) C} da 35) for some function C 0 y) independent from C. Similarly, conditional on a flip occurring the expected payoff of player Y is C y) + a x 0 a) + P + a) x a) ) {y,a) C} da 36) a for some other function C y) independent from C. Combining 35, 36) and using the fact that x 0 a), x a) as defined in 27, 28) satisfy 24) yield that the total expected payoff against X s strategy 30) for a fixed hand y is q) C 0 y) + q C y), independently from C. Claim 8. For any function ãx) : 0, x 0 ) a, ), X s betting strategy ãx), x 0, x 0 ) 0, x [x 0 ax) =, /2] a, x /2, x ] a, x = x a) x, ) is a best response to Y s calling strategy 3). Proof. The first decision X has to make is whether to bet an amount a > a or an amount 0 a a. As in the latter case Y will always call, X s expected payoff is given by 7) and thus maximized for a = 0 if x < /2, and for a = a if x > /2. The resulting expected payoffs are P 2q)x + q ) 37) and respectively. x P + a ) q) a q ) + x) P + a )q a q) ) = P x, 38)
11 We need to compare these values to the expected payoff X can achieve by betting more than a. Betting any amount a > a with a hand of x y a) yields an expected payoff of y a) P + y a)) P + a)q a q) ) = P + 4q)/7 2q), 39) independently of a and x. This independence of a is a consequence of the fact that y a) as defined in 29) satisfies 2).) Comparing 39) to 37) shows that for hands x < q)/7 2q) = x 0, this is indeed better than checking, reflecting the boundary condition 25). Similarly, betting any amount a > a with a hand of x > y a) yields an expected payoff of y a) P + x y a)) P + a) q) aq ) + x) P + a)q a q) ), 40) which is increasing for a near a and decreasing for a, as can be seen from the expression in 22). Thus the expected payoff is maximized for the unique a a, ) satisfying 22), which is a satisfying x a) = x cf. 23)). The resulting expected payoff is P 02 q)/7 2q) x 4 3 x) q)/7 2q) ), 4) as is found by inverting 28) and plugging the resulting function ax) into 40). Comparing this to 38) shows that for hands x > 4 + q)/7 2q) = x this is indeed better than the alternative of betting a. From the equilibrium pair of deterministic strategies 30, 3), we can easily calculate the value of the game. Let E x denote X s expected payoff when holding a hand of x, with X and Y playing strategies 30, 3). Combining 37, 38, 39, 4), we obtain P +4q 7 2q, x 0, x 0 ) P 2q)x + q ), x [x 0 E x =, /2] P x, x /2, x ]. ) P 02 q) 7 2q x 4 3 x) q) 7 2q, x x, ) Integrating E x over 0, ) we obtain V II q) as stated in Theorem 2. References [] S. Bargmann. On the theory of games of strategy. 40:3 42, 959. English translation of [2]. [2] R. Bellman and D. Blackwell. Some two-person games involving bluffing. Proceedings of the National Academy of Sciences of the United States of America, 350): , 949. [3] É. Borel. Traité du Calcul des Probabilités et ses Applications Volume IV, Fascicule 2, Applications aux jeux des hazard. Gautier-Villars, Paris, 938. [4] B. Chen and J. Ankenman. The Mathematics of Poker. ConJelCo, New Pittsburgh, PA, [5] W. H. Cutler. An optimal strategy for pot-limit poker. Amer. Math. Monthly, 82: , 975. [6] C. Ferguson, T. S. Ferguson, and C. Gawargy. U0,) two-person poker models. In Game theory and application, XII, volume 2 of Game Theory Appl., pages Nova Sci. Publ., Hauppauge, NY, [7] A. J. Goldman and J. J. Stone. A continuous poker game. Duke Math. J., 27:4 53, 960. [8] S. Karlin and R. Restrepo. Multistage poker models. In Contributions to the theory of games, vol. 3, Annals of Mathematics Studies, no. 39, pages Princeton University Press, Princeton, N. J., 957. [9] D. J. Newman. A model for real poker. Operations Research, 75): , 959. [0] S. Sakai. A model for real poker with an upper bound of assets. J. Optim. Theory Appl., 50):49 63, 986. [] A. H. Taub, editor. John von Neumann: Collected Works. Volume VI: Theory of Games, Astrophysics, Hydrodynamics and Meteorology [2] J. von Neumann. Zur Theorie der Gesellschaftsspiele in German). Mathematische Annalen, 00: , 928. English translation in []. Reprinted in [, Paper ]. [3] J. von Neumann and O. Morgenstern. Theory of Games and Economic Behavior. Princeton University Press, 944.
SYMMETRIC FORM OF THE VON NEUMANN POKER MODEL. Guido David 1, Pearl Anne Po 2
International Journal of Pure and Applied Mathematics Volume 99 No. 2 2015, 145-151 ISSN: 1311-8080 (printed version); ISSN: 1314-3395 (on-line version) url: http://www.ijpam.eu doi: http://dx.doi.org/10.12732/ijpam.v99i2.2
Minimax Strategies. Minimax Strategies. Zero Sum Games. Why Zero Sum Games? An Example. An Example
Everyone who has studied a game like poker knows the importance of mixing strategies With a bad hand, you often fold But you must bluff sometimes Lectures in Microeconomics-Charles W Upton Zero Sum Games
Game Theory and Poker
Game Theory and Poker Jason Swanson April, 2005 Abstract An extremely simplified version of poker is completely solved from a game theoretic standpoint. The actual properties of the optimal solution are
Nash Equilibria and. Related Observations in One-Stage Poker
Nash Equilibria and Related Observations in One-Stage Poker Zach Puller MMSS Thesis Advisor: Todd Sarver Northwestern University June 4, 2013 Contents 1 Abstract 2 2 Acknowledgements 3 3 Literature Review
ECON 459 Game Theory. Lecture Notes Auctions. Luca Anderlini Spring 2015
ECON 459 Game Theory Lecture Notes Auctions Luca Anderlini Spring 2015 These notes have been used before. If you can still spot any errors or have any suggestions for improvement, please let me know. 1
AN ANALYSIS OF A WAR-LIKE CARD GAME. Introduction
AN ANALYSIS OF A WAR-LIKE CARD GAME BORIS ALEXEEV AND JACOB TSIMERMAN Abstract. In his book Mathematical Mind-Benders, Peter Winkler poses the following open problem, originally due to the first author:
MASSACHUSETTS INSTITUTE OF TECHNOLOGY 6.436J/15.085J Fall 2008 Lecture 5 9/17/2008 RANDOM VARIABLES
MASSACHUSETTS INSTITUTE OF TECHNOLOGY 6.436J/15.085J Fall 2008 Lecture 5 9/17/2008 RANDOM VARIABLES Contents 1. Random variables and measurable functions 2. Cumulative distribution functions 3. Discrete
Creating a NL Texas Hold em Bot
Creating a NL Texas Hold em Bot Introduction Poker is an easy game to learn by very tough to master. One of the things that is hard to do is controlling emotions. Due to frustration, many have made the
Bonus Maths 2: Variable Bet Sizing in the Simplest Possible Game of Poker (JB)
Bonus Maths 2: Variable Bet Sizing in the Simplest Possible Game of Poker (JB) I recently decided to read Part Three of The Mathematics of Poker (TMOP) more carefully than I did the first time around.
6.254 : Game Theory with Engineering Applications Lecture 2: Strategic Form Games
6.254 : Game Theory with Engineering Applications Lecture 2: Strategic Form Games Asu Ozdaglar MIT February 4, 2009 1 Introduction Outline Decisions, utility maximization Strategic form games Best responses
ALMOST COMMON PRIORS 1. INTRODUCTION
ALMOST COMMON PRIORS ZIV HELLMAN ABSTRACT. What happens when priors are not common? We introduce a measure for how far a type space is from having a common prior, which we term prior distance. If a type
1 Error in Euler s Method
1 Error in Euler s Method Experience with Euler s 1 method raises some interesting questions about numerical approximations for the solutions of differential equations. 1. What determines the amount of
TOPIC 4: DERIVATIVES
TOPIC 4: DERIVATIVES 1. The derivative of a function. Differentiation rules 1.1. The slope of a curve. The slope of a curve at a point P is a measure of the steepness of the curve. If Q is a point on the
Online Appendix to Stochastic Imitative Game Dynamics with Committed Agents
Online Appendix to Stochastic Imitative Game Dynamics with Committed Agents William H. Sandholm January 6, 22 O.. Imitative protocols, mean dynamics, and equilibrium selection In this section, we consider
ECO 199 B GAMES OF STRATEGY Spring Term 2004 PROBLEM SET 4 B DRAFT ANSWER KEY 100-3 90-99 21 80-89 14 70-79 4 0-69 11
The distribution of grades was as follows. ECO 199 B GAMES OF STRATEGY Spring Term 2004 PROBLEM SET 4 B DRAFT ANSWER KEY Range Numbers 100-3 90-99 21 80-89 14 70-79 4 0-69 11 Question 1: 30 points Games
Game Theory and Algorithms Lecture 10: Extensive Games: Critiques and Extensions
Game Theory and Algorithms Lecture 0: Extensive Games: Critiques and Extensions March 3, 0 Summary: We discuss a game called the centipede game, a simple extensive game where the prediction made by backwards
Bayesian Tutorial (Sheet Updated 20 March)
Bayesian Tutorial (Sheet Updated 20 March) Practice Questions (for discussing in Class) Week starting 21 March 2016 1. What is the probability that the total of two dice will be greater than 8, given that
Math 120 Final Exam Practice Problems, Form: A
Math 120 Final Exam Practice Problems, Form: A Name: While every attempt was made to be complete in the types of problems given below, we make no guarantees about the completeness of the problems. Specifically,
Chapter 7. Sealed-bid Auctions
Chapter 7 Sealed-bid Auctions An auction is a procedure used for selling and buying items by offering them up for bid. Auctions are often used to sell objects that have a variable price (for example oil)
Section 1.3 P 1 = 1 2. = 1 4 2 8. P n = 1 P 3 = Continuing in this fashion, it should seem reasonable that, for any n = 1, 2, 3,..., = 1 2 4.
Difference Equations to Differential Equations Section. The Sum of a Sequence This section considers the problem of adding together the terms of a sequence. Of course, this is a problem only if more than
A New Interpretation of Information Rate
A New Interpretation of Information Rate reproduced with permission of AT&T By J. L. Kelly, jr. (Manuscript received March 2, 956) If the input symbols to a communication channel represent the outcomes
Poker with a Three Card Deck 1
Poker with a Three Card Deck We start with a three card deck containing one Ace, one King and one Queen. Alice and Bob are each dealt one card at random. There is a pot of $ P (and we assume P 0). Alice
6.207/14.15: Networks Lecture 15: Repeated Games and Cooperation
6.207/14.15: Networks Lecture 15: Repeated Games and Cooperation Daron Acemoglu and Asu Ozdaglar MIT November 2, 2009 1 Introduction Outline The problem of cooperation Finitely-repeated prisoner s dilemma
IEOR 6711: Stochastic Models I Fall 2012, Professor Whitt, Tuesday, September 11 Normal Approximations and the Central Limit Theorem
IEOR 6711: Stochastic Models I Fall 2012, Professor Whitt, Tuesday, September 11 Normal Approximations and the Central Limit Theorem Time on my hands: Coin tosses. Problem Formulation: Suppose that I have
Sensitivity Analysis 3.1 AN EXAMPLE FOR ANALYSIS
Sensitivity Analysis 3 We have already been introduced to sensitivity analysis in Chapter via the geometry of a simple example. We saw that the values of the decision variables and those of the slack and
Gambling Systems and Multiplication-Invariant Measures
Gambling Systems and Multiplication-Invariant Measures by Jeffrey S. Rosenthal* and Peter O. Schwartz** (May 28, 997.. Introduction. This short paper describes a surprising connection between two previously
MTH6120 Further Topics in Mathematical Finance Lesson 2
MTH6120 Further Topics in Mathematical Finance Lesson 2 Contents 1.2.3 Non-constant interest rates....................... 15 1.3 Arbitrage and Black-Scholes Theory....................... 16 1.3.1 Informal
No: 10 04. Bilkent University. Monotonic Extension. Farhad Husseinov. Discussion Papers. Department of Economics
No: 10 04 Bilkent University Monotonic Extension Farhad Husseinov Discussion Papers Department of Economics The Discussion Papers of the Department of Economics are intended to make the initial results
Metric Spaces. Chapter 7. 7.1. Metrics
Chapter 7 Metric Spaces A metric space is a set X that has a notion of the distance d(x, y) between every pair of points x, y X. The purpose of this chapter is to introduce metric spaces and give some
1 if 1 x 0 1 if 0 x 1
Chapter 3 Continuity In this chapter we begin by defining the fundamental notion of continuity for real valued functions of a single real variable. When trying to decide whether a given function is or
Math 4310 Handout - Quotient Vector Spaces
Math 4310 Handout - Quotient Vector Spaces Dan Collins The textbook defines a subspace of a vector space in Chapter 4, but it avoids ever discussing the notion of a quotient space. This is understandable
Game theory and AI: a unified approach to poker games
Game theory and AI: a unified approach to poker games Thesis for graduation as Master of Artificial Intelligence University of Amsterdam Frans Oliehoek 2 September 2005 ii Abstract This thesis focuses
Chapter 4 Lecture Notes
Chapter 4 Lecture Notes Random Variables October 27, 2015 1 Section 4.1 Random Variables A random variable is typically a real-valued function defined on the sample space of some experiment. For instance,
Real Roots of Univariate Polynomials with Real Coefficients
Real Roots of Univariate Polynomials with Real Coefficients mostly written by Christina Hewitt March 22, 2012 1 Introduction Polynomial equations are used throughout mathematics. When solving polynomials
Laboratory work in AI: First steps in Poker Playing Agents and Opponent Modeling
Laboratory work in AI: First steps in Poker Playing Agents and Opponent Modeling Avram Golbert 01574669 [email protected] Abstract: While Artificial Intelligence research has shown great success in deterministic
Persuasion by Cheap Talk - Online Appendix
Persuasion by Cheap Talk - Online Appendix By ARCHISHMAN CHAKRABORTY AND RICK HARBAUGH Online appendix to Persuasion by Cheap Talk, American Economic Review Our results in the main text concern the case
Oligopoly: How do firms behave when there are only a few competitors? These firms produce all or most of their industry s output.
Topic 8 Chapter 13 Oligopoly and Monopolistic Competition Econ 203 Topic 8 page 1 Oligopoly: How do firms behave when there are only a few competitors? These firms produce all or most of their industry
Adaptive Online Gradient Descent
Adaptive Online Gradient Descent Peter L Bartlett Division of Computer Science Department of Statistics UC Berkeley Berkeley, CA 94709 bartlett@csberkeleyedu Elad Hazan IBM Almaden Research Center 650
Student Outcomes. Lesson Notes. Classwork. Discussion (10 minutes)
NYS COMMON CORE MATHEMATICS CURRICULUM Lesson 5 8 Student Outcomes Students know the definition of a number raised to a negative exponent. Students simplify and write equivalent expressions that contain
Microeconomic Theory Jamison / Kohlberg / Avery Problem Set 4 Solutions Spring 2012. (a) LEFT CENTER RIGHT TOP 8, 5 0, 0 6, 3 BOTTOM 0, 0 7, 6 6, 3
Microeconomic Theory Jamison / Kohlberg / Avery Problem Set 4 Solutions Spring 2012 1. Subgame Perfect Equilibrium and Dominance (a) LEFT CENTER RIGHT TOP 8, 5 0, 0 6, 3 BOTTOM 0, 0 7, 6 6, 3 Highlighting
MATH10212 Linear Algebra. Systems of Linear Equations. Definition. An n-dimensional vector is a row or a column of n numbers (or letters): a 1.
MATH10212 Linear Algebra Textbook: D. Poole, Linear Algebra: A Modern Introduction. Thompson, 2006. ISBN 0-534-40596-7. Systems of Linear Equations Definition. An n-dimensional vector is a row or a column
i/io as compared to 4/10 for players 2 and 3. A "correct" way to play Certainly many interesting games are not solvable by this definition.
496 MATHEMATICS: D. GALE PROC. N. A. S. A THEORY OF N-PERSON GAMES WITH PERFECT INFORMA TION* By DAVID GALE BROWN UNIVBRSITY Communicated by J. von Neumann, April 17, 1953 1. Introduction.-The theory of
THE ENDGAME IN POKER. Chris Ferguson, Full Tilt Poker Tom Ferguson, Mathematics, UCLA
THE ENDGAME IN POKER Chris Ferguson, Full Tilt Poker Tom Ferguson, Mathematics, UCLA Abstract. The simple two-person poker model, known as Basic Endgame, may be described as follows. With a certain probability
arxiv:1112.0829v1 [math.pr] 5 Dec 2011
How Not to Win a Million Dollars: A Counterexample to a Conjecture of L. Breiman Thomas P. Hayes arxiv:1112.0829v1 [math.pr] 5 Dec 2011 Abstract Consider a gambling game in which we are allowed to repeatedly
REPEATED TRIALS. The probability of winning those k chosen times and losing the other times is then p k q n k.
REPEATED TRIALS Suppose you toss a fair coin one time. Let E be the event that the coin lands heads. We know from basic counting that p(e) = 1 since n(e) = 1 and 2 n(s) = 2. Now suppose we play a game
Duality of linear conic problems
Duality of linear conic problems Alexander Shapiro and Arkadi Nemirovski Abstract It is well known that the optimal values of a linear programming problem and its dual are equal to each other if at least
Math 115 Spring 2011 Written Homework 5 Solutions
. Evaluate each series. a) 4 7 0... 55 Math 5 Spring 0 Written Homework 5 Solutions Solution: We note that the associated sequence, 4, 7, 0,..., 55 appears to be an arithmetic sequence. If the sequence
Unraveling versus Unraveling: A Memo on Competitive Equilibriums and Trade in Insurance Markets
Unraveling versus Unraveling: A Memo on Competitive Equilibriums and Trade in Insurance Markets Nathaniel Hendren January, 2014 Abstract Both Akerlof (1970) and Rothschild and Stiglitz (1976) show that
4. Continuous Random Variables, the Pareto and Normal Distributions
4. Continuous Random Variables, the Pareto and Normal Distributions A continuous random variable X can take any value in a given range (e.g. height, weight, age). The distribution of a continuous random
FIRST YEAR CALCULUS. Chapter 7 CONTINUITY. It is a parabola, and we can draw this parabola without lifting our pencil from the paper.
FIRST YEAR CALCULUS WWLCHENW L c WWWL W L Chen, 1982, 2008. 2006. This chapter originates from material used by the author at Imperial College, University of London, between 1981 and 1990. It It is is
LOOKING FOR A GOOD TIME TO BET
LOOKING FOR A GOOD TIME TO BET LAURENT SERLET Abstract. Suppose that the cards of a well shuffled deck of cards are turned up one after another. At any time-but once only- you may bet that the next card
Introduction to Hypothesis Testing OPRE 6301
Introduction to Hypothesis Testing OPRE 6301 Motivation... The purpose of hypothesis testing is to determine whether there is enough statistical evidence in favor of a certain belief, or hypothesis, about
3. Reaction Diffusion Equations Consider the following ODE model for population growth
3. Reaction Diffusion Equations Consider the following ODE model for population growth u t a u t u t, u 0 u 0 where u t denotes the population size at time t, and a u plays the role of the population dependent
CPC/CPA Hybrid Bidding in a Second Price Auction
CPC/CPA Hybrid Bidding in a Second Price Auction Benjamin Edelman Hoan Soo Lee Working Paper 09-074 Copyright 2008 by Benjamin Edelman and Hoan Soo Lee Working papers are in draft form. This working paper
3. Mathematical Induction
3. MATHEMATICAL INDUCTION 83 3. Mathematical Induction 3.1. First Principle of Mathematical Induction. Let P (n) be a predicate with domain of discourse (over) the natural numbers N = {0, 1,,...}. If (1)
NIM with Cash. Abstract. loses. This game has been well studied. For example, it is known that for NIM(1, 2, 3; n)
NIM with Cash William Gasarch Univ. of MD at College Park John Purtilo Univ. of MD at College Park Abstract NIM(a 1,..., a k ; n) is a -player game where initially there are n stones on the board and the
The Relative Worst Order Ratio for On-Line Algorithms
The Relative Worst Order Ratio for On-Line Algorithms Joan Boyar 1 and Lene M. Favrholdt 2 1 Department of Mathematics and Computer Science, University of Southern Denmark, Odense, Denmark, [email protected]
8 Divisibility and prime numbers
8 Divisibility and prime numbers 8.1 Divisibility In this short section we extend the concept of a multiple from the natural numbers to the integers. We also summarize several other terms that express
Infinitely Repeated Games with Discounting Ù
Infinitely Repeated Games with Discounting Page 1 Infinitely Repeated Games with Discounting Ù Introduction 1 Discounting the future 2 Interpreting the discount factor 3 The average discounted payoff 4
The sum of digits of polynomial values in arithmetic progressions
The sum of digits of polynomial values in arithmetic progressions Thomas Stoll Institut de Mathématiques de Luminy, Université de la Méditerranée, 13288 Marseille Cedex 9, France E-mail: [email protected]
What is Linear Programming?
Chapter 1 What is Linear Programming? An optimization problem usually has three essential ingredients: a variable vector x consisting of a set of unknowns to be determined, an objective function of x to
1 Mathematical Models of Cost, Revenue and Profit
Section 1.: Mathematical Modeling Math 14 Business Mathematics II Minh Kha Goals: to understand what a mathematical model is, and some of its examples in business. Definition 0.1. Mathematical Modeling
Practical Guide to the Simplex Method of Linear Programming
Practical Guide to the Simplex Method of Linear Programming Marcel Oliver Revised: April, 0 The basic steps of the simplex algorithm Step : Write the linear programming problem in standard form Linear
Lecture V: Mixed Strategies
Lecture V: Mixed Strategies Markus M. Möbius February 26, 2008 Osborne, chapter 4 Gibbons, sections 1.3-1.3.A 1 The Advantage of Mixed Strategies Consider the following Rock-Paper-Scissors game: Note that
Game Theory for Humans. Matt Hawrilenko MIT: Poker Theory and Analytics
Game Theory for Humans Matt Hawrilenko MIT: Poker Theory and Analytics 1 Play Good Poker Read-Based Approach Game Theoretic Approach Why Game Theory? Computer Scientists Humans Audience Theory Practice
On Lexicographic (Dictionary) Preference
MICROECONOMICS LECTURE SUPPLEMENTS Hajime Miyazaki File Name: lexico95.usc/lexico99.dok DEPARTMENT OF ECONOMICS OHIO STATE UNIVERSITY Fall 993/994/995 [email protected] On Lexicographic (Dictionary) Preference
36 CHAPTER 1. LIMITS AND CONTINUITY. Figure 1.17: At which points is f not continuous?
36 CHAPTER 1. LIMITS AND CONTINUITY 1.3 Continuity Before Calculus became clearly de ned, continuity meant that one could draw the graph of a function without having to lift the pen and pencil. While this
An Introduction to Competitive Analysis for Online Optimization. Maurice Queyranne University of British Columbia, and IMA Visitor (Fall 2002)
An Introduction to Competitive Analysis for Online Optimization Maurice Queyranne University of British Columbia, and IMA Visitor (Fall 2002) IMA, December 11, 2002 Overview 1 Online Optimization sequences
Binomial lattice model for stock prices
Copyright c 2007 by Karl Sigman Binomial lattice model for stock prices Here we model the price of a stock in discrete time by a Markov chain of the recursive form S n+ S n Y n+, n 0, where the {Y i }
Nash and game theory
Nash and game theory Antonio Cabrales 1 I am asked to give my view on the contribution of John Nash to the development of game theory. Since I have received most of my early influence through textbooks,
Best-response play in partially observable card games
Best-response play in partially observable card games Frans Oliehoek Matthijs T. J. Spaan Nikos Vlassis Informatics Institute, Faculty of Science, University of Amsterdam, Kruislaan 43, 98 SJ Amsterdam,
Wald s Identity. by Jeffery Hein. Dartmouth College, Math 100
Wald s Identity by Jeffery Hein Dartmouth College, Math 100 1. Introduction Given random variables X 1, X 2, X 3,... with common finite mean and a stopping rule τ which may depend upon the given sequence,
Quotient Rings and Field Extensions
Chapter 5 Quotient Rings and Field Extensions In this chapter we describe a method for producing field extension of a given field. If F is a field, then a field extension is a field K that contains F.
Follow links for Class Use and other Permissions. For more information send email to: [email protected]
COPYRIGHT NOTICE: Ariel Rubinstein: Lecture Notes in Microeconomic Theory is published by Princeton University Press and copyrighted, c 2006, by Princeton University Press. All rights reserved. No part
Sums of Independent Random Variables
Chapter 7 Sums of Independent Random Variables 7.1 Sums of Discrete Random Variables In this chapter we turn to the important question of determining the distribution of a sum of independent random variables
A NOTE ON OFF-DIAGONAL SMALL ON-LINE RAMSEY NUMBERS FOR PATHS
A NOTE ON OFF-DIAGONAL SMALL ON-LINE RAMSEY NUMBERS FOR PATHS PAWE L PRA LAT Abstract. In this note we consider the on-line Ramsey numbers R(P n, P m ) for paths. Using a high performance computing clusters,
Continued Fractions and the Euclidean Algorithm
Continued Fractions and the Euclidean Algorithm Lecture notes prepared for MATH 326, Spring 997 Department of Mathematics and Statistics University at Albany William F Hammond Table of Contents Introduction
Inner Product Spaces
Math 571 Inner Product Spaces 1. Preliminaries An inner product space is a vector space V along with a function, called an inner product which associates each pair of vectors u, v with a scalar u, v, and
Constrained optimization.
ams/econ 11b supplementary notes ucsc Constrained optimization. c 2010, Yonatan Katznelson 1. Constraints In many of the optimization problems that arise in economics, there are restrictions on the values
Chapter 3. Cartesian Products and Relations. 3.1 Cartesian Products
Chapter 3 Cartesian Products and Relations The material in this chapter is the first real encounter with abstraction. Relations are very general thing they are a special type of subset. After introducing
Finitely Additive Dynamic Programming and Stochastic Games. Bill Sudderth University of Minnesota
Finitely Additive Dynamic Programming and Stochastic Games Bill Sudderth University of Minnesota 1 Discounted Dynamic Programming Five ingredients: S, A, r, q, β. S - state space A - set of actions q(
10 Evolutionarily Stable Strategies
10 Evolutionarily Stable Strategies There is but a step between the sublime and the ridiculous. Leo Tolstoy In 1973 the biologist John Maynard Smith and the mathematician G. R. Price wrote an article in
Bayesian Nash Equilibrium
. Bayesian Nash Equilibrium . In the final two weeks: Goals Understand what a game of incomplete information (Bayesian game) is Understand how to model static Bayesian games Be able to apply Bayes Nash
Goal Problems in Gambling and Game Theory. Bill Sudderth. School of Statistics University of Minnesota
Goal Problems in Gambling and Game Theory Bill Sudderth School of Statistics University of Minnesota 1 Three problems Maximizing the probability of reaching a goal. Maximizing the probability of reaching
Economics 1011a: Intermediate Microeconomics
Lecture 11: Choice Under Uncertainty Economics 1011a: Intermediate Microeconomics Lecture 11: Choice Under Uncertainty Tuesday, October 21, 2008 Last class we wrapped up consumption over time. Today we
Basic Proof Techniques
Basic Proof Techniques David Ferry [email protected] September 13, 010 1 Four Fundamental Proof Techniques When one wishes to prove the statement P Q there are four fundamental approaches. This document
PUTNAM TRAINING POLYNOMIALS. Exercises 1. Find a polynomial with integral coefficients whose zeros include 2 + 5.
PUTNAM TRAINING POLYNOMIALS (Last updated: November 17, 2015) Remark. This is a list of exercises on polynomials. Miguel A. Lerma Exercises 1. Find a polynomial with integral coefficients whose zeros include
The positive minimum degree game on sparse graphs
The positive minimum degree game on sparse graphs József Balogh Department of Mathematical Sciences University of Illinois, USA [email protected] András Pluhár Department of Computer Science University
A Little Set Theory (Never Hurt Anybody)
A Little Set Theory (Never Hurt Anybody) Matthew Saltzman Department of Mathematical Sciences Clemson University Draft: August 21, 2013 1 Introduction The fundamental ideas of set theory and the algebra
FACTORING POLYNOMIALS IN THE RING OF FORMAL POWER SERIES OVER Z
FACTORING POLYNOMIALS IN THE RING OF FORMAL POWER SERIES OVER Z DANIEL BIRMAJER, JUAN B GIL, AND MICHAEL WEINER Abstract We consider polynomials with integer coefficients and discuss their factorization
A characterization of trace zero symmetric nonnegative 5x5 matrices
A characterization of trace zero symmetric nonnegative 5x5 matrices Oren Spector June 1, 009 Abstract The problem of determining necessary and sufficient conditions for a set of real numbers to be the
Probability Generating Functions
page 39 Chapter 3 Probability Generating Functions 3 Preamble: Generating Functions Generating functions are widely used in mathematics, and play an important role in probability theory Consider a sequence
2. Information Economics
2. Information Economics In General Equilibrium Theory all agents had full information regarding any variable of interest (prices, commodities, state of nature, cost function, preferences, etc.) In many
How to Win Texas Hold em Poker
How to Win Texas Hold em Poker Richard Mealing Machine Learning and Optimisation Group School of Computer Science University of Manchester / 44 How to Play Texas Hold em Poker Deal private cards per player
