A Mathematical Perspective on Gambling



Similar documents
Hypothesis testing. Null and alternative hypotheses

I. Chi-squared Distributions

5: Introduction to Estimation

Properties of MLE: consistency, asymptotic normality. Fisher information.

Chapter 6: Variance, the law of large numbers and the Monte-Carlo method

In nite Sequences. Dr. Philippe B. Laval Kennesaw State University. October 9, 2008

PSYCHOLOGICAL STATISTICS

Week 3 Conditional probabilities, Bayes formula, WEEK 3 page 1 Expected value of a random variable

Center, Spread, and Shape in Inference: Claims, Caveats, and Insights

1. C. The formula for the confidence interval for a population mean is: x t, which was

PROCEEDINGS OF THE YEREVAN STATE UNIVERSITY AN ALTERNATIVE MODEL FOR BONUS-MALUS SYSTEM

Case Study. Normal and t Distributions. Density Plot. Normal Distributions

Sampling Distribution And Central Limit Theorem

Chapter 7 Methods of Finding Estimators

Z-TEST / Z-STATISTIC: used to test hypotheses about. µ when the population standard deviation is unknown

Discrete Mathematics and Probability Theory Spring 2014 Anant Sahai Note 13

Inference on Proportion. Chapter 8 Tests of Statistical Hypotheses. Sampling Distribution of Sample Proportion. Confidence Interval

MARTINGALES AND A BASIC APPLICATION

A PROBABILISTIC VIEW ON THE ECONOMICS OF GAMBLING


A probabilistic proof of a binomial identity

Statistical inference: example 1. Inferential Statistics

Hypergeometric Distributions

Overview of some probability distributions.

Confidence Intervals for One Mean

Chapter 7 - Sampling Distributions. 1 Introduction. What is statistics? It consist of three major areas:

The following example will help us understand The Sampling Distribution of the Mean. C1 C2 C3 C4 C5 50 miles 84 miles 38 miles 120 miles 48 miles

CHAPTER 7: Central Limit Theorem: CLT for Averages (Means)

Determining the sample size

Chapter 7: Confidence Interval and Sample Size

Soving Recurrence Relations

Department of Computer Science, University of Otago

University of California, Los Angeles Department of Statistics. Distributions related to the normal distribution

The Stable Marriage Problem

Exam 3. Instructor: Cynthia Rudin TA: Dimitrios Bisias. November 22, 2011

Non-life insurance mathematics. Nils F. Haavardsson, University of Oslo and DNB Skadeforsikring

UC Berkeley Department of Electrical Engineering and Computer Science. EE 126: Probablity and Random Processes. Solutions 9 Spring 2006

Output Analysis (2, Chapters 10 &11 Law)

Chapter 14 Nonparametric Statistics

Section 11.3: The Integral Test

Estimating Probability Distributions by Observing Betting Practices

Math C067 Sampling Distributions

SECTION 1.5 : SUMMATION NOTATION + WORK WITH SEQUENCES

How To Solve The Homewor Problem Beautifully

.04. This means $1000 is multiplied by 1.02 five times, once for each of the remaining sixmonth

Analyzing Longitudinal Data from Complex Surveys Using SUDAAN

Infinite Sequences and Series

Normal Distribution.

3 Basic Definitions of Probability Theory

Measures of Spread and Boxplots Discrete Math, Section 9.4

MEI Structured Mathematics. Module Summary Sheets. Statistics 2 (Version B: reference to new book)

THE REGRESSION MODEL IN MATRIX FORM. For simple linear regression, meaning one predictor, the model is. for i = 1, 2, 3,, n

1 Computing the Standard Deviation of Sample Means

Lecture 4: Cauchy sequences, Bolzano-Weierstrass, and the Squeeze theorem

Confidence Intervals. CI for a population mean (σ is known and n > 30 or the variable is normally distributed in the.

CS103A Handout 23 Winter 2002 February 22, 2002 Solving Recurrence Relations

Trigonometric Form of a Complex Number. The Complex Plane. axis. ( 2, 1) or 2 i FIGURE The absolute value of the complex number z a bi is


Your organization has a Class B IP address of Before you implement subnetting, the Network ID and Host ID are divided as follows:

Practice Problems for Test 3

1 Correlation and Regression Analysis

Lesson 17 Pearson s Correlation Coefficient

LECTURE 13: Cross-validation

GCSE STATISTICS. 4) How to calculate the range: The difference between the biggest number and the smallest number.

1. MATHEMATICAL INDUCTION

Overview. Learning Objectives. Point Estimate. Estimation. Estimating the Value of a Parameter Using Confidence Intervals

Basic Elements of Arithmetic Sequences and Series

Definition. A variable X that takes on values X 1, X 2, X 3,...X k with respective frequencies f 1, f 2, f 3,...f k has mean

FIBONACCI NUMBERS: AN APPLICATION OF LINEAR ALGEBRA. 1. Powers of a matrix

Lesson 15 ANOVA (analysis of variance)

Confidence intervals and hypothesis tests

Asymptotic Growth of Functions

5.3. Generalized Permutations and Combinations

Example 2 Find the square root of 0. The only square root of 0 is 0 (since 0 is not positive or negative, so those choices don t exist here).

The analysis of the Cournot oligopoly model considering the subjective motive in the strategy selection

Here are a couple of warnings to my students who may be here to get a copy of what happened on a day that you missed.

One-sample test of proportions

, a Wishart distribution with n -1 degrees of freedom and scale matrix.

THE TWO-VARIABLE LINEAR REGRESSION MODEL

CHAPTER 3 THE TIME VALUE OF MONEY

Approximating Area under a curve with rectangles. To find the area under a curve we approximate the area using rectangles and then use limits to find

Central Limit Theorem and Its Applications to Baseball

Convexity, Inequalities, and Norms

Maximum Likelihood Estimators.

Theorems About Power Series

Sequences and Series

5.4 Amortization. Question 1: How do you find the present value of an annuity? Question 2: How is a loan amortized?

THE HEIGHT OF q-binary SEARCH TREES

INVESTMENT PERFORMANCE COUNCIL (IPC)

NATIONAL SENIOR CERTIFICATE GRADE 12

THE ABRACADABRA PROBLEM

BINOMIAL EXPANSIONS In this section. Some Examples. Obtaining the Coefficients

A Test of Normality. 1 n S 2 3. n 1. Now introduce two new statistics. The sample skewness is defined as:

THE ARITHMETIC OF INTEGERS. - multiplication, exponentiation, division, addition, and subtraction

hp calculators HP 12C Statistics - average and standard deviation Average and standard deviation concepts HP12C average and standard deviation

OMG! Excessive Texting Tied to Risky Teen Behaviors

CS103X: Discrete Structures Homework 4 Solutions

Cooley-Tukey. Tukey FFT Algorithms. FFT Algorithms. Cooley

STA 2023 Practice Questions Exam 2 Chapter 7- sec 9.2. Case parameter estimator standard error Estimate of standard error

Transcription:

A Mathematical Perspective o Gamblig Molly Maxwell Abstract. This paper presets some basic topics i probability ad statistics, icludig sample spaces, probabilistic evets, expectatios, the biomial ad ormal distributios, the Cetral Limit Theorem, Bayesia aalysis, ad statistical hypothesis testig. These topics are applied to gamblig games ivolvig dice, cards, ad cois.. Itroductio. I 65, the well-kow gambler the Chevalier de Mere posed a set of problems cocerig gamblig odds to Blaise Pascal. These problems iitiated a series of letters betwee Pascal ad Pierre de Fermat, which established the basis of moder probability theory [, pp. 9 53]. The Chevalier de Mere was a astute gambler who observed a slight advatage i attemptig to roll a 6 i four tosses of a sigle die ad a slight disadvatage i attemptig to throw two 6s, a result referred to as a double-6, i twety-four tosses of two dice [, p. 89]. De Mere oticed that the proportio of the umber of tosses to the umber of possible outcomes is the same for both games :6 for the sigle-die game, ad :36 for the double-die game. Thus he cocluded that the odds of wiig each game should be equal. Yet from his experiece of playig these two games, he felt cofidet that there was a advatage to the sigle-die game ad a disadvatage to the double-die game. Thus he asked Pascal for a explaatio for this discrepacy. I their correspodece, Pascal ad Fermat laid the groudwork for the cocepts of sample space, probabilistic evets, expectatios, ad the biomial distributio. Sectio of this paper itroduces the ideas of sample spaces ad evets, ad applies these cocepts to de Mere s die games, poker, ad coi-tossig games. The work i this sectio cofirms de Mere s ituitio that the sigle-die game has a slight advatage p = 0.577 ad the double-die game has a slight disadvatage p = 0.9. Sectio 3 defies expectatios, ad fids the expected payoff for a sigle trial of the games discussed i Sectio. Sectio presets the biomial ad ormal distributios, ad explais how these distributios ca be used to calculate the odds of makig a profit i a large umber of trials of these games. Fially, Sectio 5 develops the cocepts of Bayesia aalysis ad statistical hypothesis testig, ad shows how these methods ca be used to calculate the likelihood that a give result came from a particular distributio.. Sample Spaces ad Evets. A sample space Ω is a set of all possible outcomes of a model of a experimet [, p. 6]. This set is fiest grai ; i other words, all distict experimetal outcomes are listed separately. I additio, the elemets i the set are mutually exclusive ; each trial of the experimet ca result i oly oe of the possible outcomes. The words outcome ad evet are commoly iterchaged; however, i probability theory these two words have distict meaigs. A evet is a subset of possible outcomes. As a example, a sigle roll of a die is a experimet with 3

MIT Udergraduate Joural of Mathematics six possible outcomes, correspodig to the faces that lad up. Rollig a eve umber is a evet that icludes the experimetal outcomes,, ad 6. The coditioal probability is the probability of Evet A give Evet B, ad is defied by P A B P A B = P B for P B > 0; see [, pp. 3 6]. Evet B is a coditioig evet, ad redefies the sample space for Evet A. This ew sample space icludes oly the outcomes i which B occurs. Evets A ad B are called idepedet if the occurrece of oe evet does ot affect the likelihood of the occurrece of the other [, p. 7]. More formally, Evets A ad B are idepedet if P A B = P A ad P B A = P B. The cocepts of coditioal probability ad idepedece are useful for Bayesia aalysis, discussed i Sectio 5. The idea of idepedece is also importat for the biomial distributio, discussed i Sectio. We ow cosider three examples of sample spaces ad probabilities. First we discuss die games. A sigle roll of a die has six possible outcomes, correspodig to the faces that lad up. If the die is fair, the the probability of ay particular face ladig up is /6. More formally, the sample space Ω is the set {,, 3,, 5, 6}, ad the probability is P X = i = /6. Games that ivolve rollig two dice have thirty-six possible outcomes sice the roll of each die has six possible outcomes. I these games, the dice are distiguishable; therefore, the outcomes i, j ad j, i are couted separately. More formally, the sample space Ω is the set of pairs i, j where i, j {,, 3,, 5, 6}, ad the probability is PX=i,Y=j =/36. The first game that the Chavalier de Mere discussed with Pascal ivolves rollig a die four times [, pp. 88 89]. If the die lads o a 6 at least oce i four trials, the the gambler wis. If the die ever lads o a 6 i the four trials, the the gambler loses. The four tosses are idepedet. Hece, the probability of ot gettig a 6 i four trials is 5/6, or 0.83. It follows that the probability of gettig at least oe 6 i four trials is equal to 0.83, or 0.577. The secod game discussed by Pascal ad de Mere ivolves simultaeously rollig two dice twety-four times [, pp. 88 89]. If both dice lad o 6, a outcome called a double-6, the the gambler wis the bet. If a double-6 does ot occur, the the gambler loses the bet. As i the first game, the trials are idepedet. Hece, the probability of ot gettig a double-6 i twety-four trials is 35/36, or 0.5086. It follows that the probability of gettig a double-6 i twety-four trials is equal to 0.5086, or 0.9. De Mere oticed the slight advatage of bettig o the first game ad the slight disadvatage of bettig o the secod game. To discer these small difffereces, de Mere must have played these games a large umber of times. A estimate for this umber is obtaied by usig statistical hypothesis testig i Sectio 5. As a secod example of sample spaces ad probabilities, we cosider card games. A stadard deck of cards cosists of fifty-two cards, divided ito four suits ad thirtee raks [, pp. 7 5]. I the simple case of drawig oe card from the deck, the sample space Ω is the fifty-two possible cards, ad the probability of gettig ay particular card is /5. Most card games ivolve hads of cards i which the order of the cards does ot matter. The umber of possible hads is the biomial coefficet x, where is the

A Mathematical Perspective o Gamblig 5 size of the deck of cards ad x is the size of a had. The biomial coefficiet x is defied by! = x x x! x!. The symbol x is read choose x. I a stadard poker game, each player receives a had of five cards [, p. 5 55]. The sample space Ω for poker is the set of all possible five-card hads, i which order is irrelevat. The size of this sample space is 5 5, or, 598, 960. Thus the probability of gettig a had with five particular cards is /, 598, 960, or 3.85 0 7. The probability of gettig a particular type of had is computed as follows: Cout the umber of ways this type of had ca occur, the divide this result by the total umber of hads. For example, a straight flush is a ru of five cosecutive cards i a particular suit. A straight flush ca occur i ay of the four suits, ad ca begi i te possible positios, through Jack. Thus, there are forty ways to obtai a staight flush. The probability of gettig a straight flush is, therefore, 0/, 598, 960, or 0.00005. As aother example, a full house cosists of three of a kid ad oe pair. There are 3, or 3, possible raks for the three of a kid ad 3, or, possible combiatios of the four cards of this rak. There are, or, remaiig raks for the pair ad, or 6, possible combiatios of the four cards of this rak. Thus, the probability of gettig a full house is 3 3 3, 7 5 =, 598, 960 = 0.00. 5 Table - lists possible poker hads, the umber of ways i which they ca occur, ad their probabilities of occurrig; it was take from [, p. 53]. Table - A table of poker hads ad their probabilities HAND NUMBER of WAYS PROBABILITY Straight flush 0 0.00005 Four of a kid 6 0.0000 Full House 3,7 0.00 Flush 5,08 0.00965 Straight 0,00 0.00395 Three of a kid 5,9 0.09 Two pair 3,55 0.07539 Oe pair,098,0 0.569 Worse tha oe pair,30,50 0.5077 TOTALS =, 598, 960.000000 5 5 As the last example of sample spaces ad probabilities, we discuss coi-tossig games. The sample space Ω for a sigle toss of a coi cosists of two outcomes: heads H, ad tails T. If the coi is fair, the the probability of heads ad the probability of tails are both /. The sample space for a game cosistig of five tosses of a coi is 5, or 3,

6 MIT Udergraduate Joural of Mathematics sequeces of the form X X X 3 X X 5, where each X i is H with probability /, or T with probability /. The tosses are idepedet. Therefore, the probability of gettig o heads i five tosses is / 5, or 0.035. Cosequetly, the probability of gettig at least oe head i five tosses of a fair coi is 0.035, or 0.96875. Cosider aother game that cosists of attemptig to toss exactly three heads i four trials. The sample space for this game is, or 6, sequeces of the form X X X 3 X, where each X i is H with probability /, or T with probability /. The evet of gettig exactly three heads correspods to the followig sequeces: HHHT, HHTH, HTHH, ad THHH. Thus, the probability of wiig this bet is /, or 0.5. Aother expressio for this probability is 3 /3 /, which is iterpreted as the umber of ways to choose three of the four positios for the heads multiplied by the probability of gettig three heads ad the probability of gettig oe tail. 3. Expectatios. For a iteger radom variable X, the expectatio EX is defied to be the sum of all possible experimetal values of the variable weighted by their probabilities [5, pp. 86 87]. The expectatio is writte as EX = P X =. The expectatio of a radom variable is its mea or expected value. Oe applicatio of the cocept of expectatio is to formalize the idea of the expected payoff of a game. For a iteger radom variable X with mea EX = µ, the variace σ X is defied by the formula, σ X = E[X µ ], see [3, p. 9]. The variace ca also be expressed as σ X = EX µ ; this equatio is proved i [3, p. 9]. The variace measures the spread of the distributio. The stadard deviatio σ X is the positive square root of the variace. This cocept is used i Sectios ad 5. We ow compute the expected payoffs of the games discussed i Sectio. First we fid the expected payoffs for die games. Suppose a gambler gais oe dollar for wiig a game, ad gives up oe dollar for losig. I de Mere s sigle-die game, the expected payoff is EX = $ 0.577 + $ 0.83 = $0.035, or approximately four cets. I his double-die game, the expected payoff is EX = $ 0.9 + $ 0.5086 = $0.07, or approximately a two-cet loss. Next we compute the expected payoff for casio poker. Suppose a gambler wages a oe-dollar bet o a five-card had. The gambler s payoff depeds o the type of had, with a rare had such as a straight flush resultig i a higher payoff tha a more commo had such as oe pair. Table 3- provides oe possible payoff scheme. Accordig to this scheme, the expected earigs for oe game are EX = $000 0.00005 + $500 0.0000 + $50 0.00 + $5 0.00965 + $0 0.0395 + $5 0.09 + $ 0.07539 + $0 0.569 + $ 0.5077 = $0.05568,

A Mathematical Perspective o Gamblig 7 or approximately a five-cet loss. Table 3- A table of poker hads ad their payoffs HAND PROBABILITY PAYOFF Straight flush 0.00005 $000 Four of a kid 0.0000 $500 Full House 0.00 $50 Flush 0.00965 $5 Straight 0.00395 $0 Three of a kid 0.09 $5 Two pair 0.07539 $ Oe pair 0.569 $0 Worse tha oe pair 0.5077 $ TOTAL.000000 Fially, we fid the expected payoffs for coi-tossig games. Suppose a gambler wis oe dollar for tossig heads ad loses oe dollar for tossig tails with a fair coi. His expected payoff is EX = $ 0.5 + $ 0.5 = 0. Suppose he bets oe dollar o gettig exactly three heads i four tosses. expected payoff is The his EX = $ 0.5 + $ 0.75 = $0.50, or a fifty-cet loss.. Repeated Trials. The previous sectio explored a gambler s expected earigs o a sigle trial of a game. A gambler may, however, wat to calculate his expected earigs if he plays the game may times. This sectio presets methods for calculatig the expected payoff ad estimatig the chaces of makig a profit o may repetitios of a game. Cosider the problem of calculatig the probability of wiig t trials out of, whe the probability of wiig each trial is set [, pp. 6]. The trials ca be expressed as idepedet, idetically distributed radom variables X,, X. Each X i has two outcomes:, which represets a wiig trial, ad occurs with probability p, ad 0, which represets a losig trial, ad occurs with probability p. The expectatio for each X i is EX i = p + 0 p = p. The radom variable S represets the umber of wiig trials i a series of trials, ad is give by S = X + X + + X.

8 MIT Udergraduate Joural of Mathematics Therefore, the expected umber of wiig trials is ES = EX + X + + X = EX + EX + + EX = EX i = p, sice the radom variables X i are idepedet ad idetically distributed. Furthermore, the probability of wiig exactly t trials out of ca be calculated by multiplyig the umber t of ways to choose t of the trials to be successes by the probability p t of wiig t games, ad the probability p t of losig t games. This is the biomial distributio, ad is defied by P S = t = t p t p t, for 0 t. As calculated above, the expectatio of S is Moreover, the variace of S is σ S ES = p. = p p. The probability of wiig at least t games out of is the sum of the probabilities of wiig t through games sice these evets are mutually exclusive. This probability is deoted by P S t = i=t i p i p i, for 0 t. As a example, cosider de Mere s sigle-die game, i which the probability of success o each trial is 0.577. I thirty trials, the expected umber of successes is ES 30 = 30 0.577 = 5.5, which rouds to 6. The probability of wiig exactly ietee trials out of thirty is 30 P S 30 = 9 = 0.577 9 0.83 = 0.0663. 9 Furthermore, the probability of wiig at least ietee trials out of thirty is P S 30 9 = 30 i=9 30 9 0.577 i 0.83 30 i = 0.388. The ormal, or Gaussia, distributio is importat because it describes may probabilistic pheomea [, pp. 07 ]. Cosider a ormal radom variable w with mea µ ad variace σ. The ormal distributio is defied by P w a = a σ π e w µ/σ / dw.

A Mathematical Perspective o Gamblig 9 A stadard ormal radom variable z has µ = 0 ad variace σ =. The stadard ormal distributio is defied by P z a = a π e z / dz = Φa. Most probability ad statistics textbooks provide tables for Φx; for example, see [3, pp. 676 77] ad [5, p. 77]. The Cetral Limit Theorem states that the probability distributio for a sum S of idepedet, idetically distributed radom variables X i approaches the ormal distributio as the umber of radom variables goes to ifiity [, pp. 5 9]. This result is remarkable as it does ot deped o the distributio of the radom variables. The Cetral Limit Theorem states that t ES lim P S t = Φ Hece, for fiite, but large values of, the followig approximatio holds: t ES P S t Φ. For example, i de Mere s sigle-die game, the radom variable S 30 represets the umber of wiig games out of thirty. The radom variable σ S z = S 30 ES 30 σ S30 is approximately stadard ormal. Therefore, the probability of wiig at least ietee trials out of thirty is σ S. P S 30 9 = P S 30 9 9 ES = P z σ S 9 30 0.577 Φ 30 0.577 0.83 = 0.8577 = 0.3. This probability provides a close estimate of the actual value 0.388, which was foud usig the biomial distributio. Most probability ad statistics textbooks provide graphs of the biomial ad ormal distributios for various values of ; for example, see [, p. 7] ad [3, p. 7]. The graphs idicate that as, the ormal distributio forms a good approximatio for the biomial distributio. 5. Bayesia Aalysis ad Statistical Hypothesis Testig. So far we have looked oly at the probabilities of observig particular outcomes ad the expected payoffs for various games with kow distributios. This last sectio focuses o estimatig the likelihood that a observed outcome came from a particular distributio. For example, a gambler tosses forty-oe heads i oe-hudred tosses of a coi, ad wishes to kow

30 MIT Udergraduate Joural of Mathematics if the coi is fair. Bayesia aalysis ad statistical hypothesis testig are two methods for determiig the likelihood that a outcome came from a particular distributio. Bayesia aalysis tests the likehlihood that a observed outcome came from a particular probability distributio by treatig ukow parameters of the distributio as radom variables [, pp. 50 57]. These ukow parameters have their ow probability distributios, which ofte are derived from the assumed probability distributio of the outcome. The derivatios use Bayes Theorem, which states that P A i B = P B A i P A i N i= P B A i P A i. For example, let θ be a ukow parameter that has two possible values, correspodig to the evets A = {θ = θ } ad A = {θ = θ }. The a priori probabilities for A ad A are P A ad P A. They do ot take the iformatio gaied from the occurrece of evet B ito accout. Evet B is a observed outcome, which has a assumed probability ditributio that depeds o the ukow parameter θ. Cosider the evet A = {θ = θ }. By Bayes Theorem, the probability of A give B is P A B = P B A P A P B A P A + P B A P A. Bayesia aalysis ca be applied to the example of the gambler who tossed forty-oe heads i oe-hudred coi tosses, ad wishes to kow if his coi is fair. Assume that the gambler has two cois, a fair coi p = / ad a biased coi p = /. Evet A correspods to his selectig the fair coi, ad Evet A correspods to his selectig the biased coi. He radomly selects the coi; therefore, the a priori probabilites, P A ad P A, are both equal to /. Evet B is the observed outcome that S 00 =. By Bayes Theorem, the probability that the gambler used the fair coi is P p = S 00 = = P S 00 = p = = 00 = 0.989. 59 P S 00 = p = P p = P p = + P S00 = p = 00 59 + 00 3 59 P p = Bayesia aalysis combies prior kowledge of the ukow parameter i this case, the a priori probabilities P p = / ad P p = / with experimetal evidece to form a better estimate of this parameter. I statistical hypothesis testig, a observed outcome is compared to a assumed model to determie the likelihood that the outcome came from the model [5, pp. 63 65]. If the outcome deviates sigificatly from the expected outcome of the assumed model, the it is ulikely that the result came from this model. Statistical hypothesis testig, therefore, ivolves determig what level of deviatio from the expected outcome is sigificat. Cosider a radom variable Y that has a probability distributio that relies o the parameter θ. This method tests a ull hypothesis H 0 : θ = θ 0 agaist a alterative

A Mathematical Perspective o Gamblig 3 hypothesis H, which may take the form θ < θ 0, θ > θ 0, or θ θ 0 ; see [3, pp. 336 39]. A sigificace level α is chose for the test, ad a critical value c that depeds o α is calculated. The critical regio cosists of the critical value, ad all values that are more extreme i the directio of the alterative hypothesis. If the observed outcome of Y falls withi the critical regio, the H 0 is rejected ad H is accepted. However, if the observed outcome of Y does ot fall withi the critical regio, the H 0 is accepted. The coi-tossig example ca be worked out usig statistical hypothesis testig. The hypothesis H 0 : p = / is tested agaist a alterative hypothesis H : p /. I the model where H 0 is true, the expectatio is ES 00 = 00 / = 50, ad the variace is σ S 00 = 00 / / = 5. By the Cetral Limit Theorem, the variable S 00 has a almost ormal distributio. Therefore, the variable z = S 00 50 5 has a approximately stadard ormal distributio. Thus, P.96 S 00 50.96 = 0.95, or P 0. S 00 59.8 = 0.95. 5 Sice the observed outcome, S 00 =, falls withi this iterval, H 0 is accepted at the α = 0.05 sigificace level. The sigificace level is the probability that the assumed model H 0 is erroeously rejected. This error is commoly referred to as a type I error. Likewise, a type II error is the evet that the assumed model H 0 is erroeously accepted. The probability of this type of error is deoted as β. Ideally, we wat α ad β to be as small as possible; however, decreasig both ivolves icreasig the sample size. For example, whe H 0 : p = / is tested agaist H : p = / i a coi-tossig model, H 0 will be rejected if the umber of heads S 00 is below some value c, kow as the critical value. I this case, α represets the probability that S 00 is less tha c whe p = /, ad β represets the probability that S 00 is greater tha c whe p = /. For istace, if α = 0.00 ad β = 0.00, this critical value c ad the sample size ca be foud by solvig the equatios for α ad β. Namely, α = 0.00 = P S c p = S = P c = P z z 0.00. Because z 0.00 = 3.090, it follows that Likewise, c β = 0.00 = P S c 3.090, ad c 3.090 p = = P Because z 0.00 = 3.090, it follows that c 3 S 3 3.090, ad c 3.090 +. 5 c 3 Combiig Expressios 5- ad 5- yields that 3.090 + 3.090 3 +. = P z z 0.00. 3 +. 5

3 MIT Udergraduate Joural of Mathematics Thus, is equal to 3.8, which is rouded to 33, ad c is equal to 8.6, which is rouded to 9. To reduce the probabilities of Type I ad Type II errors, α ad β, to 0.00, the umber of trials must be icreased to 33. As metioed i the Itroductio, the Chevalier de Mere oticed that his bet o the double-die game was less tha chace [, pp. 83 8]. Some of the techiques i this last sectio ca provide a estimate for the umber of trials de Mere must have played to legitimately claim that the probability of success was ideed less tha /. Assume that de Mere played the game a large umber of times, ad foud that he rolled a double-6 i 9% of the games. To determie whether he could legitimately accept that p < /, de Mere would eed to test the hypothesis H 0 : p = / agaist the hypothesis H : p < /. Let Y be the umber of times de Mere rolled a double-6 i trials. From the Cetral Limit Theorem, it follows that Y has a almost ormal distributio. Thus, the variable z = Y p p p = Y p p p has a approximate stadard ormal distributio. For a 0.05 sigificace level, it follows that If de Mere estimated that Y Cosequetly, Y P is 9 00, the 9 00 z 0.05 = 0.05 = α. z 0.05 =.65. = 6765. To legitimately accept the hypothesis H : p < / for the double-die game, de Mere must have played 6765 games. Refereces [] David, F. N., Games, Gods ad Gamblig, Hafer Publishig Compay, 96. [] Drake, A. W., Fudametals of Applied Probability Theory, McGraw-Hill, Ic., 988. [3] Hogg, R. V., ad Tais, E. A., Probability ad Statistical Iferece, Pretice-Hall, 997. [] Packel, E. W., The Mathematics of Games ad Gamblig, The Mathematical Associatio of America, 98. [5] Rota, G. C., with Baclawski, K., Billey, S., ad Sterlig, G., Itroductio to Probability Theory, Third Prelimiary Editio, Birkhauser Press, 995.