Central Limit Theorem and Its Applications to Baseball

Size: px
Start display at page:

Download "Central Limit Theorem and Its Applications to Baseball"

Transcription

1 Cetral Limit Theorem ad Its Applicatios to Baseball by Nicole Aderso A project submitted to the Departmet of Mathematical Scieces i coformity with the requiremets for Math 4301 (Hoours Semiar) Lakehead Uiversity Thuder Bay, Otario, Caada copyright c (2014) Nicole Aderso

2 Abstract This hoours project is o the Cetral Limit Theorem (CLT). The CLT is cosidered to be oe of the most powerful theorems i all of statistics ad probability. I probability theory, the CLT states that, give certai coditios, the sample mea of a sufficietly large umber or iterates of idepedet radom variables, each with a well-defied expected value ad well-defied variace, will be approximately ormally distributed. I this project, a brief historical review of the CLT is provided, some basic cocepts, two proofs of the CLT ad several properties are discussed. As a applicatio, we discuss how to use the CLT to study the samplig distributio of the sample mea ad hypothesis testig usig baseball statistics. i

3 Ackowledgemets I would like to thak my supervisor, Dr. Li, who helped me by sharig his kowledge ad may resources to help make this paper come to life. I would also like to thak Dr. Adam Va Tuyl for all of his help with Latex, ad support throughout this project. Thak you very much! ii

4 Cotets Abstract Ackowledgemets i ii Chapter 1. Itroductio 1 1. Historical Review of Cetral Limit Theorem 1 2. Cetral Limit Theorem i Practice 1 Chapter 2. Prelimiaries 3 1. Defiitios 3 2. Cetral Limit Theorem 7 Chapter 3. Proofs of Cetral Limit Theorem 8 1. Proof of Cetral Limit Theorem Usig Momet Geeratig Fuctios 8 2. Proof of Cetral Limit Theorem Usig Characteristic Fuctios 12 Chapter 4. Applicatios of the Cetral Limit Theorem i Baseball 14 Chapter 5. Summary 19 Chapter 6. Appedix 20 Bibliography 21 iii

5 CHAPTER 1 Itroductio 1. Historical Review of Cetral Limit Theorem The Cetral Limit Theorem, CLT for short, has bee aroud for over 275 years ad has may applicatios, especially i the world of probability theory. May mathematicias over the years have proved the CLT i may differet cases, therefore provided differet versios of the theorem. The origis of the Cetral Limit Theorem ca be traced to The Doctrie of Chaces by Abraham de Moivre i Abraham de Moivre s book provided techiques for solvig gamblig problems, ad i this book he provided a statemet of the theorem for Beroulli trails as well as gave a proof for p = 1. This was a very importat 2 discovery at the time which ispired may other mathematicias years later to look at de Moivre s previous work ad cotiue to prove it for other cases. [7] I 1812, Pierre Simo Laplace published his ow book titled Theorie Aalytique des Probabilities, i which he geeralized the theorem for p 1. He also gave a proof, although 2 ot a rigorous oe, for his fidig. It was ot util aroud did the Cetral Limit Theorem become more geeralized ad a complete proof was give by Aleksadr Lyapuov. A more geeral statemet of the Cetral Limit Theorem did appear i 1922 whe Lideberg gave the statemet, the sequece of radom variables eed ot be idetically distributed, istead the radom variables oly eed zero meas with idividual variaces small compared to their sum [3]. May other cotributios to the statemet of the theorem, as well as may differet ways to prove the theorem bega to surface aroud 1935, whe both Levy ad Feller published their ow idepedet papers regardig the Cetral Limit Theorem. The Cetral Limit Theorem has had, ad cotiues to have, a great impact i the world of mathematics. Not oly was the theorem used i probability theory, but it was also expaded ad ca be used i topology, aalysis ad may other fields i mathematics. 2. Cetral Limit Theorem i Practice The Cetral Limit Theorem is a powerful theorem i statistics that allows us to make assumptios about a populatio ad states that a ormal distributio will occur regardless of what the iitial distributio looks like for a sufficietly large sample size. May applicatios, such as hypothesis testig, cofidece itervals ad estimatio, use the Cetral Limit Theorem to make reasoable assumptios cocerig the populatio 1

6 Chapter 1. Itroductio 2 sice it is ofte difficult to make such assumptios whe it is ot ormally distributed ad the shape of the distributio is ukow. The goal of this project is to focus o the Cetral Limit Theorem ad its applicatios i statistics, as well as aswer the questios, Why is the Cetral Limit Theorem Importat?, How ca we prove the theorem? ad How ca we apply the Cetral Limit Theorem i baseball? Our paper is structured as follows. I Chapter 2 we will first give key defiitios that are importat i uderstadig the Cetral Limit Theorem. The we will give three differet statemets of the Cetral Limit Theorem. Chapter 3 will aswer the secod problem posed by provig the Cetral Limit Theorem. We will first give a proof usig momet geeratig fuctios, ad the we will give a proof usig characteristic fuctios. I Chapter 4 we will aswer the third problem ad show that the Cetral Limit Theorem ca be used to aswer the questio, Is there such thig as a home-field advatage i baseball? by usig a importat applicatio kow as hypothesis testig. Fially, Chapter 5 will summarize the results of the project ad discuss future applicatios.

7 CHAPTER 2 Prelimiaries This chapter will provide some basic defiitios, as well as some examples, to help uderstad the various compoets of the Cetral Limit Theorem. Sice the Cetral Limit Theorem has strog applicatios i probability ad statistics, oe must have a good uderstadig of some basic cocepts cocerig radom variables, probability distributio, mea ad variace, ad the like. 1. Defiitios There are may defiitios that must first be uderstood before we give the statemet of the Cetral Limit Theorem. The followig defiitios ca be foud i [12]. Defiitio 2.1. A populatio cosists of the etire collectio of observatios i which we are cocered. Defiitio 2.2. A experimet is a set of positive outcomes that ca be repeated. Defiitio 2.3. A sample is a subset of the populatio. Defiitio 2.4. A radom sample is a sample of size i which all observatios are take at radom ad assumes idepedece. Defiitio 2.5. A radom variable, deoted by X, is a fuctio that associates a real umber with every outcome of a experimet. We say X is a discrete radom variable if it ca assume at most a fiite or a coutably ifiite umber of possible values. A radom variable is cotiuous if it ca assume ay value i some iterval or itervals of real umbers ad the probability that it assumes ay specific value is 0. Example 2.6. Cosider if we wish to kow how well a baseball player performed this seaso by lookig at how ofte they got o base. Defie the radom variable X by X = { 1, if the hitter got o base, 0, if the hitter did ot get o base. This is a example of a radom variable with a Beroulli distributio. 3

8 Chapter 2. Prelimiaries 4 Defiitio 2.7. The probability distributio of a discrete radom variable X is a fuctio f that associates a probability with each possible value of x if it satisfies the followig three properties, 1. f(x) 0, 2. x f(x) = 1, 3. P (X = x) = f(x). where P (X = x) refers to the probability that the radom variable X is equal to a particular value, deoted by x. Defiitio 2.8. A probability desity fuctio for a cotiuous radom variable X, deoted f(x), is a fuctio such that 1. f(x) 0, for all x i R, 2. + f(x) dx = 1, 3. P (a < X < b) = b a f(x) dx for all a < b. Defiitio 2.9. Let X be a discrete radom variable with probability distributio fuctio f(x). The expected value or mea of X, deoted µ or E(X) is µ = E(X) = x f(x). Example We are iterested i fidig the expected umber of home rus that Jose Bautista will hit ext seaso based o his previous three seasos. To do this, we ca compute the expected value of home rus based o his last three seasos. Table 1. Jose Bautista s Yearly Home Rus Year Home Rus

9 Chapter 2. Prelimiaries 5 µ = E(X) = 43f(43) + 27f(27) + 28f(28) ( ) ( ) ( ) = = This tells us that based o the past three seasos, Jose Bautista is expected to hit approximately 33 home rus i the 2014 seaso. These statistics are take from [5]. Defiitio Let X be a radom variable with mea µ. The variace of X, deoted Var(x) or σ 2, is σ 2 = E[X E(X)] 2 = E(X 2 ) (E(X)) 2 = E(X 2 ) µ 2. Defiitio The stadard deviatio of a radom variable X, deoted σ, is the positive square root of the variace. Example Usig Alex Rodriguez s yearly triples from Table 2 below, compute the variace ad stadard deviatio. E(X 2 ) = X 2 = X 2 20 = = = 4.8 E(X) = X = X 20 = = = 1.5 σ 2 = E(X 2 ) E(X) 2 = 4.8 (1.5) 2 = 2.55 σ = 2.55 = These statistics are take from [5]. Defiitio A samplig distributio is the probability distributio of a statistic. Defiitio A cotiuous radom variable X is said to follow a Normal Distributio with mea µ ad variace σ 2 if it has a probability desity fuctio We write X N(µ, σ 2 ). f(x) = 1 2πσ e 1 2σ 2 (x µ)2 < x <. Example Cosider the battig averages of Major League Baseball Players i the 2013 baseball seaso.

10 Chapter 2. Prelimiaries 6 Table 2. Alex Rodriguez Stats Year AVG Triples Home Rus These statistics are take from [5]. Takig all of their battig averages, we ca see i the graph that the averages follow a bell curve, which is uique to ormal distributio. We see that the majority of players have a average betwee.250 ad.300, ad that few players have a average betwee.200 ad.225, ad.325 ad.350. This gives a perfect example of how ormal distributio

11 Chapter 2. Prelimiaries 7 ca help approximate eve discrete radom variables. Just by lookig at the graph we ca make some ifereces about the populatio. 2. Cetral Limit Theorem Over the years, may mathematicias have cotributed to the Cetral Limit Theorem ad its proof, ad therefore may differet statemets of the theorem are accepted. The first statemet of the theorem is widely kow as the de Moivre-Laplace Theorem, which was the very first statemet of the Cetral Limit Theorem. Theorem [3] Cosider a sequece of Beroulli trials with probability p of success, where 0 < p < 1. Let S deote the umber of successes i the first trials, 1. For ay a, b R {± } with a < b, ( lim P a S p b p(1 p) ) = 1 2π b e z 2 a 2 dz. Aother statemet of the Cetral Limit Theorem was give by Lyapuov which states: Theorem [8] Suppose X, 1, are idepedet radom variables with mea 0 0 for some δ > 2, the ad k=1 E( X k δ ) s δ S s distr N(0, 1), where S = X 1 + X X, s = k=1 E(X2 distr k ), 1 ad where represets covergece i distributio. Before givig the fial statemet of the Cetral Limit Theorem, we must defie what it meas for radom variables to be idepedet ad idetically distributed. Defiitio A sequece of radom variables is said to be idepedet ad idetically distributed if all radom variables are mutually idepedet, ad if each radom variable has the same probability distributio. We will ow give the fial statemet of the Cetral Limit Theorem, a special case of the Lideberg-Feller theorem. This statemet is the oe we will use throughout the rest of the paper. Theorem [8] Suppose X 1, X 2,, X are idepedet ad idetically distributed with mea µ ad variace σ 2 > 0. The, S µ σ 2 distr N(0, 1), where S = X 1 + X X, 1 ad distr represets covergece i distributio.

12 CHAPTER 3 Proofs of Cetral Limit Theorem There are may ways to prove the Cetral Limit Theorem. I this chapter we will provide two proofs of the Cetral Limit Theorem. The first proof uses momet geeratig fuctios, ad the secod uses characteristic fuctios. We will first prove the Cetral Limit Theorem usig momet geeratig fuctios. 1. Proof of Cetral Limit Theorem Usig Momet Geeratig Fuctios Before we give the proof of the Cetral Limit Theorem, it is importat to discuss some basic defiitios, properties ad remarks cocerig momet geeratig fuctios. First, we will give the defiitio of a momet geeratig fuctio as follows: Defiitio 3.1. The momet-geeratig fuctio (MGF) of a radom variable X is defied to be { M X (t) = E(e tx x ) = etx f(x), if X is discrete, + etx f(x)dx, if X is cotiuous. Momets ca also be foud by differetiatio. Theorem 3.2. Let X be a radom variable with momet-geeratig fuctio M X (t). We have where µ r = E(X r ). d r M X (t) dt r t=0 = µ r, Remark 3.3. µ r = E(X r ) describes the rth momet about the origi of the radom variable X. We ca see the that µ 1 = E(X) ad µ 2 = E(X 2 ) which therefore allows us to write the mea ad variace i terms of momets. Momet geeratig fuctios also have the followig properties. Theorem 3.4. M a+bx (t) = E(e t(a+bx) ) = e at M X (bt). Proof. M a+bx (t) = E[e t(a+bx) ] = E(e at ) E(e t(bx) ) = e at E(e (bt)x ) = e at M X (bt). Theorem 3.5. Let X ad Y be radom variables with momet-geeratig fuctios M X (t) ad M Y (t) respectively. The M X+Y (t) = M X (t) M Y (t). 8

13 Chapter 3. Proofs of Cetral Limit Theorem 9 Proof. M X+Y (t) = E(e t(x+y ) ) = E(e tx e ty ) = E(e tx ) E(e ty ) (by idepedece of radom variables) = M X (t) M Y (t). Corollary 3.6. Let X 1, X 2,..., X be radom variables, the M X1 +X X (t) = M X1 (t) M X2 (t) M X (t). The proof is early idetical to the proof of the previous theorem. To prove the Cetral Limit Theorem, it is ecessary to kow the momet geeratig fuctio of the ormal distributio: Lemma 3.7. The momet geeratig fuctio (MGF) of the ormal radom variable X with mea µ ad variace σ 2, (i.e., X N(µ, σ 2 )) is M X (t) = e µt+ σ2 t 2 2. Proof. First we will fid the MGF for the ormal distributio with mea 0 ad variace 1, i.e, N(0, 1). If Y N(0, 1), the M Y (t) = E(e ty ) = = + + e ty f(y)dx e ty ( 1 2π e 1 2 y2 )dy = 1 + e ty e 1 2 y2 dy 2π = 1 + e (ty 1 2 y2) dy 2π = 1 + e ( 1 2 t2 +[ 1 2 (y2 +2ty+t 2 )]) dy 2π = 1 + e 1 2 t2 e 1 2 (y2 2ty+t 2) dy 2π = e t2 e 1 2 (y t)2 dy. 2π But ote that by Defiitio 2.14, 1 2π + e 1 2 (y t)2 dy is just the probability distributio fuctio of ormal distributio. So

14 Chapter 3. Proofs of Cetral Limit Theorem 10 Now, if X N(µ, σ 2 ), ad by Theorem 3.3, M Y (t) = e 1 2 t2. M X (t) = M µ+σy (t) = e µt M Y (σt) = e µt e ( 1 2 σ2 t 2 ) = e (µt+ σ2 t 2 2 ). Before we begi the proof of the Cetral Limit Theorem, we must recall the followig remark from calculus: Lemma 3.8. e x = 1 + x + x2 2! + x3 3! + Now we are ready to prove the Cetral Limit Theorem. We will prove a special case of where M X (t) exists i a eighbourhood of 0. Proof. (of Theorem 2.20) Let Y i = X i µ σ for i = 1, 2, 3,... ad R = Y 1 +Y Y. So we have S µ σ 2 = Y 1 + Y Y = R. So S µ σ 2 = R = Z. Sice R is the sum of idepedet radom variables, we see that its momet geeratig fuctio is M R (t) = M Y1 (t)m Y2 (t) M Y (t) = [M Y (t)]

15 Chapter 3. Proofs of Cetral Limit Theorem 11 by Corollary 3.5. We ote that this is true because each Y i is idepedet ad idetically distributed. Now, ( ) ( ) ( M Z (t) = M R (t) = E e R (t) = E e (R)( t ) = M R Takig the atural logarithm of each side, But ote alog with usig Remark 3.7 that, lm Z (t) = lm Y ( t ). ( ) ( ) t M Y = E e t Y ( ) 1 where O stads for lim sup α We see that So we have, ( = E 1 + ty + ( t2 Y ) 2 2 ( ) = 1 + t2 E(Y 2 ) 1 + O ( ) = 1 + t O. ( O ) 1 α 1 α 3 2 <. The lm Z (t) = l (1 + t2 2 + O ( ( t 2 1 = 2 + O = t2 2 + O ( ). lm Z (t) = t2 2 + O ( ( 1 + O ( 1 )) ), M Z (t) e t2 2 as. 3 2 ) t = 3 2 )) )) (M Y ( )) t. Thus, Z N(0, 1), i.e, S µ σ 2 N(0, 1).

16 Chapter 3. Proofs of Cetral Limit Theorem Proof of Cetral Limit Theorem Usig Characteristic Fuctios Now we will prove the Cetral Limit Theorem aother way by lookig at characteristic fuctios. Momet geeratig fuctios do ot exist for all distributios. This is because some momets of the distributios are ot fiite. I these istaces, we look at aother geeral fuctio kow as the characteristic fuctio. is Defiitio 3.9. The characteristic fuctio of a cotiuous radom variable X C X (t) = E(e itx ) = + eitx f(x)dx, where t is a real valued fuctio, ad i = 1. C X (t) will always exist because e itx is a bouded fuctio, that is, e itx = 1 for all t, x R, ad so the itegral exists. The characteristic fuctio also has may similar properties to momet geeratig fuctios. To prove the cetral limit theorem usig characteristic fuctios, we eed to kow the characteristic fuctio of the ormal distributio. Lemma Let R, 1 be a sequece of radom variables. If, as, ) C R (t) = E (e irt e t2 2 for all t (, ), the R N(0, 1). We ca ow prove the Cetral Limit Theorem usig characteristics fuctios. Proof. (Of Theorem 2.20) Similar to the proof usig momet geeratig fuctios, let Y i = X i µ for i = 1, 2, 3... ad let R σ = Y 1 + Y Y so, S µ σ 2 = R = Z, where S = X 1 + X X. Note that R is the sum of idepedet radom variables, so we see that the characteristic fuctio of R is C Y (t) = C Y1 (t)c Y2 (t) C Y (t) = [C Y (t)] sice all Y i s are idepedet ad idetically distributed. Now,

17 Chapter 3. Proofs of Cetral Limit Theorem 13 C Z (t) = C R (t) = E[e i R t ] ( t = E[e i (R ) )] ( ) t = C R = [C Y ( t )]. Takig the atural logarithm of each side, lc Z (t) = lc Y ( t ). We ca ote from the previous proof with some modificatios that ( ) ( ) t C Y = 1 t2 + O 1. 2 The, Usig Remark 3.8, we see that 3 2 lc Z (t) = l(1 t2 + O( 1 )) lc Z (t) = t2 2 + O( ), So, as, lc Z (t) t2 2 ad Thus by Lemma 3.10, we coclude that C Z (t) e t2 2 as. Z = S µ σ 2 N(0, 1).

18 CHAPTER 4 Applicatios of the Cetral Limit Theorem i Baseball The Cetral Limit Theorem has may applicatios i probability theory ad statistics, but oe very iterestig applicatio is kow as hypothesis testig. This chapter will focus o the applicatio of hypothesis testig, ad i particular, aswer the followig questio: Problem 4.1. Is there such thig as a home-field advatage i Major League Baseball? Before we begi, there are a few defiitios that must be uderstood. Defiitio 4.2. A cojecture cocerig oe or more populatios is kow as a statistical hypothesis. Defiitio 4.3. A ull hypothesis is a hypothesis that we wish to test ad is deoted H 0. Defiitio 4.4. A alterative hypothesis represets the questio to be aswered i the hypothesis test ad is deoted by H 1. Remark 4.5. The ull hypothesis H 0 opposes the alterative hypothesis H 1. H 0 is commoly see as the complemet of H 1. Cocerig our problem, the ull hypothesis ad the alterative hypothesis are: H 0 : There is o home-field advatage, H 1 : There is a home-field advatage. Whe we do a hypothesis test, the goal is to determie if we will reject the ull hypothesis or if we fail to reject the ull hypothesis. If we reject H 0, we are i favour of H 1 because of sufficiet evidece i the data. If we fail to reject H 0, the we have isufficiet evidece i the data. Defiitio 4.6. A test statistic is a sample that is used to determie whether or ot a hypothesis is rejected or ot. Defiitio 4.7. A critical value is a cut off value that is compared to the test statistic to determie whether or ot the ull hypothesis is rejected. Defiitio 4.8. The level of sigificace of a test statistic is the probability that H 0 is rejected, although it is true. Defiitio 4.9. A z-score or z-value is a umber that idicates how may stadard deviatios a elemet is away from the mea. 14

19 Chapter 4. Applicatios of the Cetral Limit Theorem i Baseball 15 Defiitio A cofidece iterval is a iterval that cotais a estimated rage of values i which a ukow populatio parameter is likely to fall ito. Remark If the test statistic falls ito the iterval, the we fail to reject H 0, but if the test statistic is ot i the iterval, the we reject H 0. Defiitio A p-value is the lowest level of sigificace i which the test statistic is sigificat. Remark We reject H 0 if the p-value is very small, usually less tha Now to retur to our problem, is there such thig as a home-field advatage? How ca we test this otio? I the 2013 Major League Baseball seaso, there were 2431 games played, ad of those games, 1308 of them were wo at home. This idicates that approximately 53.81% of the games played were wo at home. We will let our observed value be this value, so ˆp = It seems as though there is such thig as a home-field advatage, but we must test this otio to be certai. To do this, we will test the hypothesis that there is o such thig as a home-field advatage, so our ull hypothesis will be H 0 : p = 0.50 That is, 50% of the Major League Baseball games are wo at home, hece, there is o home-field advatage. Our alterative hypothesis will be H 1 : p > If there is o home-field advatage, the we would expect our proportio to be 0.50, sice half of the games would be wo at home ad the other half o the road. Before we begi to compute if there is such thig as a home-field advatage we must first satisfy four coditios; idepedece assumptio, radom coditio, 10% coditio, ad the success/failure coditio. These coditios will assure that we ca test our hypothesis. Each game is idepedet of oe aother ad oe game does ot effect how aother game is played. Although i some cases whe a key batter or pitcher is ijured, the team may ot do as well i the immediate upcomig games, but roughly speakig, the games played are geerally idepedet of oe aother, ad so our idepedece coditio holds. Sice there have bee may games played over the years, each year havig roughly 2430 games, it ca be see that takig just oe year to observe the data will accout for

20 Chapter 4. Applicatios of the Cetral Limit Theorem i Baseball 16 our radomizatio coditio. Also, as stated above, we ca see that the 2431 games played i the 2013 seaso, are less tha 10% of the total games played over the years that Major League Baseball has bee aroud, so our 10% coditio also holds true, that is, the sample size is o more tha 10% of the populatio. Fially we must check that the umber of games multiplied by our proportio of 0.50, is larger tha 10. So we have p = 2431(0.50) = which is larger tha 10, so our success/failure coditio holds as well. Sice all of these coditios are met, we are ow able to use the Normal Distributio model to help us test our hypothesis. We will test our hypothesis usig two differet methods: the first by usig a cofidece iterval, ad the secod usig a p-value. First, we will test our hypothesis usig a cofidece iterval. For testig H 0 : p = 0.50 vs. H 1 : p > 0.50 at the 0.05 level of sigificace, we may costruct a right-sided 95% cofidece iterval for p. If our test statistic of p = 0.50 is i the iterval, the we fail to reject H 0 at the 0.05 level of sigificace. If p = 0.50 is ot i the iterval, we reject H 0. The right-sided 100(1 α)% cofidece iterval for p for a large sample is give by ˆp(1 ˆp) ˆp z α < p 1 where α is the level of sigificace. Sice = 2431, ˆp = , ad α = 0.05, we see from the Normal Distributio table i the Appedix that z 0.05 = So a right-sided 95% cofidece iterval for p is (0.5381)( ) < p ( ) < p < p 1. Sice 0.50 / (0.5215, 1], we reject H 0 : p = 0.50 i favour of H 1 : p > 0.50 at the 0.05 level of sigificace, that is, we have eough evidece to support that there is a home-field advatage, ad the home team wis more tha 50% of the games played at home. Now we will use the p-value approach to test our hypothesis. We must fid the z- value for testig our observed value. We use the followig equatio to do so;

21 Chapter 4. Applicatios of the Cetral Limit Theorem i Baseball 17 z = (ˆp po) poqo Now, with p = 0.50, ˆp = , ad = 2431, we have z = (ˆp po) poqo This results i a p-value < = = = 3.76 So we ca coclude, sice the p-value < is less tha 0.05, we reject H 0. That is, the data seems to support that the home field team wis more tha 50% of the time, ad hece there is such thig as a home-field advatage i Major League Baseball. We have show that takig all of the games played i the 2013 Major League Baseball seaso, that there is a home-field advatage, but is there a differece betwee the America League ad the Natioal League? Do both leagues have a home-field advatage? We will test this otio usig a 100(1 α)% cofidece iterval at the 0.01 level of sigificace. This will allow us to be 99% cofidet of our results. I the 2013 seaso, the Natioal League played 1211 games, ad wo 660 of those games at home. So this idicates that approximately 54.5% of the games were wo at home. As we calculated above, we will let the observed value be ˆp = ad we will test the same hypothesis, that is, H 0 : p = 0.50 vs. H 1 : p > 0.50 Sice = 1211, ˆp = ad α = 0.01, we ca see from the Normal Distributio table i the Appedix that z 0.01 = So a right-sided 99% cofidece iterval for p is (0.545)( ) < p ( ) < p < p 1. Sice 0.50 / (0.5117, 1], we reject H 0 : p = 0.50 i favour of H 1 : p > So we ca coclude that the Natioal League has a home-field advatage. Will the same be true for the America League? We will agai test the same hypothesis, usig a 99% cofidece iterval for the America League. I the 2013 seaso, the America League played slightly more games tha the Natioal League. They played 1220 games ad of those games, 648 of them were wo at home. So this idicates that approximately 53.11% of the games played were wo at home. Oce agai, let our observed value be ˆp = , ad testig the same hypothesis above, we

22 Chapter 4. Applicatios of the Cetral Limit Theorem i Baseball 18 see that a 99% cofidece iterval for p is (0.5311)( ) < p ( ) < p < p 1. Sice 0.50 (0.4978, 1], we fail to reject H 0 : p = That is, we do ot have eough evidece to support that there is a home-field advatage i the America League. We ca see that by testig these hypotheses for the Natioal League ad the America League, that we ca cofidetly state that there is a home-field advatage i the Natioal League, but we caot say the same thig for the America League based o the 2013 Major League Baseball seaso.

23 CHAPTER 5 Summary The Cetral Limit Theorem is very powerful i the world of mathematics ad as umerous applicatios i probability theory as well as statistics. I this paper, we have stated the Cetral Limit Theorem, proved the theorem two differet ways, oe usig momet geeratig fuctios ad aother usig characteristic fuctios, ad fially showed a applicatio of the Cetral Limit Theorem by usig hypothesis testig to aswer the questio, Is there such thig as a home-field advatage? We proved that we could express ormal distributio i terms of a momet geeratig fuctio, ad used this to prove the Cetral Limit Theorem, by showig that the momet geeratig fuctio coverges to the ormal distributio model. We the applied our results from the first proof usig momet geeratig fuctios to characteristic fuctios, otig that momet geeratig fuctios are ot always defied, ad oce agai arrived at the same coclusio ad provig the Cetral Limit Theorem. I our fial chapter, we successfully proved by takig statistics from the 2013 baseball seaso ad usig cofidece itervals, as well as a p-value, to show that there is ideed such thig as a home-field advatage i Major League Baseball. We also showed that we ca come to the same coclusio about the Natioal League, but we do ot have eough evidece to show that there is a home-field advatage i the America League. I the future, it may be iterestig to use my applicatio o other sports such as hockey, or football, although we must make sure that we have a sufficietly large sample size to have accurate results. Other applicatios of the Cetral Limit Theorem, as well as other properties such as covergece rates may also be iterestig areas of study for the future. 19

24 CHAPTER 6 Appedix 20

25 Bibliography [1] Albert, Jim. Teachig Statistics Usig Baseball. Washigto, DC: The Mathematical Associatio of America, [2] Characteristic Fuctios ad the Cetral Limit Theorem. Uiversity of Waterloo. Chapter 6. Web. dlmcleis/s901/chapt6.pdf. [3] Dubar, Steve R. The de Moivre-Laplace Cetral Limit Theorem. Topics i Probability Theory ad Stochastic Processes. 1, 7 [4] Emmauel Lesige. Heads or Tails: A Itroductio to Limit Theorems i Probability, Vol 28 of Studet Mathematical Library. America Mathematical Society, [5] ESPN.com ESPN Iteret Vetures. Web. 5, 6 [6] Filmus, Yuval. Two Proofs of the Cetral Limit Theorem. Ja/Feb Lecture. yuvalf/clt.pdf [7] Gristead, Charles M., ad J. Laurie Sell. Cetral Limit Theorem. Itroductio to Probability. Dartmouth College Web. aids/books articles/probability book/chapter9.pdf. 1 [8] Hildebrad, A.J. The Cetral Limit Theorem. Lecture. 7 [9] Itroductio to The Cetral Limit Theorem. The Theory of Iferece. NCSSM Statistics Leadership Istitute Notes. Web. Ist/PDFS/SEC 4 f.pdf [10] Krylov, N.V. A Udergraduate Lecture o The Cetral Limit Theorem. Lecture. krylov/clt1.pdf [11] Momet Geeratig Fuctios. Chapter 6. Web. [12] Walpole, Roald E, Raymod H. Myers, Sharo L. Myers, ad Keyig Ye. Probability & Statistics For Egieers & Scietists. Pretice Hall

1. C. The formula for the confidence interval for a population mean is: x t, which was

1. C. The formula for the confidence interval for a population mean is: x t, which was s 1. C. The formula for the cofidece iterval for a populatio mea is: x t, which was based o the sample Mea. So, x is guarateed to be i the iterval you form.. D. Use the rule : p-value

More information

Properties of MLE: consistency, asymptotic normality. Fisher information.

Properties of MLE: consistency, asymptotic normality. Fisher information. Lecture 3 Properties of MLE: cosistecy, asymptotic ormality. Fisher iformatio. I this sectio we will try to uderstad why MLEs are good. Let us recall two facts from probability that we be used ofte throughout

More information

Hypothesis testing. Null and alternative hypotheses

Hypothesis testing. Null and alternative hypotheses Hypothesis testig Aother importat use of samplig distributios is to test hypotheses about populatio parameters, e.g. mea, proportio, regressio coefficiets, etc. For example, it is possible to stipulate

More information

Chapter 7 - Sampling Distributions. 1 Introduction. What is statistics? It consist of three major areas:

Chapter 7 - Sampling Distributions. 1 Introduction. What is statistics? It consist of three major areas: Chapter 7 - Samplig Distributios 1 Itroductio What is statistics? It cosist of three major areas: Data Collectio: samplig plas ad experimetal desigs Descriptive Statistics: umerical ad graphical summaries

More information

Inference on Proportion. Chapter 8 Tests of Statistical Hypotheses. Sampling Distribution of Sample Proportion. Confidence Interval

Inference on Proportion. Chapter 8 Tests of Statistical Hypotheses. Sampling Distribution of Sample Proportion. Confidence Interval Chapter 8 Tests of Statistical Hypotheses 8. Tests about Proportios HT - Iferece o Proportio Parameter: Populatio Proportio p (or π) (Percetage of people has o health isurace) x Statistic: Sample Proportio

More information

University of California, Los Angeles Department of Statistics. Distributions related to the normal distribution

University of California, Los Angeles Department of Statistics. Distributions related to the normal distribution Uiversity of Califoria, Los Ageles Departmet of Statistics Statistics 100B Istructor: Nicolas Christou Three importat distributios: Distributios related to the ormal distributio Chi-square (χ ) distributio.

More information

In nite Sequences. Dr. Philippe B. Laval Kennesaw State University. October 9, 2008

In nite Sequences. Dr. Philippe B. Laval Kennesaw State University. October 9, 2008 I ite Sequeces Dr. Philippe B. Laval Keesaw State Uiversity October 9, 2008 Abstract This had out is a itroductio to i ite sequeces. mai de itios ad presets some elemetary results. It gives the I ite Sequeces

More information

I. Chi-squared Distributions

I. Chi-squared Distributions 1 M 358K Supplemet to Chapter 23: CHI-SQUARED DISTRIBUTIONS, T-DISTRIBUTIONS, AND DEGREES OF FREEDOM To uderstad t-distributios, we first eed to look at aother family of distributios, the chi-squared distributios.

More information

Overview of some probability distributions.

Overview of some probability distributions. Lecture Overview of some probability distributios. I this lecture we will review several commo distributios that will be used ofte throughtout the class. Each distributio is usually described by its probability

More information

Chapter 6: Variance, the law of large numbers and the Monte-Carlo method

Chapter 6: Variance, the law of large numbers and the Monte-Carlo method Chapter 6: Variace, the law of large umbers ad the Mote-Carlo method Expected value, variace, ad Chebyshev iequality. If X is a radom variable recall that the expected value of X, E[X] is the average value

More information

Case Study. Normal and t Distributions. Density Plot. Normal Distributions

Case Study. Normal and t Distributions. Density Plot. Normal Distributions Case Study Normal ad t Distributios Bret Halo ad Bret Larget Departmet of Statistics Uiversity of Wiscosi Madiso October 11 13, 2011 Case Study Body temperature varies withi idividuals over time (it ca

More information

Center, Spread, and Shape in Inference: Claims, Caveats, and Insights

Center, Spread, and Shape in Inference: Claims, Caveats, and Insights Ceter, Spread, ad Shape i Iferece: Claims, Caveats, ad Isights Dr. Nacy Pfeig (Uiversity of Pittsburgh) AMATYC November 2008 Prelimiary Activities 1. I would like to produce a iterval estimate for the

More information

Lecture 4: Cauchy sequences, Bolzano-Weierstrass, and the Squeeze theorem

Lecture 4: Cauchy sequences, Bolzano-Weierstrass, and the Squeeze theorem Lecture 4: Cauchy sequeces, Bolzao-Weierstrass, ad the Squeeze theorem The purpose of this lecture is more modest tha the previous oes. It is to state certai coditios uder which we are guarateed that limits

More information

Sampling Distribution And Central Limit Theorem

Sampling Distribution And Central Limit Theorem () Samplig Distributio & Cetral Limit Samplig Distributio Ad Cetral Limit Samplig distributio of the sample mea If we sample a umber of samples (say k samples where k is very large umber) each of size,

More information

Math C067 Sampling Distributions

Math C067 Sampling Distributions Math C067 Samplig Distributios Sample Mea ad Sample Proportio Richard Beigel Some time betwee April 16, 2007 ad April 16, 2007 Examples of Samplig A pollster may try to estimate the proportio of voters

More information

CHAPTER 7: Central Limit Theorem: CLT for Averages (Means)

CHAPTER 7: Central Limit Theorem: CLT for Averages (Means) CHAPTER 7: Cetral Limit Theorem: CLT for Averages (Meas) X = the umber obtaied whe rollig oe six sided die oce. If we roll a six sided die oce, the mea of the probability distributio is X P(X = x) Simulatio:

More information

0.7 0.6 0.2 0 0 96 96.5 97 97.5 98 98.5 99 99.5 100 100.5 96.5 97 97.5 98 98.5 99 99.5 100 100.5

0.7 0.6 0.2 0 0 96 96.5 97 97.5 98 98.5 99 99.5 100 100.5 96.5 97 97.5 98 98.5 99 99.5 100 100.5 Sectio 13 Kolmogorov-Smirov test. Suppose that we have a i.i.d. sample X 1,..., X with some ukow distributio P ad we would like to test the hypothesis that P is equal to a particular distributio P 0, i.e.

More information

Practice Problems for Test 3

Practice Problems for Test 3 Practice Problems for Test 3 Note: these problems oly cover CIs ad hypothesis testig You are also resposible for kowig the samplig distributio of the sample meas, ad the Cetral Limit Theorem Review all

More information

Lesson 15 ANOVA (analysis of variance)

Lesson 15 ANOVA (analysis of variance) Outlie Variability -betwee group variability -withi group variability -total variability -F-ratio Computatio -sums of squares (betwee/withi/total -degrees of freedom (betwee/withi/total -mea square (betwee/withi

More information

Determining the sample size

Determining the sample size Determiig the sample size Oe of the most commo questios ay statisticia gets asked is How large a sample size do I eed? Researchers are ofte surprised to fid out that the aswer depeds o a umber of factors

More information

Section 11.3: The Integral Test

Section 11.3: The Integral Test Sectio.3: The Itegral Test Most of the series we have looked at have either diverged or have coverged ad we have bee able to fid what they coverge to. I geeral however, the problem is much more difficult

More information

PSYCHOLOGICAL STATISTICS

PSYCHOLOGICAL STATISTICS UNIVERSITY OF CALICUT SCHOOL OF DISTANCE EDUCATION B Sc. Cousellig Psychology (0 Adm.) IV SEMESTER COMPLEMENTARY COURSE PSYCHOLOGICAL STATISTICS QUESTION BANK. Iferetial statistics is the brach of statistics

More information

The following example will help us understand The Sampling Distribution of the Mean. C1 C2 C3 C4 C5 50 miles 84 miles 38 miles 120 miles 48 miles

The following example will help us understand The Sampling Distribution of the Mean. C1 C2 C3 C4 C5 50 miles 84 miles 38 miles 120 miles 48 miles The followig eample will help us uderstad The Samplig Distributio of the Mea Review: The populatio is the etire collectio of all idividuals or objects of iterest The sample is the portio of the populatio

More information

5: Introduction to Estimation

5: Introduction to Estimation 5: Itroductio to Estimatio Cotets Acroyms ad symbols... 1 Statistical iferece... Estimatig µ with cofidece... 3 Samplig distributio of the mea... 3 Cofidece Iterval for μ whe σ is kow before had... 4 Sample

More information

Z-TEST / Z-STATISTIC: used to test hypotheses about. µ when the population standard deviation is unknown

Z-TEST / Z-STATISTIC: used to test hypotheses about. µ when the population standard deviation is unknown Z-TEST / Z-STATISTIC: used to test hypotheses about µ whe the populatio stadard deviatio is kow ad populatio distributio is ormal or sample size is large T-TEST / T-STATISTIC: used to test hypotheses about

More information

A probabilistic proof of a binomial identity

A probabilistic proof of a binomial identity A probabilistic proof of a biomial idetity Joatho Peterso Abstract We give a elemetary probabilistic proof of a biomial idetity. The proof is obtaied by computig the probability of a certai evet i two

More information

SAMPLE QUESTIONS FOR FINAL EXAM. (1) (2) (3) (4) Find the following using the definition of the Riemann integral: (2x + 1)dx

SAMPLE QUESTIONS FOR FINAL EXAM. (1) (2) (3) (4) Find the following using the definition of the Riemann integral: (2x + 1)dx SAMPLE QUESTIONS FOR FINAL EXAM REAL ANALYSIS I FALL 006 3 4 Fid the followig usig the defiitio of the Riema itegral: a 0 x + dx 3 Cosider the partitio P x 0 3, x 3 +, x 3 +,......, x 3 3 + 3 of the iterval

More information

1 Computing the Standard Deviation of Sample Means

1 Computing the Standard Deviation of Sample Means Computig the Stadard Deviatio of Sample Meas Quality cotrol charts are based o sample meas ot o idividual values withi a sample. A sample is a group of items, which are cosidered all together for our aalysis.

More information

Confidence Intervals. CI for a population mean (σ is known and n > 30 or the variable is normally distributed in the.

Confidence Intervals. CI for a population mean (σ is known and n > 30 or the variable is normally distributed in the. Cofidece Itervals A cofidece iterval is a iterval whose purpose is to estimate a parameter (a umber that could, i theory, be calculated from the populatio, if measuremets were available for the whole populatio).

More information

Measures of Spread and Boxplots Discrete Math, Section 9.4

Measures of Spread and Boxplots Discrete Math, Section 9.4 Measures of Spread ad Boxplots Discrete Math, Sectio 9.4 We start with a example: Example 1: Comparig Mea ad Media Compute the mea ad media of each data set: S 1 = {4, 6, 8, 10, 1, 14, 16} S = {4, 7, 9,

More information

Confidence Intervals for One Mean

Confidence Intervals for One Mean Chapter 420 Cofidece Itervals for Oe Mea Itroductio This routie calculates the sample size ecessary to achieve a specified distace from the mea to the cofidece limit(s) at a stated cofidece level for a

More information

Chapter 7 Methods of Finding Estimators

Chapter 7 Methods of Finding Estimators Chapter 7 for BST 695: Special Topics i Statistical Theory. Kui Zhag, 011 Chapter 7 Methods of Fidig Estimators Sectio 7.1 Itroductio Defiitio 7.1.1 A poit estimator is ay fuctio W( X) W( X1, X,, X ) of

More information

Statistical inference: example 1. Inferential Statistics

Statistical inference: example 1. Inferential Statistics Statistical iferece: example 1 Iferetial Statistics POPULATION SAMPLE A clothig store chai regularly buys from a supplier large quatities of a certai piece of clothig. Each item ca be classified either

More information

One-sample test of proportions

One-sample test of proportions Oe-sample test of proportios The Settig: Idividuals i some populatio ca be classified ito oe of two categories. You wat to make iferece about the proportio i each category, so you draw a sample. Examples:

More information

Chapter 14 Nonparametric Statistics

Chapter 14 Nonparametric Statistics Chapter 14 Noparametric Statistics A.K.A. distributio-free statistics! Does ot deped o the populatio fittig ay particular type of distributio (e.g, ormal). Sice these methods make fewer assumptios, they

More information

Convexity, Inequalities, and Norms

Convexity, Inequalities, and Norms Covexity, Iequalities, ad Norms Covex Fuctios You are probably familiar with the otio of cocavity of fuctios. Give a twicedifferetiable fuctio ϕ: R R, We say that ϕ is covex (or cocave up) if ϕ (x) 0 for

More information

Overview. Learning Objectives. Point Estimate. Estimation. Estimating the Value of a Parameter Using Confidence Intervals

Overview. Learning Objectives. Point Estimate. Estimation. Estimating the Value of a Parameter Using Confidence Intervals Overview Estimatig the Value of a Parameter Usig Cofidece Itervals We apply the results about the sample mea the problem of estimatio Estimatio is the process of usig sample data estimate the value of

More information

Lesson 17 Pearson s Correlation Coefficient

Lesson 17 Pearson s Correlation Coefficient Outlie Measures of Relatioships Pearso s Correlatio Coefficiet (r) -types of data -scatter plots -measure of directio -measure of stregth Computatio -covariatio of X ad Y -uique variatio i X ad Y -measurig

More information

Discrete Mathematics and Probability Theory Spring 2014 Anant Sahai Note 13

Discrete Mathematics and Probability Theory Spring 2014 Anant Sahai Note 13 EECS 70 Discrete Mathematics ad Probability Theory Sprig 2014 Aat Sahai Note 13 Itroductio At this poit, we have see eough examples that it is worth just takig stock of our model of probability ad may

More information

Normal Distribution.

Normal Distribution. Normal Distributio www.icrf.l Normal distributio I probability theory, the ormal or Gaussia distributio, is a cotiuous probability distributio that is ofte used as a first approimatio to describe realvalued

More information

Chapter 7: Confidence Interval and Sample Size

Chapter 7: Confidence Interval and Sample Size Chapter 7: Cofidece Iterval ad Sample Size Learig Objectives Upo successful completio of Chapter 7, you will be able to: Fid the cofidece iterval for the mea, proportio, ad variace. Determie the miimum

More information

4.3. The Integral and Comparison Tests

4.3. The Integral and Comparison Tests 4.3. THE INTEGRAL AND COMPARISON TESTS 9 4.3. The Itegral ad Compariso Tests 4.3.. The Itegral Test. Suppose f is a cotiuous, positive, decreasig fuctio o [, ), ad let a = f(). The the covergece or divergece

More information

Sequences and Series

Sequences and Series CHAPTER 9 Sequeces ad Series 9.. Covergece: Defiitio ad Examples Sequeces The purpose of this chapter is to itroduce a particular way of geeratig algorithms for fidig the values of fuctios defied by their

More information

15.075 Exam 3. Instructor: Cynthia Rudin TA: Dimitrios Bisias. November 22, 2011

15.075 Exam 3. Instructor: Cynthia Rudin TA: Dimitrios Bisias. November 22, 2011 15.075 Exam 3 Istructor: Cythia Rudi TA: Dimitrios Bisias November 22, 2011 Gradig is based o demostratio of coceptual uderstadig, so you eed to show all of your work. Problem 1 A compay makes high-defiitio

More information

Our aim is to show that under reasonable assumptions a given 2π-periodic function f can be represented as convergent series

Our aim is to show that under reasonable assumptions a given 2π-periodic function f can be represented as convergent series 8 Fourier Series Our aim is to show that uder reasoable assumptios a give -periodic fuctio f ca be represeted as coverget series f(x) = a + (a cos x + b si x). (8.) By defiitio, the covergece of the series

More information

Maximum Likelihood Estimators.

Maximum Likelihood Estimators. Lecture 2 Maximum Likelihood Estimators. Matlab example. As a motivatio, let us look at oe Matlab example. Let us geerate a radom sample of size 00 from beta distributio Beta(5, 2). We will lear the defiitio

More information

Output Analysis (2, Chapters 10 &11 Law)

Output Analysis (2, Chapters 10 &11 Law) B. Maddah ENMG 6 Simulatio 05/0/07 Output Aalysis (, Chapters 10 &11 Law) Comparig alterative system cofiguratio Sice the output of a simulatio is radom, the comparig differet systems via simulatio should

More information

Infinite Sequences and Series

Infinite Sequences and Series CHAPTER 4 Ifiite Sequeces ad Series 4.1. Sequeces A sequece is a ifiite ordered list of umbers, for example the sequece of odd positive itegers: 1, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 29...

More information

Lecture 13. Lecturer: Jonathan Kelner Scribe: Jonathan Pines (2009)

Lecture 13. Lecturer: Jonathan Kelner Scribe: Jonathan Pines (2009) 18.409 A Algorithmist s Toolkit October 27, 2009 Lecture 13 Lecturer: Joatha Keler Scribe: Joatha Pies (2009) 1 Outlie Last time, we proved the Bru-Mikowski iequality for boxes. Today we ll go over the

More information

Research Method (I) --Knowledge on Sampling (Simple Random Sampling)

Research Method (I) --Knowledge on Sampling (Simple Random Sampling) Research Method (I) --Kowledge o Samplig (Simple Radom Samplig) 1. Itroductio to samplig 1.1 Defiitio of samplig Samplig ca be defied as selectig part of the elemets i a populatio. It results i the fact

More information

STA 2023 Practice Questions Exam 2 Chapter 7- sec 9.2. Case parameter estimator standard error Estimate of standard error

STA 2023 Practice Questions Exam 2 Chapter 7- sec 9.2. Case parameter estimator standard error Estimate of standard error STA 2023 Practice Questios Exam 2 Chapter 7- sec 9.2 Formulas Give o the test: Case parameter estimator stadard error Estimate of stadard error Samplig Distributio oe mea x s t (-1) oe p ( 1 p) CI: prop.

More information

MARTINGALES AND A BASIC APPLICATION

MARTINGALES AND A BASIC APPLICATION MARTINGALES AND A BASIC APPLICATION TURNER SMITH Abstract. This paper will develop the measure-theoretic approach to probability i order to preset the defiitio of martigales. From there we will apply this

More information

Week 3 Conditional probabilities, Bayes formula, WEEK 3 page 1 Expected value of a random variable

Week 3 Conditional probabilities, Bayes formula, WEEK 3 page 1 Expected value of a random variable Week 3 Coditioal probabilities, Bayes formula, WEEK 3 page 1 Expected value of a radom variable We recall our discussio of 5 card poker hads. Example 13 : a) What is the probability of evet A that a 5

More information

Definition. A variable X that takes on values X 1, X 2, X 3,...X k with respective frequencies f 1, f 2, f 3,...f k has mean

Definition. A variable X that takes on values X 1, X 2, X 3,...X k with respective frequencies f 1, f 2, f 3,...f k has mean 1 Social Studies 201 October 13, 2004 Note: The examples i these otes may be differet tha used i class. However, the examples are similar ad the methods used are idetical to what was preseted i class.

More information

3 Basic Definitions of Probability Theory

3 Basic Definitions of Probability Theory 3 Basic Defiitios of Probability Theory 3defprob.tex: Feb 10, 2003 Classical probability Frequecy probability axiomatic probability Historical developemet: Classical Frequecy Axiomatic The Axiomatic defiitio

More information

UC Berkeley Department of Electrical Engineering and Computer Science. EE 126: Probablity and Random Processes. Solutions 9 Spring 2006

UC Berkeley Department of Electrical Engineering and Computer Science. EE 126: Probablity and Random Processes. Solutions 9 Spring 2006 Exam format UC Bereley Departmet of Electrical Egieerig ad Computer Sciece EE 6: Probablity ad Radom Processes Solutios 9 Sprig 006 The secod midterm will be held o Wedesday May 7; CHECK the fial exam

More information

Mann-Whitney U 2 Sample Test (a.k.a. Wilcoxon Rank Sum Test)

Mann-Whitney U 2 Sample Test (a.k.a. Wilcoxon Rank Sum Test) No-Parametric ivariate Statistics: Wilcoxo-Ma-Whitey 2 Sample Test 1 Ma-Whitey 2 Sample Test (a.k.a. Wilcoxo Rak Sum Test) The (Wilcoxo-) Ma-Whitey (WMW) test is the o-parametric equivalet of a pooled

More information

MEI Structured Mathematics. Module Summary Sheets. Statistics 2 (Version B: reference to new book)

MEI Structured Mathematics. Module Summary Sheets. Statistics 2 (Version B: reference to new book) MEI Mathematics i Educatio ad Idustry MEI Structured Mathematics Module Summary Sheets Statistics (Versio B: referece to ew book) Topic : The Poisso Distributio Topic : The Normal Distributio Topic 3:

More information

How To Solve The Homewor Problem Beautifully

How To Solve The Homewor Problem Beautifully Egieerig 33 eautiful Homewor et 3 of 7 Kuszmar roblem.5.5 large departmet store sells sport shirts i three sizes small, medium, ad large, three patters plaid, prit, ad stripe, ad two sleeve legths log

More information

Asymptotic Growth of Functions

Asymptotic Growth of Functions CMPS Itroductio to Aalysis of Algorithms Fall 3 Asymptotic Growth of Fuctios We itroduce several types of asymptotic otatio which are used to compare the performace ad efficiecy of algorithms As we ll

More information

Department of Computer Science, University of Otago

Department of Computer Science, University of Otago Departmet of Computer Sciece, Uiversity of Otago Techical Report OUCS-2006-09 Permutatios Cotaiig May Patters Authors: M.H. Albert Departmet of Computer Sciece, Uiversity of Otago Micah Colema, Rya Fly

More information

A Recursive Formula for Moments of a Binomial Distribution

A Recursive Formula for Moments of a Binomial Distribution A Recursive Formula for Momets of a Biomial Distributio Árpád Béyi beyi@mathumassedu, Uiversity of Massachusetts, Amherst, MA 01003 ad Saverio M Maago smmaago@psavymil Naval Postgraduate School, Moterey,

More information

A Mathematical Perspective on Gambling

A Mathematical Perspective on Gambling A Mathematical Perspective o Gamblig Molly Maxwell Abstract. This paper presets some basic topics i probability ad statistics, icludig sample spaces, probabilistic evets, expectatios, the biomial ad ormal

More information

BASIC STATISTICS. f(x 1,x 2,..., x n )=f(x 1 )f(x 2 ) f(x n )= f(x i ) (1)

BASIC STATISTICS. f(x 1,x 2,..., x n )=f(x 1 )f(x 2 ) f(x n )= f(x i ) (1) BASIC STATISTICS. SAMPLES, RANDOM SAMPLING AND SAMPLE STATISTICS.. Radom Sample. The radom variables X,X 2,..., X are called a radom sample of size from the populatio f(x if X,X 2,..., X are mutually idepedet

More information

Modified Line Search Method for Global Optimization

Modified Line Search Method for Global Optimization Modified Lie Search Method for Global Optimizatio Cria Grosa ad Ajith Abraham Ceter of Excellece for Quatifiable Quality of Service Norwegia Uiversity of Sciece ad Techology Trodheim, Norway {cria, ajith}@q2s.tu.o

More information

PROCEEDINGS OF THE YEREVAN STATE UNIVERSITY AN ALTERNATIVE MODEL FOR BONUS-MALUS SYSTEM

PROCEEDINGS OF THE YEREVAN STATE UNIVERSITY AN ALTERNATIVE MODEL FOR BONUS-MALUS SYSTEM PROCEEDINGS OF THE YEREVAN STATE UNIVERSITY Physical ad Mathematical Scieces 2015, 1, p. 15 19 M a t h e m a t i c s AN ALTERNATIVE MODEL FOR BONUS-MALUS SYSTEM A. G. GULYAN Chair of Actuarial Mathematics

More information

Example 2 Find the square root of 0. The only square root of 0 is 0 (since 0 is not positive or negative, so those choices don t exist here).

Example 2 Find the square root of 0. The only square root of 0 is 0 (since 0 is not positive or negative, so those choices don t exist here). BEGINNING ALGEBRA Roots ad Radicals (revised summer, 00 Olso) Packet to Supplemet the Curret Textbook - Part Review of Square Roots & Irratioals (This portio ca be ay time before Part ad should mostly

More information

Chapter 5: Inner Product Spaces

Chapter 5: Inner Product Spaces Chapter 5: Ier Product Spaces Chapter 5: Ier Product Spaces SECION A Itroductio to Ier Product Spaces By the ed of this sectio you will be able to uderstad what is meat by a ier product space give examples

More information

THE REGRESSION MODEL IN MATRIX FORM. For simple linear regression, meaning one predictor, the model is. for i = 1, 2, 3,, n

THE REGRESSION MODEL IN MATRIX FORM. For simple linear regression, meaning one predictor, the model is. for i = 1, 2, 3,, n We will cosider the liear regressio model i matrix form. For simple liear regressio, meaig oe predictor, the model is i = + x i + ε i for i =,,,, This model icludes the assumptio that the ε i s are a sample

More information

A Test of Normality. 1 n S 2 3. n 1. Now introduce two new statistics. The sample skewness is defined as:

A Test of Normality. 1 n S 2 3. n 1. Now introduce two new statistics. The sample skewness is defined as: A Test of Normality Textbook Referece: Chapter. (eighth editio, pages 59 ; seveth editio, pages 6 6). The calculatio of p values for hypothesis testig typically is based o the assumptio that the populatio

More information

WHEN IS THE (CO)SINE OF A RATIONAL ANGLE EQUAL TO A RATIONAL NUMBER?

WHEN IS THE (CO)SINE OF A RATIONAL ANGLE EQUAL TO A RATIONAL NUMBER? WHEN IS THE (CO)SINE OF A RATIONAL ANGLE EQUAL TO A RATIONAL NUMBER? JÖRG JAHNEL 1. My Motivatio Some Sort of a Itroductio Last term I tought Topological Groups at the Göttige Georg August Uiversity. This

More information

SECTION 1.5 : SUMMATION NOTATION + WORK WITH SEQUENCES

SECTION 1.5 : SUMMATION NOTATION + WORK WITH SEQUENCES SECTION 1.5 : SUMMATION NOTATION + WORK WITH SEQUENCES Read Sectio 1.5 (pages 5 9) Overview I Sectio 1.5 we lear to work with summatio otatio ad formulas. We will also itroduce a brief overview of sequeces,

More information

Trigonometric Form of a Complex Number. The Complex Plane. axis. ( 2, 1) or 2 i FIGURE 6.44. The absolute value of the complex number z a bi is

Trigonometric Form of a Complex Number. The Complex Plane. axis. ( 2, 1) or 2 i FIGURE 6.44. The absolute value of the complex number z a bi is 0_0605.qxd /5/05 0:45 AM Page 470 470 Chapter 6 Additioal Topics i Trigoometry 6.5 Trigoometric Form of a Complex Number What you should lear Plot complex umbers i the complex plae ad fid absolute values

More information

, a Wishart distribution with n -1 degrees of freedom and scale matrix.

, a Wishart distribution with n -1 degrees of freedom and scale matrix. UMEÅ UNIVERSITET Matematisk-statistiska istitutioe Multivariat dataaalys D MSTD79 PA TENTAMEN 004-0-9 LÖSNINGSFÖRSLAG TILL TENTAMEN I MATEMATISK STATISTIK Multivariat dataaalys D, 5 poäg.. Assume that

More information

FIBONACCI NUMBERS: AN APPLICATION OF LINEAR ALGEBRA. 1. Powers of a matrix

FIBONACCI NUMBERS: AN APPLICATION OF LINEAR ALGEBRA. 1. Powers of a matrix FIBONACCI NUMBERS: AN APPLICATION OF LINEAR ALGEBRA. Powers of a matrix We begi with a propositio which illustrates the usefuless of the diagoalizatio. Recall that a square matrix A is diogaalizable if

More information

1 Correlation and Regression Analysis

1 Correlation and Regression Analysis 1 Correlatio ad Regressio Aalysis I this sectio we will be ivestigatig the relatioship betwee two cotiuous variable, such as height ad weight, the cocetratio of a ijected drug ad heart rate, or the cosumptio

More information

Theorems About Power Series

Theorems About Power Series Physics 6A Witer 20 Theorems About Power Series Cosider a power series, f(x) = a x, () where the a are real coefficiets ad x is a real variable. There exists a real o-egative umber R, called the radius

More information

5 Boolean Decision Trees (February 11)

5 Boolean Decision Trees (February 11) 5 Boolea Decisio Trees (February 11) 5.1 Graph Coectivity Suppose we are give a udirected graph G, represeted as a boolea adjacecy matrix = (a ij ), where a ij = 1 if ad oly if vertices i ad j are coected

More information

Confidence Intervals

Confidence Intervals Cofidece Itervals Cofidece Itervals are a extesio of the cocept of Margi of Error which we met earlier i this course. Remember we saw: The sample proportio will differ from the populatio proportio by more

More information

hp calculators HP 12C Statistics - average and standard deviation Average and standard deviation concepts HP12C average and standard deviation

hp calculators HP 12C Statistics - average and standard deviation Average and standard deviation concepts HP12C average and standard deviation HP 1C Statistics - average ad stadard deviatio Average ad stadard deviatio cocepts HP1C average ad stadard deviatio Practice calculatig averages ad stadard deviatios with oe or two variables HP 1C Statistics

More information

THE HEIGHT OF q-binary SEARCH TREES

THE HEIGHT OF q-binary SEARCH TREES THE HEIGHT OF q-binary SEARCH TREES MICHAEL DRMOTA AND HELMUT PRODINGER Abstract. q biary search trees are obtaied from words, equipped with the geometric distributio istead of permutatios. The average

More information

Soving Recurrence Relations

Soving Recurrence Relations Sovig Recurrece Relatios Part 1. Homogeeous liear 2d degree relatios with costat coefficiets. Cosider the recurrece relatio ( ) T () + at ( 1) + bt ( 2) = 0 This is called a homogeeous liear 2d degree

More information

GCSE STATISTICS. 4) How to calculate the range: The difference between the biggest number and the smallest number.

GCSE STATISTICS. 4) How to calculate the range: The difference between the biggest number and the smallest number. GCSE STATISTICS You should kow: 1) How to draw a frequecy diagram: e.g. NUMBER TALLY FREQUENCY 1 3 5 ) How to draw a bar chart, a pictogram, ad a pie chart. 3) How to use averages: a) Mea - add up all

More information

The Stable Marriage Problem

The Stable Marriage Problem The Stable Marriage Problem William Hut Lae Departmet of Computer Sciece ad Electrical Egieerig, West Virgiia Uiversity, Morgatow, WV William.Hut@mail.wvu.edu 1 Itroductio Imagie you are a matchmaker,

More information

THE TWO-VARIABLE LINEAR REGRESSION MODEL

THE TWO-VARIABLE LINEAR REGRESSION MODEL THE TWO-VARIABLE LINEAR REGRESSION MODEL Herma J. Bieres Pesylvaia State Uiversity April 30, 202. Itroductio Suppose you are a ecoomics or busiess maor i a college close to the beach i the souther part

More information

Lecture 5: Span, linear independence, bases, and dimension

Lecture 5: Span, linear independence, bases, and dimension Lecture 5: Spa, liear idepedece, bases, ad dimesio Travis Schedler Thurs, Sep 23, 2010 (versio: 9/21 9:55 PM) 1 Motivatio Motivatio To uderstad what it meas that R has dimesio oe, R 2 dimesio 2, etc.;

More information

CS103A Handout 23 Winter 2002 February 22, 2002 Solving Recurrence Relations

CS103A Handout 23 Winter 2002 February 22, 2002 Solving Recurrence Relations CS3A Hadout 3 Witer 00 February, 00 Solvig Recurrece Relatios Itroductio A wide variety of recurrece problems occur i models. Some of these recurrece relatios ca be solved usig iteratio or some other ad

More information

Hypergeometric Distributions

Hypergeometric Distributions 7.4 Hypergeometric Distributios Whe choosig the startig lie-up for a game, a coach obviously has to choose a differet player for each positio. Similarly, whe a uio elects delegates for a covetio or you

More information

OMG! Excessive Texting Tied to Risky Teen Behaviors

OMG! Excessive Texting Tied to Risky Teen Behaviors BUSIESS WEEK: EXECUTIVE EALT ovember 09, 2010 OMG! Excessive Textig Tied to Risky Tee Behaviors Kids who sed more tha 120 a day more likely to try drugs, alcohol ad sex, researchers fid TUESDAY, ov. 9

More information

Multi-server Optimal Bandwidth Monitoring for QoS based Multimedia Delivery Anup Basu, Irene Cheng and Yinzhe Yu

Multi-server Optimal Bandwidth Monitoring for QoS based Multimedia Delivery Anup Basu, Irene Cheng and Yinzhe Yu Multi-server Optimal Badwidth Moitorig for QoS based Multimedia Delivery Aup Basu, Iree Cheg ad Yizhe Yu Departmet of Computig Sciece U. of Alberta Architecture Applicatio Layer Request receptio -coectio

More information

3. Greatest Common Divisor - Least Common Multiple

3. Greatest Common Divisor - Least Common Multiple 3 Greatest Commo Divisor - Least Commo Multiple Defiitio 31: The greatest commo divisor of two atural umbers a ad b is the largest atural umber c which divides both a ad b We deote the greatest commo gcd

More information

LECTURE 13: Cross-validation

LECTURE 13: Cross-validation LECTURE 3: Cross-validatio Resampli methods Cross Validatio Bootstrap Bias ad variace estimatio with the Bootstrap Three-way data partitioi Itroductio to Patter Aalysis Ricardo Gutierrez-Osua Texas A&M

More information

Unit 8: Inference for Proportions. Chapters 8 & 9 in IPS

Unit 8: Inference for Proportions. Chapters 8 & 9 in IPS Uit 8: Iferece for Proortios Chaters 8 & 9 i IPS Lecture Outlie Iferece for a Proortio (oe samle) Iferece for Two Proortios (two samles) Cotigecy Tables ad the χ test Iferece for Proortios IPS, Chater

More information

Descriptive Statistics

Descriptive Statistics Descriptive Statistics We leared to describe data sets graphically. We ca also describe a data set umerically. Measures of Locatio Defiitio The sample mea is the arithmetic average of values. We deote

More information

Incremental calculation of weighted mean and variance

Incremental calculation of weighted mean and variance Icremetal calculatio of weighted mea ad variace Toy Fich faf@cam.ac.uk dot@dotat.at Uiversity of Cambridge Computig Service February 009 Abstract I these otes I eplai how to derive formulae for umerically

More information

Irreducible polynomials with consecutive zero coefficients

Irreducible polynomials with consecutive zero coefficients Irreducible polyomials with cosecutive zero coefficiets Theodoulos Garefalakis Departmet of Mathematics, Uiversity of Crete, 71409 Heraklio, Greece Abstract Let q be a prime power. We cosider the problem

More information

THE ABRACADABRA PROBLEM

THE ABRACADABRA PROBLEM THE ABRACADABRA PROBLEM FRANCESCO CARAVENNA Abstract. We preset a detailed solutio of Exercise E0.6 i [Wil9]: i a radom sequece of letters, draw idepedetly ad uiformly from the Eglish alphabet, the expected

More information

Confidence intervals and hypothesis tests

Confidence intervals and hypothesis tests Chapter 2 Cofidece itervals ad hypothesis tests This chapter focuses o how to draw coclusios about populatios from sample data. We ll start by lookig at biary data (e.g., pollig), ad lear how to estimate

More information

Quadrat Sampling in Population Ecology

Quadrat Sampling in Population Ecology Quadrat Samplig i Populatio Ecology Backgroud Estimatig the abudace of orgaisms. Ecology is ofte referred to as the "study of distributio ad abudace". This beig true, we would ofte like to kow how may

More information

Chapter 5 O A Cojecture Of Erdíos Proceedigs NCUR VIII è1994è, Vol II, pp 794í798 Jeærey F Gold Departmet of Mathematics, Departmet of Physics Uiversity of Utah Do H Tucker Departmet of Mathematics Uiversity

More information