Monte Carlo Methods and Importance Sampling

Size: px
Start display at page:

Download "Monte Carlo Methods and Importance Sampling"

Transcription

1 Lecture Notes for Stat 578C c Eric C. Aderso Statistical Geetics 2 October 999 (subbi for E.A Thompso) Mote Carlo Methods ad Importace Samplig History ad defiitio: The term Mote Carlo was apparetly first used by Ulam ad vo Neuma as a Los Alamos code word for the stochastic simulatios they applied to buildig better atomic bombs. Their methods, ivolvig the laws of chace, were aptly amed after the iteratioal gamig destiatio; the moiker stuck ad soo after the War a wide rage of sticky problems yielded to the ew techiques. Despite the widespread use of the methods, ad umerous descriptios of them i articles ad moographs, it is virtually impossible to fid a succit defiitio of Mote Carlo method i the literature. Perhaps this is owig to the ituitive ature of the topic which spaws may defiitios by way of specific examples. Some authors prefer to use the term stochastic simulatio for almost everythig, reservig Mote Carlo oly for Mote Carlo Itegratio ad Mote Carlo Tests (cf. Ripley 987). Others seem less cocered about blurrig the distictio betwee simulatio studies ad Mote Carlo methods. Be that as it may, a suitable defiitio ca be good to have, if for othig other tha to avoid the awkwardess of tryig to defie the Mote Carlo method by appealig to a whole bevy of examples of it. Sice I am (so Elizabeth claims!) uduly iflueced by my advisor s ways of thikig, I like to defie Mote Carlo i the spirit of defiitios she has used before. I particular, I use: Defiitio: Mote Carlo is the art of approximatig a expectatio by the sample mea of a fuctio of simulated radom variables. We will fid that this defiitio is broad eough to cover everythig that has bee called Mote Carlo, ad yet makes clear its essece i very familiar terms: Mote Carlo is about ivokig laws of large umbers to approximate expectatios. While most Mote Carlo simulatios are doe by computer today, there were may applicatios of Mote Carlo methods usig coi-flippig, card-drawig, or eedle-tossig (rather tha computergeerated pseudo-radom umbers) as early as the tur of the cetury log before the ame Mote Carlo arose. I more mathematical terms: Cosider a (possibly multidimesioal) radom variable X havig probability mass fuctio or probability desity fuctio f X (x) which is greater tha zero o a set of values X. The the expected value of a fuctio g of X is E(g(X)) = g(x)f X (x) () if X is discrete, ad E(g(X)) = g(x)f X (x)dx (2) if X is cotiuous. Now, if we were to take a -sample of X s, (x,...,x ), ad we computed the mea of g(x) over the sample, the we would have the Mote Carlo estimate g (x) = g(x i ) This applies whe the simulated variables are idepedet of oe aother, ad might apply whe they are correlated with oe aother (for example if they are states visited by a ergodic Markov chai). For ow we will just deal with idepedet simulated radom variables, but all of this exteds to samples from Markov chais via the weak law of large umbers for the umber of passages through a recurret state i a ergodic Markov chai (see Feller 957). You will ecouter this later whe talkig about MCMC.

2 2 c Eric C. Aderso. You may duplicate this freely for pedagogical purposes. of E(g(X)). We could, alteratively, speak of the radom variable g (X) = g(x) which we call the Mote Carlo estimator of E(g(X)). If E(g(X)), exists, the the weak law of large umbers tells us that for ay arbitrarily small ɛ lim P ( g (X) E(g(X)) ɛ) =. This tells us that as gets large, the there is small probability that g (X) deviates much from E(g(X)). For our purposes, the strog law of large umbers says much the same thig the importat part beig that so log as is large eough, g (x) arisig from a Mote Carlo experimet shall be close to E(g(X)), as desired. Oe other thig to ote at this poit is that g (X) is ubiased for E(g(X)): ( ) E( g (X)) = E g(x i ) = E(g(X i )) = E(g(X)). Makig this useful: The precedig sectio comes to life ad becomes useful whe oe realizes that very may quatities of iterest may be cast as expectatios. Most importatly for applicatios i statistical geetics, it is possible to express all probabilities, itegrals, ad summatios as expectatios: Probabilities: Let Y be a radom variable. The probability that Y takes o some value i a set A ca be expressed as a expectatio usig the idicator fuctio: P (Y A) =E(I {A} (Y )) (3) where I {A} (Y ) is the idicator fuctio that takes the value whe Y A ad whe Y A. Itegrals: Cosider a problem ow which is completely determiistic itegratig a fuctio q(x) from a to b (as i high-school calculus). So we have b a q(x)dx. This ca be expressed as a expectatio with respect to a uiformly distributed, cotiuous radom variable U betwee a ad b. U has desity fuctio f U (u) =/(b a), so if we rewrite the itegral we get (b a) b a q(x) dx =(b a) b a b a q(x)f U (x)dx =(b a)e(q(u))...voila! Discrete Sums: The discrete versio of the above is just the sum of a fuctio q(x) over the coutably may values of x i a set A. If we have a radom variable W which takes values i A all with equal probability p (so that w A p = the the sum may be cast as the expectatio q(x) = q(x)p = E(q(W )). p p x A x A The immediate cosequece of this is that all probabilities, itegrals, ad summatios ca be approximated by the Mote Carlo method. A crucial thig to ote, however, is that there is o restrictio that says U or W above must have uiform distributios. This is just for easy illustratio of the poits above. We will explore this poit more while cosiderig importace samplig.

3 c Eric C. Aderso. Correctios, commets?= 3 Example I: Approximatig probabilities by Mote Carlo. Cosider a Wright-Fisher populatio of size N diploid idividuals i which X t couts the umbers of copies of a certai allelic type i the populatio at time t. At time zero, there are x copies of the allele. Give this, what is the probability that the allele will be lost from the populatio i t geeratios, i.e., P (X t = X = x )? This ca be computed exactly by multiplyig trasitio probability matrices together, or by employig the Baum (972) algorithm (which you will lear about later), but it ca also be approximated by Mote Carlo. It is simple to simulate geetic drift i a Wright-Fisher populatio; thus we ca easily simulate values for X t give X = x. The, P (X t = X = x )=E(I {} (X t ) X = x ) where I {} (X t ) takes the value whe X t = ad otherwise. Deotig the i th simulated value of X t by x (i) t our Mote Carlo estimate would be P (X t = X = x ) I {} (x (i) t ). Example II: Mote Carlo approximatios to distributios. A simple extesio of the above example is to approximate the whole probability distributio P (X t X = x ) by Mote Carlo. Cosider the histogram below: Mote Carlo Estimate of Probability Number of Copies of Allele It represets the results of simulatios i which =5,, x = 6, t = 4, the Wright-Fisher populatio size N = diploids, ad each rectagle represets a Mote Carlo approximatio to P (a X 4 <a+2 X = 6), a =, 2, 4,...,2. For each such probability, the approximatio follows from P (a X 4 <a+2 X = 6) = E(I {a X<a+2} (X 4 )) I {a X<a+2} (x (i) 4 ),a=, 2,...,2 Example III: A discrete sum over latet variables. I may applicatios i statistical geetics, the probability P (Y ) of a observed evet Y must be computed as the sum over very may latet variables X of the joit probability P (Y,X). I such a case, Y is typically fixed, i.e., we have observed Y = y so we are iterested i P (Y = y), but we ca t observe the values of the latet variables which may take values i the space X. Though it follows from the laws of probability that P (Y = y) = P (Y = y, X = x), quite ofte X is such a large space (cotais so may elemets) that it is impossible to compute the sum. Applicatio of the law of coditioal probability, however, gives P (Y = y) = P (Y = y, X = x) = P (Y = y X = x)p (X = x). (4)

4 4 c Eric C. Aderso. You may duplicate this freely for pedagogical purposes. The term followig the last equals sig is the sum over all x of a fuctio of x [amely, P (Y = y X = x)], weighted by the margial probabilities P (X = x). Clearly this is a expectatio, ad therefore may be approximated by Mote Carlo, givig us P (Y = y) P (Y = y X = x i ) where x i is the i th realizatio from the margial distributio of X. You will see this sort of thig may times agai i Stat 578C. OK, these examples have all bee preseted as if the applicatio of Mote Carlo to practically ay problem is a soporific ad trivial exercise. However, othig could be further from the truth! Though it is typically easy to formulate a quatity as a expectatio ad to propose a aive Mote Carlo estimator, it is quite aother thig to actually have the Mote Carlo estimator provide you with good estimates i a reasoable amout of computer time. For most problems, a umber of Mote Carlo estimators may be proposed, however some Mote Carlo estimators are clearly better tha others. Typically, a better Mote Carlo estimator has smaller variace (for the same amout of computatioal effort) tha its competitors. Thus we tur to matters of... Mote Carlo variace: Goig back to our origial otatio, we have the radom variable g (X), a Mote Carlo estimator of E(g(X)). Like all radom variables, we may compute its variace (if it exists) by the stadard formulas: ( ) Var( g (X))=Var g(x i ) = Var(g(X)) = [g(x) E(g(X))] 2 f X (x) (5) if X is discrete, ad Var( g (X))=Var ( ) g(x i ) = Var(g(X)) = [g(x) E(g(X))] 2 f X (x)dx (6) if X is cotiuous. From here o out, let us do everythig i terms of itegrals over cotiuous variables, but it all applies equally well to sums over discrete radom variables. There are umerous ways to reduce the variace of Mote Carlo estimators. Of these variace-reductio techiques, the oe called importace samplig is particularly useful. I fid that it is best itroduced by describig its atithesis which I call irrelevace samplig or barely relevat samplig, which we will tur to after a short digressio. Digressio : Estimatig Var( g (X)): Sice, typically E( g (X)) i (5) or (6) is ukow ad the sum or the itegral is ot feasibly computed (that is why we would be doig Mote Carlo i the first place) the formulas i (5) ad (6) are ot useful for estimatig the variace associated with your Mote Carlo estimate whe you are actually doig Mote Carlo. Istead, just like approximatig the variace from a sample à la our earliest statistics classes, we have a ubiased estimator for Var(g(X)): Var(g(X)) = (g(x i ) g (x)) 2 (7) (This is just the familiar s 2 from Statistics ). The ubiased estimate of the variace of g (X) is / of that: Var( g (X)) = Var(g(X)) = ( ) (g(x i ) g (x)) 2. (8)

5 c Eric C. Aderso. Correctios, commets?= 5 The form give i (8) is ot particularly satisfyig if oe does ot wat to wait util the ed of the simulatio (util is reached) to compute the variace. To this ed, the followig formulas are extremely useful: The mea ca be computed o the fly, recursively by: g + (x) = + ( g (x)+g(x + )). (9) If we also exped the effort to record the sum of the squares of the g(x) s, SS g the a simple calculatio gives Var( g (X)): Var( g (X)) = ( SS g = [g(x i)] 2, ) ( g (x)) 2. () Whe programmig i a laguage like C, usig zero-base subscriptig, I ofte cofuse myself tryig to implemet the above recursio ad formulas. So, primarily for my ow beefit, (though this may be a useful referece for you if you program i C or C ++ ) I iclude a code sippet for implemetig the above: // variable declaratios double i; // the idex for coutig umber of reps (declarig as double // avoids havig to cast it to a double all the time) double gx; // the values that will be realized double mea_gx; // the Mote Carlo average that we will accumulate (our Mote Carlo estimate) double ss_gx; // the sum of squares of gx double var_of_mea_gx; // the estimate of our mote carlo variace // variable iitializatios: mea_gx =.; ss_gx =.; // loop over the idex i. these are repetitios i the simulatio: for(i=.;i<;i+=.) { gx = "the value for gx realized o this repetio"; mea_gx += (gx - mea_gx)/(i+.); // the curret Mote Carlo estimate ss_gx += gx * gx; // the curret sum of squares if(i>) { var_of_mea_gx = (ss_gx - (i+.)*(mea_gx * mea_gx) ) / (i * (i+.) ); // above is the curret estimate of the variace of our Mote Carlo estimator } } Barely relevat samplig: Back to the task at had. To itroduce importace samplig we cosider its opposite. Imagie that we wat a Mote Carlo approximatio to g(x)dx for g(x) show i the figure below. Note that g(x) =forx< ad x>..75 g(x) desity of a Uiform(, ) r.v. desity of a Uiform(, 5) r.v If we have U Uiform(, ), the we ca cast the itegral as the expectatio with respect to U: g(x)dx = E(g(U)), so we may approximate it by Mote Carlo: g(u i). This would work reasoably well. The figure, however, suggests aother possibility. Oe could use W Uiform(, 5) givig g(x)dx =5 E(g(W )) ad hece the Mote Carlo estimator 5 g(w i). Obviously such a

6 6 c Eric C. Aderso. You may duplicate this freely for pedagogical purposes. course would make o sese at all because, o average, 8% of the realized w i s would tell you othig substatial about the itegral of g(x) sice g(x) = for <x<5. This would be barely relevat samplig, ad o oe i their right mid would willigly do it. It does make clear that oe s choice of distributio from which to draw their radom variables will affect the quality of their Mote Carlo estimator. Importace samplig: Importace samplig is choosig a good distributio from which to simulate oe s radom variables. It ivolves multiplyig the itegrad by (usually dressed up i a tricky fashio ) to yield a expectatio of a quatity that varies less tha the origial itegrad over the regio of itegratio. For example, let h(x) be a desity for the radom variable X which takes values oly i A so that h(x) x A h(x)dx =. The h(x) = ad g(x)dx = g(x) h(x) ( ) x A x A h(x) dx = g(x) g(x) x A h(x) h(x)dx = E h, () h(x) so log as h(x) for ay x A for which g(x), ad where E h deotes the expectatio with respect to the desity h. This gives a Mote Carlo estimator: g h (X) = g(x i ) h(x i ) where X i h(x). (2) Usig (6) ad the Cauchy-Schwarz iequality, it ca be show that Var( g (X)) h is miimized whe h(x) g(x) (see Rubistei 98, p. 23). If we restrict our attetio to what for most of our purposes is the truly relevat case, 2 that is, g(x) x A, the it is immediately apparet that the choice of the desity h(x) which miimizes Mote Carlo variace is proportioal to g(x), i.e., ifαh(x) = g(x) where α is some costat of proportioality, the clearly we have g(x)/h(x) =α {x : h(x) > } so E(g(X)/h(X)) = α ad hece the Mote Carlo variace would be zero by (6). Woderful! All we eed to do to have a Mote Carlo estimator with zero variace is use (2) ad make sure that our desity h is proportioal to the fuctio g. The absurdity of this wishful thikig is that the ability to simulate idepedet radom variables from h(x), or the ability to compute the desity h(x), itself, implies that the ormalizig costat of the distributio is computable, which i tur would imply that the origial itegral ivolvig g(x) is computable ad there would hece be o reaso to do Mote Carlo at all! Ultimately, however, it makes clear that a good importace samplig fuctio (as h is called) will be oe that is as close as possible to beig proportioal to g(x) a poit made by the followig cotrived example. Example V: A cotrived demostratio. Let us approximate by Mote Carlo the area uder a Normal(, ) desity curve from -5 to 5. This quatity will, of course, be extremely close to (ad we may as well call it ). This is cotrived because o oe would ever do this i practice... Noetheless, we will use a series of importace samplig fuctios: (a) a Uiform( 5, 5) desity, (b) a cauchy desity (a t radom variable o df) trucated at 5 ad 5, ad (c) a trucated t radom variable o 3 df. Figure o Page 7 shows each importace samplig fuctio as a dashed curve ext to the ormal curve. Below each of these is the histogram of 5, Mote Carlo estimates (usig =, ) of the area uder the ormal curve. As is clear from the progressio from (a) to (c), the Mote Carlo estimates are less variable whe the importace samplig fuctio is closer to the shape of the ormal desity. (Note: the importat feature is that the shape of the curves is closer. Obviously g(x) ad h(x) will, i geeral, ot be similar i height.) 2 This is typically the relevat case because we are iterested i o-egative quatities like probabilities.

7 c Eric C. Aderso. Correctios, commets?= # out of 5, 2 5 # out of 5, # out of 5, < Estimated Area > < Estimated Area >.5 5 < Estimated Area >.5 (a) Uiform (b) t Dist. (c) t 3 Dist. Figure : Three differet importace samplig fuctios (dotted lies) used to itegrate the stadard ormal desity (solid lie) from 5 to 5. Top paels are the desity curves ad bottom paels are histograms of 5, Mote Carlo estimates of the area (which is exactly ) usig =,. I summary, a good importace samplig fuctio h(x) should have the followig properties:. h(x) > wheever g(x) 2. h(x) should be close to beig proportioal to g(x) 3. it should be easy to simulate values from h(x) 4. it should be easy to compute the desity h(x) for ay value x that you might realize. Fulfillig this wish-list i high dimesioal space (where Mote Carlo techiques are most useful) is quite ofte a tall task, ad ca provide hours of etertaimet, ot to metio dissertatio chapters, etc. Note also that g(x) is ay arbitrary fuctio, so it certaily icludes the itegrad of a stadard expectatio. For example, with X f X we might be iterested i E(r(X)) for some fuctio r so we could use ( ) r(x)fx (x) r(x)fx (x) E(r(X)) = r(x)f X (x)dx = h(x) =E h h(x) h(x) ad the go searchig about for a suitable h(x) that is close to proportioal to r(x)f X (x). Example VI: Latet variables ad importace samplig Goig back to Example III with the discrete sum over latet variables X it is clear that the optimal importace samplig fuctio would be the coditioal distributio of X give Y, i.e., P (Y = y) = P (Y = y, X = x) = P (Y = y, X = x) P (X Y = y). P (X Y = y)

8 8 c Eric C. Aderso. You may duplicate this freely for pedagogical purposes. 2 # out of 5, < Estimated Area >.5 Figure 2: Histogram of 5, Mote Carlo estimates of the area uder a trucated t distributio with oe df usig the stadard ormal desity as the importace samplig fuctio. The true area is. Note the several very high values (>.5). Note that the right side is a coditioal expectatio of a fuctio of X. As before P (X Y )is ot computable. So oe must tur to fidig some other distributio, say P (X), that is close to P (X Y ) but which is more easily sampled from ad computed. A commo pitfall of importace samplig: As a fial word o importace samplig, it should be poited out that the tails of the distributios matter! While h(x) might be roughly the same shape as g(x), serious difficulties arise if h(x) gets small much faster tha g(x) out i the tails. I such a case, though it is improbable (by defiitio) that you will realize a value x i from the far tails of h(x), if you do, the your Mote Carlo estimator will take a jolt g(x i )/h(x i ) for such a improbable x i may be orders of magitude larger tha the typical values g(x)/h(x) that you see. As a example of this pheomeo, ivestigate Figure 2 which shows the histogram of 5, Mote Carlo estimates of the area betwee -5 ad 5 of a t desity (trucated at -5 ad 5 ad reormalized so the exact area is ). The importace samplig fuctio used for this was a stadard, uit ormal desity, which obviously gets small i the tails much faster tha a cauchy (see Figure (b)). Note i particular that about 5 of the 5, Mote Carlo estimates were greater tha.5! Further readig: A classic referece o Mote Carlo is Hammersley ad Hadscomb (964). They describe several other variace-reductio techiques that you might fid iterestig. Refereces Baum, L. E., 972 A iequality ad associated maximizatio techique i statistical estimatio for probabilistic fuctios of Markov processes. I O. Shisha (Ed.), Iequalities III: Proceedigs of the Third Symposium o Iequalities Held at the Uiversity of Califoria, Los Ageles, September 9, 969, pp. 8. New York: Academic Press. Feller, W., 957 A Itroductio to Probability Theory ad Its Applicatios, 2d Editio. New York: Joh Wiley & Sos. Hammersley, J. M. ad D. C. Hadscomb, 964 Mote Carlo Methods. Lodo: Methue & Co Ltd. Ripley, B. D., 987 Stochastic Simulatio. New York: Wiley & Sos. Rubistei, B. Y., 98 Simulatio ad the Mote Carlo Method. New York: Wiley & Sos.

Properties of MLE: consistency, asymptotic normality. Fisher information.

Properties of MLE: consistency, asymptotic normality. Fisher information. Lecture 3 Properties of MLE: cosistecy, asymptotic ormality. Fisher iformatio. I this sectio we will try to uderstad why MLEs are good. Let us recall two facts from probability that we be used ofte throughout

More information

I. Chi-squared Distributions

I. Chi-squared Distributions 1 M 358K Supplemet to Chapter 23: CHI-SQUARED DISTRIBUTIONS, T-DISTRIBUTIONS, AND DEGREES OF FREEDOM To uderstad t-distributios, we first eed to look at aother family of distributios, the chi-squared distributios.

More information

Overview of some probability distributions.

Overview of some probability distributions. Lecture Overview of some probability distributios. I this lecture we will review several commo distributios that will be used ofte throughtout the class. Each distributio is usually described by its probability

More information

In nite Sequences. Dr. Philippe B. Laval Kennesaw State University. October 9, 2008

In nite Sequences. Dr. Philippe B. Laval Kennesaw State University. October 9, 2008 I ite Sequeces Dr. Philippe B. Laval Keesaw State Uiversity October 9, 2008 Abstract This had out is a itroductio to i ite sequeces. mai de itios ad presets some elemetary results. It gives the I ite Sequeces

More information

Hypothesis testing. Null and alternative hypotheses

Hypothesis testing. Null and alternative hypotheses Hypothesis testig Aother importat use of samplig distributios is to test hypotheses about populatio parameters, e.g. mea, proportio, regressio coefficiets, etc. For example, it is possible to stipulate

More information

Chapter 6: Variance, the law of large numbers and the Monte-Carlo method

Chapter 6: Variance, the law of large numbers and the Monte-Carlo method Chapter 6: Variace, the law of large umbers ad the Mote-Carlo method Expected value, variace, ad Chebyshev iequality. If X is a radom variable recall that the expected value of X, E[X] is the average value

More information

Incremental calculation of weighted mean and variance

Incremental calculation of weighted mean and variance Icremetal calculatio of weighted mea ad variace Toy Fich faf@cam.ac.uk dot@dotat.at Uiversity of Cambridge Computig Service February 009 Abstract I these otes I eplai how to derive formulae for umerically

More information

5: Introduction to Estimation

5: Introduction to Estimation 5: Itroductio to Estimatio Cotets Acroyms ad symbols... 1 Statistical iferece... Estimatig µ with cofidece... 3 Samplig distributio of the mea... 3 Cofidece Iterval for μ whe σ is kow before had... 4 Sample

More information

Chapter 7 Methods of Finding Estimators

Chapter 7 Methods of Finding Estimators Chapter 7 for BST 695: Special Topics i Statistical Theory. Kui Zhag, 011 Chapter 7 Methods of Fidig Estimators Sectio 7.1 Itroductio Defiitio 7.1.1 A poit estimator is ay fuctio W( X) W( X1, X,, X ) of

More information

0.7 0.6 0.2 0 0 96 96.5 97 97.5 98 98.5 99 99.5 100 100.5 96.5 97 97.5 98 98.5 99 99.5 100 100.5

0.7 0.6 0.2 0 0 96 96.5 97 97.5 98 98.5 99 99.5 100 100.5 96.5 97 97.5 98 98.5 99 99.5 100 100.5 Sectio 13 Kolmogorov-Smirov test. Suppose that we have a i.i.d. sample X 1,..., X with some ukow distributio P ad we would like to test the hypothesis that P is equal to a particular distributio P 0, i.e.

More information

1. C. The formula for the confidence interval for a population mean is: x t, which was

1. C. The formula for the confidence interval for a population mean is: x t, which was s 1. C. The formula for the cofidece iterval for a populatio mea is: x t, which was based o the sample Mea. So, x is guarateed to be i the iterval you form.. D. Use the rule : p-value

More information

Case Study. Normal and t Distributions. Density Plot. Normal Distributions

Case Study. Normal and t Distributions. Density Plot. Normal Distributions Case Study Normal ad t Distributios Bret Halo ad Bret Larget Departmet of Statistics Uiversity of Wiscosi Madiso October 11 13, 2011 Case Study Body temperature varies withi idividuals over time (it ca

More information

University of California, Los Angeles Department of Statistics. Distributions related to the normal distribution

University of California, Los Angeles Department of Statistics. Distributions related to the normal distribution Uiversity of Califoria, Los Ageles Departmet of Statistics Statistics 100B Istructor: Nicolas Christou Three importat distributios: Distributios related to the ormal distributio Chi-square (χ ) distributio.

More information

Lecture 4: Cauchy sequences, Bolzano-Weierstrass, and the Squeeze theorem

Lecture 4: Cauchy sequences, Bolzano-Weierstrass, and the Squeeze theorem Lecture 4: Cauchy sequeces, Bolzao-Weierstrass, ad the Squeeze theorem The purpose of this lecture is more modest tha the previous oes. It is to state certai coditios uder which we are guarateed that limits

More information

Confidence Intervals for One Mean

Confidence Intervals for One Mean Chapter 420 Cofidece Itervals for Oe Mea Itroductio This routie calculates the sample size ecessary to achieve a specified distace from the mea to the cofidece limit(s) at a stated cofidece level for a

More information

Maximum Likelihood Estimators.

Maximum Likelihood Estimators. Lecture 2 Maximum Likelihood Estimators. Matlab example. As a motivatio, let us look at oe Matlab example. Let us geerate a radom sample of size 00 from beta distributio Beta(5, 2). We will lear the defiitio

More information

THE REGRESSION MODEL IN MATRIX FORM. For simple linear regression, meaning one predictor, the model is. for i = 1, 2, 3,, n

THE REGRESSION MODEL IN MATRIX FORM. For simple linear regression, meaning one predictor, the model is. for i = 1, 2, 3,, n We will cosider the liear regressio model i matrix form. For simple liear regressio, meaig oe predictor, the model is i = + x i + ε i for i =,,,, This model icludes the assumptio that the ε i s are a sample

More information

Example 2 Find the square root of 0. The only square root of 0 is 0 (since 0 is not positive or negative, so those choices don t exist here).

Example 2 Find the square root of 0. The only square root of 0 is 0 (since 0 is not positive or negative, so those choices don t exist here). BEGINNING ALGEBRA Roots ad Radicals (revised summer, 00 Olso) Packet to Supplemet the Curret Textbook - Part Review of Square Roots & Irratioals (This portio ca be ay time before Part ad should mostly

More information

Section 11.3: The Integral Test

Section 11.3: The Integral Test Sectio.3: The Itegral Test Most of the series we have looked at have either diverged or have coverged ad we have bee able to fid what they coverge to. I geeral however, the problem is much more difficult

More information

Modified Line Search Method for Global Optimization

Modified Line Search Method for Global Optimization Modified Lie Search Method for Global Optimizatio Cria Grosa ad Ajith Abraham Ceter of Excellece for Quatifiable Quality of Service Norwegia Uiversity of Sciece ad Techology Trodheim, Norway {cria, ajith}@q2s.tu.o

More information

Parameter estimation for nonlinear models: Numerical approaches to solving the inverse problem. Lecture 11 04/01/2008. Sven Zenker

Parameter estimation for nonlinear models: Numerical approaches to solving the inverse problem. Lecture 11 04/01/2008. Sven Zenker Parameter estimatio for oliear models: Numerical approaches to solvig the iverse problem Lecture 11 04/01/2008 Sve Zeker Review: Trasformatio of radom variables Cosider probability distributio of a radom

More information

Normal Distribution.

Normal Distribution. Normal Distributio www.icrf.l Normal distributio I probability theory, the ormal or Gaussia distributio, is a cotiuous probability distributio that is ofte used as a first approimatio to describe realvalued

More information

Chapter 14 Nonparametric Statistics

Chapter 14 Nonparametric Statistics Chapter 14 Noparametric Statistics A.K.A. distributio-free statistics! Does ot deped o the populatio fittig ay particular type of distributio (e.g, ormal). Sice these methods make fewer assumptios, they

More information

Non-life insurance mathematics. Nils F. Haavardsson, University of Oslo and DNB Skadeforsikring

Non-life insurance mathematics. Nils F. Haavardsson, University of Oslo and DNB Skadeforsikring No-life isurace mathematics Nils F. Haavardsso, Uiversity of Oslo ad DNB Skadeforsikrig Mai issues so far Why does isurace work? How is risk premium defied ad why is it importat? How ca claim frequecy

More information

Chapter 7 - Sampling Distributions. 1 Introduction. What is statistics? It consist of three major areas:

Chapter 7 - Sampling Distributions. 1 Introduction. What is statistics? It consist of three major areas: Chapter 7 - Samplig Distributios 1 Itroductio What is statistics? It cosist of three major areas: Data Collectio: samplig plas ad experimetal desigs Descriptive Statistics: umerical ad graphical summaries

More information

Sequences and Series

Sequences and Series CHAPTER 9 Sequeces ad Series 9.. Covergece: Defiitio ad Examples Sequeces The purpose of this chapter is to itroduce a particular way of geeratig algorithms for fidig the values of fuctios defied by their

More information

A probabilistic proof of a binomial identity

A probabilistic proof of a binomial identity A probabilistic proof of a biomial idetity Joatho Peterso Abstract We give a elemetary probabilistic proof of a biomial idetity. The proof is obtaied by computig the probability of a certai evet i two

More information

Discrete Mathematics and Probability Theory Spring 2014 Anant Sahai Note 13

Discrete Mathematics and Probability Theory Spring 2014 Anant Sahai Note 13 EECS 70 Discrete Mathematics ad Probability Theory Sprig 2014 Aat Sahai Note 13 Itroductio At this poit, we have see eough examples that it is worth just takig stock of our model of probability ad may

More information

Measures of Spread and Boxplots Discrete Math, Section 9.4

Measures of Spread and Boxplots Discrete Math, Section 9.4 Measures of Spread ad Boxplots Discrete Math, Sectio 9.4 We start with a example: Example 1: Comparig Mea ad Media Compute the mea ad media of each data set: S 1 = {4, 6, 8, 10, 1, 14, 16} S = {4, 7, 9,

More information

Lesson 17 Pearson s Correlation Coefficient

Lesson 17 Pearson s Correlation Coefficient Outlie Measures of Relatioships Pearso s Correlatio Coefficiet (r) -types of data -scatter plots -measure of directio -measure of stregth Computatio -covariatio of X ad Y -uique variatio i X ad Y -measurig

More information

Sampling Distribution And Central Limit Theorem

Sampling Distribution And Central Limit Theorem () Samplig Distributio & Cetral Limit Samplig Distributio Ad Cetral Limit Samplig distributio of the sample mea If we sample a umber of samples (say k samples where k is very large umber) each of size,

More information

Lecture 4: Cheeger s Inequality

Lecture 4: Cheeger s Inequality Spectral Graph Theory ad Applicatios WS 0/0 Lecture 4: Cheeger s Iequality Lecturer: Thomas Sauerwald & He Su Statemet of Cheeger s Iequality I this lecture we assume for simplicity that G is a d-regular

More information

BASIC STATISTICS. f(x 1,x 2,..., x n )=f(x 1 )f(x 2 ) f(x n )= f(x i ) (1)

BASIC STATISTICS. f(x 1,x 2,..., x n )=f(x 1 )f(x 2 ) f(x n )= f(x i ) (1) BASIC STATISTICS. SAMPLES, RANDOM SAMPLING AND SAMPLE STATISTICS.. Radom Sample. The radom variables X,X 2,..., X are called a radom sample of size from the populatio f(x if X,X 2,..., X are mutually idepedet

More information

CS103A Handout 23 Winter 2002 February 22, 2002 Solving Recurrence Relations

CS103A Handout 23 Winter 2002 February 22, 2002 Solving Recurrence Relations CS3A Hadout 3 Witer 00 February, 00 Solvig Recurrece Relatios Itroductio A wide variety of recurrece problems occur i models. Some of these recurrece relatios ca be solved usig iteratio or some other ad

More information

Output Analysis (2, Chapters 10 &11 Law)

Output Analysis (2, Chapters 10 &11 Law) B. Maddah ENMG 6 Simulatio 05/0/07 Output Aalysis (, Chapters 10 &11 Law) Comparig alterative system cofiguratio Sice the output of a simulatio is radom, the comparig differet systems via simulatio should

More information

Week 3 Conditional probabilities, Bayes formula, WEEK 3 page 1 Expected value of a random variable

Week 3 Conditional probabilities, Bayes formula, WEEK 3 page 1 Expected value of a random variable Week 3 Coditioal probabilities, Bayes formula, WEEK 3 page 1 Expected value of a radom variable We recall our discussio of 5 card poker hads. Example 13 : a) What is the probability of evet A that a 5

More information

GCSE STATISTICS. 4) How to calculate the range: The difference between the biggest number and the smallest number.

GCSE STATISTICS. 4) How to calculate the range: The difference between the biggest number and the smallest number. GCSE STATISTICS You should kow: 1) How to draw a frequecy diagram: e.g. NUMBER TALLY FREQUENCY 1 3 5 ) How to draw a bar chart, a pictogram, ad a pie chart. 3) How to use averages: a) Mea - add up all

More information

LECTURE 13: Cross-validation

LECTURE 13: Cross-validation LECTURE 3: Cross-validatio Resampli methods Cross Validatio Bootstrap Bias ad variace estimatio with the Bootstrap Three-way data partitioi Itroductio to Patter Aalysis Ricardo Gutierrez-Osua Texas A&M

More information

Annuities Under Random Rates of Interest II By Abraham Zaks. Technion I.I.T. Haifa ISRAEL and Haifa University Haifa ISRAEL.

Annuities Under Random Rates of Interest II By Abraham Zaks. Technion I.I.T. Haifa ISRAEL and Haifa University Haifa ISRAEL. Auities Uder Radom Rates of Iterest II By Abraham Zas Techio I.I.T. Haifa ISRAEL ad Haifa Uiversity Haifa ISRAEL Departmet of Mathematics, Techio - Israel Istitute of Techology, 3000, Haifa, Israel I memory

More information

CS103X: Discrete Structures Homework 4 Solutions

CS103X: Discrete Structures Homework 4 Solutions CS103X: Discrete Structures Homewor 4 Solutios Due February 22, 2008 Exercise 1 10 poits. Silico Valley questios: a How may possible six-figure salaries i whole dollar amouts are there that cotai at least

More information

Determining the sample size

Determining the sample size Determiig the sample size Oe of the most commo questios ay statisticia gets asked is How large a sample size do I eed? Researchers are ofte surprised to fid out that the aswer depeds o a umber of factors

More information

Lesson 15 ANOVA (analysis of variance)

Lesson 15 ANOVA (analysis of variance) Outlie Variability -betwee group variability -withi group variability -total variability -F-ratio Computatio -sums of squares (betwee/withi/total -degrees of freedom (betwee/withi/total -mea square (betwee/withi

More information

SECTION 1.5 : SUMMATION NOTATION + WORK WITH SEQUENCES

SECTION 1.5 : SUMMATION NOTATION + WORK WITH SEQUENCES SECTION 1.5 : SUMMATION NOTATION + WORK WITH SEQUENCES Read Sectio 1.5 (pages 5 9) Overview I Sectio 1.5 we lear to work with summatio otatio ad formulas. We will also itroduce a brief overview of sequeces,

More information

Z-TEST / Z-STATISTIC: used to test hypotheses about. µ when the population standard deviation is unknown

Z-TEST / Z-STATISTIC: used to test hypotheses about. µ when the population standard deviation is unknown Z-TEST / Z-STATISTIC: used to test hypotheses about µ whe the populatio stadard deviatio is kow ad populatio distributio is ormal or sample size is large T-TEST / T-STATISTIC: used to test hypotheses about

More information

Math C067 Sampling Distributions

Math C067 Sampling Distributions Math C067 Samplig Distributios Sample Mea ad Sample Proportio Richard Beigel Some time betwee April 16, 2007 ad April 16, 2007 Examples of Samplig A pollster may try to estimate the proportio of voters

More information

Convexity, Inequalities, and Norms

Convexity, Inequalities, and Norms Covexity, Iequalities, ad Norms Covex Fuctios You are probably familiar with the otio of cocavity of fuctios. Give a twicedifferetiable fuctio ϕ: R R, We say that ϕ is covex (or cocave up) if ϕ (x) 0 for

More information

Statistical inference: example 1. Inferential Statistics

Statistical inference: example 1. Inferential Statistics Statistical iferece: example 1 Iferetial Statistics POPULATION SAMPLE A clothig store chai regularly buys from a supplier large quatities of a certai piece of clothig. Each item ca be classified either

More information

CHAPTER 7: Central Limit Theorem: CLT for Averages (Means)

CHAPTER 7: Central Limit Theorem: CLT for Averages (Means) CHAPTER 7: Cetral Limit Theorem: CLT for Averages (Meas) X = the umber obtaied whe rollig oe six sided die oce. If we roll a six sided die oce, the mea of the probability distributio is X P(X = x) Simulatio:

More information

Here are a couple of warnings to my students who may be here to get a copy of what happened on a day that you missed.

Here are a couple of warnings to my students who may be here to get a copy of what happened on a day that you missed. This documet was writte ad copyrighted by Paul Dawkis. Use of this documet ad its olie versio is govered by the Terms ad Coditios of Use located at http://tutorial.math.lamar.edu/terms.asp. The olie versio

More information

Department of Computer Science, University of Otago

Department of Computer Science, University of Otago Departmet of Computer Sciece, Uiversity of Otago Techical Report OUCS-2006-09 Permutatios Cotaiig May Patters Authors: M.H. Albert Departmet of Computer Sciece, Uiversity of Otago Micah Colema, Rya Fly

More information

Approximating Area under a curve with rectangles. To find the area under a curve we approximate the area using rectangles and then use limits to find

Approximating Area under a curve with rectangles. To find the area under a curve we approximate the area using rectangles and then use limits to find 1.8 Approximatig Area uder a curve with rectagles 1.6 To fid the area uder a curve we approximate the area usig rectagles ad the use limits to fid 1.4 the area. Example 1 Suppose we wat to estimate 1.

More information

1 Computing the Standard Deviation of Sample Means

1 Computing the Standard Deviation of Sample Means Computig the Stadard Deviatio of Sample Meas Quality cotrol charts are based o sample meas ot o idividual values withi a sample. A sample is a group of items, which are cosidered all together for our aalysis.

More information

Swaps: Constant maturity swaps (CMS) and constant maturity. Treasury (CMT) swaps

Swaps: Constant maturity swaps (CMS) and constant maturity. Treasury (CMT) swaps Swaps: Costat maturity swaps (CMS) ad costat maturity reasury (CM) swaps A Costat Maturity Swap (CMS) swap is a swap where oe of the legs pays (respectively receives) a swap rate of a fixed maturity, while

More information

Descriptive Statistics

Descriptive Statistics Descriptive Statistics We leared to describe data sets graphically. We ca also describe a data set umerically. Measures of Locatio Defiitio The sample mea is the arithmetic average of values. We deote

More information

Center, Spread, and Shape in Inference: Claims, Caveats, and Insights

Center, Spread, and Shape in Inference: Claims, Caveats, and Insights Ceter, Spread, ad Shape i Iferece: Claims, Caveats, ad Isights Dr. Nacy Pfeig (Uiversity of Pittsburgh) AMATYC November 2008 Prelimiary Activities 1. I would like to produce a iterval estimate for the

More information

Chapter 7: Confidence Interval and Sample Size

Chapter 7: Confidence Interval and Sample Size Chapter 7: Cofidece Iterval ad Sample Size Learig Objectives Upo successful completio of Chapter 7, you will be able to: Fid the cofidece iterval for the mea, proportio, ad variace. Determie the miimum

More information

An Efficient Polynomial Approximation of the Normal Distribution Function & Its Inverse Function

An Efficient Polynomial Approximation of the Normal Distribution Function & Its Inverse Function A Efficiet Polyomial Approximatio of the Normal Distributio Fuctio & Its Iverse Fuctio Wisto A. Richards, 1 Robi Atoie, * 1 Asho Sahai, ad 3 M. Raghuadh Acharya 1 Departmet of Mathematics & Computer Sciece;

More information

Running Time ( 3.1) Analysis of Algorithms. Experimental Studies ( 3.1.1) Limitations of Experiments. Pseudocode ( 3.1.2) Theoretical Analysis

Running Time ( 3.1) Analysis of Algorithms. Experimental Studies ( 3.1.1) Limitations of Experiments. Pseudocode ( 3.1.2) Theoretical Analysis Ruig Time ( 3.) Aalysis of Algorithms Iput Algorithm Output A algorithm is a step-by-step procedure for solvig a problem i a fiite amout of time. Most algorithms trasform iput objects ito output objects.

More information

Quadrat Sampling in Population Ecology

Quadrat Sampling in Population Ecology Quadrat Samplig i Populatio Ecology Backgroud Estimatig the abudace of orgaisms. Ecology is ofte referred to as the "study of distributio ad abudace". This beig true, we would ofte like to kow how may

More information

UC Berkeley Department of Electrical Engineering and Computer Science. EE 126: Probablity and Random Processes. Solutions 9 Spring 2006

UC Berkeley Department of Electrical Engineering and Computer Science. EE 126: Probablity and Random Processes. Solutions 9 Spring 2006 Exam format UC Bereley Departmet of Electrical Egieerig ad Computer Sciece EE 6: Probablity ad Radom Processes Solutios 9 Sprig 006 The secod midterm will be held o Wedesday May 7; CHECK the fial exam

More information

Asymptotic Growth of Functions

Asymptotic Growth of Functions CMPS Itroductio to Aalysis of Algorithms Fall 3 Asymptotic Growth of Fuctios We itroduce several types of asymptotic otatio which are used to compare the performace ad efficiecy of algorithms As we ll

More information

Project Deliverables. CS 361, Lecture 28. Outline. Project Deliverables. Administrative. Project Comments

Project Deliverables. CS 361, Lecture 28. Outline. Project Deliverables. Administrative. Project Comments Project Deliverables CS 361, Lecture 28 Jared Saia Uiversity of New Mexico Each Group should tur i oe group project cosistig of: About 6-12 pages of text (ca be loger with appedix) 6-12 figures (please

More information

Our aim is to show that under reasonable assumptions a given 2π-periodic function f can be represented as convergent series

Our aim is to show that under reasonable assumptions a given 2π-periodic function f can be represented as convergent series 8 Fourier Series Our aim is to show that uder reasoable assumptios a give -periodic fuctio f ca be represeted as coverget series f(x) = a + (a cos x + b si x). (8.) By defiitio, the covergece of the series

More information

Theorems About Power Series

Theorems About Power Series Physics 6A Witer 20 Theorems About Power Series Cosider a power series, f(x) = a x, () where the a are real coefficiets ad x is a real variable. There exists a real o-egative umber R, called the radius

More information

The following example will help us understand The Sampling Distribution of the Mean. C1 C2 C3 C4 C5 50 miles 84 miles 38 miles 120 miles 48 miles

The following example will help us understand The Sampling Distribution of the Mean. C1 C2 C3 C4 C5 50 miles 84 miles 38 miles 120 miles 48 miles The followig eample will help us uderstad The Samplig Distributio of the Mea Review: The populatio is the etire collectio of all idividuals or objects of iterest The sample is the portio of the populatio

More information

Basic Elements of Arithmetic Sequences and Series

Basic Elements of Arithmetic Sequences and Series MA40S PRE-CALCULUS UNIT G GEOMETRIC SEQUENCES CLASS NOTES (COMPLETED NO NEED TO COPY NOTES FROM OVERHEAD) Basic Elemets of Arithmetic Sequeces ad Series Objective: To establish basic elemets of arithmetic

More information

The Stable Marriage Problem

The Stable Marriage Problem The Stable Marriage Problem William Hut Lae Departmet of Computer Sciece ad Electrical Egieerig, West Virgiia Uiversity, Morgatow, WV William.Hut@mail.wvu.edu 1 Itroductio Imagie you are a matchmaker,

More information

THE HEIGHT OF q-binary SEARCH TREES

THE HEIGHT OF q-binary SEARCH TREES THE HEIGHT OF q-binary SEARCH TREES MICHAEL DRMOTA AND HELMUT PRODINGER Abstract. q biary search trees are obtaied from words, equipped with the geometric distributio istead of permutatios. The average

More information

Chapter 5: Inner Product Spaces

Chapter 5: Inner Product Spaces Chapter 5: Ier Product Spaces Chapter 5: Ier Product Spaces SECION A Itroductio to Ier Product Spaces By the ed of this sectio you will be able to uderstad what is meat by a ier product space give examples

More information

A Test of Normality. 1 n S 2 3. n 1. Now introduce two new statistics. The sample skewness is defined as:

A Test of Normality. 1 n S 2 3. n 1. Now introduce two new statistics. The sample skewness is defined as: A Test of Normality Textbook Referece: Chapter. (eighth editio, pages 59 ; seveth editio, pages 6 6). The calculatio of p values for hypothesis testig typically is based o the assumptio that the populatio

More information

A Combined Continuous/Binary Genetic Algorithm for Microstrip Antenna Design

A Combined Continuous/Binary Genetic Algorithm for Microstrip Antenna Design A Combied Cotiuous/Biary Geetic Algorithm for Microstrip Atea Desig Rady L. Haupt The Pesylvaia State Uiversity Applied Research Laboratory P. O. Box 30 State College, PA 16804-0030 haupt@ieee.org Abstract:

More information

Unbiased Estimation. Topic 14. 14.1 Introduction

Unbiased Estimation. Topic 14. 14.1 Introduction Topic 4 Ubiased Estimatio 4. Itroductio I creatig a parameter estimator, a fudametal questio is whether or ot the estimator differs from the parameter i a systematic maer. Let s examie this by lookig a

More information

Universal coding for classes of sources

Universal coding for classes of sources Coexios module: m46228 Uiversal codig for classes of sources Dever Greee This work is produced by The Coexios Project ad licesed uder the Creative Commos Attributio Licese We have discussed several parametric

More information

1 Correlation and Regression Analysis

1 Correlation and Regression Analysis 1 Correlatio ad Regressio Aalysis I this sectio we will be ivestigatig the relatioship betwee two cotiuous variable, such as height ad weight, the cocetratio of a ijected drug ad heart rate, or the cosumptio

More information

Research Method (I) --Knowledge on Sampling (Simple Random Sampling)

Research Method (I) --Knowledge on Sampling (Simple Random Sampling) Research Method (I) --Kowledge o Samplig (Simple Radom Samplig) 1. Itroductio to samplig 1.1 Defiitio of samplig Samplig ca be defied as selectig part of the elemets i a populatio. It results i the fact

More information

SAMPLE QUESTIONS FOR FINAL EXAM. (1) (2) (3) (4) Find the following using the definition of the Riemann integral: (2x + 1)dx

SAMPLE QUESTIONS FOR FINAL EXAM. (1) (2) (3) (4) Find the following using the definition of the Riemann integral: (2x + 1)dx SAMPLE QUESTIONS FOR FINAL EXAM REAL ANALYSIS I FALL 006 3 4 Fid the followig usig the defiitio of the Riema itegral: a 0 x + dx 3 Cosider the partitio P x 0 3, x 3 +, x 3 +,......, x 3 3 + 3 of the iterval

More information

Lecture 2: Karger s Min Cut Algorithm

Lecture 2: Karger s Min Cut Algorithm priceto uiv. F 3 cos 5: Advaced Algorithm Desig Lecture : Karger s Mi Cut Algorithm Lecturer: Sajeev Arora Scribe:Sajeev Today s topic is simple but gorgeous: Karger s mi cut algorithm ad its extesio.

More information

Ekkehart Schlicht: Economic Surplus and Derived Demand

Ekkehart Schlicht: Economic Surplus and Derived Demand Ekkehart Schlicht: Ecoomic Surplus ad Derived Demad Muich Discussio Paper No. 2006-17 Departmet of Ecoomics Uiversity of Muich Volkswirtschaftliche Fakultät Ludwig-Maximilias-Uiversität Müche Olie at http://epub.ub.ui-mueche.de/940/

More information

Soving Recurrence Relations

Soving Recurrence Relations Sovig Recurrece Relatios Part 1. Homogeeous liear 2d degree relatios with costat coefficiets. Cosider the recurrece relatio ( ) T () + at ( 1) + bt ( 2) = 0 This is called a homogeeous liear 2d degree

More information

Overview. Learning Objectives. Point Estimate. Estimation. Estimating the Value of a Parameter Using Confidence Intervals

Overview. Learning Objectives. Point Estimate. Estimation. Estimating the Value of a Parameter Using Confidence Intervals Overview Estimatig the Value of a Parameter Usig Cofidece Itervals We apply the results about the sample mea the problem of estimatio Estimatio is the process of usig sample data estimate the value of

More information

Confidence intervals and hypothesis tests

Confidence intervals and hypothesis tests Chapter 2 Cofidece itervals ad hypothesis tests This chapter focuses o how to draw coclusios about populatios from sample data. We ll start by lookig at biary data (e.g., pollig), ad lear how to estimate

More information

WHEN IS THE (CO)SINE OF A RATIONAL ANGLE EQUAL TO A RATIONAL NUMBER?

WHEN IS THE (CO)SINE OF A RATIONAL ANGLE EQUAL TO A RATIONAL NUMBER? WHEN IS THE (CO)SINE OF A RATIONAL ANGLE EQUAL TO A RATIONAL NUMBER? JÖRG JAHNEL 1. My Motivatio Some Sort of a Itroductio Last term I tought Topological Groups at the Göttige Georg August Uiversity. This

More information

PSYCHOLOGICAL STATISTICS

PSYCHOLOGICAL STATISTICS UNIVERSITY OF CALICUT SCHOOL OF DISTANCE EDUCATION B Sc. Cousellig Psychology (0 Adm.) IV SEMESTER COMPLEMENTARY COURSE PSYCHOLOGICAL STATISTICS QUESTION BANK. Iferetial statistics is the brach of statistics

More information

arxiv:1506.03481v1 [stat.me] 10 Jun 2015

arxiv:1506.03481v1 [stat.me] 10 Jun 2015 BEHAVIOUR OF ABC FOR BIG DATA By Wetao Li ad Paul Fearhead Lacaster Uiversity arxiv:1506.03481v1 [stat.me] 10 Ju 2015 May statistical applicatios ivolve models that it is difficult to evaluate the likelihood,

More information

Solutions to Selected Problems In: Pattern Classification by Duda, Hart, Stork

Solutions to Selected Problems In: Pattern Classification by Duda, Hart, Stork Solutios to Selected Problems I: Patter Classificatio by Duda, Hart, Stork Joh L. Weatherwax February 4, 008 Problem Solutios Chapter Bayesia Decisio Theory Problem radomized rules Part a: Let Rx be the

More information

Inference on Proportion. Chapter 8 Tests of Statistical Hypotheses. Sampling Distribution of Sample Proportion. Confidence Interval

Inference on Proportion. Chapter 8 Tests of Statistical Hypotheses. Sampling Distribution of Sample Proportion. Confidence Interval Chapter 8 Tests of Statistical Hypotheses 8. Tests about Proportios HT - Iferece o Proportio Parameter: Populatio Proportio p (or π) (Percetage of people has o health isurace) x Statistic: Sample Proportio

More information

Analyzing Longitudinal Data from Complex Surveys Using SUDAAN

Analyzing Longitudinal Data from Complex Surveys Using SUDAAN Aalyzig Logitudial Data from Complex Surveys Usig SUDAAN Darryl Creel Statistics ad Epidemiology, RTI Iteratioal, 312 Trotter Farm Drive, Rockville, MD, 20850 Abstract SUDAAN: Software for the Statistical

More information

Hypergeometric Distributions

Hypergeometric Distributions 7.4 Hypergeometric Distributios Whe choosig the startig lie-up for a game, a coach obviously has to choose a differet player for each positio. Similarly, whe a uio elects delegates for a covetio or you

More information

Definition. A variable X that takes on values X 1, X 2, X 3,...X k with respective frequencies f 1, f 2, f 3,...f k has mean

Definition. A variable X that takes on values X 1, X 2, X 3,...X k with respective frequencies f 1, f 2, f 3,...f k has mean 1 Social Studies 201 October 13, 2004 Note: The examples i these otes may be differet tha used i class. However, the examples are similar ad the methods used are idetical to what was preseted i class.

More information

How Euler Did It. In a more modern treatment, Hardy and Wright [H+W] state this same theorem as. n n+ is perfect.

How Euler Did It. In a more modern treatment, Hardy and Wright [H+W] state this same theorem as. n n+ is perfect. Amicable umbers November 005 How Euler Did It by Ed Sadifer Six is a special umber. It is divisible by, ad 3, ad, i what at first looks like a strage coicidece, 6 = + + 3. The umber 8 shares this remarkable

More information

Lecture 13. Lecturer: Jonathan Kelner Scribe: Jonathan Pines (2009)

Lecture 13. Lecturer: Jonathan Kelner Scribe: Jonathan Pines (2009) 18.409 A Algorithmist s Toolkit October 27, 2009 Lecture 13 Lecturer: Joatha Keler Scribe: Joatha Pies (2009) 1 Outlie Last time, we proved the Bru-Mikowski iequality for boxes. Today we ll go over the

More information

where: T = number of years of cash flow in investment's life n = the year in which the cash flow X n i = IRR = the internal rate of return

where: T = number of years of cash flow in investment's life n = the year in which the cash flow X n i = IRR = the internal rate of return EVALUATING ALTERNATIVE CAPITAL INVESTMENT PROGRAMS By Ke D. Duft, Extesio Ecoomist I the March 98 issue of this publicatio we reviewed the procedure by which a capital ivestmet project was assessed. The

More information

5 Boolean Decision Trees (February 11)

5 Boolean Decision Trees (February 11) 5 Boolea Decisio Trees (February 11) 5.1 Graph Coectivity Suppose we are give a udirected graph G, represeted as a boolea adjacecy matrix = (a ij ), where a ij = 1 if ad oly if vertices i ad j are coected

More information

A Recursive Formula for Moments of a Binomial Distribution

A Recursive Formula for Moments of a Binomial Distribution A Recursive Formula for Momets of a Biomial Distributio Árpád Béyi beyi@mathumassedu, Uiversity of Massachusetts, Amherst, MA 01003 ad Saverio M Maago smmaago@psavymil Naval Postgraduate School, Moterey,

More information

Biology 171L Environment and Ecology Lab Lab 2: Descriptive Statistics, Presenting Data and Graphing Relationships

Biology 171L Environment and Ecology Lab Lab 2: Descriptive Statistics, Presenting Data and Graphing Relationships Biology 171L Eviromet ad Ecology Lab Lab : Descriptive Statistics, Presetig Data ad Graphig Relatioships Itroductio Log lists of data are ofte ot very useful for idetifyig geeral treds i the data or the

More information

Confidence Intervals

Confidence Intervals Cofidece Itervals Cofidece Itervals are a extesio of the cocept of Margi of Error which we met earlier i this course. Remember we saw: The sample proportio will differ from the populatio proportio by more

More information

MEI Structured Mathematics. Module Summary Sheets. Statistics 2 (Version B: reference to new book)

MEI Structured Mathematics. Module Summary Sheets. Statistics 2 (Version B: reference to new book) MEI Mathematics i Educatio ad Idustry MEI Structured Mathematics Module Summary Sheets Statistics (Versio B: referece to ew book) Topic : The Poisso Distributio Topic : The Normal Distributio Topic 3:

More information

Class Meeting # 16: The Fourier Transform on R n

Class Meeting # 16: The Fourier Transform on R n MATH 18.152 COUSE NOTES - CLASS MEETING # 16 18.152 Itroductio to PDEs, Fall 2011 Professor: Jared Speck Class Meetig # 16: The Fourier Trasform o 1. Itroductio to the Fourier Trasform Earlier i the course,

More information

hp calculators HP 12C Statistics - average and standard deviation Average and standard deviation concepts HP12C average and standard deviation

hp calculators HP 12C Statistics - average and standard deviation Average and standard deviation concepts HP12C average and standard deviation HP 1C Statistics - average ad stadard deviatio Average ad stadard deviatio cocepts HP1C average ad stadard deviatio Practice calculatig averages ad stadard deviatios with oe or two variables HP 1C Statistics

More information

Plug-in martingales for testing exchangeability on-line

Plug-in martingales for testing exchangeability on-line Plug-i martigales for testig exchageability o-lie Valetia Fedorova, Alex Gammerma, Ilia Nouretdiov, ad Vladimir Vovk Computer Learig Research Cetre Royal Holloway, Uiversity of Lodo, UK {valetia,ilia,alex,vovk}@cs.rhul.ac.uk

More information