7. Sample Covariance and Correlation
|
|
- Doris Stewart
- 7 years ago
- Views:
Transcription
1 1 of 8 7/16/2009 6:06 AM Virtual Laboratories > 6. Radom Samples > Sample Covariace ad Correlatio The Bivariate Model Suppose agai that we have a basic radom experimet, ad that X ad Y are real-valued radom variables for the experimet. Equivaletly, (X, Y) is a radom vector takig values i R 2. Please recall the basic properties of the meas, (X) ad (Y), the variaces, var(x) ad var(y) ad the covariace. I particular, recall that the correlatio is We will also eed a higher order bivariate momet. Let cor(x, Y) = sd(x) sd(y) d(x, Y) = (((X (X)) (Y (Y))) 2 ) Now suppose that we ru the basic experimet times. This creates a compoud experimet with a sequece of idepedet radom vectors ((X 1, Y 1 ), (X 2, Y 2 ),..., (X, Y )) each with the same distributio as (X, Y). I statistical terms, this is a radom sample of size from the distributio of (X, Y). As usual, we will let X = (X 1, X 2,..., X ) deote the sequece of first coordiates; this is a radom sample of size from the distributio of X. Similarly, we will let Y = (Y 1, Y 2,..., X ) deote the sequece of secod coordiates; this is a radom sample of size from the distributio of Y. Recall that the sample meas ad sample variaces for X are defied as follows (ad of course aalogous defiitios hold for Y):. M(X) = 1 i =1 X i, W 2 (X) = 1 i =1 (X i (X)) 2, S 2 (X) = 1 i =1 (X i M(X)) 2 I this sectio, we will defie ad study statistics that are atural estimators of the distributio covariace ad correlatio. These statistics will be measures of the liear relatioship of the sample poits i the plae. As usual, the defiitios deped o what other parameters are kow ad ukow. A Special Sample Covariace Suppose first that the distributio meas (X) ad (Y) are kow. This is usually a urealistic assumptio, of course, but is still a good place to start because the aalysis is very simple ad the results we obtai will be useful below. A atural estimator of i this case is
2 2 of 8 7/16/2009 6:06 AM W(X, Y) = 1 i =1 (X i (X)) (Y i (Y)) 1. Show that W(X, Y) is the sample mea for a radom sample of size from the distributio of (X (X)) (Y (Y)). 2. Use the result of Exercise 1 to show that (W(X, Y)) = var(w(x, Y)) = 1 (d(x, Y) cov 2 (X, Y)) W(X, Y) as with probability 1. I particular, W(X, Y) is a ubiased ad cosistet estimator of. Properties The formula i the followig exercise is sometimes better tha the defiitio for computatioal purposes. 3. With X Y defied to be the sequece (X 1 Y 1, X 2 Y 2,..., X Y ), show that W(X, Y) = M(X Y) M(X) (Y) M(Y) (X) + (X) (Y) The properties established i the followig exercises are aalogies of properties for the distributio covariace 4. Show that W(X, X) = W 2 (X) 5. Show that W(X, Y) = W(Y, X) 6. Show that if a is a costat the W(a X, Y) = a W(X, Y) 7. Show that W(X + Y, Z) = W(X, Z) + W(Y, Z) The followig exercise gives a formula for the sample variace of a sum. The result exteds aturally to larger sums. 8. Show that W 2 (X + Y) = W 2 (X) + W 2 (Y) + 2 W(X, Y) The Stadard Sample Covariace Cosider ow the more realistic assumptio that the distributio meas (X) ad (Y) are ukow. A atural approach i this case is to average (X i M(X)) (Y i M(Y)) over i {1, 2,..., }. But rather tha dividig by i our average, we should divide by whatever costat gives a ubiased estimator of. 9. Iterpret the sig of (X i M(X)) (Y i M(Y)) geometrically, i terms of the scatterplot of poits ad its ceter.
3 3 of 8 7/16/2009 6:06 AM Derivatio. 10. Use the biliearity of the covariace operator to show that cov(m(x), M(Y)) = 11. Expad ad sum term by term to show that i =1 (X i M(X)) (Y i M(Y)) = i =1 X i Y i M(X) M(Y) 12. Use the result of Exercises 10 ad 11, ad basic properties of expected value, to show that ( i =1 (X i M(X)) (Y i M(Y))) = ( 1) Therefore, to have a ubiased estimator of, we should defie the sample covariace to be the radom variable = 1 i =1 (X i M(X)) (Y i M(Y)) As with the sample variace, whe the sample size is large, it makes little differece whether we divide by or 1. Properties The formula i the followig exercise is sometimes better tha the defiitio for computatioal purposes. 13. With X Y defied as i Exercise 3, show that = 1 i =1 X i Y i M(X) M(Y) = (M(X Y) M(X) M(Y)) Use the result of the previous exercise ad the strog law of large umbers to show that as with probability 1. The properties established i the followig exercises are aalogies of properties for the distributio covariace 15. Show that S(X, X) = S 2 (X) 16. Show that = S(Y, X) 17. Show that if a is a costat the S(a X, Y) = a 18. Show that S(X + Y, Z) = S(X, Z) + S(Y, Z) 19. Show that
4 4 of 8 7/16/2009 6:06 AM = (W(X, Y) (M(X) (X)) (M(Y) (Y))) 1 The followig exercise gives a formula for the sample variace of a sum. The result exteds aturally to larger sums. 20. Show that S 2 (X + Y) = S 2 (X) + S 2 (Y) + 2 Variace I this subsectio we will derive the followig formuala for the variace of the sample covariace. The derivatio was cotributed by Rajith Uikrisha, ad is similar to the derivatio of the variace of the sample variace. var() = d(x, Y) + var(x) var(y) ( 1 1 cov2 (X, Y) ) 21. Verify the followig result. Hit: Start with the expressio o the right. Expad the product (X i X j) (Y i Y j), ad take the sums term by term. 1 = 2 ( 1) i =1 j =1 (X i X j) (Y i Y j) It follows that var() is the sum of all of the pairwise covariaces of the terms i the expasio of Exercise Now, derive the formula for var() by showig that cov((x i X j) (Y i Y j), (X k X l ) (Y k Y l )) = 0 if i = j or k = l or i, j, k, l are distict. cov((x i X j) (Y i Y j), (X i X j) (Y i Y j)) = 2 d(x, Y) + 2 var(x) var(y) if i j, ad there are 2 ( 1) such terms i the sum of covariaces. cov((x i X j) (Y i Y j), (X k X j) (Y k Y j)) = d(x, Y) cov 2 (X, Y) if i, j, k are distict, ad there are 4 ( 1) ( 2) such terms i the sum of covariaces. 23. Show that var() > var(w(x, Y)). Does this seem reasoable? 24. Show that var() 0 as. Thus, the sample covariace is a cosistet estimator of the distributio covariace. Sample Correlatio By aalogy with the distributio correlatio, the sample correlatio is obtaied by dividig the sample covariace by the product of the sample stadard deviatios:
5 5 of 8 7/16/2009 6:06 AM R(X, Y) = S(X) S(Y) 25. Use the strog law of large umbers to show that R(X, Y) cor(x, Y) as with probability Click i the iteractive scatterplot to defie 20 poits ad try to come as close as possible to the followig coditios: sample meas 0, sample stadard deviatios 1, sample correlatio as follows: 0, 0.5, 0.5, 0.7, 0.7, 0.9, Click i the iteractive scatterplot to defie 20 poits ad try to come as close as possible to the followig coditios: X sample mea 1, Y sample mea 3, Xsample stadard deviatio 2, Y sample stadard deviatio 1, sample correlatio as follows: 0, 0.5, 0.5, 0.7, 0.7, 0.9, 0.9. The Best Liear Predictor The Distributio Versio Recall that i the sectio o (distributio) correlatio ad regressio, we showed that the best liear predictor of Y based o X, i the sese of miimizig mea square error, is the radom variable L(Y X) = (Y) + (X (X)) var(x) Moreover, the (miimum) value of the mea square error is ((Y L(Y X)) 2 ) = var(y) (1 cor(x, Y) 2 ) The distributio regressio lie is give by y = L(Y X = x) = (Y) + (x (X)) var(x) The S ample Versio Of course, i real applicatios, we are ulikely to kow the distributio parameters (X), (Y), var(x), ad. Thus, i this sectio, we are iterested i the problem of estimatig the best liear predictor of Y based o X from our radom sample ((X 1, Y 1 ), (X 2, Y 2 ),..., (X, Y )). Oe atural approach is to fid the lie y = A x + B that fits the sample poits best. This is a basic ad importat problem i may areas of mathematics, ot just statistics. The term best meas that we wat to fid the lie (that is, fid A ad B) that miimizes the average of the squared errors betwee the actual y values i our data ad the predicted y values: MSE(A, B) = 1 i =1 (Y i (A X i + B)) 2 Fidig A ad B that miimize M SE is a stadard problem i calculus.
6 6 of 8 7/16/2009 6:06 AM 28. Show that MSE is miimized for A(X, Y) = S 2, B(X, Y) = M(Y) (X) S 2 M(X) (X) ad thus the sample regressio lie is y = M(Y) + S 2 (x M(X)) (X) 29. Show that the miimum mea square error, usig the coefficiets i the previous exercise, is MSE(A(X, Y), B(X, Y)) = S 2 (Y) (1 R 2 (X, Y)) 30. Use the result of the previous exercise to show that 1 R(X, Y) 1 R(X, Y) = 1 if ad oly if the sample poits lie o a lie with egative slope. R(X, Y) = 1 if ad oly if the sample poits lie o a lie with positive slope. Thus, the sample correlatio measures the degree of liearity of the sample poits. The results i the previous exercise ca also be obtaied by otig that the sample correlatio is simply the correlatio of the empirical distributio. Of course, properties (a), (b), ad (c) are kow for the distributio correlatio. The fact that the results i Exercise 28 ad Exercise 29 are the sample aalogies of the correspodig distributio results is beautiful ad reassurig. Note that the sample regressio lie passes through (M(X), M(Y)), the ceter of the empirical distributio. Naturally, the coefficiets of the sample regressio lie ca be viewed as estimators of the respective coefficiets i the distributio regressio lie. 31. Assumig that the appropriate higher order momets are fiite, use the law of large umbers to show that, with probability 1, the coefficiets of the sample regressio lie coverge to the coefficiets of the distributio regressio lie: S 2 as (X) var(x) M(Y) S 2 M(X) (Y) (X) as (X) var(x) As with the distributio regressio lies, the choice of predictor ad respose variables is importat. 32. Show that the sample regressio lie for Y based o X ad the sample regressio lie for X based o Y are ot the same lie, except i the trivial case where the sample poits all lie o a lie. Recall that the costat B that miimizes MSE(B) = 1 i =1 (Y i B) 2
7 7 of 8 7/16/2009 6:06 AM is the sample mea M(Y), ad the miimum value of the mea square error is the sample variace S 2 (Y). Thus, the differece betwee this value of the mea square error ad the oe i Exercise 29, amely S 2 (Y) R 2 (X, Y) is the reductio i the variability of the Y data whe the liear term i X is added to the predictor. The fractioal reductio is R 2 (X, Y), ad hece this statistics is called the (sample) coefficiet of determiatio. Exercises S imulatio Exercises 33. Click i the iteractive scatterplot, i various places, ad watch how the regressio lie chages. 34. Click i the iteractive scatterplot to defie 20 poits. Try to geerate a scatterplot i which the mea of the x values is 0, the stadard deviatio of the x values is 1, ad i which the regressio lie has slope 1, itercept 1 slope 3, itercept 0 slope 2, itercept Click i the iteractive scatterplot to defie 20 poits with the followig properties: the mea of the x values is 1, the mea of the y values is 1, ad the regressio lie has slope 1 ad itercept 2. If you had a difficult time with the previous exercise, it's because the coditios imposed are impossible to satisfy! 36. Ru the bivariate uiform experimet 2000 times, with a update frequecy of 10, i each of the followig cases. Note the apparet covergece of the sample meas to the distributio meas, the sample stadard deviatios to the distributio stadard deviatios, the sample correlatio to the distributio correlatio, ad the sample regressio lie to distributio regressio lie. The uiform distributio o the square The uiform distributio o the triagle. The uiform distributio o the circle. 37. Ru the bivariate ormal experimet 2000 times, with a update frequecy of 10, i each of the followig cases. Note the apparet covergece of the sample meas to the distributio meas, the sample stadard deviatios to the distributio stadard deviatios, the sample correlatio to the distributio correlatio, ad the sample regressio lie to the distributio regressio lie. sd(x) = 1, sd(y) = 2, cor(x, Y) = 0.5 sd(x) = 1.5, sd(y) = 0.5, cor(x, Y) = 0.7 Data Aalysis Exercises
8 8 of 8 7/16/2009 6:06 AM 38. Compute the correlatio betwee petal legth ad petal width for the followig cases i Fisher's iris dat Commet o the differeces. d. All cases Setosa oly Vergiica oly Versicolor oly 39. Compute the correlatio betwee each pair of color cout variables i the M&M data 40. Cosider all cases i Fisher's iris dat Compute the least squares regressio lie with petal legth as the predictor variable ad petal width as the respose variable. Draw the scatterplot ad the regressio lie together. Predict the petal width of a iris with petal legth Cosider the Setosa cases oly i Fisher's iris dat Compute the least squares regressio lie with sepal legth as the predictor variable ad sepal width as the ukow variable. Draw the scatterplot ad regressio lie together. Predict the sepal width of a iris with sepal legth 45. Virtual Laboratories > 6. Radom Samples > Cotets Applets Data Sets Biographies Exteral Resources Key words Feedback
1 Correlation and Regression Analysis
1 Correlatio ad Regressio Aalysis I this sectio we will be ivestigatig the relatioship betwee two cotiuous variable, such as height ad weight, the cocetratio of a ijected drug ad heart rate, or the cosumptio
More informationProperties of MLE: consistency, asymptotic normality. Fisher information.
Lecture 3 Properties of MLE: cosistecy, asymptotic ormality. Fisher iformatio. I this sectio we will try to uderstad why MLEs are good. Let us recall two facts from probability that we be used ofte throughout
More informationChapter 7 Methods of Finding Estimators
Chapter 7 for BST 695: Special Topics i Statistical Theory. Kui Zhag, 011 Chapter 7 Methods of Fidig Estimators Sectio 7.1 Itroductio Defiitio 7.1.1 A poit estimator is ay fuctio W( X) W( X1, X,, X ) of
More informationTHE REGRESSION MODEL IN MATRIX FORM. For simple linear regression, meaning one predictor, the model is. for i = 1, 2, 3,, n
We will cosider the liear regressio model i matrix form. For simple liear regressio, meaig oe predictor, the model is i = + x i + ε i for i =,,,, This model icludes the assumptio that the ε i s are a sample
More informationHypothesis testing. Null and alternative hypotheses
Hypothesis testig Aother importat use of samplig distributios is to test hypotheses about populatio parameters, e.g. mea, proportio, regressio coefficiets, etc. For example, it is possible to stipulate
More information0.7 0.6 0.2 0 0 96 96.5 97 97.5 98 98.5 99 99.5 100 100.5 96.5 97 97.5 98 98.5 99 99.5 100 100.5
Sectio 13 Kolmogorov-Smirov test. Suppose that we have a i.i.d. sample X 1,..., X with some ukow distributio P ad we would like to test the hypothesis that P is equal to a particular distributio P 0, i.e.
More informationI. Chi-squared Distributions
1 M 358K Supplemet to Chapter 23: CHI-SQUARED DISTRIBUTIONS, T-DISTRIBUTIONS, AND DEGREES OF FREEDOM To uderstad t-distributios, we first eed to look at aother family of distributios, the chi-squared distributios.
More informationChapter 6: Variance, the law of large numbers and the Monte-Carlo method
Chapter 6: Variace, the law of large umbers ad the Mote-Carlo method Expected value, variace, ad Chebyshev iequality. If X is a radom variable recall that the expected value of X, E[X] is the average value
More information1. C. The formula for the confidence interval for a population mean is: x t, which was
s 1. C. The formula for the cofidece iterval for a populatio mea is: x t, which was based o the sample Mea. So, x is guarateed to be i the iterval you form.. D. Use the rule : p-value
More informationIn nite Sequences. Dr. Philippe B. Laval Kennesaw State University. October 9, 2008
I ite Sequeces Dr. Philippe B. Laval Keesaw State Uiversity October 9, 2008 Abstract This had out is a itroductio to i ite sequeces. mai de itios ad presets some elemetary results. It gives the I ite Sequeces
More informationOutput Analysis (2, Chapters 10 &11 Law)
B. Maddah ENMG 6 Simulatio 05/0/07 Output Aalysis (, Chapters 10 &11 Law) Comparig alterative system cofiguratio Sice the output of a simulatio is radom, the comparig differet systems via simulatio should
More informationOverview of some probability distributions.
Lecture Overview of some probability distributios. I this lecture we will review several commo distributios that will be used ofte throughtout the class. Each distributio is usually described by its probability
More informationNon-life insurance mathematics. Nils F. Haavardsson, University of Oslo and DNB Skadeforsikring
No-life isurace mathematics Nils F. Haavardsso, Uiversity of Oslo ad DNB Skadeforsikrig Mai issues so far Why does isurace work? How is risk premium defied ad why is it importat? How ca claim frequecy
More informationBASIC STATISTICS. f(x 1,x 2,..., x n )=f(x 1 )f(x 2 ) f(x n )= f(x i ) (1)
BASIC STATISTICS. SAMPLES, RANDOM SAMPLING AND SAMPLE STATISTICS.. Radom Sample. The radom variables X,X 2,..., X are called a radom sample of size from the populatio f(x if X,X 2,..., X are mutually idepedet
More informationSampling Distribution And Central Limit Theorem
() Samplig Distributio & Cetral Limit Samplig Distributio Ad Cetral Limit Samplig distributio of the sample mea If we sample a umber of samples (say k samples where k is very large umber) each of size,
More informationSoving Recurrence Relations
Sovig Recurrece Relatios Part 1. Homogeeous liear 2d degree relatios with costat coefficiets. Cosider the recurrece relatio ( ) T () + at ( 1) + bt ( 2) = 0 This is called a homogeeous liear 2d degree
More informationMeasures of Spread and Boxplots Discrete Math, Section 9.4
Measures of Spread ad Boxplots Discrete Math, Sectio 9.4 We start with a example: Example 1: Comparig Mea ad Media Compute the mea ad media of each data set: S 1 = {4, 6, 8, 10, 1, 14, 16} S = {4, 7, 9,
More informationMaximum Likelihood Estimators.
Lecture 2 Maximum Likelihood Estimators. Matlab example. As a motivatio, let us look at oe Matlab example. Let us geerate a radom sample of size 00 from beta distributio Beta(5, 2). We will lear the defiitio
More informationSection 11.3: The Integral Test
Sectio.3: The Itegral Test Most of the series we have looked at have either diverged or have coverged ad we have bee able to fid what they coverge to. I geeral however, the problem is much more difficult
More informationConfidence Intervals for One Mean
Chapter 420 Cofidece Itervals for Oe Mea Itroductio This routie calculates the sample size ecessary to achieve a specified distace from the mea to the cofidece limit(s) at a stated cofidece level for a
More informationPSYCHOLOGICAL STATISTICS
UNIVERSITY OF CALICUT SCHOOL OF DISTANCE EDUCATION B Sc. Cousellig Psychology (0 Adm.) IV SEMESTER COMPLEMENTARY COURSE PSYCHOLOGICAL STATISTICS QUESTION BANK. Iferetial statistics is the brach of statistics
More informationMath C067 Sampling Distributions
Math C067 Samplig Distributios Sample Mea ad Sample Proportio Richard Beigel Some time betwee April 16, 2007 ad April 16, 2007 Examples of Samplig A pollster may try to estimate the proportio of voters
More information1 Computing the Standard Deviation of Sample Means
Computig the Stadard Deviatio of Sample Meas Quality cotrol charts are based o sample meas ot o idividual values withi a sample. A sample is a group of items, which are cosidered all together for our aalysis.
More informationUniversity of California, Los Angeles Department of Statistics. Distributions related to the normal distribution
Uiversity of Califoria, Los Ageles Departmet of Statistics Statistics 100B Istructor: Nicolas Christou Three importat distributios: Distributios related to the ormal distributio Chi-square (χ ) distributio.
More informationSequences and Series
CHAPTER 9 Sequeces ad Series 9.. Covergece: Defiitio ad Examples Sequeces The purpose of this chapter is to itroduce a particular way of geeratig algorithms for fidig the values of fuctios defied by their
More informationConfidence Intervals. CI for a population mean (σ is known and n > 30 or the variable is normally distributed in the.
Cofidece Itervals A cofidece iterval is a iterval whose purpose is to estimate a parameter (a umber that could, i theory, be calculated from the populatio, if measuremets were available for the whole populatio).
More informationZ-TEST / Z-STATISTIC: used to test hypotheses about. µ when the population standard deviation is unknown
Z-TEST / Z-STATISTIC: used to test hypotheses about µ whe the populatio stadard deviatio is kow ad populatio distributio is ormal or sample size is large T-TEST / T-STATISTIC: used to test hypotheses about
More informationLecture 4: Cauchy sequences, Bolzano-Weierstrass, and the Squeeze theorem
Lecture 4: Cauchy sequeces, Bolzao-Weierstrass, ad the Squeeze theorem The purpose of this lecture is more modest tha the previous oes. It is to state certai coditios uder which we are guarateed that limits
More informationNow here is the important step
LINEST i Excel The Excel spreadsheet fuctio "liest" is a complete liear least squares curve fittig routie that produces ucertaity estimates for the fit values. There are two ways to access the "liest"
More informationMEI Structured Mathematics. Module Summary Sheets. Statistics 2 (Version B: reference to new book)
MEI Mathematics i Educatio ad Idustry MEI Structured Mathematics Module Summary Sheets Statistics (Versio B: referece to ew book) Topic : The Poisso Distributio Topic : The Normal Distributio Topic 3:
More informationGCSE STATISTICS. 4) How to calculate the range: The difference between the biggest number and the smallest number.
GCSE STATISTICS You should kow: 1) How to draw a frequecy diagram: e.g. NUMBER TALLY FREQUENCY 1 3 5 ) How to draw a bar chart, a pictogram, ad a pie chart. 3) How to use averages: a) Mea - add up all
More information, a Wishart distribution with n -1 degrees of freedom and scale matrix.
UMEÅ UNIVERSITET Matematisk-statistiska istitutioe Multivariat dataaalys D MSTD79 PA TENTAMEN 004-0-9 LÖSNINGSFÖRSLAG TILL TENTAMEN I MATEMATISK STATISTIK Multivariat dataaalys D, 5 poäg.. Assume that
More information5: Introduction to Estimation
5: Itroductio to Estimatio Cotets Acroyms ad symbols... 1 Statistical iferece... Estimatig µ with cofidece... 3 Samplig distributio of the mea... 3 Cofidece Iterval for μ whe σ is kow before had... 4 Sample
More informationIncremental calculation of weighted mean and variance
Icremetal calculatio of weighted mea ad variace Toy Fich faf@cam.ac.uk dot@dotat.at Uiversity of Cambridge Computig Service February 009 Abstract I these otes I eplai how to derive formulae for umerically
More informationS. Tanny MAT 344 Spring 1999. be the minimum number of moves required.
S. Tay MAT 344 Sprig 999 Recurrece Relatios Tower of Haoi Let T be the miimum umber of moves required. T 0 = 0, T = 7 Iitial Coditios * T = T + $ T is a sequece (f. o itegers). Solve for T? * is a recurrece,
More informationTHE TWO-VARIABLE LINEAR REGRESSION MODEL
THE TWO-VARIABLE LINEAR REGRESSION MODEL Herma J. Bieres Pesylvaia State Uiversity April 30, 202. Itroductio Suppose you are a ecoomics or busiess maor i a college close to the beach i the souther part
More informationConvexity, Inequalities, and Norms
Covexity, Iequalities, ad Norms Covex Fuctios You are probably familiar with the otio of cocavity of fuctios. Give a twicedifferetiable fuctio ϕ: R R, We say that ϕ is covex (or cocave up) if ϕ (x) 0 for
More informationSAMPLE QUESTIONS FOR FINAL EXAM. (1) (2) (3) (4) Find the following using the definition of the Riemann integral: (2x + 1)dx
SAMPLE QUESTIONS FOR FINAL EXAM REAL ANALYSIS I FALL 006 3 4 Fid the followig usig the defiitio of the Riema itegral: a 0 x + dx 3 Cosider the partitio P x 0 3, x 3 +, x 3 +,......, x 3 3 + 3 of the iterval
More informationLesson 17 Pearson s Correlation Coefficient
Outlie Measures of Relatioships Pearso s Correlatio Coefficiet (r) -types of data -scatter plots -measure of directio -measure of stregth Computatio -covariatio of X ad Y -uique variatio i X ad Y -measurig
More informationLecture 13. Lecturer: Jonathan Kelner Scribe: Jonathan Pines (2009)
18.409 A Algorithmist s Toolkit October 27, 2009 Lecture 13 Lecturer: Joatha Keler Scribe: Joatha Pies (2009) 1 Outlie Last time, we proved the Bru-Mikowski iequality for boxes. Today we ll go over the
More informationCS103A Handout 23 Winter 2002 February 22, 2002 Solving Recurrence Relations
CS3A Hadout 3 Witer 00 February, 00 Solvig Recurrece Relatios Itroductio A wide variety of recurrece problems occur i models. Some of these recurrece relatios ca be solved usig iteratio or some other ad
More informationChapter 7 - Sampling Distributions. 1 Introduction. What is statistics? It consist of three major areas:
Chapter 7 - Samplig Distributios 1 Itroductio What is statistics? It cosist of three major areas: Data Collectio: samplig plas ad experimetal desigs Descriptive Statistics: umerical ad graphical summaries
More informationChapter 14 Nonparametric Statistics
Chapter 14 Noparametric Statistics A.K.A. distributio-free statistics! Does ot deped o the populatio fittig ay particular type of distributio (e.g, ormal). Sice these methods make fewer assumptios, they
More informationThe following example will help us understand The Sampling Distribution of the Mean. C1 C2 C3 C4 C5 50 miles 84 miles 38 miles 120 miles 48 miles
The followig eample will help us uderstad The Samplig Distributio of the Mea Review: The populatio is the etire collectio of all idividuals or objects of iterest The sample is the portio of the populatio
More informationCase Study. Normal and t Distributions. Density Plot. Normal Distributions
Case Study Normal ad t Distributios Bret Halo ad Bret Larget Departmet of Statistics Uiversity of Wiscosi Madiso October 11 13, 2011 Case Study Body temperature varies withi idividuals over time (it ca
More informationLECTURE 13: Cross-validation
LECTURE 3: Cross-validatio Resampli methods Cross Validatio Bootstrap Bias ad variace estimatio with the Bootstrap Three-way data partitioi Itroductio to Patter Aalysis Ricardo Gutierrez-Osua Texas A&M
More informationCHAPTER 7: Central Limit Theorem: CLT for Averages (Means)
CHAPTER 7: Cetral Limit Theorem: CLT for Averages (Meas) X = the umber obtaied whe rollig oe six sided die oce. If we roll a six sided die oce, the mea of the probability distributio is X P(X = x) Simulatio:
More informationNormal Distribution.
Normal Distributio www.icrf.l Normal distributio I probability theory, the ormal or Gaussia distributio, is a cotiuous probability distributio that is ofte used as a first approimatio to describe realvalued
More informationTrigonometric Form of a Complex Number. The Complex Plane. axis. ( 2, 1) or 2 i FIGURE 6.44. The absolute value of the complex number z a bi is
0_0605.qxd /5/05 0:45 AM Page 470 470 Chapter 6 Additioal Topics i Trigoometry 6.5 Trigoometric Form of a Complex Number What you should lear Plot complex umbers i the complex plae ad fid absolute values
More informationTheorems About Power Series
Physics 6A Witer 20 Theorems About Power Series Cosider a power series, f(x) = a x, () where the a are real coefficiets ad x is a real variable. There exists a real o-egative umber R, called the radius
More informationBINOMIAL EXPANSIONS 12.5. In this section. Some Examples. Obtaining the Coefficients
652 (12-26) Chapter 12 Sequeces ad Series 12.5 BINOMIAL EXPANSIONS I this sectio Some Examples Otaiig the Coefficiets The Biomial Theorem I Chapter 5 you leared how to square a iomial. I this sectio you
More informationChapter 7: Confidence Interval and Sample Size
Chapter 7: Cofidece Iterval ad Sample Size Learig Objectives Upo successful completio of Chapter 7, you will be able to: Fid the cofidece iterval for the mea, proportio, ad variace. Determie the miimum
More informationLecture 4: Cheeger s Inequality
Spectral Graph Theory ad Applicatios WS 0/0 Lecture 4: Cheeger s Iequality Lecturer: Thomas Sauerwald & He Su Statemet of Cheeger s Iequality I this lecture we assume for simplicity that G is a d-regular
More informationFIBONACCI NUMBERS: AN APPLICATION OF LINEAR ALGEBRA. 1. Powers of a matrix
FIBONACCI NUMBERS: AN APPLICATION OF LINEAR ALGEBRA. Powers of a matrix We begi with a propositio which illustrates the usefuless of the diagoalizatio. Recall that a square matrix A is diogaalizable if
More informationApproximating Area under a curve with rectangles. To find the area under a curve we approximate the area using rectangles and then use limits to find
1.8 Approximatig Area uder a curve with rectagles 1.6 To fid the area uder a curve we approximate the area usig rectagles ad the use limits to fid 1.4 the area. Example 1 Suppose we wat to estimate 1.
More informationhp calculators HP 12C Statistics - average and standard deviation Average and standard deviation concepts HP12C average and standard deviation
HP 1C Statistics - average ad stadard deviatio Average ad stadard deviatio cocepts HP1C average ad stadard deviatio Practice calculatig averages ad stadard deviatios with oe or two variables HP 1C Statistics
More informationUniversal coding for classes of sources
Coexios module: m46228 Uiversal codig for classes of sources Dever Greee This work is produced by The Coexios Project ad licesed uder the Creative Commos Attributio Licese We have discussed several parametric
More informationOverview. Learning Objectives. Point Estimate. Estimation. Estimating the Value of a Parameter Using Confidence Intervals
Overview Estimatig the Value of a Parameter Usig Cofidece Itervals We apply the results about the sample mea the problem of estimatio Estimatio is the process of usig sample data estimate the value of
More informationA Recursive Formula for Moments of a Binomial Distribution
A Recursive Formula for Momets of a Biomial Distributio Árpád Béyi beyi@mathumassedu, Uiversity of Massachusetts, Amherst, MA 01003 ad Saverio M Maago smmaago@psavymil Naval Postgraduate School, Moterey,
More informationLecture 5: Span, linear independence, bases, and dimension
Lecture 5: Spa, liear idepedece, bases, ad dimesio Travis Schedler Thurs, Sep 23, 2010 (versio: 9/21 9:55 PM) 1 Motivatio Motivatio To uderstad what it meas that R has dimesio oe, R 2 dimesio 2, etc.;
More informationCS103X: Discrete Structures Homework 4 Solutions
CS103X: Discrete Structures Homewor 4 Solutios Due February 22, 2008 Exercise 1 10 poits. Silico Valley questios: a How may possible six-figure salaries i whole dollar amouts are there that cotai at least
More informationLesson 15 ANOVA (analysis of variance)
Outlie Variability -betwee group variability -withi group variability -total variability -F-ratio Computatio -sums of squares (betwee/withi/total -degrees of freedom (betwee/withi/total -mea square (betwee/withi
More informationOur aim is to show that under reasonable assumptions a given 2π-periodic function f can be represented as convergent series
8 Fourier Series Our aim is to show that uder reasoable assumptios a give -periodic fuctio f ca be represeted as coverget series f(x) = a + (a cos x + b si x). (8.) By defiitio, the covergece of the series
More informationwhere: T = number of years of cash flow in investment's life n = the year in which the cash flow X n i = IRR = the internal rate of return
EVALUATING ALTERNATIVE CAPITAL INVESTMENT PROGRAMS By Ke D. Duft, Extesio Ecoomist I the March 98 issue of this publicatio we reviewed the procedure by which a capital ivestmet project was assessed. The
More informationInfinite Sequences and Series
CHAPTER 4 Ifiite Sequeces ad Series 4.1. Sequeces A sequece is a ifiite ordered list of umbers, for example the sequece of odd positive itegers: 1, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 29...
More informationA probabilistic proof of a binomial identity
A probabilistic proof of a biomial idetity Joatho Peterso Abstract We give a elemetary probabilistic proof of a biomial idetity. The proof is obtaied by computig the probability of a certai evet i two
More informationStatistical inference: example 1. Inferential Statistics
Statistical iferece: example 1 Iferetial Statistics POPULATION SAMPLE A clothig store chai regularly buys from a supplier large quatities of a certai piece of clothig. Each item ca be classified either
More informationUnbiased Estimation. Topic 14. 14.1 Introduction
Topic 4 Ubiased Estimatio 4. Itroductio I creatig a parameter estimator, a fudametal questio is whether or ot the estimator differs from the parameter i a systematic maer. Let s examie this by lookig a
More informationDepartment of Computer Science, University of Otago
Departmet of Computer Sciece, Uiversity of Otago Techical Report OUCS-2006-09 Permutatios Cotaiig May Patters Authors: M.H. Albert Departmet of Computer Sciece, Uiversity of Otago Micah Colema, Rya Fly
More informationA Mathematical Perspective on Gambling
A Mathematical Perspective o Gamblig Molly Maxwell Abstract. This paper presets some basic topics i probability ad statistics, icludig sample spaces, probabilistic evets, expectatios, the biomial ad ormal
More informationAnnuities Under Random Rates of Interest II By Abraham Zaks. Technion I.I.T. Haifa ISRAEL and Haifa University Haifa ISRAEL.
Auities Uder Radom Rates of Iterest II By Abraham Zas Techio I.I.T. Haifa ISRAEL ad Haifa Uiversity Haifa ISRAEL Departmet of Mathematics, Techio - Israel Istitute of Techology, 3000, Haifa, Israel I memory
More informationPresent Values, Investment Returns and Discount Rates
Preset Values, Ivestmet Returs ad Discout Rates Dimitry Midli, ASA, MAAA, PhD Presidet CDI Advisors LLC dmidli@cdiadvisors.com May 2, 203 Copyright 20, CDI Advisors LLC The cocept of preset value lies
More informationDescriptive Statistics
Descriptive Statistics We leared to describe data sets graphically. We ca also describe a data set umerically. Measures of Locatio Defiitio The sample mea is the arithmetic average of values. We deote
More informationMulti-server Optimal Bandwidth Monitoring for QoS based Multimedia Delivery Anup Basu, Irene Cheng and Yinzhe Yu
Multi-server Optimal Badwidth Moitorig for QoS based Multimedia Delivery Aup Basu, Iree Cheg ad Yizhe Yu Departmet of Computig Sciece U. of Alberta Architecture Applicatio Layer Request receptio -coectio
More information1 The Gaussian channel
ECE 77 Lecture 0 The Gaussia chael Objective: I this lecture we will lear about commuicatio over a chael of practical iterest, i which the trasmitted sigal is subjected to additive white Gaussia oise.
More informationDefinition. A variable X that takes on values X 1, X 2, X 3,...X k with respective frequencies f 1, f 2, f 3,...f k has mean
1 Social Studies 201 October 13, 2004 Note: The examples i these otes may be differet tha used i class. However, the examples are similar ad the methods used are idetical to what was preseted i class.
More informationDiscrete Mathematics and Probability Theory Spring 2014 Anant Sahai Note 13
EECS 70 Discrete Mathematics ad Probability Theory Sprig 2014 Aat Sahai Note 13 Itroductio At this poit, we have see eough examples that it is worth just takig stock of our model of probability ad may
More informationAsymptotic Growth of Functions
CMPS Itroductio to Aalysis of Algorithms Fall 3 Asymptotic Growth of Fuctios We itroduce several types of asymptotic otatio which are used to compare the performace ad efficiecy of algorithms As we ll
More informationParametric (theoretical) probability distributions. (Wilks, Ch. 4) Discrete distributions: (e.g., yes/no; above normal, normal, below normal)
6 Parametric (theoretical) probability distributios. (Wilks, Ch. 4) Note: parametric: assume a theoretical distributio (e.g., Gauss) No-parametric: o assumptio made about the distributio Advatages of assumig
More informationFinding the circle that best fits a set of points
Fidig the circle that best fits a set of poits L. MAISONOBE October 5 th 007 Cotets 1 Itroductio Solvig the problem.1 Priciples............................... Iitializatio.............................
More informationSECTION 1.5 : SUMMATION NOTATION + WORK WITH SEQUENCES
SECTION 1.5 : SUMMATION NOTATION + WORK WITH SEQUENCES Read Sectio 1.5 (pages 5 9) Overview I Sectio 1.5 we lear to work with summatio otatio ad formulas. We will also itroduce a brief overview of sequeces,
More informationUC Berkeley Department of Electrical Engineering and Computer Science. EE 126: Probablity and Random Processes. Solutions 9 Spring 2006
Exam format UC Bereley Departmet of Electrical Egieerig ad Computer Sciece EE 6: Probablity ad Radom Processes Solutios 9 Sprig 006 The secod midterm will be held o Wedesday May 7; CHECK the fial exam
More informationClass Meeting # 16: The Fourier Transform on R n
MATH 18.152 COUSE NOTES - CLASS MEETING # 16 18.152 Itroductio to PDEs, Fall 2011 Professor: Jared Speck Class Meetig # 16: The Fourier Trasform o 1. Itroductio to the Fourier Trasform Earlier i the course,
More informationModified Line Search Method for Global Optimization
Modified Lie Search Method for Global Optimizatio Cria Grosa ad Ajith Abraham Ceter of Excellece for Quatifiable Quality of Service Norwegia Uiversity of Sciece ad Techology Trodheim, Norway {cria, ajith}@q2s.tu.o
More informationSolutions to Selected Problems In: Pattern Classification by Duda, Hart, Stork
Solutios to Selected Problems I: Patter Classificatio by Duda, Hart, Stork Joh L. Weatherwax February 4, 008 Problem Solutios Chapter Bayesia Decisio Theory Problem radomized rules Part a: Let Rx be the
More informationarxiv:0908.3095v1 [math.st] 21 Aug 2009
The Aals of Statistics 2009, Vol. 37, No. 5A, 2202 2244 DOI: 10.1214/08-AOS640 c Istitute of Mathematical Statistics, 2009 arxiv:0908.3095v1 [math.st] 21 Aug 2009 ESTIMATING THE DEGREE OF ACTIVITY OF JUMPS
More informationSEQUENCES AND SERIES
Chapter 9 SEQUENCES AND SERIES Natural umbers are the product of huma spirit. DEDEKIND 9.1 Itroductio I mathematics, the word, sequece is used i much the same way as it is i ordiary Eglish. Whe we say
More informationAMS 2000 subject classification. Primary 62G08, 62G20; secondary 62G99
VARIABLE SELECTION IN NONPARAMETRIC ADDITIVE MODELS Jia Huag 1, Joel L. Horowitz 2 ad Fegrog Wei 3 1 Uiversity of Iowa, 2 Northwester Uiversity ad 3 Uiversity of West Georgia Abstract We cosider a oparametric
More informationPresent Value Factor To bring one dollar in the future back to present, one uses the Present Value Factor (PVF): Concept 9: Present Value
Cocept 9: Preset Value Is the value of a dollar received today the same as received a year from today? A dollar today is worth more tha a dollar tomorrow because of iflatio, opportuity cost, ad risk Brigig
More informationTHE ABRACADABRA PROBLEM
THE ABRACADABRA PROBLEM FRANCESCO CARAVENNA Abstract. We preset a detailed solutio of Exercise E0.6 i [Wil9]: i a radom sequece of letters, draw idepedetly ad uiformly from the Eglish alphabet, the expected
More informationCOMPARISON OF THE EFFICIENCY OF S-CONTROL CHART AND EWMA-S 2 CONTROL CHART FOR THE CHANGES IN A PROCESS
COMPARISON OF THE EFFICIENCY OF S-CONTROL CHART AND EWMA-S CONTROL CHART FOR THE CHANGES IN A PROCESS Supraee Lisawadi Departmet of Mathematics ad Statistics, Faculty of Sciece ad Techoology, Thammasat
More informationMathematical goals. Starting points. Materials required. Time needed
Level A1 of challege: C A1 Mathematical goals Startig poits Materials required Time eeded Iterpretig algebraic expressios To help learers to: traslate betwee words, symbols, tables, ad area represetatios
More informationCONTROL CHART BASED ON A MULTIPLICATIVE-BINOMIAL DISTRIBUTION
www.arpapress.com/volumes/vol8issue2/ijrras_8_2_04.pdf CONTROL CHART BASED ON A MULTIPLICATIVE-BINOMIAL DISTRIBUTION Elsayed A. E. Habib Departmet of Statistics ad Mathematics, Faculty of Commerce, Beha
More informationBasic Elements of Arithmetic Sequences and Series
MA40S PRE-CALCULUS UNIT G GEOMETRIC SEQUENCES CLASS NOTES (COMPLETED NO NEED TO COPY NOTES FROM OVERHEAD) Basic Elemets of Arithmetic Sequeces ad Series Objective: To establish basic elemets of arithmetic
More informationHere are a couple of warnings to my students who may be here to get a copy of what happened on a day that you missed.
This documet was writte ad copyrighted by Paul Dawkis. Use of this documet ad its olie versio is govered by the Terms ad Coditios of Use located at http://tutorial.math.lamar.edu/terms.asp. The olie versio
More informationarxiv:1506.03481v1 [stat.me] 10 Jun 2015
BEHAVIOUR OF ABC FOR BIG DATA By Wetao Li ad Paul Fearhead Lacaster Uiversity arxiv:1506.03481v1 [stat.me] 10 Ju 2015 May statistical applicatios ivolve models that it is difficult to evaluate the likelihood,
More informationCenter, Spread, and Shape in Inference: Claims, Caveats, and Insights
Ceter, Spread, ad Shape i Iferece: Claims, Caveats, ad Isights Dr. Nacy Pfeig (Uiversity of Pittsburgh) AMATYC November 2008 Prelimiary Activities 1. I would like to produce a iterval estimate for the
More informationSTATISTICAL METHODS FOR BUSINESS
STATISTICAL METHODS FOR BUSINESS UNIT 7: INFERENTIAL TOOLS. DISTRIBUTIONS ASSOCIATED WITH SAMPLING 7.1.- Distributios associated with the samplig process. 7.2.- Iferetial processes ad relevat distributios.
More informationThis document contains a collection of formulas and constants useful for SPC chart construction. It assumes you are already familiar with SPC.
SPC Formulas ad Tables 1 This documet cotais a collectio of formulas ad costats useful for SPC chart costructio. It assumes you are already familiar with SPC. Termiology Geerally, a bar draw over a symbol
More informationTO: Users of the ACTEX Review Seminar on DVD for SOA Exam MLC
TO: Users of the ACTEX Review Semiar o DVD for SOA Eam MLC FROM: Richard L. (Dick) Lodo, FSA Dear Studets, Thak you for purchasig the DVD recordig of the ACTEX Review Semiar for SOA Eam M, Life Cotigecies
More information