, a Wishart distribution with n -1 degrees of freedom and scale matrix.

Similar documents
n. We know that the sum of squares of p independent standard normal variables has a chi square distribution with p degrees of freedom.

I. Chi-squared Distributions

Output Analysis (2, Chapters 10 &11 Law)

THE REGRESSION MODEL IN MATRIX FORM. For simple linear regression, meaning one predictor, the model is. for i = 1, 2, 3,, n

University of California, Los Angeles Department of Statistics. Distributions related to the normal distribution

1 Computing the Standard Deviation of Sample Means

Z-TEST / Z-STATISTIC: used to test hypotheses about. µ when the population standard deviation is unknown

Hypothesis testing. Null and alternative hypotheses


Inference on Proportion. Chapter 8 Tests of Statistical Hypotheses. Sampling Distribution of Sample Proportion. Confidence Interval

PSYCHOLOGICAL STATISTICS

One-sample test of proportions

Maximum Likelihood Estimators.

Confidence Intervals for One Mean

Lesson 15 ANOVA (analysis of variance)

The following example will help us understand The Sampling Distribution of the Mean. C1 C2 C3 C4 C5 50 miles 84 miles 38 miles 120 miles 48 miles

Non-life insurance mathematics. Nils F. Haavardsson, University of Oslo and DNB Skadeforsikring

CHAPTER 7: Central Limit Theorem: CLT for Averages (Means)

Properties of MLE: consistency, asymptotic normality. Fisher information.

A Test of Normality. 1 n S 2 3. n 1. Now introduce two new statistics. The sample skewness is defined as:

Measures of Spread and Boxplots Discrete Math, Section 9.4

Practice Problems for Test 3

Lesson 17 Pearson s Correlation Coefficient

1. C. The formula for the confidence interval for a population mean is: x t, which was

Overview of some probability distributions.

Math C067 Sampling Distributions

UC Berkeley Department of Electrical Engineering and Computer Science. EE 126: Probablity and Random Processes. Solutions 9 Spring 2006

Chapter 6: Variance, the law of large numbers and the Monte-Carlo method

1 Correlation and Regression Analysis

Chapter 7 Methods of Finding Estimators

PROCEEDINGS OF THE YEREVAN STATE UNIVERSITY AN ALTERNATIVE MODEL FOR BONUS-MALUS SYSTEM

Case Study. Normal and t Distributions. Density Plot. Normal Distributions

Normal Distribution.

Confidence Intervals

A probabilistic proof of a binomial identity

Mann-Whitney U 2 Sample Test (a.k.a. Wilcoxon Rank Sum Test)

FIBONACCI NUMBERS: AN APPLICATION OF LINEAR ALGEBRA. 1. Powers of a matrix

CME 302: NUMERICAL LINEAR ALGEBRA FALL 2005/06 LECTURE 8

Confidence Intervals. CI for a population mean (σ is known and n > 30 or the variable is normally distributed in the.

Section 11.3: The Integral Test

Statistical inference: example 1. Inferential Statistics

Now here is the important step

MEI Structured Mathematics. Module Summary Sheets. Statistics 2 (Version B: reference to new book)

Soving Recurrence Relations

Solutions to Selected Problems In: Pattern Classification by Duda, Hart, Stork

Center, Spread, and Shape in Inference: Claims, Caveats, and Insights

Definition. A variable X that takes on values X 1, X 2, X 3,...X k with respective frequencies f 1, f 2, f 3,...f k has mean

Repeating Decimals are decimal numbers that have number(s) after the decimal point that repeat in a pattern.

GCSE STATISTICS. 4) How to calculate the range: The difference between the biggest number and the smallest number.

Lecture 4: Cheeger s Inequality

5: Introduction to Estimation

Department of Computer Science, University of Otago

Chapter 7: Confidence Interval and Sample Size

Example: Probability ($1 million in S&P 500 Index will decline by more than 20% within a

Sampling Distribution And Central Limit Theorem

Chapter 14 Nonparametric Statistics

Analyzing Longitudinal Data from Complex Surveys Using SUDAAN

Chapter 7 - Sampling Distributions. 1 Introduction. What is statistics? It consist of three major areas:

SAMPLE QUESTIONS FOR FINAL EXAM. (1) (2) (3) (4) Find the following using the definition of the Riemann integral: (2x + 1)dx

Determining the sample size

Exam 3. Instructor: Cynthia Rudin TA: Dimitrios Bisias. November 22, 2011

A Mathematical Perspective on Gambling

CHAPTER 11 Financial mathematics

Unit 8: Inference for Proportions. Chapters 8 & 9 in IPS

BASIC STATISTICS. f(x 1,x 2,..., x n )=f(x 1 )f(x 2 ) f(x n )= f(x i ) (1)

3. Greatest Common Divisor - Least Common Multiple

In nite Sequences. Dr. Philippe B. Laval Kennesaw State University. October 9, 2008

Modified Line Search Method for Global Optimization

Our aim is to show that under reasonable assumptions a given 2π-periodic function f can be represented as convergent series

Convexity, Inequalities, and Norms

NATIONAL SENIOR CERTIFICATE GRADE 12

Heat (or Diffusion) equation in 1D*

Quadrat Sampling in Population Ecology

CHAPTER 3 DIGITAL CODING OF SIGNALS

Chair for Network Architectures and Services Institute of Informatics TU München Prof. Carle. Network Security. Chapter 2 Basics

Confidence intervals and hypothesis tests

Week 3 Conditional probabilities, Bayes formula, WEEK 3 page 1 Expected value of a random variable

.04. This means $1000 is multiplied by 1.02 five times, once for each of the remaining sixmonth

Incremental calculation of weighted mean and variance

Parametric (theoretical) probability distributions. (Wilks, Ch. 4) Discrete distributions: (e.g., yes/no; above normal, normal, below normal)

Hypergeometric Distributions

THE TWO-VARIABLE LINEAR REGRESSION MODEL

1. MATHEMATICAL INDUCTION

Function factorization using warped Gaussian processes

Multi-server Optimal Bandwidth Monitoring for QoS based Multimedia Delivery Anup Basu, Irene Cheng and Yinzhe Yu

THE HEIGHT OF q-binary SEARCH TREES

Overview. Learning Objectives. Point Estimate. Estimation. Estimating the Value of a Parameter Using Confidence Intervals

Chapter 5: Inner Product Spaces

1 The Gaussian channel

Biology 171L Environment and Ecology Lab Lab 2: Descriptive Statistics, Presenting Data and Graphing Relationships

Estimating Probability Distributions by Observing Betting Practices

This document contains a collection of formulas and constants useful for SPC chart construction. It assumes you are already familiar with SPC.

CONTROL CHART BASED ON A MULTIPLICATIVE-BINOMIAL DISTRIBUTION

Overview on S-Box Design Principles

Vladimir N. Burkov, Dmitri A. Novikov MODELS AND METHODS OF MULTIPROJECTS MANAGEMENT

A Recursive Formula for Moments of a Binomial Distribution

Here are a couple of warnings to my students who may be here to get a copy of what happened on a day that you missed.

Transcription:

UMEÅ UNIVERSITET Matematisk-statistiska istitutioe Multivariat dataaalys D MSTD79 PA TENTAMEN 004-0-9 LÖSNINGSFÖRSLAG TILL TENTAMEN I MATEMATISK STATISTIK Multivariat dataaalys D, 5 poäg.. Assume that x, x,..., x are iid observatios from a p-dimesioal ormal distributio with mea µ ad variace matrix. Let x ad S deote the sample mea ad sample variace matrix respectively. Derive the distributios of the followig statistics: a) x x is a liear combiatio of p-variate ormal distributed radom variables, ad has hece itself a p-variate ormal distributio. E( x ) = E( x i ) = µ = µ. V( x ) = V ( xi ) V ( xi ) Σ Σ = = =. b) Ax + x, where A is a fixed p p-matrix Ax + x is a liear combiatio of p-variate ormal distributed radom variables, ad has hece itself a p- variate ormal distributio. E Ax + x ) = Aµ + µ = ( A I) µ. V Ax + x ) = AΣΣ + Σ. ( + ( c) ( x - µ) -- ( x - µ) x N( µ, Σ). Suppose rak( ) = p, the there exists a matrix B such that BB = Σ ad u = B ( x µ ) N(0, I). We kow that the sum of squares of p idepedet stadard ormal variables has a chi square distributio with p degrees of freedom. Hece, u u = ( x µ ) ( Σ) ( x µ ) = ( x µ ) Σ ( x µ ) χ ( p). Describe the distributios (without proof) of the followig statistics: d) (-)S ( ) S W p (, Σ), a Wishart distributio with - degrees of freedom ad scale matrix. e) ( x - µ) S -- ( x - µ) ( x - µ) S -- ( x - µ) T ( p), Hotelligs T -distributio.

. Assume we are classifyig objects ito two groups, based o bivariate observatios. The observatio (x, y) has the desity fuctio f (x, y) if it comes from group, ad desity fuctio f (x, y) if it comes from group. f (x, y) is a uiform desity o all (x, y) with 0 x 3 ad 0 y 3, ad zero otherwise. f (x, y) is a bivariate ormal desity with expectatio (, ) ad the idetity matrix as covariace matrix. a) Assume you have a ew object with observatio (/, /). Classify it ito either group or group, assumig equal misclassificatio costs ad equal priors. (( x ) + ( y ) ) f ( x, y) = / 9;0 x, y 3. f ( x, y) = e, < x, y <. π Choose the distributio with highest desity i (½, ½). f (½, ½) = /9 0,. f (½, ½) 0,395 The observatio will be allocated to populatio. b) Assumig ow the prior probability for beig i group is twice the prior probability for beig i group. Use this whe classifyig a object with observatio (, ). p = p p = /3, p = /3. Choose the distributio with the highest value o p i f i (, ). p f (, ) = (/3)(/9) 0,07407. p f (, ) = (/3)f (, ) 0,05305. The observatio will be allocated to populatio. 3. a) What is the idea behid pricipal compoet aalysis? Explai how the sample pricipal compoets ca be derived. The mai idea is to get a better uderstadig of the data. This is achieved by computig ew ucorrelated variables (compoets). The coefficiets (of legth oe) i the liear combiatios are chose so that the first compoet has maximum variace. The secod compoet should have maximum variace give that it is ucorrelated with the first ad so o. Hopefully a small umber of compoets cotais most of the total variace, so that the data ca be iterpreted i a (much) smaller dimesio tha the origial oe (p). The coefficiets for the sample compoets are give by the eigevectors of the sample covariace or correlatio matrices. The coefficiets for the first compoet is foud i the eigevector correspodig to the largest eigevalue of S (or R) ad so o. b) Write dow ad explai the basic model used i factor aalysis. I factor aalysis we assume that there is a umber of u observable factors, that liearly affect the observed variables. We assume that the variace matrix of the observed variables ca be writte as Σ = LL + Ψ, where L cotais the factor loadigs, that is L tells how the uderlyig uobservable factors affect the observed variables. Ψ cotais the variace i the observed variable that is ot explaied by the factors. c) Compare pricipal compoet aalysis ad factor aalysis. What similarities are there ad which are the major differeces? Both methods try to explai a data set i a lower dimesio tha the origial. Oe way of computig factor loadigs is to use the pricipal compoet method, i. e. usig eigevalues ad eigevectors of the covariace matrix as i PCA. The major differeces are: PCA is model free, it is just a rotatio of the data. I FA we assume a well defied model. If the model is ot valid the results of the aalysis might be spurious. The directio of the trasformatio differs. I PCA we trasform the origial variables ito ew pricipal compoets. I FA we assume that uobservable factors are trasformed ito our measurable variables.

4. A food scietist was studyig the effectiveess of phosphate salts i cojuctio with vacuum packagig o the preservatio of precooked groud turkey meat durig log storage periods. I particular, the researcher wated to compare five phosphate salt treatmets to determie the particular salt treatmet that would be most effective. The five phosphate salt treatmets cosisted of () a cotrol (o phosphate salt), () sodium tripolyphosphate (STP) at a 0.3% level, (3) STP at a 0.5% level, (4) sodium ascorbate moophosphate (SAsMP) at a 0.3% level, ad (5) SAsMP at a 0.5% level. Samples of cooked groud turkey meat were vacuum packaged usig oe of the five salt treatmets, froze ad stored for 50 days at approximately -4 C. Oe complete replicatio of each of the five salt treatmets was started o each of five differet days, creatig five blocked replicates for each set of treatmets. Some of the variables measured icluded cookig loss (CKG_LOSS), ph (PH), moistess after cookig (MOIST), fat cotet (FAT), hexaal cotet (HEX), bathopheathrolie-chelateable iro cotets (NONHEM), ad the cookig time it took to reach a optimal cookig temperature (CKG_TIME). A aalysis usig MINITAB gave the followig pritout. Factor Type Levels Values TREATMENT fixed 5 ; ; 3; 4; 5 BLOCK fixed 5 ; ; 3; 4; 5 Aalysis of Variace for CKG_LOSS TREATMENT 4 06,74 6,68 3,08 0,000 BLOCK 4 57,56 4,379 7,05 0,00 Error 6 3,640,040 Total 4 96,880 S =,489 R-Sq = 83,4% R-Sq(adj) = 75,3% Aalysis of Variace for PH TREATMENT 4 0,378 0,03445 0,70 0,605 BLOCK 4 0,3306 0,0876,67 0,05 Error 6 0,7906 0,0494 Total 4,5950 S = 0,93 R-Sq = 37,3% R-Sq(adj) = 5,84% Aalysis of Variace for MOIST TREATMENT 4 34,85 8,73 5,58 0,005 BLOCK 4 7,,805,6 0,366 Error 6 4,968,56 Total 4 67,040 S =,490 R-Sq = 6,76% R-Sq(adj) = 44,3% Aalysis of Variace for FAT TREATMENT 4,8494 0,464,60 0, BLOCK 4,6673 0,6668,3 0,0 Error 6 4,695 0,887 Total 4 9,36 S = 0,53738 R-Sq = 49,44% R-Sq(adj) = 4,6%

Aalysis of Variace for HEX TREATMENT 4,4876 0,69 7,00 0,000 BLOCK 4,3846 0,3456 9,45 0,000 Error 6 0,585 0,03658 Total 4 4,45530 S = 0,949 R-Sq = 86,86% R-Sq(adj) = 80,30% Aalysis of Variace for NONHEM TREATMENT 4 47,6344,9086 4,79 0,000 BLOCK 4 3,6384 3,4096 4,4 0,06 Error 6,886 0,805 Total 4 74,544 S = 0,89774 R-Sq = 8,63% R-Sq(adj) = 73,94% Aalysis of Variace for CKG_TIME TREATMENT 4 7,040,760 0,6 0,654 BLOCK 4 48,40,060 4,5 0,06 Error 6 45,360,835 Total 4 00,640 S =,68375 R-Sq = 54,93% R-Sq(adj) = 3,39% MANOVA for TREATMENT s = 4 m =,0 = 4,0 Test DF Criterio Statistic Approx F Num Deom P Wilks' 0,0487,96 8 37 0,00 Lawley-Hotellig 9,6083 5,95 8 34 0,000 Pillai's,83896,580 8 5 0,076 Roy's 7,5600 MANOVA for BLOCK s = 4 m =,0 = 4,0 Test DF Criterio Statistic Approx F Num Deom P Wilks' 0,004 3,444 8 37 0,000 Lawley-Hotellig 3,00 3,983 8 34 0,000 Pillai's,35756,666 8 5 0,00 Roy's 9,04354 a) What model has bee used? What assumptios have to be fulfilled for the model to be valid? A MANOVA-model: y = µ + α + β + ε, where α i measures the effect of treatmet i ad β j the effect of ij i j ij block j. The radom errors ε ij are assumed to be idepedet ad ormally distributed with mea vector zero ad idetical variace matrix. Sice we have oly oe replicate of each treatmet-block combiatio, we have to assume that there are o iteractios betwee treatmets ad blocks. b) What coclusios ca be draw from the output? From the MANOVA-part we see that whe usig Wilks Λ as test variable, we ca reject the hypotheses about o treatmet effect (p =0,00) as well as about o block effect (p). Whe lookig at the idividual ANOVA s

we see that there are sigificat treatmet effects for the variables Cookig loss, Moisture, Hexaal cotet ad bathopheathrolie-chelateable iro cotets. c) What further aalyses could be cosidered? We would be iterested i fidig what treatmets affect what variables. This could be foud by computig simultaeous cofidece itervals for the differeces betwee the meas of the differet treatmets. Aother approach is to compute caocical variates ad cofidece itervals for their meas.