Inference for Multivariate Means


 Shanon Geoffrey Harrington
 1 years ago
 Views:
Transcription
1 Inference for Multivariate Means Statistics 407, ISU Inference for the Population Mean Inference for the Population Mean Inference for the Population Mean his section focuses on the question: This section focuses on the question: Inference This section focuses for on the the question: Population Mean Whether a predetermined mean is a plausible value for the normal Whether population a predetermined mean, iven mean theisissample a plausible thatvalue we for for This section focuses on the question: have collected. the normal population mean, iven the sample that we we have collected. Whether a predetermined mean is a value for the normal population hypothesis mean, testin iven the sample lanuae, that we this have means collected. we want to test In hypothesis testin lanuae, this means we want to totest In hypothesis testin lanuae, this means we want to test H o : µ = µ H o : 0 vs H µ = µ 0 vs A : µ = µ H A : : 0 µ = = µ 00 here µ where where =(µ,µ µ =(µ =(µ 2,...,µ is true mean is the,µ,µ p 2 ),...,µ is the p ) true is is thepopulation the true true population mean population mean and mean and µ and 0 = µ µ 0 = hypothesized mean. 0 = µ 0,µ 02 (µ,...,µ (µ 0,µ 0,µ 02 0p,...,µ ) is the 02,...,µ 0p ) hypothesized is hypothesized mean. mean. 0p is the hypothesized mean. 2
2 Univariate Testin of a Mean Univariate of Recall, in the univariate situation, when we want to test Recall, in the univariate situation, when we want to test H o : µ = µ 0 vs H A : µ = µ 0 we we would calculate calculate the tstatistic, the tstatistic, as follows: as follows: Then we would compare t t with = X µ 0 s/ values n. from a tdistribution with (n) derees of freedom, and reject Ho only if t is larer than the critical values. Then we would compare t with values from a tdistribution with (n ) derees of freedom, and reject H o only if t is larer than the critical values. 2 3 Confidence Interval for a Mean ould also We could compute also compute a 00( a 00( α)% confidence interval interval for the for ould also compute a 00( α)% confidence interval fo opulation mean, µ, usin population mean, µ,, usin X ± t n (α/2) s n X ± t n (α/2) s n 3 4
3 Multivariate Testin of a Mean Multivariate Testin Testin of of a Mean a Mean Multivariate Testin of a Mean Mean The multivariate analo of the tstatistic is: Multivariate Testin of a Mean Multivariate Testin of a Mean he The multivariate multivariate analo analo of of the the tstatistic tstatistic is: is: The multivariate The multivariate analo analo of the tstatistic is: te analo The multivariate of the tstatistic analo T 2 of the tstatistic = of is: n( the X tstatistic µ 0 ) is: Sis: T 2 T= 2 n( = X n( T 2 X µ = µ n( X 0 ) S µ 0 n ) S ( X n ( µ X 0 ) 0 ) S n ( X µ 0 ) µ 0 ) T 2 = n( X µ 0 ) S It T 2 is = called n( X µ Hotellins 0 ) S n ( X µ 0 ) It is called It is called It ishotellins called Hotellins T 2 T 2 n ( X µ 0 ). t is called T 2.. T 2. It is called Hotellins T 2. n ( X µ 0 ) tellins follows an (Remember? In T follows 2 2. T 2 follows an (n )p F p,n p,α distribution. in Tfollows 2 follows an an an (n )p n p F p,n p,α distribution. (Remember that, in the univariate n p (n )p case, when t t n, then t 2 that, in n p F p,n p,α distribution. (Remember that, in F,n. Note in the he (n )p the univariate multivariate case, when t t analo, that the n, then t 2 F derees of freedom,n.). Note in the incorporate the n p F case, p,n p,α distribution. when t t (Remember t 2 n, then t 2 F that,,n F,n. Note in. Note in the in the multivariate dimensionality analo, that that of the the the data.) derees freedom incorporate the case, when t t n of, the then t 2 derees of offreedom incorporate the the F,n. Note in the dimensionality of of the the of data.) the data.) nalo, that the derees of freedom incorporate the 4 of the data.) T 2 follows an (n )p n p F p,n p,α distribution. (Remember that, in the univariate case, when t t n, then t 2 F,n. Note in the multivariate analo, that the derees of freedom incorporate the BRIEF ARTICLE 4 5 THE AUTHOR Multivariate Testin of a Mean Null hypothesis: H 0 : µ = µ 0 Alternative hypothesis: H A : µ = µ 0 Why aren t there onesided alternatives? 6
4 Example: Sweat data Example: Sweat data Perspiration Perspirationofof20 20 healthy females for 3 components: Sweat Sweat Rate, Perspiration Rate, Sodium Sodiumof Content Content 20 healthy and and females Potassium Potassium for 3 components: Content. Sweat Rate, Sodium Content and Potassium Content SweatRate SweatRate Na Na K K X = 45.6 X S n = S n S S n = n = We want to test if We want to test H o : µ = (4 50 0) vs H A : µ = (4 50 0) We want to test if H o : µ = ( ) vs H A : µ = (4 50 0) Calculate: T 2 = T 2 = = =
5 Takin α =0.0, (n )p Takin Usin =0.0, (n )p n p F n p p,n p(0.0) = p,n p(0.0) = 8.8. Then Then we then would we would reject H we would reject o at the atho the at the 0% 0% 0% level level level of of of sinificance. sinificance. Takin Takin Usin α =0.05, =0.05, (n )p (n )p n p F n p p,n p(0.05) p,n p(0.05) = = Then Then we then we would Ho we would would NOT NOT reject reject H o at at the theat 5% 5% the 5% level level level of of of sinificance. sinificance Confidence Reions and Simultaneous Confidence Reions and Simultaneous Comparisons of Comparisons Confidence Component Reions and of Component Simultaneous Means Means Comparisons Means of fidence Reions and Simultaneous Comparisons of Confidence Reions Component and Simultaneous Means Comparisons of Component Means A 00( α)% confidence reion for the mean of a pdimensional A 00( α)% normal distribution confidence is the reion ellipsoid for the determined mean of a pdimensional by all µ such that A 00( α)% confidence reion for the mean of a pdimensional normal distribution is the ellipsoid determined by all µ such that that normal distribution is the ellipsoid determined by all µ such that n( X µ) S n( X µ) n n ( p(n ) X µ) p(n ) X µ) n p F p,n p(α) p,n p(α) n( X n ( µ) p(n ) X S n µ) ( p(n ) X µ) n p n p F F p,n p(α) where X,...,X n are sample observations from a N p (µ, Σ) population.,...,x are sample observations from p (µ, Σ) pop where where X,...,X n are sample observations from a N p (µ, Σ) population. population. This equation defines an ellipse in pspace. This This equation defines an ellipse in in pspace. pspace. in pspace. α)% confidence reion for the mean of a pdimensional l distribution is the ellipsoid determined by all µ such that n( X µ) S X,...,X n are sample observations from a N p (µ, Σ) pop. quation defines an ellipse in pspace
6 Lenths of the major axes. Lenths of the major axes. The shape of the confidence ellipse is dictated by the variancecovariance matrix, S. The orientation is iven by the eienvectors of S, and the lenths of the major axes of the confidence The shape of the confidence ellipse is dictated by the variancecovariance matrix,. The is iven by the eienvectors, ellipse ellipse are are iven iven by by and the of the major axes of the confidence ellipse are iven by p(n ) λ i p(n ) λ n(n p) F i p,n p(α) where λ i is the i th eienvalue of S. where is the i'th eienvalue. Lenths of the major axes. The shape of the confidence ellipse is dictated by the varia covariance matrix, S. The orientation is iven by the eien tors of S, and the lenths of the major axes of the confide n(n p) F p,n p(α) where λ i is the i th eienvalue of S. Inference for the Populati 9 9 This section focuses on the question: Inference for the Population Mea Example: Sweat data This section Whether focuses on a predetermined the question: mean is a the normal population mean, iven t Multivariate Testin of a Mea have collected. Whether a predetermined mean is a plausibl thethe normal multivariate population 90% confidence analomean, of reion theiven tstatistic the is: samp have collected. In hypothesis testin lanuae, this mean x data points + sample T 2 mean, = n( X µ 0 ) S n ( X µ 0 ) In hypothesis testin hypothesized lanuae, H o : mean, µ this = µ 0 means vs we H A wa : µ confidence reion (3D) It is called Hotellins T 2. where µ =(µ,µ 2,...,µ p ) is the true popu Notice H that is the (µ 0,µ 02,...,µ o : µ = µ confidence 0p ) is 0 vs H the hypothesized A : µ = µ 0 T 2 follows an (n )p mea n p reion, F p,n p,α which distribution. (Re the univariate corresponds case, when to the t t where µ =(µ,µ result 2,...,µ for p the ) ishypothesis true n, then t test. population 2 F m multivariate analo, that derees of freedom (µ 0,µ 02,...,µ 0p ) is the hypothesized mean. dimensionality of the data.) 2
7 Relationship between Interval and Hypothesis Test Hypothesis testin is equivalent to examinin the hypothesized value in relation to the simultaneous confidence ellipse. If the hypothesized value lies the ellipse, then the null hypothesis is. 3 Simultaneous Confidence Intervals Bonferroni Method vs T 2 intervals method for computin simultaneous confidence intervals Scheffe s: (n )p X i ± n p F p,n p,α s i n called Scheffe s method. These intervals are the confilipse projected into the coordinate axes. 4
8 0% tinterval Simultaneous Confidence Intervals s i X i ± t n, α 2 n o contain the true mean with probability ( α). in multiple (p) confidence intervals the joint probey contain the true means needs to be at least hieve this each interval will need to have a hiher t it contains the true mean. One method for enhe Bonferroni correction: Bonferroni: X i ± t n, α 2p s i n Inference for the Populati This section focuses on the question: 5 Intervals vs Reion Whether a predetermined mean is a the normal population mean, iven t Multivariate Testin of a Mea have collected. The multivariate analo of the tstatistic is: In hypothesis 90% confidence testinreion lanuae, this mean x data points + sample T 2 mean, = n( X µ 0 ) S n ( X µ 0 ) hypothesized H o : mean, µ = µ 0 vs H A : µ confidence reion (3D) It is called Hotellins T where µ =(µ Bonferroni confidence 2.,µ 2,...,µ p ) is the true popu interval, reion T 2 follows (µ 0,µ 02 an,...,µ (n )p 0p ) is the hypothesized mea n p F p,n p,α distribution. (Re the univariate Ellipse case, extends whenoutside t t n of box,, then in t 2 F multivariatesome analo, directions. that Hypothesized derees of freedom dimensionality mean of is the inside data.) box always. 6
9 Paired Samples! the second value from the first and treat as a one sample problem.! Example: Municipal wastewater treatment plants are required by law to monitor their dischares into rivers and streams on a reular basis. Concern about the reliability of data from one of these selfmonitorin prorams led to a study in which samples of effluent were divided and sent to two laboratories were divided and sent to two laboratories for testin. One half of each sample was sent to the Wisconsin state lab and the other half sent to a private commercial lab routinely used in the monitorin proram. Measurements were made on biochemical oxyen demand (BOD) and suspended solids (SS) were obtained, on n= samples.! Are the analyses producin equivalent results? 7 Data Sample BOD SS BOD SS BOD SS
10 Plots BOD SS Lab D 2= SS SS Lab Differences x SS S d =! Plot vs! Are the values for BOD the same from labs & 2?! Are the values for SS the same from labs & 2?! Plot in multivariate plots. +! Is there a systematic shift 222 from x one rep to another ! Plot BOD, alon BOD with sample mean and hypothesized mean.! How similar are the 99.3 sample mean 88.3 and hyp mean? SS Differences Lab BOD T = [ ] S d = 88.3Two 48.6 Sample Hotellins T 2 o Sample Two Sample Hotellins T 2 BOD F Two Sample Hotellins T 2 For two 2,9(0.05) samples Two = collected 9.47 Sample independently Reject Hotellins H o T. 2 from multivariate normal populations, For two N(µ samples 2 Two Sample Hotellins T, Σ),N(µ collected 2, Σ), independently the test statistic from multivariate for H 0 : n For two For samples two mal collected populations, collected independently 9.36 = from 3.6 multivariate normal normal populations, µ N(µ, Σ),N(µ , Σ),, the test statistic for for H 0 : µ = For µ 2 two vs Hsamples A : µ collected = µ 2 is: N(µ, Σ),N(µ 2, Σ), the test statistic for H independently from multivariate normal populations, = µ 2 vs H A : µ N(µ µ = µ 2 vs H A : µ, Σ),N(µ = µ 2 is: 2, Σ), the test statistic for H 0 : vs = µ 2 is: is ected independently from multivaria µ = µ 2 vs H A : µ = µ 2 is: µ Reject, Σ),N(µ T 2 =( X H o. 2 X, ) Σ), (( + the T test statistic f 2 T µ 2 is: 2 =( X X 2 ) (( =( X )S pooled ) T 2 =( X + )S X 2 ) (( n X + n 2 ) (( + )S( pooled X ) X ( 2 X ) X 2 ) n )S pooled ) n 2 ( X X X 22 )) n n n 2 n 2 2 is distributed is as distributed (n +n 2 2)p as is distributed is distributed as (n as n +n (n +n 2 2)p 2 p F (n +n 2 2)p 2)p n n +n +n 2 p 2 p F p,n F np,n +n +n 2 p 2 p,α F p,n +n and 2 p,α and p,n +n +n 2 p,α 2 p,α and and S S pooled S pooled = = n n +n n +n 2 2 S n pooled = n S pooled n n n +n 2 2 S 2. S +n 2 2 S = n + n n +n2 2 2 S n +n 2 2. S + n 2 n 2. +n 2 2 S 2. X 2 ) (( + )S pooled ) 22 ( X X 22 2 ) n n = 22
11 THE AUTHOR BRIEFn i ARTICLE # K! wij yij yi yij yi ni ni j Testin 3 " 83 " " > = for 8 6 if Equality of VarianceCovariance S i = % & K THE 0 if n AUTHOR i $ ' Test for Homoeneity of Variancecovariances H 0 : µ = µ 2 Bartlett s test for homoeneity of variances, for univariate data is: Covariance Matrix H A : µ = µ 2 H 0 : µ which To test eneralizes = µ % 2 the hypothesis to " S " > H 0 : Σ = Σ 2 =... = Σ aainst the alternative S = &K! that they n i 6 0 are n not 5 i if n i= all equal, SAS PROC DISCRIM uses a LRT test 'K 0H A : µ = µ 2 if n$ Λ = i= S i (n i )/2 H 0 : Σ = Σ 2 =... = Σ S pooled (n p)/2 M Statistic (extends Bartlett s univariate test.) No exact distribution, but for lare M % & ' χ 2 data can be used to obtain pvalues. Also Box s M test, has no exact distribution. K0n" 5lo S " n =! i " 6lo S i if S > 0, there s R code for this. BUT, both tend to be K i= sensitive, SYSMIS to!! if S $ Comparin several Multivariate Means  MANOVA Comparin Comparin several several Multivariate Means  MANOVA Comparin several Multivariate Means  MANOVA Data: Data:! Data: samples samples samples of of sizeof of size nsize,...n n n,...n,...n ; p variables. ; ; p variables. ; p variables. Population:! Population: populations; populations; different different means means (µ, Population: populations;, different means,...,µ (µ (µ,...,µ,...,µ ), same ), same ), same covariance covariance (Σ), (Σ), multivariate multivariate normal. normal. covariance (Σ), multivariate normal.! Arises Arises Arises from from () from desined () () desined desined experimental experimental experimental studies, studies, studies, where where where there there are are there are multiple responses measured for one or more treatments, (2) in Arises multiple multiple from responses () responses desined measured measured experimental for one foror one more studies, treatments, more treatments, where (2) there in (2) are in classification problems, where we want to classify a new observation multiple classification classification into responses problems, a class, based problems, measured where we on two where or more for want we oneto measured want orclassify more to a variables. classify treatments, newaobser observation into ainto class, (2) in classification vation problems, a class, based based on where twonor we two more want measured more to classify measured variables. a variables. new observation into a class, based on two or more measured variables
12 Recall Univariate ANOVA Recall Univariate ANOVA We have random samples X l,x l2,...,x lnl from N(µ l, σ 2 ), l =,...,. We are interested to know if the population means of Recall Recall Univariate ANOVA the roups are different, that is, if µ = µ 2 =...= µ. e We have The have We random problem have random issamples parametrized XX l,x l2 l, l,x into l2,...,x the followin lnl from model N(µ formulation: l=, We..., We are. We are interested are to toknow if if ifthe population population means of means the of of l, σ 2 ), ), l l =,...,.,...,. e the roups roups are are different, that that is, if is, is, if if µ = µ 2 =...=. µ. The problem is Xparametrized lj = µ + into the τ l followin + emodel lj formulation: he The problem problem is parametrized is parametrized into into the the followin model formulation: formulaon: with with the the constraint l= n lτ l = 0,, which leadsto tothethe null hypothesis null hypothesis notation: notation: X lj = µ + τ l lj X lj = µ + τ l + e lj H with the constraint o : τ = τ 2 =...= τ =0 l= n lτ l = 0, which leads to the null hy constraint notation: ithpothesis l= n 29 lτ l = 0, which leads to the null hyothesis notation: H o : τ = τ 2 =...= τ =0 H o : τ = τ 2 =...= τ =0 The model is fit to the data by splittin each observation into The model is fit to the data by splittin each each observation into into components: components: X lj = X + ( X l X) + (X lj X l ), j =,...,n l ; l =,..., X lj = X + ( X l X) + (X lj X l ), j =,...,n l ; l =,..., and computin the sums of squares for each component: and computin the sums of squares for each component: and computin the sums of squares for each component: n l (X lj X) 2 = n l ( X l X) 2 n l n l (X lj X l ) (X 2 l= j= lj X) 2 = n l= l ( X l X) 2 n l + (X l= j= lj X l ) 2 l= j= l= l= j=
13 ANOVA Table ANOVA Table SourceTreatment (Between) SS SS Tr df Treatment (Between) SS Tr Error (Within) SS Res l= n Error (Within) SS Res l Total SS CorTot l= n l Total SS CorTot l= n l Then Then we we test test the hypothesisusin H o : τ = τ 2 =... = τ = 0 vs H A : not all equal 0, usin: F = ANOVA Table Source SS df Then we test the hypothesis H o : τ = τ 2 =... = τ = 0 vs H A : not all equal 0, usin: F = SS Tr /( ) SS Res /( l= n l ) F, n l (α) SS Tr /( ) SS Res /( l= n l ) F, n l (α) Multivariate ANOVA Multivariate ANOVA The correspondin model formulation is The correspondin model formulation is X lj = µ + τ l + e lj And the correspondin data decomposition where X lj, e l j are n l p matrices, µ, τ l are pd vectors, ivin the correspondin data representation: X lj = X + ( X l X) + (X lj X l ), j =,...,n l ; l =,..., 32 26
14 H A : µ = µ 2 H 0 : Σ = Σ 2 =... = Σ The correspondin sums of squares χ The correspondin sums of squares 2 is iven by: is iven by: n l l= j= n l (X lj X)(X lj X) = j= n l (X lj X l )(X lj X l ) l= l= (X lj X)(X lj X) = l= j= n l ( X l X)( X l X) + l= = B + W n l ( X l X)( X l X) + n l l= j= (X lj X l )(X lj MANOVA Table Source SS df Treatment (Between) B Error (Within) W l= n l Total B + W l= n l 33 Testin the hypothesis, H o : τ =...= τ l 27 = 0 he bi complication Testinhere the hypothesis, Hthat 0 : µ = we µ are H o comparin : τ 2 =...= τmatrices l = 0 now nstead The bi Testin of sinle complication the numbers. here is hypothesis, The thatmatrices we are comparin H o : correspond matrices τ =...= toτ variance now l = 0 llipses instead The pd of sinle bi space. complication numbers. So there The H A : isµ are matrices that = µ 2 numerous we are correspond comparin test statistics. tomatrices variance now The ellipses most instead pd popular of space. sinleis So numbers. Wilks there are The lambda: numerous matrices test correspond statistics. to variance The ost popular The bi complication is Wilks lambda: here H 0 : is Σ that = Σwe 2 =... are = comparin Σ matrices now most instead popular ellipses in of sinle is pd Wilks space. numbers. lambda: So there are numerous test statistics. The The matrices correspond to variance ellipses most popular is Wilks lambda: χ in pd space. So there are numerous 2 test statistics. The most popular is Wilks' lambda: Λ = W W n l Λ = W (X lj X)(X lj B + W B X) = n + W l ( X l X)( X l X) + l= j= Λ = W l= B + W e bi complication here is that we are comparin matrices now tead of sinle numbers. The matrices correspond to variance ipses in pd space. So there are numerous test statistics. The st popular is Wilks lambda: f ΛIf is Λ If too is too is too small small then then reject H. If o. If l= j= n l n= l = n is n lare, is lare, (n (n If Λ is too W p + )/2) (p ln W W small then reject H o is Λ distributed = W. If (p + )/2) ln is distributed as asχ B+W 2 χ as as χ 2 B+W 2 n l = n is lare, (n (n (p + )/2) ln W W (α). p( ) For p( ) p( ) smaller n, (p + )/2) ln B+W is distributed as χ n, B+W 2 (α). For. For smaller n, p( ) some combinations of p, the test statistic has an F distribution. ome combinations n, some combinations of p, of of p, the p, the test the test statistic has an has F distribution. an F distribution. some combinations of p, B the + W test statistic has an F distribu Λ is too small then reject H o. If n l = n is lare, (n + )/2) ln Testin the hypothesis, THE AUTHOR H o : τ =...= τ l = 0 Testin the hypothesis, H o : τ =...= τ l = 0 The bi complication here is that we are comparin matrice instead Testin of sinle the numbers. hypothesis The matrices correspond to var ellipses in pd space. So there are numerous test statistics. If Λ is too small then reject H o. If n l = n is lare, (n W B+W BRIEF ARTICLE n l (X lj X l )(X lj X l ) (α). For smal is distributed as χ 2 28 (α). For smaller n p( ) 34
15 Note that we can also express Λ as Note that we can also express Note that Note we that can we can also also express Λ Λ as as as Λ min(p, ) = Λmin(p, ) min(p, ) = i= +λ i i= +λ i Λ = +λ i i= where where λ i ; i λ=,...,min(p, ) are the eienvalues of W i ; i ) are the eienvalues of of WB. B. values of W B.. Also note, Also note, bein bein based based on on determinants of the of the matrices matrices that that e Wilks matrices lambda Wilks Also that note, lambda isbein essentially isbased essentially on comparin determinants comparin volumes of volumes the matrices of the of the incorrespond ellipses. lambda ellipses. Another is essentially Another way of calculatin of calculatin volumes of volumes the correspondin of ellipses of ellipses is the is the that correspond Wilks f the products products ellipses. of the Another oflenths way lenths of the calculatin ofleadin the leadin volumes axes axes of ellipses (eienvalues). is the of ellipses is the products of the lenths of the leadin axes (eienvalues). envalues). where λ i ; i =,...,min(p, ) are the eienvalues of W B. Also note, bein based on determinants of the matrices that Wilks lambda is essentially comparin volumes of the correspondin ellipses. Another way of calculatin volumes of ellipses is the products of the lenths of the leadin axes (eienvalues) Other Test Statistics Other Test Statistics Other Test Statistics Other Test Statistics Roy s! Roy's larest eienvalue of ofbw BW Roy s larest eienvalue of BW Lawley! Lawley and and Hotellins: T trace(bw ) ) Lawley and Hotellins: T = trace(bw ) Pillai s trace: trace(b(b + W) ) Pillai s! Pillai s Pillai's trace: trace: V V = trace(b(b + W) W) ) )
16 Example: Dominican Republic Student Testin 82 Students, enrolled in one of Technoloy (23), Architecture (38), Medical Technoloy (2) prorams. The students were iven 4 tests: Aptitude, Math, Lanuae, General Knowlede. Is there is a BRIEF difference ARTICLE in the averae test scores for each student type? THE AUTHOR H 0 : µ = µ 2 H A : µ = µ 2 H 0 : Σ = Σ 2 =... = Σ χ 2 3 n l (X lj X)(X lj X) = l= j= l= n Summary l (X lj Statistics X l )(X lj X l ) l= j= n l ( X l X)( X l X) + n=82, n =23, n 2 =38, n 3 =2, p=4, =3 2 (n (p + )/2) ln W THE AUTHOR B+W THE AUTHOR X = X = X 2 = X = THE AUTHOR S = S = S = S 2 = S 2 = S 2 = S 3 = S 3 = Can we consider that S 3 = the population variancecovariances miht be equal? 32
17 Descriptive raphics Can we consider that the population variances miht be equal? Are the population means different? Apt Math Lan GenKnow 5 count Tech Arch MedTech StudType Tech Arch MedTech value Descriptive raphics Apt Corr: Corr: 0.52 Corr: 0.39 Can we consider that the population variancecovariances miht be equal? Are the means different? Lan Math Corr: 0.43 Corr: Corr: GenKnow Apt Math Lan GenKnow 34
18 Descriptive raphics Can we consider that the population variancecovariances miht be equal? Are the means different? 35 Test OR B = W =
19 S 2 = S 3 = Test µ = µ 2 = µ 3 τ = τ 2 = τ 3 = 0 H o : µ = µ 2 = µ 3 H o : τ = τ 2 = τ 3 = S 3 = OR B = THE AUTHOR W = S = > summary(manova(cbind(apt, Math, Lan, GenKnow)~StudType, S 2 = domrep), test="wilks") Df Wilks approx 67.5 F num 28.3 Df den 0.57 Df 68.2 Pr(>F) StudType e07 *** Residuals 79 S 3 = H o : µ = µ 2 = µ 3 H o : τ = τ 2 = τ 3 = 0 B = Which variables? Analysis of variance on each variable W = Df Sum Sq Mean Sq F value Pr(>F) Apt e07 *** Math ** Lan e05 *** GenKnow * All variables have sinificant difference between means! In order of importance: 38
20 B = W = Which roups? Df Sum Sq Mean Sq F value Pr(>F) e07 *** h ** e05 *** Know * BRIEF ARTICLE 3 BRIEF ARTICLE Tukey s multiple comparisons Apt diff lwr lwr upr padj ArchTech MedTechTech MedTechArch Lan diff lwr upr p adj ArchTech MedTechTech MedTechArch Apt: diff lwr upr p adj ArchTech Math: MedTechTech Lan: MedTechArch GenKnow: BRIEF ARTICLE diff lwr upr p adj ArchTech MedTechTech MedTechArch diff lwr upr p adj Math diff lwr upr adj ArchTech ArchTech MedTechTech MedTechArch GenKnow diff lwr lwr upr p adj ArchTech MedTechTech MedTechArch diff lwr upr p adj ArchTech MedTechTech MedTechArch Summary In eneral, usin a multivariate test, such as, is preferable to multiple univariate tests, such as. This controls for the correlation structure in the data. Its possible to et nonsinificant results in the univariate tests, but sinificant results in the multivariate tests. That is, the univariate tests may not detect the differences, in the presence of correlation between the responses. 40
21 This work is licensed under the Creative Commons Attribution Noncommercial 3.0 United States License. To view a copy of this license, visit us/ or send a letter to Creative Commons, 7 Second Street, Suite 300, San Francisco, California, 9405, USA. 4
Multivariate ANOVA. Lecture 5 July 26, 2011 Advanced Multivariate Statistical Methods ICPSR Summer Session #2. Lecture #57/26/2011 Slide 1 of 47
Multivariate ANOVA Lecture 5 July 26, 2011 Advanced Multivariate Statistical Methods ICPSR Summer Session #2 Lecture #57/26/2011 Slide 1 of 47 Today s Lecture Univariate ANOVA Example Overview Today s
More informationMANOVA. Lecture 7. October 5, 2005 Multivariate Analysis. Lecture #710/5/2005 Slide 1 of 55
Lecture 7 October 5, 2005 Multivariate Analysis Lecture #710/5/2005 Slide 1 of 55 Today s Lecture» Today s Lecture TwoWay TwoWay Example Oneway. Twoway. Note: Homework #5 due Thursday 10/13 at 5pm.
More informationWhat is the difference between a regular ANOVA and MANOVA? Iris setosa Iris versicolor Iris virginica
MANOVA Introduction. MANOVA stands for Multivariate Analysis of Variance. What is the difference between a regular ANOVA and MANOVA? In the case of ANOVA we have one measurement (and however many factors,
More informationPaired Comparisons and Repeated Measures
Paired Comparisons and Repeated Measures Hotelling s Tsquared tests for a single mean vector have useful applications for studies with paired comparisons or repeated measurements. 1. Paired comparison
More informationStatistical Consulting Topics. MANOVA: Multivariate ANOVA
Statistical Consulting Topics MANOVA: Multivariate ANOVA Predictors are still factors, but we have more than one continuousvariable response on each experimental unit. For example, y i = (y i1, y i2 ).
More informationTwoway MANOVA. X ikr = µ + α i + τ k + γ ik + e ikr,
Twoway MANOVA We now consider designs with two factors levels and factor 2 has b levels Factor 1 has g If X ikr is the p 1 vector of measurements on the rth unit in the ith level of factor 1 and the kth
More informationMultivariate Analysis of Variance (MANOVA)
Multivariate Analysis of Variance (MANOVA) Example: (Spector, 987) describes a study of two drugs on human heart rate There are 24 subjects enrolled in the study which are assigned at random to one of
More informationMultivariate Analysis of Variance (MANOVA): I. Theory
Gregory Carey, 1998 MANOVA: I  1 Multivariate Analysis of Variance (MANOVA): I. Theory Introduction The purpose of a t test is to assess the likelihood that the means for two groups are sampled from the
More informationOneWay Multivariate Analysis of Variance: MANOVA
OneWay Multivariate Analysis of Variance: MANOVA Dr. J. Kyle Roberts Southern Methodist University Simmons School of Education and Human Development Department of Teaching and Learning Null Hypothesis
More informationMultivariate Normal Distribution
Multivariate Normal Distribution Lecture 4 July 21, 2011 Advanced Multivariate Statistical Methods ICPSR Summer Session #2 Lecture #47/21/2011 Slide 1 of 41 Last Time Matrices and vectors Eigenvalues
More informationBiostatistics. ANOVA  Analysis of Variance. Burkhardt Seifert & Alois Tschopp. Biostatistics Unit University of Zurich
Biostatistics ANOVA  Analysis of Variance Burkhardt Seifert & Alois Tschopp Biostatistics Unit University of Zurich Master of Science in Medical Biology 1 ANOVA = Analysis of variance Analysis of variance
More informationThreeway ANOVA. Threeway ANOVA. Threeway ANOVA. Repeated Measures Designs. Threeway ANOVA. General Guidelines for Dealing with a 3way ANOVA
Threeway ANOVA Divide and conquer General Guidelines for Dealing with a 3way ANOVA ABC is significant: Do not interpret the main effects or the way interactions. Divide the 3way analysis into way
More informationHypothesis Testing for Three or More Means. Sometimes we wish to compare the means of three or more independent populations rather than just two
Hypothesis Testing for Three or More Means Sometimes we wish to compare the means of three or more independent populations rather than just two The twosample t test can be extended to accommodate this
More informationPlanned Contrasts and Post Hoc Tests in MANOVA Made Easy ChiiDean Joey Lin, SDSU, San Diego, CA
Planned Contrasts and Post Hoc Tests in MANOVA Made Easy ChiiDean Joey Lin, SDSU, San Diego, CA ABSTRACT Multivariate analysis of variance (MANOVA) is a multivariate version of analysis of variance (ANOVA).
More informationINTERPRETING THE REPEATEDMEASURES ANOVA
INTERPRETING THE REPEATEDMEASURES ANOVA USING THE SPSS GENERAL LINEAR MODEL PROGRAM RM ANOVA In this scenario (based on a RM ANOVA example from Leech, Barrett, and Morgan, 2005) each of 12 participants
More informationI L L I N O I S UNIVERSITY OF ILLINOIS AT URBANACHAMPAIGN
: Part 2 Profile Analsysis and 2 way Edps/Soc 584 and Psych 594 Applied Multivariate Statistics Carolyn J. Anderson Department of Educational Psychology I L L I N O I S UNIVERSITY OF ILLINOIS AT URBANACHAMPAIGN
More informationAn example ANOVA situation. 1Way ANOVA. Some notation for ANOVA. Are these differences significant? Example (Treating Blisters)
An example ANOVA situation Example (Treating Blisters) 1Way ANOVA MATH 143 Department of Mathematics and Statistics Calvin College Subjects: 25 patients with blisters Treatments: Treatment A, Treatment
More informationExample: Multivariate Analysis of Variance
1 of 36 Example: Multivariate Analysis of Variance Multivariate analyses of variance (MANOVA) differs from univariate analyses of variance (ANOVA) in the number of dependent variables utilized. The major
More informationNotes for STA 437/1005 Methods for Multivariate Data
Notes for STA 437/1005 Methods for Multivariate Data Radford M. Neal, 26 November 2010 Random Vectors Notation: Let X be a random vector with p elements, so that X = [X 1,..., X p ], where denotes transpose.
More informationLesson 1: Comparison of Population Means Part c: Comparison of Two Means
Lesson : Comparison of Population Means Part c: Comparison of Two Means Welcome to lesson c. This third lesson of lesson will discuss hypothesis testing for two independent means. Steps in Hypothesis
More informationData and Regression Analysis. Lecturer: Prof. Duane S. Boning. Rev 10
Data and Regression Analysis Lecturer: Prof. Duane S. Boning Rev 10 1 Agenda 1. Comparison of Treatments (One Variable) Analysis of Variance (ANOVA) 2. Multivariate Analysis of Variance Model forms 3.
More informationMultivariate Analysis of Variance (MANOVA)
Multivariate Analysis of Variance (MANOVA) Aaron French, Marcelo Macedo, John Poulsen, Tyler Waterson and Angela Yu Keywords: MANCOVA, special cases, assumptions, further reading, computations Introduction
More informationBusiness Statistics. Lecture 8: More Hypothesis Testing
Business Statistics Lecture 8: More Hypothesis Testing 1 Goals for this Lecture Review of ttests Additional hypothesis tests Twosample tests Paired tests 2 The Basic Idea of Hypothesis Testing Start
More informationProfile analysis is the multivariate equivalent of repeated measures or mixed ANOVA. Profile analysis is most commonly used in two cases:
Profile Analysis Introduction Profile analysis is the multivariate equivalent of repeated measures or mixed ANOVA. Profile analysis is most commonly used in two cases: ) Comparing the same dependent variables
More informationHow to calculate an ANOVA table
How to calculate an ANOVA table Calculations by Hand We look at the following example: Let us say we measure the height of some plants under the effect of different fertilizers. Treatment Measures Mean
More informationInferential Statistics. Probability. From Samples to Populations. Katie RommelEsham Education 504
Inferential Statistics Katie RommelEsham Education 504 Probability Probability is the scientific way of stating the degree of confidence we have in predicting something Tossing coins and rolling dice
More informationLecture  32 Regression Modelling Using SPSS
Applied Multivariate Statistical Modelling Prof. J. Maiti Department of Industrial Engineering and Management Indian Institute of Technology, Kharagpur Lecture  32 Regression Modelling Using SPSS (Refer
More informationANSWERS TO EXERCISES AND REVIEW QUESTIONS
ANSWERS TO EXERCISES AND REVIEW QUESTIONS PART FIVE: STATISTICAL TECHNIQUES TO COMPARE GROUPS Before attempting these questions read through the introduction to Part Five and Chapters 1621 of the SPSS
More informationInferences About Differences Between Means Edpsy 580
Inferences About Differences Between Means Edpsy 580 Carolyn J. Anderson Department of Educational Psychology University of Illinois at UrbanaChampaign Inferences About Differences Between Means Slide
More informationONEWAY REPEATED MEASURES ANOVA DANIEL BODUSZEK
ONEWAY REPEATED MEASURES ANOVA DANIEL BODUSZEK d.boduszek@interia.eu www.danielboduszek.com Presentation Outline Introduction Assumptions SPSS procedure Presenting results Introduction Oneway repeated
More informationRecall this chart that showed how most of our course would be organized:
Chapter 4 OneWay ANOVA Recall this chart that showed how most of our course would be organized: Explanatory Variable(s) Response Variable Methods Categorical Categorical Contingency Tables Categorical
More informationNotes on Applied Linear Regression
Notes on Applied Linear Regression Jamie DeCoster Department of Social Psychology Free University Amsterdam Van der Boechorststraat 1 1081 BT Amsterdam The Netherlands phone: +31 (0)20 4448935 email:
More informationMultivariate Analysis of Variance. The general purpose of multivariate analysis of variance (MANOVA) is to determine
2  Manova 4.3.05 25 Multivariate Analysis of Variance What Multivariate Analysis of Variance is The general purpose of multivariate analysis of variance (MANOVA) is to determine whether multiple levels
More informationMultiple Analysis of Variance (MANOVA) Kate Tweedy and Alberta Lunardelli
Multiple Analysis of Variance (MANOVA) Kate Tweedy and Alberta Lunardelli Generally speaking, multivariate analysis of variance (MANOVA) is an extension of ANOV However, rather than measuring the effect
More information multiple linear regression, when having 1. one continuous response variable 2. Two or more continuous explanatory variables
Completely Randomized Experimental Design Simplified as Completely Randomized Design or CRD in which replicated treatments are assigned randomly to EUs (pots, plots, individuals, containers, chambers,
More informationAEROSOL STATISTICS LOGNORMAL DISTRIBUTIONS AND dn/dlogd p
Concentration (particle/cm3) [e5] Concentration (particle/cm3) [e5] AEROSOL STATISTICS LOGNORMAL DISTRIBUTIONS AND dn/dlod p APPLICATION NOTE PR001 Lonormal Distributions Standard statistics based on
More informationChapter 10: ANOVA: The Analysis of Variance
Chapter 10: ANOVA: The Analysis of Variance Dr. Sharabati Purdue University April 10, 2014 Dr. Sharabati (Purdue University) ANOVA Spring 2014 1 / 26 The Analysis of Variance (ANOVA) When data are sampled
More informationBST 708 T. Mark Beasley SplitPlot ANOVA handout
BST 708 T. Mark Beasley SplitPlot ANOVA handout In the SplitPlot ANOVA, three factors represent separate sources of variance. Two interactions also present independent sources of variation. Suppose a
More informationLatent Profile Analysis
Lecture 14 April 4, 2006 Clustering and Classification Lecture #144/4/2006 Slide 1 of 30 Today s Lecture (LPA). Today s Lecture LPA as a specific case of a Finite Mixture Model. How to do LPA. Lecture
More informationAnalysis of Variance  ANOVA 30/09/09
Analysis of Variance  ANOVA Eleisa Heron 30/09/09 Introduction Analysis of variance (ANOVA) is a method for testing the hypothesis that there is no difference between two or more population means (usually
More information" Y. Notation and Equations for Regression Lecture 11/4. Notation:
Notation: Notation and Equations for Regression Lecture 11/4 m: The number of predictor variables in a regression Xi: One of multiple predictor variables. The subscript i represents any number from 1 through
More informationMultivariate Analysis of Variance (MANOVA) Pekka Malo 30E00500 Quantitative Empirical Research Spring 2016
Multivariate Analysis of Variance () Pekka Malo 30E00500 Quantitative Empirical Research Spring 2016 Multivariate Analysis of Variance Multivariate Analysis of Variance () ~ a dependence technique that
More informationUnivariate and Multivariate Statistical Distributions
Univariate and Multivariate Statistical Distributions Multivariate Methods in Education ERSH 8350 Lecture #3 August 31, 2011 ERSH 8350: Lecture 3 Today s Lecture Univariate and multivariate statistics
More informationMultivariate Analysis of Variance (MANOVA)
Chapter 415 Multivariate Analysis of Variance (MANOVA) Introduction Multivariate analysis of variance (MANOVA) is an extension of common analysis of variance (ANOVA). In ANOVA, differences among various
More informationMODEL AND ANALYSIS FOR RANDOMIZED COMPLETE BLOCK DESIGNS
MODEL AND ANALYSIS FOR RANDOMIZED COMPLETE BLOCK DESIGNS The randomized complete block design (RCBD) v treatments (They could be treatment combinations.) b blocks of v units, chosen so that units within
More informationChapter 11: Two Variable Regression Analysis
Department of Mathematics Izmir University of Economics Week 1415 20142015 In this chapter, we will focus on linear models and extend our analysis to relationships between variables, the definitions
More informationChapter 5 Analysis of variance SPSS Analysis of variance
Chapter 5 Analysis of variance SPSS Analysis of variance Data file used: gss.sav How to get there: Analyze Compare Means Oneway ANOVA To test the null hypothesis that several population means are equal,
More informationCorrelation and Regression
Correlation and Regression Fathers and daughters heights Fathers heights mean = 67.7 SD = 2.8 55 60 65 70 75 height (inches) Daughters heights mean = 63.8 SD = 2.7 55 60 65 70 75 height (inches) Reference:
More informationAn example of an APAstyle writeup for. the Repeated Measures Analysis of Variance and Multivariate Analysis. of Variance lab example
Repeated Measures 1 Running head: REPEATED MEASURES ANOVA AND MANOVA An example of an APAstyle writeup for the Repeated Measures Analysis of Variance and Multivariate Analysis of Variance lab example
More informationUnit 31 A Hypothesis Test about Correlation and Slope in a Simple Linear Regression
Unit 31 A Hypothesis Test about Correlation and Slope in a Simple Linear Regression Objectives: To perform a hypothesis test concerning the slope of a least squares line To recognize that testing for a
More informationSection 13, Part 1 ANOVA. Analysis Of Variance
Section 13, Part 1 ANOVA Analysis Of Variance Course Overview So far in this course we ve covered: Descriptive statistics Summary statistics Tables and Graphs Probability Probability Rules Probability
More informationSuggested solution for exam in MSA830: Statistical Analysis and Experimental Design October 2009
Petter Mostad Matematisk Statistik Chalmers Suggested solution for exam in MSA830: Statistical Analysis and Experimental Design October 2009 1. (a) To use a ttest, one must assume that both groups of
More informationMultivariate Statistical Inference and Applications
Multivariate Statistical Inference and Applications ALVIN C. RENCHER Department of Statistics Brigham Young University A WileyInterscience Publication JOHN WILEY & SONS, INC. New York Chichester Weinheim
More informationAssessing Normality The Univariate Case
Assessing Normality The Univariate Case In general, most multivariate methods will depend on the distribution of X or on distances of the form. n( X µ) S 1 ( X µ) Large sample theory tells us that if the
More informationANOVA must be modified to take correlated errors into account when multiple measurements are made for each subject.
Chapter 14 WithinSubjects Designs ANOVA must be modified to take correlated errors into account when multiple measurements are made for each subject. 14.1 Overview of withinsubjects designs Any categorical
More informationMultiple Analysis of Variance (MANOVA) or Multiple Analysis of Covariance (MANCOVA). Kelly Davis
Multiple Analysis of Variance (MANOVA) or Multiple Analysis of Covariance (MANCOVA). Kelly Davis Goals, assumptions, data required. Goals MANOVA can be an appropriate statistical technique when an ANOVAlike
More informationAMS577. Repeated Measures ANOVA: The Univariate and the Multivariate Analysis Approaches
AMS577. Repeated Meaure ANOVA: The Univariate and the Multivariate Analyi Approache 1. Oneway Repeated Meaure ANOVA Oneway (onefactor) repeatedmeaure ANOVA i an extenion of the matchedpair ttet to
More information4.4. Further Analysis within ANOVA
4.4. Further Analysis within ANOVA 1) Estimation of the effects Fixed effects model: α i = µ i µ is estimated by a i = ( x i x) if H 0 : µ 1 = µ 2 = = µ k is rejected. Random effects model: If H 0 : σa
More informationSummary of Chapter 8 (Sections 8.3, )
Summary of Chapter 8 (Sections 8.3, 8.58.7) 8.3. Tests of the Equality of Two Normal Distributions (1) Hypothesis Test for Equality of Two Means Let X N(µ X, σx 2 ) and Y N(µ Y, σy 2 ). Assume that X
More informationOneway Classifications of Data and Completely. Randomized Design. Concepts for Completely
Oneway Classifications of Data and Completely Randomized Designs A oneway classification of data refers to data sets that are grouped according to one criterion. It can result from designed experiments,
More informationStatistical Modeling Using SAS
Statistical Modeling Using SAS Xiangming Fang Department of Biostatistics East Carolina University SAS Code Workshop Series 2012 Xiangming Fang (Department of Biostatistics) Statistical Modeling Using
More informationANOVA. ANalysis Of VAriance ANOVA. Dr. Tom Ilvento FREC 408. Elements of a Designed Experiment. Elements of a Designed Experiment
ANalysis Of VAriance Dr. Tom Ilvento FREC 408 provides a strategy to compare two or more population means associated with various treatments It is used when we have A dependent variable (AKA Response Variable)
More informationDISCRIMINANT FUNCTION ANALYSIS (DA)
DISCRIMINANT FUNCTION ANALYSIS (DA) John Poulsen and Aaron French Key words: assumptions, further reading, computations, standardized coefficents, structure matrix, tests of signficance Introduction Discriminant
More informationMultiple Linear Regression
Multiple Linear Regression A regression with two or more explanatory variables is called a multiple regression. Rather than modeling the mean response as a straight line, as in simple regression, it is
More informationInferential Statistics
Inferential Statistics Sampling and the normal distribution Zscores Confidence levels and intervals Hypothesis testing Commonly used statistical methods Inferential Statistics Descriptive statistics are
More informationCustom Hypothesis Tests Index table removed to save space...
Univari Analysis of Variance ( Simple Main Effect) BetweenSubjects Factors 3 Value Label N 36 Gradu 36 Alone Gradu Tests of BetweenSubjects Effects Corrected Model Intercept * Total Corrected Total Type
More informationGLM I An Introduction to Generalized Linear Models
GLM I An Introduction to Generalized Linear Models CAS Ratemaking and Product Management Seminar March 2009 Presented by: Tanya D. Havlicek, Actuarial Assistant 0 ANTITRUST Notice The Casualty Actuarial
More informationKSTAT MINIMANUAL. Decision Sciences 434 Kellogg Graduate School of Management
KSTAT MINIMANUAL Decision Sciences 434 Kellogg Graduate School of Management Kstat is a set of macros added to Excel and it will enable you to do the statistics required for this course very easily. To
More informationChapter 19 main topics. Sociology 360 Statistics for Sociologists I Chapter 19 TwoSample Problems. Chapter 19 homework assignment
Sociology 360 Statistics for Sociologists I Chapter 19 TwoSample Problems Chapter 19 main topics Twosample t procedures Robustness of twosample t procedures Details of the t approximation Avoid the
More informationStatistics Review PSY379
Statistics Review PSY379 Basic concepts Measurement scales Populations vs. samples Continuous vs. discrete variable Independent vs. dependent variable Descriptive vs. inferential stats Common analyses
More informationBackground to ANOVA. OneWay Analysis of Variance: ANOVA. The OneWay ANOVA Summary Table. Hypothesis Test in OneWay ANOVA
Background to ANOVA OneWay Analysis of Variance: ANOVA Dr. J. Kyle Roberts Southern Methodist University Simmons School of Education and Human Development Department of Teaching and Learning Recall from
More informationDidacticiel  Études de cas
1 Topic Linear Discriminant Analysis Data Mining Tools Comparison (Tanagra, R, SAS and SPSS). Linear discriminant analysis is a popular method in domains of statistics, machine learning and pattern recognition.
More informationLab Four: Initial statistical tests: ttests, nonparametric tests, simple ANOVA
Lab Four: Initial statistical tests: ttests, nonparametric tests, simple ANOVA Lab Objectives After today s lab you should be able to: 1. Use PROC FORMAT to divide a variable into categories for use
More informationRonald H. Heck and Lynn N. Tabata 1 EDEP 606 (S2014): Multivariate Methods rev. February 2014 The University of Hawai i at Mānoa
Ronald H. Heck and Lynn N. Tabata 1 EDEP 606 (S2014): Multivariate Methods rev. February 2014 The University of Hawai i at Mānoa Examining a Multivariate Outcome Examining a Multivariate Outcome 1 A logical
More informationA KERNEL MAXIMUM UNCERTAINTY DISCRIMINANT ANALYSIS AND ITS APPLICATION TO FACE RECOGNITION
A KERNEL MAXIMUM UNCERTAINTY DISCRIMINANT ANALYSIS AND ITS APPLICATION TO FACE RECOGNITION Carlos Eduardo Thomaz Department of Electrical Enineerin, Centro Universitario da FEI, FEI, Sao Paulo, Brazil
More informationANOVA. February 12, 2015
ANOVA February 12, 2015 1 ANOVA models Last time, we discussed the use of categorical variables in multivariate regression. Often, these are encoded as indicator columns in the design matrix. In [1]: %%R
More informationIntroduction to General and Generalized Linear Models
Introduction to General and Generalized Linear Models General Linear Models  part I Henrik Madsen Poul Thyregod Informatics and Mathematical Modelling Technical University of Denmark DK2800 Kgs. Lyngby
More informationData Mining and Data Warehousing. Henryk Maciejewski. Data Mining Predictive modelling: regression
Data Mining and Data Warehousing Henryk Maciejewski Data Mining Predictive modelling: regression Algorithms for Predictive Modelling Contents Regression Classification Auxiliary topics: Estimation of prediction
More informationAppendix A: Background
12 Appendix A: Background The purpose of this Appendix is to review background material on the normal distribution and its relatives, and an outline of the basics of estimation and hypothesis testing as
More informationExample: Credit card default, we may be more interested in predicting the probabilty of a default than classifying individuals as default or not.
Statistical Learning: Chapter 4 Classification 4.1 Introduction Supervised learning with a categorical (Qualitative) response Notation:  Feature vector X,  qualitative response Y, taking values in C
More informationRegression. Name: Class: Date: Multiple Choice Identify the choice that best completes the statement or answers the question.
Class: Date: Regression Multiple Choice Identify the choice that best completes the statement or answers the question. 1. Given the least squares regression line y8 = 5 2x: a. the relationship between
More informationChapter 23 Inferences About Means
Chapter 23 Inferences About Means Chapter 23  Inferences About Means 391 Chapter 23 Solutions to Class Examples 1. See Class Example 1. 2. We want to know if the mean battery lifespan exceeds the 300minute
More informationTHE FIRST SET OF EXAMPLES USE SUMMARY DATA... EXAMPLE 7.2, PAGE 227 DESCRIBES A PROBLEM AND A HYPOTHESIS TEST IS PERFORMED IN EXAMPLE 7.
THERE ARE TWO WAYS TO DO HYPOTHESIS TESTING WITH STATCRUNCH: WITH SUMMARY DATA (AS IN EXAMPLE 7.17, PAGE 236, IN ROSNER); WITH THE ORIGINAL DATA (AS IN EXAMPLE 8.5, PAGE 301 IN ROSNER THAT USES DATA FROM
More information1. What is the critical value for this 95% confidence interval? CV = z.025 = invnorm(0.025) = 1.96
1 Final Review 2 Review 2.1 CI 1propZint Scenario 1 A TV manufacturer claims in its warranty brochure that in the past not more than 10 percent of its TV sets needed any repair during the first two years
More informationOneWay Analysis of Variance
OneWay Analysis of Variance Note: Much of the math here is tedious but straightforward. We ll skim over it in class but you should be sure to ask questions if you don t understand it. I. Overview A. We
More information1 Basic ANOVA concepts
Math 143 ANOVA 1 Analysis of Variance (ANOVA) Recall, when we wanted to compare two population means, we used the 2sample t procedures. Now let s expand this to compare k 3 population means. As with the
More informationRESULTS: STATISTICAL INFERENCE
UNDERSTANDING RESEARCH RESULTS: STATISTICAL INFERENCE A FEW TERMS A FEW TERMS SAMPLES AND POPULATIONS Inferential statistics are necessary because The results of a given study are based on data obtained
More informationWater Quality and Environmental Treatment Facilities
Geum Soo Kim, Youn Jae Chan, David S. Kelleher1 Paper presented April 2009 and at the Teachin APPAMKDI Methods International (Seoul, June Seminar 1113, on 2009) Environmental Policy direct, twostae
More informationOneWay Analysis of Variance: A Guide to Testing Differences Between Multiple Groups
OneWay Analysis of Variance: A Guide to Testing Differences Between Multiple Groups In analysis of variance, the main research question is whether the sample means are from different populations. The
More informationDescriptive Statistics
Descriptive Statistics Primer Descriptive statistics Central tendency Variation Relative position Relationships Calculating descriptive statistics Descriptive Statistics Purpose to describe or summarize
More informationMath 62 Statistics Sample Exam Questions
Math 62 Statistics Sample Exam Questions 1. (10) Explain the difference between the distribution of a population and the sampling distribution of a statistic, such as the mean, of a sample randomly selected
More informationLecture 23 Multiple Comparisons & Contrasts
Lecture 23 Multiple Comparisons & Contrasts STAT 512 Spring 2011 Background Reading KNNL: 17.317.7 231 Topic Overview Linear Combinations and Contrasts Pairwise Comparisons and Multiple Testing Adjustments
More informationStatistiek II. John Nerbonne. October 1, 2010. Dept of Information Science j.nerbonne@rug.nl
Dept of Information Science j.nerbonne@rug.nl October 1, 2010 Course outline 1 Oneway ANOVA. 2 Factorial ANOVA. 3 Repeated measures ANOVA. 4 Correlation and regression. 5 Multiple regression. 6 Logistic
More informationAnalysis of Variance: General Concepts
Research Skills for Psychology Majors: Everything You Need to Know to Get Started Analysis of Variance: General Concepts This chapter is designed to present the most basic ideas in analysis of variance
More informationRegression Analysis: A Complete Example
Regression Analysis: A Complete Example This section works out an example that includes all the topics we have discussed so far in this chapter. A complete example of regression analysis. PhotoDisc, Inc./Getty
More informationTwosample hypothesis testing, II 9.07 3/16/2004
Twosample hypothesis testing, II 9.07 3/16/004 Small sample tests for the difference between two independent means For twosample tests of the difference in mean, things get a little confusing, here,
More informationIntroduction to ANOVA
Introduction to ANOVA Lamb Weight Gain Example from Text The following table contains fictitious data on the weight gain of lambs on three different diets over a 2 week period. Individual Value Plot of
More informationMultivariate analysis of variance
21 Multivariate analysis of variance In previous chapters, we explored the use of analysis of variance to compare groups on a single dependent variable. In many research situations, however, we are interested
More informationMultivariate analyses
14 Multivariate analyses Learning objectives By the end of this chapter you should be able to: Recognise when it is appropriate to use multivariate analyses (MANOVA) and which test to use (traditional
More informationANOVA ANOVA. TwoWay ANOVA. OneWay ANOVA. When to use ANOVA ANOVA. Analysis of Variance. Chapter 16. A procedure for comparing more than two groups
ANOVA ANOVA Analysis of Variance Chapter 6 A procedure for comparing more than two groups independent variable: smoking status nonsmoking one pack a day > two packs a day dependent variable: number of
More information