ANOVA Notes Page 1. Analysis of Variance for a One-Way Classification of Data



Similar documents
Simple Linear Regression

STATISTICAL PROPERTIES OF LEAST SQUARES ESTIMATORS. x, where. = y - ˆ " 1

The simple linear Regression Model

ADAPTATION OF SHAPIRO-WILK TEST TO THE CASE OF KNOWN MEAN

CHAPTER 2. Time Value of Money 6-1

SHAPIRO-WILK TEST FOR NORMALITY WITH KNOWN MEAN

Average Price Ratios

1. The Time Value of Money

Statistical Pattern Recognition (CE-725) Department of Computer Engineering Sharif University of Technology

How To Value An Annuity

Abraham Zaks. Technion I.I.T. Haifa ISRAEL. and. University of Haifa, Haifa ISRAEL. Abstract

ECONOMIC CHOICE OF OPTIMUM FEEDER CABLE CONSIDERING RISK ANALYSIS. University of Brasilia (UnB) and The Brazilian Regulatory Agency (ANEEL), Brazil

Chapter = 3000 ( ( 1 ) Present Value of an Annuity. Section 4 Present Value of an Annuity; Amortization

Curve Fitting and Solution of Equation

MDM 4U PRACTICE EXAMINATION

Credibility Premium Calculation in Motor Third-Party Liability Insurance

CH. V ME256 STATICS Center of Gravity, Centroid, and Moment of Inertia CENTER OF GRAVITY AND CENTROID

Chapter 3. AMORTIZATION OF LOAN. SINKING FUNDS R =

Numerical Methods with MS Excel

6.7 Network analysis Introduction. References - Network analysis. Topological analysis

of the relationship between time and the value of money.

IDENTIFICATION OF THE DYNAMICS OF THE GOOGLE S RANKING ALGORITHM. A. Khaki Sedigh, Mehdi Roudaki

n. We know that the sum of squares of p independent standard normal variables has a chi square distribution with p degrees of freedom.

10.5 Future Value and Present Value of a General Annuity Due

AP Statistics 2006 Free-Response Questions Form B

Load and Resistance Factor Design (LRFD)

APPENDIX III THE ENVELOPE PROPERTY

Basic statistics formulas

A Study of Unrelated Parallel-Machine Scheduling with Deteriorating Maintenance Activities to Minimize the Total Completion Time

T = 1/freq, T = 2/freq, T = i/freq, T = n (number of cash flows = freq n) are :

CSSE463: Image Recognition Day 27

FINANCIAL MATHEMATICS 12 MARCH 2014

Classic Problems at a Glance using the TVM Solver

Reinsurance and the distribution of term insurance claims

Mathematics of Finance

Performance Attribution. Methodology Overview

CHAPTER 13. Simple Linear Regression LEARNING OBJECTIVES. USING Sunflowers Apparel

Speeding up k-means Clustering by Bootstrap Averaging

Preprocess a planar map S. Given a query point p, report the face of S containing p. Goal: O(n)-size data structure that enables O(log n) query time.

The analysis of annuities relies on the formula for geometric sums: r k = rn+1 1 r 1. (2.1) k=0

Banking (Early Repayment of Housing Loans) Order,

Chapter Eight. f : R R

The Gompertz-Makeham distribution. Fredrik Norström. Supervisor: Yuri Belyaev

Report 52 Fixed Maturity EUR Industrial Bond Funds

Conversion of Non-Linear Strength Envelopes into Generalized Hoek-Brown Envelopes

10/19/2011. Financial Mathematics. Lecture 24 Annuities. Ana NoraEvans 403 Kerchof

How To Make A Supply Chain System Work

Fast, Secure Encryption for Indexing in a Column-Oriented DBMS

Settlement Prediction by Spatial-temporal Random Process

The Time Value of Money

Regression Analysis. 1. Introduction

Session 4: Descriptive statistics and exporting Stata results

Compressive Sensing over Strongly Connected Digraph and Its Application in Traffic Monitoring

A Parallel Transmission Remote Backup System

Taylor & Francis, Ltd. is collaborating with JSTOR to digitize, preserve and extend access to The Journal of Experimental Education.

Constrained Cubic Spline Interpolation for Chemical Engineering Applications

Statistical Techniques for Sampling and Monitoring Natural Resources

Common p-belief: The General Case

Lesson 15 ANOVA (analysis of variance)

Sequences and Series

On formula to compute primes and the n th prime

Lecture 7. Norms and Condition Numbers

Load Balancing Control for Parallel Systems

ISyE 512 Chapter 7. Control Charts for Attributes. Instructor: Prof. Kaibo Liu. Department of Industrial and Systems Engineering UW-Madison

ANALYTICAL MODEL FOR TCP FILE TRANSFERS OVER UMTS. Janne Peisa Ericsson Research Jorvas, Finland. Michael Meyer Ericsson Research, Germany

Bayesian Network Representation

Dynamic Two-phase Truncated Rayleigh Model for Release Date Prediction of Software

Data Analysis Toolkit #10: Simple linear regression Page 1

Numerical Comparisons of Quality Control Charts for Variables

Relaxation Methods for Iterative Solution to Linear Systems of Equations

Preparation of Calibration Curves

A New Bayesian Network Method for Computing Bottom Event's Structural Importance Degree using Jointree

Statistical Intrusion Detector with Instance-Based Learning

RUSSIAN ROULETTE AND PARTICLE SPLITTING

α 2 α 1 β 1 ANTISYMMETRIC WAVEFUNCTIONS: SLATER DETERMINANTS (08/24/14)

A COMPARATIVE STUDY BETWEEN POLYCLASS AND MULTICLASS LANGUAGE MODELS

M. Salahi, F. Mehrdoust, F. Piri. CVaR Robust Mean-CVaR Portfolio Optimization

SPATIAL INTERPOLATION TECHNIQUES (1)

Measuring the Quality of Credit Scoring Models

Questions? Ask Prof. Herz, General Classification of adsorption

Proceedings of the 2010 Winter Simulation Conference B. Johansson, S. Jain, J. Montoya-Torres, J. Hugan, and E. Yücesan, eds.

I. Chi-squared Distributions

Now here is the important step

Security Analysis of RAPP: An RFID Authentication Protocol based on Permutation

Lesson 17 Pearson s Correlation Coefficient

ANNEX 77 FINANCE MANAGEMENT. (Working material) Chief Actuary Prof. Gaida Pettere BTA INSURANCE COMPANY SE

Efficient Traceback of DoS Attacks using Small Worlds in MANET

Optimal replacement and overhaul decisions with imperfect maintenance and warranty contracts

Integrating Production Scheduling and Maintenance: Practical Implications

Transcription:

ANOVA Notes Page Aalss of Varace for a Oe-Wa Classfcato of Data Cosder a sgle factor or treatmet doe at levels (e, there are,, 3, dfferet varatos o the prescrbed treatmet) Wth a gve treatmet level there are measuremets or scores he subscrpt or dex s betwee ad ad labels the dfferet factor levels or treatmet varatos he j th score the th level s desgated as j, s the the treatmet factor dex; j labels the score the wth the th treatmet: j Note: It s ot requred ad t s tpcall ot the case that ou have a equal umber of measuremets from each treatmet group he ull hpothess asserts that o treatmet populato dffers from a other hus, the populatos of scores of the treatmet levels should have the same mea ad varace For theoretcal coveece, we wll assume the scores are ormall dstrbuted wth a commo varace wth each treatmet level, but the tests whch follow are farl robust for departures from ormalt Null Hpothess: H 0: μ = μ = μ3 = μ hs s equvalet to sag that the dfferet treatmets are reall just dfferet samples all tae from a sgle populato hus, all of the varato see the measured scores s due to sample varatos, e, radomess he total umber of scores s + + 3 + = I R Johso s text, ths s desgated as upper case N, however, to be cosstet wth our earler otato that lower case s assocated wth a sample ad upper case wth a populato, these otes wll use = () = he average of all scores s ofte called the grad mea ad s gve b = + + + + + + + + + + +,,,3,,,,3,,,,3, For otatoal coveece, ths s wrtte as a double summato j, = = () If the ull hpothess s true ad all of the treatmets gve the same mea result, the should be a far estmate of ever score Formall, we ca wrte the followg ( ) ( ) = + + (3) j, j, Al Lehe Madso Area echcal College 4/9/00

ANOVA Notes Page Here s the mea of treatmet varato j, = (4) I equato (3), ca be looed upo as a correcto term to the grad mea to tae accout of how treatmet group dffers from the grad mea, whle j, s a addtoal correcto term to tae care of varatos wth treatmet group If the ull hpothess s true, both of these terms should be of the same magtude ad both reflect radom devatos from the grad mea From equato (4) wth each treatmet the sum of the devatos about the treatmet mea vashes ( j, ) = 0 (5) j, Furthermore, ( ) = = =, j = 0, = = = = where the zero s a cosequece of equato () ( ) = = 0 (6) = = So, the sum of the devatos of the treatmet meas about the grad mea vashes hs meas that the sample meas have degrees of freedom sce ol of them ca be arbtrarl specfed gve the value of the grad mea Cosder the total varato of, the sum of squared devatos of each score about the grad mea he devato of a score s gve b the followg ( ) ( ) = + (7) j, j, So the sum of squared devatos or total varato s ( ) ( ) ( )( ) ( ) j, = j, + j, + = = (8) Now from equato (5), ( )(, j ) = ( ) (, j ) = 0 = = So the total varato s made up of two cotrbutos SS = (, j ) = (, j ) + ( ) = SSE+ SSr = = (9) Al Lehe Madso Area echcal College 4/9/00

ANOVA Notes Page 3 SSr = = () = ( ) (0) SSE = ( ) j, he term SSr s the reatmet sum of squares ad represets that part of the total varato that s due to the dffereces betwee the treatmets (factor levels) hs varato has degrees of freedom he secod term, SSE, s the resdual (left over) varato due to dffereces scores wth the dfferet treatmet groups he th treatmet level has degrees of freedom due to the costrat mposed b equato (5) he total degrees of freedom of the resdual or Error sum of squared devatos s gve b ( ) = = () = = he sum of the degrees of freedom of the treatmet sum of squares ad the error sum of squares s + = hs s the degrees of freedom of the total varato SS hs s sce the grad mea uses up a degree of freedom R Johso s text uses the otato SS for SS ad SS(r) for SS r If the ull hpothess s deed true, the the mea squares of treatmet ad error both estmate the same commo populato varace of each treatmet populato he mea squares are computed as a varato dvded b assocated degrees of freedom ( ) SS he reatmet or Betwee Mea Square: MS r r = = = (3) he Error or Wth Mea Square: ( ) j, SSE = MSE = = (4) o test the ull hpothess that all treatmet levels have the same mea, we compute the observed Fsher F score MS F = r (5) MSE For a gve level of sgfcace,α, ths value s compared agast a crtcal score calculated from a F dstrbuto used to compare two varaces obtaed from samplg varaces from two ormall dstrbuted populatos he umerator degrees of freedom s ad the deomator degrees of freedom s If F > Fα (, ), the ull hpothess s rejected, whle f F < Fα (, ) we fal to reject H 0 o facltate the actual calculatos, we defe the followg termedate varables Al Lehe Madso Area echcal College 4/9/00

ANOVA Notes Page 4 =, j (6) A = (8) = B ( ) =, j (7) = (9) = B = B (0) = From equatos () ad (4), = () = () B Also the th sample varace s gve b s = (3) Now, ( ) SSr = + = + = = = = = = Smlarl,, so SSr = A (4) (,, ), j j j j, j j = = = = = = = =, so SSE = + = + = SSE = B A (5) Fall, as a chec, the total varato must be the sum of treatmet ad error varatos SS = (, j ) =, j, j + = B = = = = A + ( B A) = SSr + SSE From equatos (3) ad (4), the mea squares are gve b the followg Al Lehe Madso Area echcal College 4/9/00

ANOVA Notes Page 5 he reatmet or Betwee Mea Square: MSr A SS = r = (6) he Error or Wth Mea Square: SSE B A MSE = = (7) So a scheme to calculate a oe-wa classfcato aalss of varace s to la out the data colums as a spreadsheet, wth each colum represetg a dfferet treatmet or factor level Leave room betwee treatmets for a secod colum whch s the square of the frst colum hus, each treatmet s assocated wth a par of colums Sum the colum of scores to obta ad sum the colum of squares of scores to obta B he for each treatmet, calculate the B sample mea,, ad the sample varace s = he sum over the treatmet groups to obta, B, A, ad Note: he table below should ot be terpreted as mplg that the umber of rows each colum s the same I geeral, t s ot true that = = 3 = reatmet reatmet reatmet 3 reatmet,,,, 3, 3,,,,,,, 3, 3,,,,3,3,3,3 3,3 3,3,3,3, 3,,,, 3, 3, 3 -------------------------------------- ----------- B B 3 B 3 B 3 s s For each par of colums:, Summg over the colums: j At ths pot we ca costruct a ANOVA table B 3 = ( ) s =, j A = B = B = = = s = = Source Sum of Squares Degrees of Freedom Mea Square reatmet SS r = A / - MS r = SS r /( ) Error SSE = B - A - MSE = SSE /( ) otal SS = B / - SS s =, Al Lehe Madso Area echcal College 4/9/00

ANOVA Notes Page 6 MS Compute F = r ad compare ts value agast Fα ( ν =, ν = ) for a stated level of MSE sgfcace,α If F < Fα (, ), we fal to reject H 0: μ = μ = μ3 = μ If F > Fα (, ), we reject H0 ad coclude that at least two of the populato meas are dfferet o determe whch treatmet meas are dfferet more testg eeds to be doe Some procedures whch are used are Scheffe s test ad ue s test However, the method we wll use ( ) s multple t testg, also ow as the Boferro procedure here are = dstct pars of treatmets whch ca be pced from the dfferet treatmet groups For each such par a observed t - score s calculated If ths observed score s larger absolute value tha a crtcal t score, the the ull hpothess that asserts the equalt of the two populato meas assocated wth the two treatmets s rejected Specfcall, let ad m stad for the treatmet group dces of the two treatmets beg examed he, obs (, ) m t m = (8) sp + m s the b ow famlar observed t - score for comparg two depedet sample meas he absolute value s used sce ths s a two-sded test for a dfferece betwee two populatos Sce we have alread assumed that the treatmet populatos have measuremets whch are ormall dstrbuted wth a commo varace, we use the pooled varace whch best estmates ths commo wth treatmet group varace, amel MSE B A s p = MSE = (9) Hece, our observed t score s computed from obs (, ) m t m = (30) MSE + m Now, oe would expect that ths t obs (, m) would be compared agast a crtcal t score tc = t α /, where α s the level of sgfcace However, we watα to be the maxmum ( ) probablt of a pe I error for the etre sequece of m = = comparsos If each comparso were made at a level of sgfcace of α, the the probablt of a pe I error somewhere the m comparsos made would be ( α ) m hs probablt s larger tha α ad cosderabl larger f m s large o have a actual level of sgfcace of α, we wll therefore use tc = t α /( m ), so that the probablt of a pe I error the m comparsos becomes Al Lehe Madso Area echcal College 4/9/00

ANOVA Notes Page 7 m α α m Fall, whe we fd t c, we eed to ow the degrees of freedom Sce we are usg MSE for s p, the degrees of freedom sν = hus, summar we have tc = t α /( m) wth degrees of freedom (3) Cosder the followg example Suppose there are fve dfferet treatmets, e, = 5, ad suppose we wsh to wor at a level of sgfcace of 0%,,e, α = 00 he the umber of 5 54 ( ) par wse comparsos s m = = = 0 Suppose that the total umber of measuremets over the fve treatmets s 6, e, = 6 hus, from equato (3), tc = tα /( m) = t0/0 = t005 wth 6 5 = degrees of freedom Hece, t c = 83 Now to facltate the 0 dfferet comparsos, t helps to summarze the results tabular form Each ope cell the followg table represets oe of the 0 possble comparsos reatmet 3 4 t obs (, ) 3 t obs (,3) t obs (,3) 4 t obs (, 4) t obs (,4) t obs (3,4) 5 t obs (,5) t obs (,5) t obs (3,5) t obs (4,5) I these cells the computed value of t obs for the two treatmets beg compared s dsplaed Wheever ths value of t obs exceeds 83 oe ca coclude at a 0% level of sgfcace that the two populato meas are dfferet Al Lehe Madso Area echcal College 4/9/00