Chapter 4. Blocking. 4.1 Types of block
|
|
|
- James Lyons
- 10 years ago
- Views:
Transcription
1 Chapter Blocking. Types of block If the plots are not all reasonably similar, we should group them into blocks in such a way that plots within each block are alike. There are three main types of block... atural discrete divisions These divisions in the experimental units are already present. If the experimental units are new-born animals then litters make natural blocks. In an experiment on people or animals, the two sexes make obvious blocks. In testing tags on cows ears, the ears are the experimental units and cows are the blocks. In an industrial process, a block could be a batch of chemical or ore used for part of the process. Sometimes there is more than one type of natural discrete block. If the experimental units are half-leaves of tobacco plants then whole leaves make one sort of block while the plants make another. In a consumer experiment, such as Example.9, testers and weeks are both natural blocks. In an experiment in the laboratory, technicians, benches and days may all be blocks. If an experiment is carried out on plots that had previously been used for another experiments then you should consider whether to deem the previous treatments to be blocks. This is because the previous treatments may have left some residue that may affect the responses in the new experiment. This type of block is particularly important in experiments on trees, which may have to be used for different experiments year after year. Example. (Irrigated rice) Rice is usually grown on irrigated land. Figure. shows plots in a rice paddy to be used for an experiment. Irrigation channels 5
2 5 Chapter. Blocking water Figure.: Irrigation channels in the rice experiment branch off the main irrigation channel, each one watering a long strip of plots. These strips, or irrigation groupings, should be considered as blocks... Continuous gradients If an experiment is spread out in time or space then there will probably be continuous underlying trends but no natural boundaries. In such cases the plots can be divided into blocks of plots which are contiguous in time or space. To some extent the positioning of the block boundaries is arbitrary. In an experiment on people or animals, age, weight and state of health are continuous variables which are often suitable for determining blocks. To be in the same block two people do not have to have exactly the same weight, but weight ranges can be chosen so that blocks have a suitable size. Similarly, severity of disease can be used to block patients in a clinical trial. Example. (Laboratory measurement of samples) Consider the technician measuring soil samples in Question.. His experimental units follow one another in
3 .. Types of block 5 time. As time goes on, he may get more accurate, or he may get tired. Outside factors, such as temperature or humidity, may change. Dividing up the experimental units into three or four blocks of consecutive plots should remove these unnecessary sources of variation from the conclusions of the experiment. Example. (Field trial) The plots in a agricultural field trial may cover quite a large area, encompassing changes in fertility. Sometimes it is possible to form natural blocks by marking out a stony area, a shady area and so on. More often it is simply assumed that plots close to each other are more likely to repond similarly than plots far apart, so small compact areas are chosen as blocks. In Example., the distance from the main irrigation channel gives a continuous source of variability that should also be used for blocking, but now there is some freedom to choose how large a distance each block should cover... Choice of blocking for trial management Some aspects of trial management force differences between the plots. As far as possible, these differences should match (some of) the block boundaries. In a clinical trial patients may have to be divided into groups to be attended to by different doctors or nurses. These groups should be blocks. In a laboratory experiment, technicians may be thought of as natural blocks if their times and places of work are already fixed. However, if technicians can be allocated to tasks as part of the management of the experiment, then it may be possible to adjust their work so that, for example, the number of samples analysed by one person in one session is equal to the number of treatments. There are many experiments where one or more treatment factors can be applied only to large areas: see Example.5. These large areas form a sort of block. Example. revisited (Field trial) In the developed world, most agricultural operations are by tractor. Typically a tractor is driven as far as possible in a straight line before being turned round. This suggests that blocks in field trials should be long thin areas corresponding to a few passes of the tractor... How and when to block If possible, (i) blocks should all have the same size; (ii) blocks should be big enough to allow each treatment to occur at least once in each block. atural discrete blocks should always be used once they have been recognized. If possible, choose plots and blocks to satisfy (i).
4 5 Chapter. Blocking Example. (Piglets) If the experimental units are piglets then litters are natural blocks. Litters are not all of the same size, typically being in the range 8, depending on the breed. It would be sensible to use only some fixed number, say 9, of piglets from each litter. Then you need an objective rule for which pigets to choose from the larger litters, such the heaviest piglets. Alternatively, if larger blocks are needed, start with more sows than necessary and use only those litters large enough to give, say, piglets. atural blocks have an upper limit on their size, so it may be impossible to satisfy (ii). In the cows ears example, blocks have size no matter how many treatments there are. Blocks should always be used for management. Then all trial operations sowing, harvesting, interim applications of treatments, measuring are done blockby-block, in case of interruptions, improvements in technique, replacement of staff, etc. This ensures that any extra variation caused by changing conditions is already accounted for by the blocking. Management blocks can usually be chosen to satisfy both (i) and (ii). To eliminate the effects of a continuous trend, blocks can also be chosen to satisfy both (i) and (ii). Usually such blocking is helpful, but it may be better not to use this sort of block if doing so would make the number of residual degrees of freedom very small: see Example.6. As noted in Example., the requirements of blocking for trial management may conflict with those of blocking to remove a continuous trend. You may have to decide which is more important for the experiment at hand. We have also noted examples where more than one sort of block is needed. This point will be developed further in Chapters 6 and 8.. Orthogonal block designs For the rest of this chapter we suppose that Ω consists of b blocks of equal size k. We thus have a block factor B which is defined by B(ω) = the block containing ω. The block subspace V B consist of those vectors in V which take a constant value on each block. For j =,..., b, let v j be the vector whose entry on plot ω is equal to { if ω is in block j; otherwise. Then v j v j = k, while v j v l = if j l. Therefore { v j : j =,...,b } is an orthogonal basis for V B, and dimv B = b. ow u = b j= v j V B, so V V B. Just as we defined W T, we put W B = {v V B : v is orthogonal to V } = V B V.
5 .. Construction and randomization 55 Definition A block design is orthogonal if the spaces W T and W B are orthogonal to each other. Theorem. Let s i j be the number of times that treatment i occurs in block j, for i =,..., t and j =,..., b. Then the block design is orthogonal if and only if s i j = r i /b for i =,..., t and j =,..., b. Proof First note that s i j = u i v j. Since W T is orthogonal to V, W T W B if and only if W T V B, which happens if and only if ( ) t a i u i v j = for j =,..., b i= whenever i a i r i = ; that is, so i a i s i j = whenever i a i r i =. If s i j = r i /b for each i then i a i s i j = i (a i r i )/b, which is zero whenever i a i r i =. This is true for all j, so W T W B. Conversely, suppose that W T W B. Fix i different from, and put a = /r, a i = /r i and a l = if l / {,i}. Then l a l r l = and l a l s l j = s j /r s i j /r j so s j = s i j r r i for all j. This is true for all i, including i =, so counting the plots in block j gives k = t i= s i j = s j r t i= Therefore s j = r /b and hence s i j = r i /b for all i. r i = s j r = s j r bk. Definition A complete-block design has blocks of size t, with each treatment occurring once in each block. Corollary. Complete-block designs are orthogonal. We consider only orthogonal block designs for the remainder of this chapter.. Construction and randomization Construct and randomize an orthogonal block design as follows. (i) Apply treatment i to r i /b plots in block, for i =,..., t, and randomize, just as for a completely randomized design. (ii) Repeat for each block, using a fresh randomization each time, independent of the preceding randomizations.
6 56 Chapter. Blocking X X 7 7 X 7 X 6 9 X X Table.: Stream of random digits, used to randomize the design in Example.5 Judge Tasting Wine Judge Tasting Wine Judge 7 Tasting Wine Judge Tasting Wine Judge 5 Tasting Wine Judge 8 Tasting Wine Judge Tasting Wine Judge 6 Tasting Wine Table.: Randomized plan in Example.5 Example.5 (Wine tasting) Four wines are tasted and evaluated by each of eight judges. A plot is one tasting by one judge; judges are blocks. So there are eight blocks and plots. Plots within each judge are identified by order of tasting. The systematic design is the same for each judge. Judge j Tasting Wine To randomize this design we need eight independent random permutations of four objects. Here we use the method described at the end of Section., using a stream of random digits and taking as many as are needed for each successive block. The random digits are shown in the top row of Table. and the randomized plan in Table..
7 . Models for block designs.. Models for block designs 57 Recall that Y ω = τ T (ω) + ω, where ω is the effect of plot ω. There are two common models for how the blocks affect ω. In the first model, the blocks affect the expectation but not the covariance. Thus E( ω ) = ζ B(ω), where ζ B(ω) is an unknown constant depending on the block B(ω) containing ω. However, the covariance still has its simplest form; that is { cov( α, β ) = σ if α = β otherwise. This is called the fixed-effects model. In the second model the blocks make no contribution to the expectation, so that E( ω ) =. However, the covariance between the responses on plots α and β depends on whether α = β, α and β are different but in the same block, or α and β are in different blocks. Thus σ if α = β cov( α, β ) = ρ σ if α β but B(α) = B(β) ρ σ if B(α) B(β). Of course, ρ and ρ. Usually we expect that ρ > ρ, because plots in the same block should respond in a more alike manner than plots in different blocks. This is called the random-effects model. Let J B be the matrix whose (α,β)-entry is equal to Then, in the random-effects model, { if B(α) = B(β) otherwise. Cov(Y) = σ I + ρ σ (J B I) + ρ σ (J J B ) = σ [( ρ )I + (ρ ρ )J B + ρ J]. Some natural discrete classifications with a small number of possibilities (such as sex) are best considered as fixed. For example, -year-old human males might always be heavier than -year-old human females and we might want to find out how much heavier. Most other classifications are just a nuisance and are best thought of as random. For example, plots at the top end of the field may do better than plots at the bottom end in wet years and worse in dry years, but, on the whole, plots at the top end will tend to perform more similarly to each other than to plots at the bottom end.
8 58 Chapter. Blocking.5 Analysis: fixed effects The expectation part of the fixed-effects model is that E(Y ω ) = τ T (ω) + ζ B(ω). (.) In vector terms, this is E(Y) = τ + ζ, where τ V T and ζ V B. Equation (.) shows that τ = τ +τ T, where τ = τu V and τ T = τ τu W T. Similarly, ζ = ζ + ζ B, where ζ = ζu V and ζ B = ζ ζu W B. Thus E(Y) = (τ + ζ ) + τ T + ζ B with τ + ζ in V, τ in W T and ζ B in W B. ow, τ and ζ are both multiples of the all- vector u and so they cannot be distinguished, either in the model (.) or from the data. This can be seen in another way. We could replace τ i by (τ i + c) for some constant c, for all i, and replace ζ j by (ζ j c), for all j, without changing Equation (.). This implies that neither τ nor ζ can be estimated. However, we can estimate treatment contrasts, and we can estimate block contrasts. The definition of the sum of two vector subspaces gives Thus Equation (.) can be rewritten as V T +V B = {v + w : v V T, w V B }. E(Y) V T +V B. Suppose that x W T. Then x V T +V B. Applying Theorem.5 with V T +V B in place of V T shows that x Y is the best linear unbiased estimator of x (τ +ζ +τ T + ζ B ). ow, x τ = x ζ = because x V, and x ζ B = because x W B : that is why we restrict attention to orthogonal designs throughout this chapter. Therefore x (τ + ζ + τ T + ζ B ) = x τ T, whose best linear unbiased estimator is x Y with variance x σ. Similarly, if z W B then z Y is the best linear unbiased estimator of z ζ B, with variance z σ. Likewise, we have P WT (τ + ζ ) = because τ + ζ is orthogonal to W T. Similarly, P WT (ζ B ) = because ζ B is in W B, which is orthogonal to W T. ow Theorem. shows that E(P WT (Y)) = P WT (EY) = P WT (τ + ζ + τ T + ζ B ) = τ T and that E(P WB (Y)) = ζ B. Put W E = (V T +V B ). This is going to be the residual subspace: the reason for the notation E will be explained in Chapter. Then V is the following direct sum of orthogonal subspaces: V = V W T W B W E.
9 .5. Analysis: fixed effects 59 We have constructed both W T and W B to be orthogonal to V. The subspaces W T and W B are orthogonal to each other because we have assumed that the design is orthogonal. Finally, we have constructed W E to be orthogonal to the previous three subspaces because V T +V B = V W T W B. Just as in Section., the orthogonal decomposition of V leads to (orthogonal) decompositions of the dimension, expectation, data and sum of squares, as follows: V = V W T W B W E dimension = bk = + (t ) + (b ) + [b(k ) (t )] expectation E(Y) = (τ + ζ ) + τ T + ζ B + data y = ȳu + y T + y B + residual sum of squares y ω = ω Ω sum + SS(treatments) + SS(blocks) + SS(residual) where y T = y B = t i= SS(treatments) = SS(blocks) = mean T =i u i ȳu, b mean B= j v j ȳu, j= t sum T =i sum i= r i, b sum B= j sum j= k, SS(residual) = sum of squares of the residuals = y ω SS(mean) SS(treatments) SS(blocks), ω Ω and sum B= j and mean B= j are the total and mean respectively of the values of y ω for ω in block j. Hence we obtain the anova table shown in Table.. Of course, this is really two anova tables in one. The theoretical anova table, which tells us what to do, can omit the columns for mean square and variance ratio, but must show the column for EMS, which shows us which variance ratios to calculate. The anova table given by the actual data, in which the formulae are replaced by their values, does not need to show the EMS column, but may well include a final column headed F-probability, which gives the probability of obtaining a variance ratio at least as big as the one obtained in the table, under the null hypothesis of zero effect for that line, and assuming normality.
10 6 Chapter. Blocking Use the variance ratio MS(treatments) MS(residual) to test for treatment differences, and the variance ratio MS(blocks) MS(residual) to test for block differences (if you are interested in them). Both tests are one-sided..6 Analysis: random effects Put C = Cov(Y). Then we have C = σ [( ρ )I + (ρ ρ )J B + ρ J]. If plot ω is in block j then the ω-row of J B is just v j. Hence if x is any vector in V then the ω-entry in J B x is equal to v j x. In particular, if x = u then v j x = k for all j and so J B u = ku. Since Iu = u and Ju = u, we see that Cu = σ [( ρ ) + k(ρ ρ ) + ρ ]u, so that u is an eigenvector of C with eigenvalue ξ, where ξ = σ [( ρ ) + k(ρ ρ ) + ρ ]. If x V B then x = j λ j v j for some scalars λ,..., λ b ; hence v j x = kλ j and so J B x = kx. Hence if x W B = V B V then Cx = σ [( ρ ) + k(ρ ρ )]x, and so x is an eigenvalue of C with eigenvalue ξ, where ξ = σ [( ρ ) + k(ρ ρ )]. Finally, if x VB V then J B x = and Jx = so Cx = ξ x, where ξ = σ ( ρ ). Thus the eigenspaces of C (the strata) are V, W B and VB, with dimensions, b and b and eigenvalues ξ, ξ and ξ respectively. Usually we expect that ξ > ξ. Theorem. shows that the appropriate anova table is that shown in Table.. The arithmetic calculations are identical to those for the fixed-effects model. Assess treament differences just as before. For the effects of blocks, do a two-sided test using MS(blocks) MS(residual). If MS(blocks) >> MS(residual) then the choice of blocks was good: do it similarly next time. If MS(blocks) << MS(residual) then
11 .6. Analysis: random effects 6 source V mean sum of squares sum degrees of freedom mean square EMS variance ratio SS(mean) τ + ζ + σ MS(mean) MS(residual) sum WB blocks B= j j k sum b SS(blocks) b ζ B b + MS(blocks) σ MS(residual) sum WT treatments T =i i ri sum t SS(treatments) t τt t + MS(treatments) σ MS(residual) residual..... by subtraction..... SS(residual) df(residual) Total ω y ω σ Table.: Anova table for blocks and unstructured treatments under the fixed-effects model
12 6 Chapter. Blocking stratum source df EMS VR V mean mean τ + ξ W B blocks blocks b ξ V B plots treatments t τ T t + ξ MS(treatments) MS(residual) residual b(k ) (t ) ξ Total Table.: Anova table for blocks and unstructured treatments under the randomeffects model either ξ < ξ because plots within a block compete (for example, if all plots in a chamber in a greenhouse share a single system of circulating liquid nutrients) or ξ < ξ and there is a better way of blocking or trial management has not been by block or the scientist is fiddling the data (and is not expecting you to notice very low values of the variance ratio)..7 Why use blocks? If we should use blocks and do not, what happens? If the blocks contribute fixed effects then ζ B is almost certainly not zero. If the treatments are not allocated orthogonally to blocks then ζ B will not be orthogonal to W T. The estimator of τ T is P WT Y, whose expectation is τ T +P WT ζ B. Thus treatment estimators are biased. It is most likely that ζ B is also not orthogonal to V T, so the estimator of σ will also be biased. In fact, Theorem.(ii) shows that the expectation of this estimator will be P V T ζ B / ( t) + σ, so that the variance will be overestimated. If the blocks contribute random effects then treatment estimators are unbiased but their variances are larger than they need be: on average, ξ will be replaced by (b )ξ + ( b)ξ.
13 .8. Loss of power with blocking 6 If we do use blocks in the design but forget to include them in the analysis, what happens? ow the treatment estimators are unbiased, but in both models our estimates of their variances are too high, so we may fail to detect genuine treatment differences. For fixed effects, the expectation of the estimator of σ is equal to ζ B /( t) + σ ; for random effects, the expectation of the estimator of ξ is equal to (b )ξ + ( b t + )ξ. t.8 Loss of power with blocking The following example, which is taken from a case where the manufacturer tried to sue the statisticians for using blocks, shows the only circumstances where blocking may be a disadvantage: there are no natural block boundaries, there are a small number of residual degrees of freedom, and the purpose of the experiment is (arguably) hypothesis testing rather than estimation. Example.6 (Pasture grass) A new additive is claimed to vastly improve the quality of pasture grass. Are farmers wasting their money in buying it? There are two treatments: the new additive, and nothing. Plots must be large enough for several sheep to graze freely. Hence the replication cannot be large: replication is chosen. Should the design be completely randomized or in three randomized complete blocks? Put τ = response to nothing τ = response to new additive. ( ) τ + τ The null hypothesis is H : τ = τ = τ =. ow, τ T = τ τ = τ τ τ τ = τ τ on nothing on new additive Using the model for the completely randomized design from Section., but writing ξ in place of ξ (to avoid confusion with the next model), we obtain the
14 6 Chapter. Blocking following anova table. stratum source df EMS mean mean 6 τ + ξ plots treatments 6 (τ τ ) + ξ residual ξ Total 6 ow we consider the complete-block design. There are no natural block boundaries, so the random-effects model is appropriate, and we obtain the following anova table. stratum source df EMS mean mean 6 τ + ξ blocks blocks ξ plots treatments (τ τ ) + ξ residual ξ Total 6 The completely randomized design mixes up the 5 degrees of freedom orthogonal to V, so 5ξ = ξ + ξ so ξ > ξ if ξ > ξ. The variance of the estimator of τ τ is ξ in the completely randomized design ξ in the complete-block design so the complete-block design gives smaller variance and so is better for estimation. For hypothesis testing, we consider the one-sided alternative that τ is bigger then τ. To test at the 5% significance level we need the.95 point of the t- distribution, which is.9 on degrees of freedom and. on degrees of freedom. To have 9% power of detecting that τ > τ, we also need the.9 points of these distributions, which are.886 and.5 respectively. The argument in
15 .8. Loss of power with blocking 65 Section. shows that to have probability at least.9 of detecting that τ > τ when doing a one-sided test at the 5% significance level we need τ τ > (. +.5) ξ in the completely randomized design τ τ > ( ) ξ in the complete-block design. Thus the block design is better.86 ξ <.665 ξ ξ >.7ξ ξ >.8ξ. Typically we have ξ.5ξ for such a trial, so smaller differences can be detected by the unblocked design. A scientist who is more interested in proving that the new additive is better (than in accurately estimating how much better) might complain if the experiment is conducted in blocks rather than in a completely randomized design. Questions for Discussion. The plan in Figure. is the field layout of an experiment conducted in 95 at Rothamsted Experimental Station (an agricultural research station founded in 8). Each plot had a notice on it showing the block number and the plot number. These are the top two numbers given in each plot in the plan. The purpose of the experiment was to compare various types of fumigant, in single and double doses, for their ability to control eelworms in the soil where oats were being grown. A control treatment (i.e. no fumigant) was included. In the plan, each plot shows, in order below the plot number, the level of a factor called Fumigant, then the dose, then the type of chemical. In the spring, gm of soil were sampled from each plot, and the number of eelworm cysts in each sample counted and recorded. The oats were sown, fumigated, grown and harvested. After harvest the plots were sampled again in the same way, and the number of cysts recorded. The variable logcount was calculated as logcount = log(number of eelworm cysts at harvest) log(number of eelworm cysts in spring before treatment), where the logarithms are to base e. This variable is shown at the bottom of each plot in the plan. (i) How many treatments were there? (ii) How were the plots divided into blocks?
16 66 Chapter. Blocking (iii) After sampling soil from plot of block III, which plot should the scientist sample next? (iv) Devise a better way of numbering the plots. (v) Why do you think logarithms were used to present the data in the form logcount?. Ignoring the factorial structure of the treatments, calculate the analysis-of-variance table and the table of means for the data logcount in the eelworm experiment.. Redo Question. under the assumption that the professor has pills of Wakey- Wakey and of izzaway. There is only one observation room, so only one pill can be tested per day. Your plan should show which student should take which pill on which day. What information should you give the professor about the plan?
17 .8. Loss of power with blocking 67 K.77 I I I I I I 5 K M S I I I I I I 8 9 M S K II II II II S K.69 II II II II 6 7 S M II II II II M K..7 III III III III K M.88 III III III III 6 7 S S III III III III M.86 IV IV IV S.68 M.79.8 K.57 IV IV IV 5 K.79 7 M IV IV IV IV IV IV.99 Figure.: Field layout for the experiment in Question. S.7
Notes from Design of Experiments
Notes from Design of Experiments Cambridge Part III Mathematical Tripos 2012-2013 Lecturer: Rosemary Bailey Vivak Patel March 21, 2013 1 Contents 1 Overview 4 1.1 Stages of a Statistically Designed Experiment...........
Review Jeopardy. Blue vs. Orange. Review Jeopardy
Review Jeopardy Blue vs. Orange Review Jeopardy Jeopardy Round Lectures 0-3 Jeopardy Round $200 How could I measure how far apart (i.e. how different) two observations, y 1 and y 2, are from each other?
NCSS Statistical Software Principal Components Regression. In ordinary least squares, the regression coefficients are estimated using the formula ( )
Chapter 340 Principal Components Regression Introduction is a technique for analyzing multiple regression data that suffer from multicollinearity. When multicollinearity occurs, least squares estimates
Orthogonal Diagonalization of Symmetric Matrices
MATH10212 Linear Algebra Brief lecture notes 57 Gram Schmidt Process enables us to find an orthogonal basis of a subspace. Let u 1,..., u k be a basis of a subspace V of R n. We begin the process of finding
Recall that two vectors in are perpendicular or orthogonal provided that their dot
Orthogonal Complements and Projections Recall that two vectors in are perpendicular or orthogonal provided that their dot product vanishes That is, if and only if Example 1 The vectors in are orthogonal
Simple Linear Regression Inference
Simple Linear Regression Inference 1 Inference requirements The Normality assumption of the stochastic term e is needed for inference even if it is not a OLS requirement. Therefore we have: Interpretation
" Y. Notation and Equations for Regression Lecture 11/4. Notation:
Notation: Notation and Equations for Regression Lecture 11/4 m: The number of predictor variables in a regression Xi: One of multiple predictor variables. The subscript i represents any number from 1 through
by the matrix A results in a vector which is a reflection of the given
Eigenvalues & Eigenvectors Example Suppose Then So, geometrically, multiplying a vector in by the matrix A results in a vector which is a reflection of the given vector about the y-axis We observe that
Recall this chart that showed how most of our course would be organized:
Chapter 4 One-Way ANOVA Recall this chart that showed how most of our course would be organized: Explanatory Variable(s) Response Variable Methods Categorical Categorical Contingency Tables Categorical
Randomized Block Analysis of Variance
Chapter 565 Randomized Block Analysis of Variance Introduction This module analyzes a randomized block analysis of variance with up to two treatment factors and their interaction. It provides tables of
Similarity and Diagonalization. Similar Matrices
MATH022 Linear Algebra Brief lecture notes 48 Similarity and Diagonalization Similar Matrices Let A and B be n n matrices. We say that A is similar to B if there is an invertible n n matrix P such that
CHAPTER 13. Experimental Design and Analysis of Variance
CHAPTER 13 Experimental Design and Analysis of Variance CONTENTS STATISTICS IN PRACTICE: BURKE MARKETING SERVICES, INC. 13.1 AN INTRODUCTION TO EXPERIMENTAL DESIGN AND ANALYSIS OF VARIANCE Data Collection
15.062 Data Mining: Algorithms and Applications Matrix Math Review
.6 Data Mining: Algorithms and Applications Matrix Math Review The purpose of this document is to give a brief review of selected linear algebra concepts that will be useful for the course and to develop
α = u v. In other words, Orthogonal Projection
Orthogonal Projection Given any nonzero vector v, it is possible to decompose an arbitrary vector u into a component that points in the direction of v and one that points in a direction orthogonal to v
Chapter 13. Fractional Factorials. 13.1 Fractional replicates
244 Chapter 13 Fractional Factorials 13.1 Fractional replicates A factorial design is a fractional replicate if not all possible combinations of the treatment factors occur. A fractional replicate can
Multivariate Analysis of Variance (MANOVA): I. Theory
Gregory Carey, 1998 MANOVA: I - 1 Multivariate Analysis of Variance (MANOVA): I. Theory Introduction The purpose of a t test is to assess the likelihood that the means for two groups are sampled from the
Introduction to General and Generalized Linear Models
Introduction to General and Generalized Linear Models General Linear Models - part I Henrik Madsen Poul Thyregod Informatics and Mathematical Modelling Technical University of Denmark DK-2800 Kgs. Lyngby
1 Theory: The General Linear Model
QMIN GLM Theory - 1.1 1 Theory: The General Linear Model 1.1 Introduction Before digital computers, statistics textbooks spoke of three procedures regression, the analysis of variance (ANOVA), and the
AP Physics 1 and 2 Lab Investigations
AP Physics 1 and 2 Lab Investigations Student Guide to Data Analysis New York, NY. College Board, Advanced Placement, Advanced Placement Program, AP, AP Central, and the acorn logo are registered trademarks
Experimental Designs (revisited)
Introduction to ANOVA Copyright 2000, 2011, J. Toby Mordkoff Probably, the best way to start thinking about ANOVA is in terms of factors with levels. (I say this because this is how they are described
LINEAR ALGEBRA W W L CHEN
LINEAR ALGEBRA W W L CHEN c W W L Chen, 1997, 2008 This chapter is available free to all individuals, on understanding that it is not to be used for financial gain, and may be downloaded and/or photocopied,
Dimensionality Reduction: Principal Components Analysis
Dimensionality Reduction: Principal Components Analysis In data mining one often encounters situations where there are a large number of variables in the database. In such situations it is very likely
MATH10212 Linear Algebra. Systems of Linear Equations. Definition. An n-dimensional vector is a row or a column of n numbers (or letters): a 1.
MATH10212 Linear Algebra Textbook: D. Poole, Linear Algebra: A Modern Introduction. Thompson, 2006. ISBN 0-534-40596-7. Systems of Linear Equations Definition. An n-dimensional vector is a row or a column
Simple Regression Theory II 2010 Samuel L. Baker
SIMPLE REGRESSION THEORY II 1 Simple Regression Theory II 2010 Samuel L. Baker Assessing how good the regression equation is likely to be Assignment 1A gets into drawing inferences about how close the
Orthogonal Projections
Orthogonal Projections and Reflections (with exercises) by D. Klain Version.. Corrections and comments are welcome! Orthogonal Projections Let X,..., X k be a family of linearly independent (column) vectors
These axioms must hold for all vectors ū, v, and w in V and all scalars c and d.
DEFINITION: A vector space is a nonempty set V of objects, called vectors, on which are defined two operations, called addition and multiplication by scalars (real numbers), subject to the following axioms
Introduction to. Hypothesis Testing CHAPTER LEARNING OBJECTIVES. 1 Identify the four steps of hypothesis testing.
Introduction to Hypothesis Testing CHAPTER 8 LEARNING OBJECTIVES After reading this chapter, you should be able to: 1 Identify the four steps of hypothesis testing. 2 Define null hypothesis, alternative
8 Square matrices continued: Determinants
8 Square matrices continued: Determinants 8. Introduction Determinants give us important information about square matrices, and, as we ll soon see, are essential for the computation of eigenvalues. You
A POPULATION MEAN, CONFIDENCE INTERVALS AND HYPOTHESIS TESTING
CHAPTER 5. A POPULATION MEAN, CONFIDENCE INTERVALS AND HYPOTHESIS TESTING 5.1 Concepts When a number of animals or plots are exposed to a certain treatment, we usually estimate the effect of the treatment
Quadratic forms Cochran s theorem, degrees of freedom, and all that
Quadratic forms Cochran s theorem, degrees of freedom, and all that Dr. Frank Wood Frank Wood, [email protected] Linear Regression Models Lecture 1, Slide 1 Why We Care Cochran s theorem tells us
Using Excel for inferential statistics
FACT SHEET Using Excel for inferential statistics Introduction When you collect data, you expect a certain amount of variation, just caused by chance. A wide variety of statistical tests can be applied
CONTROLLABILITY. Chapter 2. 2.1 Reachable Set and Controllability. Suppose we have a linear system described by the state equation
Chapter 2 CONTROLLABILITY 2 Reachable Set and Controllability Suppose we have a linear system described by the state equation ẋ Ax + Bu (2) x() x Consider the following problem For a given vector x in
Multivariate Analysis of Variance (MANOVA)
Chapter 415 Multivariate Analysis of Variance (MANOVA) Introduction Multivariate analysis of variance (MANOVA) is an extension of common analysis of variance (ANOVA). In ANOVA, differences among various
Eigenvalues, Eigenvectors, Matrix Factoring, and Principal Components
Eigenvalues, Eigenvectors, Matrix Factoring, and Principal Components The eigenvalues and eigenvectors of a square matrix play a key role in some important operations in statistics. In particular, they
1 VECTOR SPACES AND SUBSPACES
1 VECTOR SPACES AND SUBSPACES What is a vector? Many are familiar with the concept of a vector as: Something which has magnitude and direction. an ordered pair or triple. a description for quantities such
Factor analysis. Angela Montanari
Factor analysis Angela Montanari 1 Introduction Factor analysis is a statistical model that allows to explain the correlations between a large number of observed correlated variables through a small number
Data Analysis Tools. Tools for Summarizing Data
Data Analysis Tools This section of the notes is meant to introduce you to many of the tools that are provided by Excel under the Tools/Data Analysis menu item. If your computer does not have that tool
CURVE FITTING LEAST SQUARES APPROXIMATION
CURVE FITTING LEAST SQUARES APPROXIMATION Data analysis and curve fitting: Imagine that we are studying a physical system involving two quantities: x and y Also suppose that we expect a linear relationship
Overview of Violations of the Basic Assumptions in the Classical Normal Linear Regression Model
Overview of Violations of the Basic Assumptions in the Classical Normal Linear Regression Model 1 September 004 A. Introduction and assumptions The classical normal linear regression model can be written
Notes on Factoring. MA 206 Kurt Bryan
The General Approach Notes on Factoring MA 26 Kurt Bryan Suppose I hand you n, a 2 digit integer and tell you that n is composite, with smallest prime factor around 5 digits. Finding a nontrivial factor
Chapter 6. Orthogonality
6.3 Orthogonal Matrices 1 Chapter 6. Orthogonality 6.3 Orthogonal Matrices Definition 6.4. An n n matrix A is orthogonal if A T A = I. Note. We will see that the columns of an orthogonal matrix must be
Analysis of Variance. MINITAB User s Guide 2 3-1
3 Analysis of Variance Analysis of Variance Overview, 3-2 One-Way Analysis of Variance, 3-5 Two-Way Analysis of Variance, 3-11 Analysis of Means, 3-13 Overview of Balanced ANOVA and GLM, 3-18 Balanced
Multivariate Analysis of Ecological Data
Multivariate Analysis of Ecological Data MICHAEL GREENACRE Professor of Statistics at the Pompeu Fabra University in Barcelona, Spain RAUL PRIMICERIO Associate Professor of Ecology, Evolutionary Biology
MULTIPLE REGRESSION WITH CATEGORICAL DATA
DEPARTMENT OF POLITICAL SCIENCE AND INTERNATIONAL RELATIONS Posc/Uapp 86 MULTIPLE REGRESSION WITH CATEGORICAL DATA I. AGENDA: A. Multiple regression with categorical variables. Coding schemes. Interpreting
3. INNER PRODUCT SPACES
. INNER PRODUCT SPACES.. Definition So far we have studied abstract vector spaces. These are a generalisation of the geometric spaces R and R. But these have more structure than just that of a vector space.
1 Sets and Set Notation.
LINEAR ALGEBRA MATH 27.6 SPRING 23 (COHEN) LECTURE NOTES Sets and Set Notation. Definition (Naive Definition of a Set). A set is any collection of objects, called the elements of that set. We will most
4.3 Least Squares Approximations
18 Chapter. Orthogonality.3 Least Squares Approximations It often happens that Ax D b has no solution. The usual reason is: too many equations. The matrix has more rows than columns. There are more equations
MATH 423 Linear Algebra II Lecture 38: Generalized eigenvectors. Jordan canonical form (continued).
MATH 423 Linear Algebra II Lecture 38: Generalized eigenvectors Jordan canonical form (continued) Jordan canonical form A Jordan block is a square matrix of the form λ 1 0 0 0 0 λ 1 0 0 0 0 λ 0 0 J = 0
E(y i ) = x T i β. yield of the refined product as a percentage of crude specific gravity vapour pressure ASTM 10% point ASTM end point in degrees F
Random and Mixed Effects Models (Ch. 10) Random effects models are very useful when the observations are sampled in a highly structured way. The basic idea is that the error associated with any linear,
CONTINGENCY TABLES ARE NOT ALL THE SAME David C. Howell University of Vermont
CONTINGENCY TABLES ARE NOT ALL THE SAME David C. Howell University of Vermont To most people studying statistics a contingency table is a contingency table. We tend to forget, if we ever knew, that contingency
Chapter 4: Vector Autoregressive Models
Chapter 4: Vector Autoregressive Models 1 Contents: Lehrstuhl für Department Empirische of Wirtschaftsforschung Empirical Research and und Econometrics Ökonometrie IV.1 Vector Autoregressive Models (VAR)...
Case Study in Data Analysis Does a drug prevent cardiomegaly in heart failure?
Case Study in Data Analysis Does a drug prevent cardiomegaly in heart failure? Harvey Motulsky [email protected] This is the first case in what I expect will be a series of case studies. While I mention
LAB : THE CHI-SQUARE TEST. Probability, Random Chance, and Genetics
Period Date LAB : THE CHI-SQUARE TEST Probability, Random Chance, and Genetics Why do we study random chance and probability at the beginning of a unit on genetics? Genetics is the study of inheritance,
Recall the basic property of the transpose (for any A): v A t Aw = v w, v, w R n.
ORTHOGONAL MATRICES Informally, an orthogonal n n matrix is the n-dimensional analogue of the rotation matrices R θ in R 2. When does a linear transformation of R 3 (or R n ) deserve to be called a rotation?
Class 19: Two Way Tables, Conditional Distributions, Chi-Square (Text: Sections 2.5; 9.1)
Spring 204 Class 9: Two Way Tables, Conditional Distributions, Chi-Square (Text: Sections 2.5; 9.) Big Picture: More than Two Samples In Chapter 7: We looked at quantitative variables and compared the
( ) which must be a vector
MATH 37 Linear Transformations from Rn to Rm Dr. Neal, WKU Let T : R n R m be a function which maps vectors from R n to R m. Then T is called a linear transformation if the following two properties are
Linear Codes. Chapter 3. 3.1 Basics
Chapter 3 Linear Codes In order to define codes that we can encode and decode efficiently, we add more structure to the codespace. We shall be mainly interested in linear codes. A linear code of length
MULTIPLE LINEAR REGRESSION ANALYSIS USING MICROSOFT EXCEL. by Michael L. Orlov Chemistry Department, Oregon State University (1996)
MULTIPLE LINEAR REGRESSION ANALYSIS USING MICROSOFT EXCEL by Michael L. Orlov Chemistry Department, Oregon State University (1996) INTRODUCTION In modern science, regression analysis is a necessary part
Notes on Orthogonal and Symmetric Matrices MENU, Winter 2013
Notes on Orthogonal and Symmetric Matrices MENU, Winter 201 These notes summarize the main properties and uses of orthogonal and symmetric matrices. We covered quite a bit of material regarding these topics,
Inner Product Spaces and Orthogonality
Inner Product Spaces and Orthogonality week 3-4 Fall 2006 Dot product of R n The inner product or dot product of R n is a function, defined by u, v a b + a 2 b 2 + + a n b n for u a, a 2,, a n T, v b,
Using Excel for Statistics Tips and Warnings
Using Excel for Statistics Tips and Warnings November 2000 University of Reading Statistical Services Centre Biometrics Advisory and Support Service to DFID Contents 1. Introduction 3 1.1 Data Entry and
2DI36 Statistics. 2DI36 Part II (Chapter 7 of MR)
2DI36 Statistics 2DI36 Part II (Chapter 7 of MR) What Have we Done so Far? Last time we introduced the concept of a dataset and seen how we can represent it in various ways But, how did this dataset came
Nonlinear Iterative Partial Least Squares Method
Numerical Methods for Determining Principal Component Analysis Abstract Factors Béchu, S., Richard-Plouet, M., Fernandez, V., Walton, J., and Fairley, N. (2016) Developments in numerical treatments for
DEPARTMENT OF PSYCHOLOGY UNIVERSITY OF LANCASTER MSC IN PSYCHOLOGICAL RESEARCH METHODS ANALYSING AND INTERPRETING DATA 2 PART 1 WEEK 9
DEPARTMENT OF PSYCHOLOGY UNIVERSITY OF LANCASTER MSC IN PSYCHOLOGICAL RESEARCH METHODS ANALYSING AND INTERPRETING DATA 2 PART 1 WEEK 9 Analysis of covariance and multiple regression So far in this course,
1 Introduction to Matrices
1 Introduction to Matrices In this section, important definitions and results from matrix algebra that are useful in regression analysis are introduced. While all statements below regarding the columns
Linear Models for Continuous Data
Chapter 2 Linear Models for Continuous Data The starting point in our exploration of statistical models in social research will be the classical linear model. Stops along the way include multiple linear
How To Run Statistical Tests in Excel
How To Run Statistical Tests in Excel Microsoft Excel is your best tool for storing and manipulating data, calculating basic descriptive statistics such as means and standard deviations, and conducting
Subspaces of R n LECTURE 7. 1. Subspaces
LECTURE 7 Subspaces of R n Subspaces Definition 7 A subset W of R n is said to be closed under vector addition if for all u, v W, u + v is also in W If rv is in W for all vectors v W and all scalars r
Cryptography and Network Security Department of Computer Science and Engineering Indian Institute of Technology Kharagpur
Cryptography and Network Security Department of Computer Science and Engineering Indian Institute of Technology Kharagpur Module No. # 01 Lecture No. # 05 Classic Cryptosystems (Refer Slide Time: 00:42)
4: SINGLE-PERIOD MARKET MODELS
4: SINGLE-PERIOD MARKET MODELS Ben Goldys and Marek Rutkowski School of Mathematics and Statistics University of Sydney Semester 2, 2015 B. Goldys and M. Rutkowski (USydney) Slides 4: Single-Period Market
Section 14 Simple Linear Regression: Introduction to Least Squares Regression
Slide 1 Section 14 Simple Linear Regression: Introduction to Least Squares Regression There are several different measures of statistical association used for understanding the quantitative relationship
Mathematics Course 111: Algebra I Part IV: Vector Spaces
Mathematics Course 111: Algebra I Part IV: Vector Spaces D. R. Wilkins Academic Year 1996-7 9 Vector Spaces A vector space over some field K is an algebraic structure consisting of a set V on which are
Answer: C. The strength of a correlation does not change if units change by a linear transformation such as: Fahrenheit = 32 + (5/9) * Centigrade
Statistics Quiz Correlation and Regression -- ANSWERS 1. Temperature and air pollution are known to be correlated. We collect data from two laboratories, in Boston and Montreal. Boston makes their measurements
How to calculate an ANOVA table
How to calculate an ANOVA table Calculations by Hand We look at the following example: Let us say we measure the height of some plants under the effect of different fertilizers. Treatment Measures Mean
Testing for Lack of Fit
Chapter 6 Testing for Lack of Fit How can we tell if a model fits the data? If the model is correct then ˆσ 2 should be an unbiased estimate of σ 2. If we have a model which is not complex enough to fit
Chapter 7 Section 7.1: Inference for the Mean of a Population
Chapter 7 Section 7.1: Inference for the Mean of a Population Now let s look at a similar situation Take an SRS of size n Normal Population : N(, ). Both and are unknown parameters. Unlike what we used
Why Taking This Course? Course Introduction, Descriptive Statistics and Data Visualization. Learning Goals. GENOME 560, Spring 2012
Why Taking This Course? Course Introduction, Descriptive Statistics and Data Visualization GENOME 560, Spring 2012 Data are interesting because they help us understand the world Genomics: Massive Amounts
Summation Algebra. x i
2 Summation Algebra In the next 3 chapters, we deal with the very basic results in summation algebra, descriptive statistics, and matrix algebra that are prerequisites for the study of SEM theory. You
Solutions to Math 51 First Exam January 29, 2015
Solutions to Math 5 First Exam January 29, 25. ( points) (a) Complete the following sentence: A set of vectors {v,..., v k } is defined to be linearly dependent if (2 points) there exist c,... c k R, not
Chapter 20. Vector Spaces and Bases
Chapter 20. Vector Spaces and Bases In this course, we have proceeded step-by-step through low-dimensional Linear Algebra. We have looked at lines, planes, hyperplanes, and have seen that there is no limit
CHAPTER 8 FACTOR EXTRACTION BY MATRIX FACTORING TECHNIQUES. From Exploratory Factor Analysis Ledyard R Tucker and Robert C.
CHAPTER 8 FACTOR EXTRACTION BY MATRIX FACTORING TECHNIQUES From Exploratory Factor Analysis Ledyard R Tucker and Robert C MacCallum 1997 180 CHAPTER 8 FACTOR EXTRACTION BY MATRIX FACTORING TECHNIQUES In
9.2 Summation Notation
9. Summation Notation 66 9. Summation Notation In the previous section, we introduced sequences and now we shall present notation and theorems concerning the sum of terms of a sequence. We begin with a
If A is divided by B the result is 2/3. If B is divided by C the result is 4/7. What is the result if A is divided by C?
Problem 3 If A is divided by B the result is 2/3. If B is divided by C the result is 4/7. What is the result if A is divided by C? Suggested Questions to ask students about Problem 3 The key to this question
Inner product. Definition of inner product
Math 20F Linear Algebra Lecture 25 1 Inner product Review: Definition of inner product. Slide 1 Norm and distance. Orthogonal vectors. Orthogonal complement. Orthogonal basis. Definition of inner product
Other forms of ANOVA
Other forms of ANOVA Pierre Legendre, Université de Montréal August 009 1 - Introduction: different forms of analysis of variance 1. One-way or single classification ANOVA (previous lecture) Equal or unequal
1 Another method of estimation: least squares
1 Another method of estimation: least squares erm: -estim.tex, Dec8, 009: 6 p.m. (draft - typos/writos likely exist) Corrections, comments, suggestions welcome. 1.1 Least squares in general Assume Y i
Econometrics Simple Linear Regression
Econometrics Simple Linear Regression Burcu Eke UC3M Linear equations with one variable Recall what a linear equation is: y = b 0 + b 1 x is a linear equation with one variable, or equivalently, a straight
Machine Learning and Pattern Recognition Logistic Regression
Machine Learning and Pattern Recognition Logistic Regression Course Lecturer:Amos J Storkey Institute for Adaptive and Neural Computation School of Informatics University of Edinburgh Crichton Street,
QUALITY ENGINEERING PROGRAM
QUALITY ENGINEERING PROGRAM Production engineering deals with the practical engineering problems that occur in manufacturing planning, manufacturing processes and in the integration of the facilities and
Statistical Functions in Excel
Statistical Functions in Excel There are many statistical functions in Excel. Moreover, there are other functions that are not specified as statistical functions that are helpful in some statistical analyses.
CS 147: Computer Systems Performance Analysis
CS 147: Computer Systems Performance Analysis One-Factor Experiments CS 147: Computer Systems Performance Analysis One-Factor Experiments 1 / 42 Overview Introduction Overview Overview Introduction Finding
6 EXTENDING ALGEBRA. 6.0 Introduction. 6.1 The cubic equation. Objectives
6 EXTENDING ALGEBRA Chapter 6 Extending Algebra Objectives After studying this chapter you should understand techniques whereby equations of cubic degree and higher can be solved; be able to factorise
Inner Product Spaces
Math 571 Inner Product Spaces 1. Preliminaries An inner product space is a vector space V along with a function, called an inner product which associates each pair of vectors u, v with a scalar u, v, and
Stat 5303 (Oehlert): Tukey One Degree of Freedom 1
Stat 5303 (Oehlert): Tukey One Degree of Freedom 1 > catch
STRUTS: Statistical Rules of Thumb. Seattle, WA
STRUTS: Statistical Rules of Thumb Gerald van Belle Departments of Environmental Health and Biostatistics University ofwashington Seattle, WA 98195-4691 Steven P. Millard Probability, Statistics and Information
The Open University s repository of research publications and other research outputs
Open Research Online The Open University s repository of research publications and other research outputs The degree-diameter problem for circulant graphs of degree 8 and 9 Journal Article How to cite:
Linear Algebra Notes
Linear Algebra Notes Chapter 19 KERNEL AND IMAGE OF A MATRIX Take an n m matrix a 11 a 12 a 1m a 21 a 22 a 2m a n1 a n2 a nm and think of it as a function A : R m R n The kernel of A is defined as Note
Numerical Analysis Lecture Notes
Numerical Analysis Lecture Notes Peter J. Olver 5. Inner Products and Norms The norm of a vector is a measure of its size. Besides the familiar Euclidean norm based on the dot product, there are a number
MATH 551 - APPLIED MATRIX THEORY
MATH 55 - APPLIED MATRIX THEORY FINAL TEST: SAMPLE with SOLUTIONS (25 points NAME: PROBLEM (3 points A web of 5 pages is described by a directed graph whose matrix is given by A Do the following ( points
