Chapter 2 Simple Comparative Experiments Solutions

Size: px
Start display at page:

Download "Chapter 2 Simple Comparative Experiments Solutions"

Transcription

1 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Chapter Simple Comparative Experiments Solutions - The breaking strength of a fiber is required to be at least 5 psi. Past experience has indicated that the standard deviation of breaking strength is = 3 psi. A random sample of four specimens is tested. The results are y =45, y =53, y 3 =5 and y 4 =47. (a) State the hypotheses that you think should be tested in this experiment. H : = 5 H : > 5 (b) Test these hypotheses using =.5. What are your conclusions? n = 4, = 3, y = /4 ( ) = y o zo n 4 Since z.5 =.645, do not reject. (c) Find the P-value for the test in part (b). From the z-table: P (d) Construct a 95 percent confidence interval on the mean breaking strength. The 95% confidence interval is y z y z n n The viscosity of a liquid detergent is supposed to average 8 centistokes at 5C. A random sample of 6 batches of detergent is collected, and the average viscosity is 8. Suppose we know that the standard deviation of viscosity is = 5 centistokes. (a) State the hypotheses that should be tested. H : = 8 H : 8 (b) Test these hypotheses using =.5. What are your conclusions? -

2 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY y o 8 8 zo n 6 4 Since z / = z.5 =.96, do not reject. (c) What is the P-value for the test? P (. 74). 549 (d) Find a 95 percent confidence interval on the mean. The 95% confidence interval is y z y z n n The diameters of steel shafts produced by a certain manufacturing process should have a mean diameter of.55 inches. The diameter is known to have a standard deviation of =. inch. A random sample of shafts has an average diameter of.545 inches. (a) Set up the appropriate hypotheses on the mean. H : =.55 H :.55 (b) Test these hypotheses using =.5. What are your conclusions? n =, =., y =.545 Since z.5 =.96, reject H. y o zo 5.8. n (c) Find the P-value for this test. P=.6547x -56 (d) Construct a 95 percent confidence interval on the mean shaft diameter. The 95% confidence interval is y z y z n n A normally distributed random variable has an unknown mean and a known variance = 9. Find the sample size required to construct a 95 percent confidence interval on the mean, that has total width of.. -

3 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Since y N(,9), a 95% two-sided confidence interval on is yz yz n n 3 y (. 96) y (. 96) n 3 n If the total interval is to have width., then the half-interval is.5. Since z / = z.5 =.96,. 963 n n n The shelf life of a carbonated beverage is of interest. Ten bottles are randomly selected and tested, and the following results are obtained: Days (a) We would like to demonstrate that the mean shelf life exceeds days. Set up appropriate hypotheses for investigating this claim. H : = H : > (b) Test these hypotheses using =.. What are your conclusions? y = 3 s = [ (8-3) + (4-3) + (4-3) + (6-3) + (5-3) + (38-3) + (63-3) + (59-3) + (34-3) + ( 39-3) ] / ( - ) s = 3438 / 9 = 38 s t o y o 3 s n since t.,9 =.8; do not reject H Minitab Output T-Test of the Mean Test of mu =. vs mu >. Variable N Mean StDev SE Mean T P Shelf Life T Confidence Intervals -3

4 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Variable N Mean StDev SE Mean 99. % CI Shelf Life (.9, 5.9) (c) Find the P-value for the test in part (b). P=.54 (d) Construct a 99 percent confidence interval on the mean shelf life. s s The 95% confidence interval is y t y t, n, n n n Consider the shelf life data in Problem -5. Can shelf life be described or modeled adequately by a normal distribution? What effect would violation of this assumption have on the test procedure you used in solving Problem -5? A normal probability plot, obtained from Minitab, is shown. There is no reason to doubt the adequacy of the normality assumption. If shelf life is not normally distributed, then the impact of this on the t-test in problem -5 is not too serious unless the departure from normality is severe. Normal Probability Plot for Shelf Life ML Estimates Percent ML Estimates Mean 3 StDev Goodness of Fit AD* Data The time to repair an electronic instrument is a normally distributed random variable measured in hours. The repair time for 6 such instruments chosen at random are as follows: Hours

5 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY (a) You wish to know if the mean repair time exceeds 5 hours. Set up appropriate hypotheses for investigating this issue. H : = 5 H : > 5 (b) Test the hypotheses you formulated in part (a). What are your conclusions? Use =.5. y = 47.5 s =46 / (6 - ) = s y o to.67 s n 6 since t.5,5 =.753; do not reject H Minitab Output T-Test of the Mean Test of mu = 5. vs mu > 5. Variable N Mean StDev SE Mean T P Hours T Confidence Intervals Variable N Mean StDev SE Mean 95. % CI Hours ( 88.9, 94.) (c) Find the P-value for this test. P=.6 (d) Construct a 95 percent confidence interval on mean repair time. The 95% confidence interval is y t s n y t, n, n s n Reconsider the repair time data in Problem -7. Can repair time, in your opinion, be adequately modeled by a normal distribution? The normal probability plot below does not reveal any serious problem with the normality assumption. -5

6 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Normal Probability Plot for Hours ML Estimates Percent ML Estimates Mean 4.5 StDev Goodness of Fit AD* Data Two machines are used for filling plastic bottles with a net volume of 6. ounces. The filling processes can be assumed to be normal, with standard deviation of =.5 and =.8. The quality engineering department suspects that both machines fill to the same net volume, whether or not this volume is 6. ounces. An experiment is performed by taking a random sample from the output of each machine. Machine Machine (a) State the hypotheses that should be tested in this experiment. H : = H : (b) Test these hypotheses using =.5. What are your conclusions? y n y n z o y y n n z.5 =.96; do not reject (c) What is the P-value for the test? P=.77 (d) Find a 95 percent confidence interval on the difference in the mean fill volume for the two machines. -6

7 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY The 95% confidence interval is (6.56.5) (9.6) y y z y y z n n n n.5.8 ( ) (9.6) Two types of plastic are suitable for use by an electronic calculator manufacturer. The breaking strength of this plastic is important. It is known that = =. psi. From random samples of n = and n = we obtain y = 6.5 and y = 55.. The company will not adopt plastic unless its breaking strength exceeds that of plastic by at least psi. Based on the sample information, should they use plastic? In answering this questions, set up and test appropriate hypotheses using =.. Construct a 99 percent confidence interval on the true mean difference in breaking strength. H : - = H : - > y 6. 5 n y n 55. z o y y n n z. =.5; do not reject The 99 percent confidence interval is ( ) (.575) y y z y y z n n n n 6.4 ( ) (.575) The following are the burning times of chemical flares of two different formulations. The design engineers are interested in both the means and variance of the burning times. Type Type (a) Test the hypotheses that the two variances are equal. Use =.5. -7

8 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY F 599.,, 43. F H: H: F ,, S S S S Do not reject. F ,,. (b) Using the results of (a), test the hypotheses that the mean burning times are equal. Use =.5. What is the P-value for this test? ( n ) S ( n ) S S p n n 8 S p 93. y y t S p n n t 58.,. Do not reject. From the computer output, t=.5; do not reject. Also from the computer output P=.96 Minitab Output Two Sample T-Test and Confidence Interval Two sample T for Type vs Type N Mean StDev SE Mean Type Type % CI for mu Type - mu Type : ( -8.6, 9.) T-Test mu Type = mu Type (vs not =): T =.5 P =.96 DF = 8 Both use Pooled StDev = 9.3 (c) Discuss the role of the normality assumption in this problem. Check the assumption of normality for both types of flares. The assumption of normality is required in the theoretical development of the t-test. However, moderate departure from normality has little impact on the performance of the t-test. The normality assumption is more important for the test on the equality of the two variances. An indication of nonnormality would be of concern here. The normal probability plots shown below indicate that burning time for both formulations follow the normal distribution. -8

9 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Normal Probability Plot for Type ML Estimates Percent ML Estimates Mean 7.4 StDev Goodness of Fit AD* Data 8 9 Normal Probability Plot for Type ML Estimates Percent ML Estimates Mean 7. StDev Goodness of Fit AD* Data An article in Solid State Technology, "Orthogonal Design of Process Optimization and Its Application to Plasma Etching" by G.Z. Yin and D.W. Jillie (May, 987) describes an experiment to determine the effect of C F 6 flow rate on the uniformity of the etch on a silicon wafer used in integrated circuit manufacturing. Data for two flow rates are as follows: C F 6 Uniformity Observation (SCCM) (a) Does the C F 6 flow rate affect average etch uniformity? Use =.5. No, C F 6 flow rate does not affect average etch uniformity. Minitab Output Two Sample T-Test and Confidence Interval -9

10 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Two sample T for Uniformity Flow Rat N Mean StDev SE Mean % CI for mu (5) - mu (): ( -.63,.4) T-Test mu (5) = mu () (vs not =): T = -.35 P =. DF = Both use Pooled StDev =.79 (b) What is the P-value for the test in part (a)? From the computer printout, P=. (c) Does the C F 6 flow rate affect the wafer-to-wafer variability in etch uniformity? Use =.5. H: H: F555.,, F Do not reject; C F 6 flow rate does not affect wafer-to-wafer variability. (d) Draw box plots to assist in the interpretation of the data from this experiment. The box plots shown below indicate that there is little difference in uniformity at the two gas flow rates. Any observed difference is not statistically significant. See the t-test in part (a). 5 Uniformity Flow Rate -3 A new filtering device is installed in a chemical unit. Before its installation, a random sample yielded the following information about the percentage of impurity: y =.5, S =.7, and n = 8. After installation, a random sample yielded y =., S = 94.73, n = 9. (a) Can you concluded that the two variances are equal? Use =.5. -

11 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY. 5, 7, S. 7 F S Do Not Reject. Assume that the variances are equal. H F : H :. 7 (b) Has the filtering device reduced the percentage of impurity significantly? Use =.5. H : H : ( n )S ( n )S ( 8 )(. 7 ) ( 9 )( ) S p n n 8 9 S p y y. 5. t. 479 S p n n 8 9 t. 5, Do not reject. There is no evidence to indicate that the new filtering device has affected the mean -4 Twenty observations on etch uniformity on silicon wafers are taken during a qualification experiment for a plasma etcher. The data are as follows: (a) Construct a 95 percent confidence interval estimate of. n S n S, n ( ), n (b) Test the hypothesis that =.. Use =.5. What are your conclusions? H 59.,. H : : SS ,

12 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Do not reject. There is no evidence to indicate that (c) Discuss the normality assumption and its role in this problem. The normality assumption is much more important when analyzing variances then when analyzing means. A moderate departure from normality could cause problems with both statistical tests and confidence intervals. Specifically, it will cause the reported significance levels to be incorrect. (d) Check normality by constructing a normal probability plot. What are your conclusions? The normal probability plot indicates that there is not any serious problem with the normality assumption. Normal Probability Plot for Uniformity ML Estimates Percent ML Estimates Mean 5.88 StDev Goodness of Fit AD* Data The diameter of a ball bearing was measured by inspectors, each using two different kinds of calipers. The results were: Inspector Caliper Caliper Difference Difference^ (a) Is there a significant difference between the means of the population of measurements represented by the two samples? Use =.5. -

13 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY H : H : or equivalently H H : : d d Minitab Output Paired T-Test and Confidence Interval Paired T for Caliper - Caliper N Mean StDev SE Mean Caliper Caliper Difference % CI for mean difference: (-.4,.54) T-Test of mean difference = (vs not = ): T-Value =.43 P-Value =.674 (b) Find the P-value for the test in part (a). P=.674 (c) Construct a 95 percent confidence interval on the difference in the mean diameter measurements for the two types of calipers. Sd Sd d t, D d t n, n n n...5. d d -6 An article in the Journal of Strain Analysis (vol.8, no., 983) compares several procedures for predicting the shear strength for steel plate girders. Data for nine girders in the form of the ratio of predicted to observed load for two of these procedures, the Karlsruhe and Lehigh methods, are as follows: Girder Karlsruhe Method Lehigh Method Difference Difference^ S/ S/ S3/ S4/ S5/ S/ S/ S/ S/ Sum = Average =.74 (a) Is there any evidence to support a claim that there is a difference in mean performance between the two methods? Use =.5. H : H : or equivalently H H : : d d n d di n i -3

14 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY s d n n di d i.85 (.465) i n i 9.35 n 9 Minitab Output Paired T-Test and Confidence Interval Paired T for Karlsruhe - Lehigh N Mean StDev SE Mean Karlsruh Lehigh Difference t, n t. 5, 9 95% CI for mean difference: (.7,.3777) T-Test of mean difference = (vs not = ): T-Value = 6.8 P-Value =. d.74 t 6.8 Sd.35 n 9. 36, reject the null hypothesis. (b) What is the P-value for the test in part (a)? P=. (c) Construct a 95 percent confidence interval for the difference in mean predicted to observed load. S d d t,n n d d d S d d t,n n (d) Investigate the normality assumption for both samples. Normal Probability Plot.999 Probability Av erage:.34 StDev :.463 N: Karlsruhe Anderson-Darling Normality Test A-Squared:.86 P-Value:.537-4

15 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Normal Probability Plot Probability Lehigh Av erage:.66 StDev : N: 9 Anderson-Darling Normality Test A-Squared:.77 P-Value:.8 (e) Investigate the normality assumption for the difference in ratios for the two methods. Normal Probability Plot Probability Difference Av erage: StDev :.3599 N: 9 Anderson-Darling Normality Test A-Squared:.38 P-Value:.464 (f) Discuss the role of the normality assumption in the paired t-test. As in any t-test, the assumption of normality is of only moderate importance. In the paired t-test, the assumption of normality applies to the distribution of the differences. That is, the individual sample measurements do not have to be normally distributed, only their difference. -7 The deflection temperature under load for two different formulations of ABS plastic pipe is being studied. Two samples of observations each are prepared using each formulation, and the deflection temperatures (in F) are reported below: Formulation Formulation

16 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY (a) Construct normal probability plots for both samples. Do these plots support assumptions of normality and equal variance for both samples? Normal Probability Plot.999 Probability Av erage:.5 StDev :.757 N: Form 5 5 Anderson-Darling Normality Test A-Squared:.45 P-Value:.7 Normal Probability Plot Probability Av erage: StDev : N: Form 95 5 Anderson-Darling Normality Test A-Squared:.443 P-Value:.36 (b) Do the data support the claim that the mean deflection temperature under load for formulation exceeds that of formulation? Use =.5. Minitab Output Two Sample T-Test and Confidence Interval Two sample T for Form vs Form N Mean StDev SE Mean Form.5..9 Form % CI for mu Form - mu Form : ( -., 5.9) T-Test mu Form = mu Form (vs >): T =.8 P =.4 DF = Both use Pooled StDev =. (c) What is the P-value for the test in part (a)? P =.4-6

17 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY -8 Refer to the data in problem -7. Do the data support a claim that the mean deflection temperature under load for formulation exceeds that of formulation by at least 3 F? Yes, formulation exceeds formulation by at least 3 F. Minitab Output Two-Sample T-Test and CI: Form, Form Two-sample T for Form vs Form N Mean StDev SE Mean Form.5..9 Form r Difference = mu Form - mu Form Estimate for difference: % lower bound for difference:.36 T-Test of difference = 3 (vs >): T-Value =.8 P-Value =.47 DF = Both use Pooled StDev =. -9 In semiconductor manufacturing, wet chemical etching is often used to remove silicon from the backs of wafers prior to metalization. The etch rate is an important characteristic of this process. Two different etching solutionsare being evaluated. Eight randomly selected wafers have been etched in each solution and the observed etch rates (in mils/min) are shown below: Solution Solution (a) Do the data indicate that the claim that both solutions have the same mean etch rate is valid? Use =.5 and assume equal variances. See the Minitab output below. Minitab Output Two Sample T-Test and Confidence Interval Two sample T for Solution vs Solution N Mean StDev SE Mean Solution Solution % CI for mu Solution - mu Solution: ( -.83, -.43) T-Test mu Solution = mu Solution (vs not =):T = -.38 P =.3 DF = 4 Both use Pooled StDev =.368 (b) Find a 95% confidence interval on the difference in mean etch rate. From the Minitab output, -.83 to.43. (c) Use normal probability plots to investigate the adequacy of the assumptions of normality and equal variances. -7

18 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Normal Probability Plot Probability Av erage: 9.95 StDev :.4659 N: Solution.5 Anderson-Darling Normality Test A-Squared:. P-Value:.743 Normal Probability Plot.999 Probability Av erage:.365 StDev :.369 N: Solution Anderson-Darling Normality Test A-Squared:.58 P-Value:.99 Both the normality and equality of variance assumptions are valid. - Two popular pain medications are being compared on the basis of the speed of absorption by the body. Specifically, tablet is claimed to be absorbed twice as fast as tablet. Assume that and are known. Develop a test statistic for H : = H : 4 y y ~ N,, assuming that the data is normally distributed. n n y y The test statistic is: zo, reject if zo z 4 n n - Suppose we are testing H : = H : -8

19 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY where and are known. Our sampling resources are constrained such that n + n = N. How should we allocate the N observations between the two populations to obtain the most powerful test? The most powerful test is attained by the n and n that maximize z o for given y Thus, we chose n and n to y y max z, subject to n + n = N. o n n This is equivalent to min L, subject to n n n n N n + n = N. Now dl, implies that n / n = /. dn n N n y. Thus n and n are assigned proportionally to the ratio of the standard deviations. This has intuitive appeal, as it allocates more observations to the population with the greatest variability. - Develop Equation -46 for a ( - ) percent confidence interval for the variance of a normal distribution. SS ~. Thus, SS n P, n, n. Therefore, SS SS, P, n, n so SS SS, is the ( - )% confidence interval on., n, n -3 Develop Equation -5 for a ( - ) percent confidence interval for the ratio / and are the variances of two normal distributions., where S S ~ F n, n S PF, n, n F or S, n, n S S P F, n, n F S, n, n S -4 Develop an equation for finding a ( - ) percent confidence interval on the difference in the means of two normal distributions where. Apply your equation to the portland cement experiment data, and find a 95% confidence interval. -9

20 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY y y S n S n ~ t, S S S S t y y t,, n n n n S S S S y y t y y t,, n n n n where S S n n S S n n n n Using the data from Table - n y S n y S This agrees with the result in Table where Construct a data set for which the paired t-test statistic is very large, but for which the usual twosample or pooled t-test statistic is small. In general, describe how you created the data. Does this give you any insight regarding how the paired t-test works? A B delta

21 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Minitab Output Paired T-Test and Confidence Interval Paired T for A - B N Mean StDev SE Mean A B Difference % CI for mean difference: (-.74, -.65) T-Test of mean difference = (vs not = ): T-Value = -7.7 P-Value =. Two Sample T-Test and Confidence Interval Two sample T for A vs B N Mean StDev SE Mean A B % CI for mu A - mu B: ( -.4, 8.6) T-Test mu A = mu B (vs not =): T = -. P =.85 DF = 8 Both use Pooled StDev =. These two sets of data were created by making the observation for A and B moderately different within each pair (or block), but making the observations between pairs very different. The fact that the difference between pairs is large makes the pooled estimate of the standard deviation large and the two-sample t-test statistic small. Therefore the fairly small difference between the means of the two treatments that is present when they are applied to the same experimental unit cannot be detected. Generally, if the blocks are very different, then this will occur. Blocking eliminates the variabiliy associated with the nuisance variable that they represent. -

22 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Chapter 3 Experiments with a Single Factor: The Analysis of Variance Solutions 3- The tensile strength of portland cement is being studied. Four different mixing techniques can be used economically. The following data have been collected: Mixing Technique Tensile Strength (lb/in ) (a) Test the hypothesis that mixing techniques affect the strength of the cement. Use =.5. Design Expert Output Response: Tensile Strengthin lb/in^ ANOVA for Selected Factorial Model Analysis of variance table [Partial sum of squares] Sum of Mean F Source Squares DF Square Value Prob > F Model 4.897E E significant A 4.897E E Residual.539E Lack of Fit. Pure Error.539E Cor Total 6.436E+5 5 The Model F-value of.73 implies the model is significant. There is only a.5% chance that a "Model F-Value" this large could occur due to noise. Treatment Means (Adjusted, If Necessary) Estimated Standard Mean Error Mean Standard t for H Treatment Difference DF Error Coeff= Prob > t vs vs vs vs vs <. 3 vs The F-value is.73 with a corresponding P-value of.5. Mixing technique has an effect. (b) Construct a graphical display as described in Section to compare the mean tensile strengths for the four mixing techniques. What are your conclusions? S y i. MS n E

23 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Scaled t Distribution (4) (3) () () Tensile Strength Based on examination of the plot, we would conclude that and 3 are the same; that 4 differs from and 3, that differs from and 3, and that and 4 are different. (c) Use the Fisher LSD method with =.5 to make comparisons between pairs of means. LSD t LSD t,n a. 5, 64 LSD. 79 MS n E ( ) Treatment vs. Treatment 4 = = 49. > Treatment vs. Treatment 3 = =.5 > Treatment vs. Treatment = = 85.5 > Treatment vs. Treatment 4 = = > Treatment vs. Treatment 3 = = 37.5 < Treatment 3 vs. Treatment 4 = = 67.5 > The Fisher LSD method is also presented in the Design-Expert computer output above. The results agree with graphical method for this experiment. (d) Construct a normal probability plot of the residuals. What conclusion would you draw about the validity of the normality assumption? There is nothing unusual about the normal probability plot of residuals. 3-

24 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Normal plot of residuals 99 Norm al % probability Residual (e) Plot the residuals versus the predicted tensile strength. Comment on the plot. There is nothing unusual about this plot. Residuals vs. Predicted Residuals Predicted (f) Prepare a scatter plot of the results to aid the interpretation of the results of this experiment. Design-Expert automatically generates the scatter plot. The plot below also shows the sample average for each treatment and the 95 percent confidence interval on the treatment mean. 3-3

25 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY 33 One Factor Plot Tensile Strength Technique 3- Rework part (b) of Problem 3- using the Duncan's multiple range test. Does this make any difference in your conclusions? R R R 3 4 S y i. r r r MS n, S E y i. 3, S y i. 4, S yi. Treatment vs. Treatment 4 = = 49. > (R 4 ) Treatment vs. Treatment 3 = =.5 > 8.9 (R 3 ) Treatment vs. Treatment = = 85.5 > (R ) Treatment vs. Treatment 4 = = > 8.9 (R 3 ) Treatment vs. Treatment 3 = = 37.5 < (R ) Treatment 3 vs. Treatment 4 = = 67.5 > (R ) Treatment 3 and Treatment are not different. All other pairs of means differ. This is the same result obtained from the Fisher LSD method and the graphical method. (b) Rework part (b) of Problem 3- using Tukey s test with =.5. Do you get the same conclusions from Tukey s test that you did from the graphical procedure and/or Duncan s multiple range test? Minitab Output Tukey's pairwise comparisons Family error rate =.5 Individual error rate =.7 Critical value = 4. Intervals for (column level mean) - (row level mean)

26 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY No, the conclusions are not the same. The mean of Treatment 4 is different than the means of Treatments,, and 3. However, the mean of Treatment is not different from the means of Treatments and 3 according to the Tukey method, they were found to be different using the graphical method and Duncan s multiple range test. (c) Explain the difference between the Tukey and Duncan procedures. A single critical value is used for comparison with the Tukey procedure where a critical values are used with the Duncan procedure. Tukey s test has a type I error rate of for all pairwise comparisons where Duncan s test type I error rate varies based on the steps between the means. Tukey s test is more conservative and has less power than Duncan s test. 3-3 Reconsider the experiment in Problem 3-. Find a 95 percent confidence interval on the mean tensile strength of the portland cement produced by each of the four mixing techniques. Also find a 95 percent confidence interval on the difference in means for techniques and 3. Does this aid in interpreting the results of the experiment? MSE MSE yi. t i yi. t,n a,n a n n Treatment - Treatment 3: Treatment : Treatment : Treatment 3: Treatment 4: y y i. j. t,n a MS n E i j y y i. j. t,n a MS n 3-4 An experiment was run to determine whether four specific firing temperatures affect the density of a certain type of brick. The experiment led to the following data: Temperature Density E 3-5

27 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY (a) Does the firing temperature affect the density of the bricks? Use =.5. No, firing temperature does not affect the density of the bricks. Refer to the Design-Expert output below. Design Expert Output Response: Density ANOVA for Selected Factorial Model Analysis of variance table [Partial sum of squares] Sum of Mean F Source Squares DF Square Value Prob > F Model not significant A Residual Lack of Fit. Pure Error Cor Total.5 7 The "Model F-value" of. implies the model is not significant relative to the noise. There is a 5.69 % chance that a "Model F-value" this large could occur due to noise. Treatment Means (Adjusted, If Necessary) Estimated Standard Mean Error Mean Standard t for H Treatment Difference DF Error Coeff= Prob > t vs vs vs vs vs vs (b) Is it appropriate to compare the means using Duncan s multiple range test in this experiment? The analysis of variance tells us that there is no difference in the treatments. There is no need to proceed with Duncan s multiple range test to decide which mean is difference. (c) Analyze the residuals from this experiment. Are the analysis of variance assumptions satisfied? There is nothing unusual about the residual plots. 3-6

28 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Normal plot of residuals. Residuals vs. Predicted 99 Normal % probability Residuals Residual Predicted (d) Construct a graphical display of the treatments as described in Section Does this graph adequately summarize the results of the analysis of variance in part (b). Yes. Scaled t Distribution (5) (75,5,) M ean D ensity 3-5 Rework Part (d) of Problem 3-4 using the Fisher LSD method. What conclusions can you draw? Explain carefully how you modified the procedure to account for unequal sample sizes. When sample sizes are unequal, the appropriate formula for the LSD is LSD t,n a MS E ni n j Treatment vs. Treatment =.74.5 =.4 >.3 Treatment vs. Treatment 3 =.74.7 =. <.87 Treatment vs. Treatment 4 =.74.7 =.4 <.3 Treatment 3 vs. Treatment =.7.5 =. <.3 Treatment 4 vs. Treatment =.7.5 =. <.446 Treatment 3 vs. Treatment 4 =.7.7 =. <.3 3-7

29 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Treatment, temperature of, is different than Treatment, temperature of 5. All other pairwise comparisons do not identify differences. Notice something very interesting has happened here. The analysis of variance indicated that there were no differences between treatment means, yet the LSD procedure found a difference; in fact, the Design-Expert output indicates that the P-value if slightly less that.5. This illustrates a danger of using multiple comparison procedures without relying on the results from the analysis of variance. Because we could not reject the hypothesis of equal means using the analysis of variance, we should never have performed the Fisher LSD (or any other multiple comparison procedure, for that matter). If you ignore the analysis of variance results and run multiple comparisons, you will likely make type I errors. The LSD calculations utilized Equation 3-3, which accommodates different sample sizes. Equation 3-3 simplifies to Equation 3-33 for a balanced design experiment. 3-6 A manufacturer of television sets is interested in the effect of tube conductivity of four different types of coating for color picture tubes. The following conductivity data are obtained: Coating Type Conductivity (a) Is there a difference in conductivity due to coating type? Use =.5. Yes, there is a difference in means. Refer to the Design-Expert output below.. Design Expert Output ANOVA for Selected Factorial Model Analysis of variance table [Partial sum of squares] Sum of Mean F Source Squares DF Square Value Prob > F Model significant A Residual Lack of Fit. Pure Error Cor Total The Model F-value of 4.3 implies the model is significant. There is only a.3% chance that a "Model F-Value" this large could occur due to noise. Treatment Means (Adjusted, If Necessary) Estimated Standard Mean Error Mean Standard t for H Treatment Difference DF Error Coeff= Prob > t vs vs vs vs vs vs (b) Estimate the overall mean and the treatment effects. 3-8

30 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY ˆ 7 / ˆ y. y ˆ y. y ˆ 3 y3. y ˆ y y (c) Compute a 95 percent interval estimate of the mean of coating type 4. Compute a 99 percent interval estimate of the mean difference between coating types and Treatment 4: Treatment - Treatment 4: (d) Test all pairs of means using the Fisher LSD method with =.5. Refer to the Design-Expert output above. The Fisher LSD procedure is automatically included in the output. The means of Coating Type and Coating Type are not different. The means of Coating Type 3 and Coating Type 4 are not different. However, Coating Types and produce higher mean conductivity that does Coating Types 3 and 4. (e) Use the graphical method discussed in Section to compare the means. Which coating produces the highest conductivity? S y i. MS E Coating types and produce the highest conductivity. n 4 Scaled t Distribution (4) (3) ()() Conductivity 3-9

31 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY (f) Assuming that coating type 4 is currently in use, what are your recommendations to the manufacturer? We wish to minimize conductivity. Since coatings 3 and 4 do not differ, and as they both produce the lowest mean values of conductivity, use either coating 3 or 4. As type 4 is currently being used, there is probably no need to change. 3-7 Reconsider the experiment in Problem 3-6. Analyze the residuals and draw conclusions about model adequacy. There is nothing unusual in the normal probability plot. A funnel shape is seen in the plot of residuals versus predicted conductivity indicating a possible non-constant variance. Normal plot of residuals Residuals vs. Predicted Normal % probability Residuals Residual Predicted Residuals vs. Coating Type Residuals Coating Type 3-8 An article in the ACI Materials Journal (Vol. 84, 987. pp. 3-6) describes several experiments investigating the rodding of concrete to remove entrapped air. A 3 x 6 cylinder was used, and the 3-

32 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY number of times this rod was used is the design variable. The resulting compressive strength of the concrete specimen is the response. The data are shown in the following table. Rodding Level Compressive Strength (a) Is there any difference in compressive strength due to the rodding level? Use =.5. There are no differences. Design Expert Output ANOVA for Selected Factorial Model Analysis of variance table [Partial sum of squares] Sum of Mean F Source Squares DF Square Value Prob > F Model not significant A Residual Lack of Fit. Pure Error Cor Total The "Model F-value" of.87 implies the model is not significant relative to the noise. There is a.38 % chance that a "Model F-value" this large could occur due to noise. Treatment Means (Adjusted, If Necessary) Estimated Standard Mean Error Mean Standard t for H Treatment Difference DF Error Coeff= Prob > t vs vs vs vs vs vs (b) Find the P-value for the F statistic in part (a). From computer output, P=.38. (c) Analyze the residuals from this experiment. What conclusions can you draw about the underlying model assumptions? There is nothing unusual about the residual plots. 3-

33 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Normal plot of residuals Residuals vs. Predicted Normal % probability Residuals Residual Predicted Residuals vs. Rodding Level Residuals Rodding Level (d) Construct a graphical display to compare the treatment means as describe in Section

34 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Scaled t Distribution (, 5) (5) () Mean Compressive Strength 3-9 An article in Environment International (Vol. 8, No. 4, 99) describes an experiment in which the amount of radon released in showers was investigated. Radon enriched water was used in the experiment and six different orifice diameters were tested in shower heads. The data from the experiment are shown in the following table. Orifice Dia. Radon Released (%) (a) Does the size of the orifice affect the mean percentage of radon released? Use =.5. Yes. There is at least one treatment mean that is different. Design Expert Output Response: Radon Released in % ANOVA for Selected Factorial Model Analysis of variance table [Partial sum of squares] Sum of Mean F Source Squares DF Square Value Prob > F Model <. significant A <. Residual Lack of Fit. Pure Error Cor Total The Model F-value of 3.85 implies the model is significant. There is only a.% chance that a "Model F-Value" this large could occur due to noise. Treatment Means (Adjusted, If Necessary) EstimatedStandard Mean Error

35 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Mean Standard t for H Treatment Difference DF Error Coeff= Prob > t vs vs vs <. vs <. vs <. vs vs vs <. vs <. 3 vs vs <. 3 vs <. 4 vs vs vs (b) Find the P-value for the F statistic in part (a). P=3.6 x -8 (c) Analyze the residuals from this experiment. There is nothing unusual about the residuals. Normal plot of residuals Residuals vs. Predicted 4 99 Normal % probability Residuals Residual Predicted 3-4

36 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY 4 Residuals vs. Orifice Diameter.85 Residuals Orifice Diameter (d) Find a 95 percent confidence interval on the mean percent radon released when the orifice diameter is Treatment 5 (Orifice =.4): (e) Construct a graphical display to compare the treatment means as describe in Section What conclusions can you draw? Scaled t Distribution (6) (5) (4) (3) () () Conductivity Treatments 5 and 6 as a group differ from the other means;, 3, and 4 as a group differ from the other means, differs from the others. 3- The response time in milliseconds was determined for three different types of circuits that could be used in an automatic valve shutoff mechanism. The results are shown in the following table. 3-5

37 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Circuit Type Response Time (a) Test the hypothesis that the three circuit types have the same response time. Use =.. From the computer printout, F=6.8, so there is at least one circuit type that is different. Design Expert Output Response: Response Time in ms ANOVA for Selected Factorial Model Analysis of variance table [Partial sum of squares] Sum of Mean F Source Squares DF Square Value Prob > F Model significant A Residual Lack of Fit. Pure Error Cor Total The Model F-value of 6.8 implies the model is significant. There is only a.4% chance that a "Model F-Value" this large could occur due to noise. Treatment Means (Adjusted, If Necessary) Estimated Standard Mean Error Mean Standard t for H Treatment Difference DF Error Coeff= Prob > t vs vs vs (b) Use Tukey s test to compare pairs of treatment means. Use =.. S y i. MS E n 5 q., 3, 5. 4 t vs. :.8-.=.4 > 9.66 vs. 3:.8-8.4=.4 < 9.66 vs. 3:.-8.4=3.8 > 9.66 and are different. and 3 are different. Notice that the results indicate that the mean of treatment differs from the means of both treatments and 3, and that the means for treatments and 3 are the same. Notice also that the Fisher LSD procedure (see the computer output) gives the same results. (c) Use the graphical procedure in Section to compare the treatment means. What conclusions can you draw? How do they compare with the conclusions from part (a). The scaled-t plot agrees with part (b). In this case, the large difference between the mean of treatment and the other two treatments is very obvious. 3-6

38 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Scaled t Distribution (3) () () Tensile Strength (d) Construct a set of orthogonal contrasts, assuming that at the outset of the experiment you suspected the response time of circuit type to be different from the other two. H 3 H 3 C y. y. y3. C SS F C C Type differs from the average of type and type 3. (e) If you were a design engineer and you wished to minimize the response time, which circuit type would you select? Either type or type 3 as they are not different from each other and have the lowest response time. (f) Analyze the residuals from this experiment. Are the basic analysis of variance assumptions satisfied? The normal probability plot has some points that do not lie along the line in the upper region. This may indicate potential outliers in the data. 3-7

39 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Normal plot of residuals 7.8 Residuals vs. Predicted 99 Normal % probability Residuals Residual Predicted Residuals vs. Circuit Type Residuals Circuit Type 3- The effective life of insulating fluids at an accelerated load of 35 kv is being studied. Test data have been obtained for four types of fluids. The results were as follows: Fluid Type Life (in h) at 35 kv Load (a) Is there any indication that the fluids differ? Use =.5. At =.5 there are no difference, but at since the P-value is just slightly above.5, there is probably a difference in means. Design Expert Output Response: Life in in h 3-8

40 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY ANOVA for Selected Factorial Model Analysis of variance table [Partial sum of squares] Sum of Mean F Source Squares DF Square Value Prob > F Model not significant A Residual Lack of Fit. Pure Error Cor Total The Model F-value of 3.5 implies there is a 5.5% chance that a "Model F-Value" this large could occur due to noise. Treatment Means (Adjusted, If Necessary) Estimated Standard Mean Error Mean Standard t for H Treatment Difference DF Error Coeff= Prob > t vs vs vs vs vs vs (b) Which fluid would you select, given that the objective is long life? Treatment 3. The Fisher LSD procedure in the computer output indicates that the fluid 3 is different from the others, and it s average life also exceeds the average lives of the other three fluids. (c) Analyze the residuals from this experiment. Are the basic analysis of variance assumptions satisfied? There is nothing unusual in the residual plots. Normal plot of residuals.95 Residuals vs. Predicted 99 Normal % probability Residuals Residual Predicted 3-9

41 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY.95 Residuals vs. Fluid Type.55 Residuals Fluid Type 3- Four different designs for a digital computer circuit are being studied in order to compare the amount of noise present. The following data have been obtained: Circuit Design Noise Observed (a) Is the amount of noise present the same for all four designs? Use =.5. No, at least one treatment mean is different. Design Expert Output Response: Noise ANOVA for Selected Factorial Model Analysis of variance table [Partial sum of squares] Sum of Mean F Source Squares DF Square Value Prob > F Model <. significant A <. Residual Lack of Fit. Pure Error Cor Total The Model F-value of.78 implies the model is significant. There is only a.% chance that a "Model F-Value" this large could occur due to noise. Treatment Means (Adjusted, If Necessary) Estimated Standard Mean Error Mean Standard t for H Treatment Difference DF Error Coeff= Prob > t vs <. 3-

42 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY vs vs <. vs vs vs (b) Analyze the residuals from this experiment. Are the basic analysis of variance assumptions satisfied? There is nothing unusual about the residual plots. Normal plot of residuals Residuals vs. Predicted Normal % probability Residuals Residual Predicted Residuals vs. Circuit Design Residuals Circuit Design (c) Which circuit design would you select for use? Low noise is best. From the Design Expert Output, the Fisher LSD procedure comparing the difference in means identifies Type as having lower noise than Types and 4. Although the LSD procedure comparing Types and 3 has a P-value greater than.5, it is less than.. Unless there are other reasons for choosing Type 3, Type would be selected. 3-

43 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY 3-3 Four chemists are asked to determine the percentage of methyl alcohol in a certain chemical compound. Each chemist makes three determinations, and the results are the following: Chemist Percentage of Methyl Alcohol (a) Do chemists differ significantly? Use =.5. There is no significant difference at the 5% level, but chemists differ significantly at the % level. Design Expert Output Response: Methyl Alcohol in % ANOVA for Selected Factorial Model Analysis of variance table [Partial sum of squares] Sum of Mean F Source Squares DF Square Value Prob > F Model not significant A Residual Lack of Fit. Pure Error Cor Total.9 The Model F-value of 3.5 implies there is a 8.3% chance that a "Model F-Value" this large could occur due to noise. Treatment Means (Adjusted, If Necessary) Estimated Standard Mean Error Mean Standard t for H Treatment Difference DF Error Coeff= Prob > t vs vs vs vs vs vs (b) Analyze the residuals from this experiment. There is nothing unusual about the residual plots. 3-

44 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Normal plot of residuals Residuals vs. Predicted.5 99 Normal % probability Residuals Residual Predicted Residuals vs. Chemist.5.85 Residuals Chemist (c) If chemist is a new employee, construct a meaningful set of orthogonal contrasts that might have been useful at the start of the experiment. Chemists Total C C C Contrast Totals:

45 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY SS SS SS C C C FC 6. 5* FC FC Only contrast is significant at 5%. 3-4 Three brands of batteries are under study. It is s suspected that the lives (in weeks) of the three brands are different. Five batteries of each brand are tested with the following results: Weeks of Life Brand Brand Brand (a) Are the lives of these brands of batteries different? Yes, at least one of the brands is different. Design Expert Output Response: Life in Weeks ANOVA for Selected Factorial Model Analysis of variance table [Partial sum of squares] Sum of Mean F Source Squares DF Square Value Prob > F Model <. significant A <. Residual Lack of Fit. Pure Error Cor Total The Model F-value of implies the model is significant. There is only a.% chance that a "Model F-Value" this large could occur due to noise. Treatment Means (Adjusted, If Necessary) Estimated Standard Mean Error Mean Standard t for H Treatment Difference DF Error Coeff= Prob > t vs <. vs vs <. (b) Analyze the residuals from this experiment. There is nothing unusual about the residuals. 3-4

46 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Normal plot of residuals 7.6 Residuals vs. Predicted 99 Normal % probability Residuals Residual Predicted Residuals vs. Brand Residuals Brand (c) Construct a 95 percent interval estimate on the mean life of battery brand. Construct a 99 percent interval estimate on the mean difference between the lives of battery brands and 3. y i. t,n a MS 5. 6 Brand : Brand - Brand 3: y i. n E y j. t,n a MS n E 3-5

47 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY (d) Which brand would you select for use? If the manufacturer will replace without charge any battery that fails in less than 85 weeks, what percentage would the company expect to replace? Chose brand 3 for longest life. Mean life of this brand in.4 weeks, and the variance of life is estimated by 5.6 (MSE). Assuming normality, then the probability of failure before 85 weeks is: That is, about 5 out of, batteries will fail before 85 week. 3-5 Four catalysts that may affect the concentration of one component in a three component liquid mixture are being investigated. The following concentrations are obtained: Catalyst (a) Do the four catalysts have the same effect on concentration? No, their means are different. Design Expert Output Response: Concentration ANOVA for Selected Factorial Model Analysis of variance table [Partial sum of squares] Sum of Mean F Source Squares DF Square Value Prob > F Model significant A Residual Lack of Fit. Pure Error Cor Total.4 5 The Model F-value of 9.9 implies the model is significant. There is only a.4% chance that a "Model F-Value" this large could occur due to noise. Treatment Means (Adjusted, If Necessary) Estimated Standard Mean Error Mean Standard t for H Treatment Difference DF Error Coeff= Prob > t vs vs vs vs vs vs

48 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY (b) Analyze the residuals from this experiment. There is nothing unusual about the residual plots. Normal plot of residuals.6667 Residuals vs. Predicted 99 Normal % probability Residuals Residual Predicted Residuals vs. Catalyst Residuals Catalyst (c) Construct a 99 percent confidence interval estimate of the mean response for catalyst. y i. t,n a MS. 88 Catalyst : n E 3-7

49 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY 3-6 An experiment was performed to investigate the effectiveness of five insulating materials. Four samples of each material were tested at an elevated voltage level to accelerate the time to failure. The failure times (in minutes) is shown below. Material Failure Time (minutes) (a) Do all five materials have the same effect on mean failure time? No, at least one material is different. Design Expert Output Response: Failure Time in Minutes ANOVA for Selected Factorial Model Analysis of variance table [Partial sum of squares] Sum of Mean F Source Squares DF Square Value Prob > F Model.3E E significant A.3E E Residual 6.5E E+6 Lack of Fit. Pure Error 6.5E E+6 Cor Total.657E+89 The Model F-value of 6.9 implies the model is significant. There is only a.38% chance that a "Model F-Value" this large could occur due to noise. Treatment Means (Adjusted, If Necessary) Estimated Standard Mean Error Mean Standard t for H Treatment Difference DF Error Coeff= Prob > t vs vs vs vs vs vs vs E vs vs vs (b) Plot the residuals versus the predicted response. Construct a normal probability plot of the residuals. What information do these plots convey? 3-8

50 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY 437 Residuals vs. Predicted Normal plot of residuals 99 Residuals Normal % probability Predicted Residual The plot of residuals versus predicted has a strong outward-opening funnel shape, which indicates the variance of the original observations is not constant. The residuals plotted in the normal probability plot also imply that the normality assumption is not valid. A data transformation is recommended. (c) Based on your answer to part (b) conduct another analysis of the failure time data and draw appropriate conclusions. A natural log transformation was applied to the failure time data. The analysis identifies that there exists at least one difference in treatment means. Design Expert Output Response: Failure Time in Minutes Transform: Natural log Constant:. ANOVA for Selected Factorial Model Analysis of variance table [Partial sum of squares] Sum of Mean F Source Squares DF Square Value Prob > F Model <. significant A <. Residual Lack of Fit. Pure Error Cor Total The Model F-value of implies the model is significant. There is only a.% chance that a "Model F-Value" this large could occur due to noise. Treatment Means (Adjusted, If Necessary) Estimated Standard Mean Error Mean Standard t for H Treatment Difference DF Error Coeff= Prob > t vs vs vs vs vs <. 3-9

51 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY vs <. vs vs vs <. 4 vs <. There is nothing unusual about the residual plots when the natural log transformation is applied. Normal plot of residuals.6479 Residuals vs. Predicted 99 Normal % probability Residuals Residual Predicted Residuals vs. Material Residuals Material 3-7 A semiconductor manufacturer has developed three different methods for reducing particle counts on wafers. All three methods are tested on five wafers and the after-treatment particle counts obtained. The data are shown below. Method Count

52 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY (a) Do all methods have the same effect on mean particle count? No, at least one method has a different effect on mean particle count. Design Expert Output Response: Count ANOVA for Selected Factorial Model Analysis of variance table [Partial sum of squares] Sum of Mean F Source Squares DF Square Value Prob > F Model significant A Residual Lack of Fit. Pure Error Cor Total The Model F-value of 7.9 implies the model is significant. There is only a.64% chance that a "Model F-Value" this large could occur due to noise. Treatment Means (Adjusted, If Necessary) Estimated Standard Mean Error Mean Standard t for H Treatment Difference DF Error Coeff= Prob > t vs vs vs (b) Plot the residuals versus the predicted response. Construct a normal probability plot of the residuals. Are there potential concerns about the validity of the assumptions? The plot of residuals versus predicted appears to be funnel shaped. This indicates the variance of the original observations is not constant. The residuals plotted in the normal probability plot do not fall along a straight line, which suggests that the normality assumption is not valid. A data transformation is recommended. 47 Residuals vs. Predicted Normal plot of residuals 99 Residuals Normal % probability Predicted Residual 3-3

53 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY (c) Based on your answer to part (b) conduct another analysis of the particle count data and draw appropriate conclusions. For count data, a square root transformation is often very effective in resolving problems with inequality of variance. The analysis of variance for the transformed response is shown below. The difference between methods is much more apparent after applying the square root transformation. Design Expert Output Response: Count Transform: Square root Constant:. ANOVA for Selected Factorial Model Analysis of variance table [Partial sum of squares] Sum of Mean F Source Squares DF Square Value Prob > F Model significant A Residual Lack of Fit. Pure Error Cor Total.86 4 The Model F-value of 9.84 implies the model is significant. There is only a.3% chance that a "Model F-Value" this large could occur due to noise. Treatment Means (Adjusted, If Necessary) Estimated Standard Mean Error Mean Standard t for H Treatment Difference DF Error Coeff= Prob > t vs vs vs Consider testing the equality of the means of two normal populations, where the variances are unknown but are assumed to be equal. The appropriate test procedure is the pooled t test. Show that the pooled t test is equivalent to the single factor analysis of variance. S p y. y. t ~ tn assuming n = n = n S p n n n n y j y. y j y. yij y. j j n i j n MS E for a= n yi. y.. Furthermore, y. y., which is exactly the same as SS Treatments in a one-way n n i classification with a=. Thus we have shown that SSTreatments t. In general, we know that tu F, u MS so that t ~ F, n. Thus the square of the test statistic from the pooled t-test is the same test statistic that results from a single-factor analysis of variance with a=. E 3-3

54 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Show that the variance of the linear combination a i c i y i. is a i n i c i. i i n j ij. a i i a i n j ij i a i i. i a i i. i y V c y V c y c V y c V, y ij V a i c i n i 3- In a fixed effects experiment, suppose that there are n observations for each of four treatments. Let 3 Q,,Q Q be single-degree-of-freedom components for the orthogonal contrasts. Prove that 3 Q Q Q SS Treatments Q SS y y C Q SS y y y C Q SS y y y y C C.. C... C.... n ) y y ( Q n ) y y y ( Q n ) y y y y ( Q n y y y Q Q Q i j i j. i. i and since 4 i i... j i j.. i y y y y, we have Treatments.. i i. i.. i. SS n y n y n y y Q Q Q for a=4. 3- Use Bartlett's test to determine if the assumption of equal variances is satisfied in Problem 3-4. Use =.5. Did you reach the same conclusion regarding the equality of variance by examining the residual plots? c q.36, where

55 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY S S S q N alog S n c 3a a a i p i log n N a i a i ni S i S p N a S p S p a c i 3 c i S q q N alog S n 5 3log log. log 4. 8 log. 8 i q p a i log q , c. 389 Cannot reject null hypothesis; conclude that the variance are equal. This agrees with the residual plots in Problem 3-6. S i i 3- Use the modified Levene test to determine if the assumption of equal variances is satisfied on Problem 3-4. Use =.5. Did you reach the same conclusion regarding the equality of variances by examining the residual plots? The absolute value of Battery Life brand median is: y ij yi Brand Brand Brand The analysis of variance indicates that there is not a difference between the different brands and therefore the assumption of equal variances is satisfired. Design Expert Output Response: Mod Levine ANOVA for Selected Factorial Model Analysis of variance table [Partial sum of squares] 3-34

56 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Sum of Mean F Source Squares DF Square Value Prob > F Model A Pure Error Cor Total Refer to Problem 3-. If we wish to detect a maximum difference in mean response times of milliseconds with a probability of at least.9, what sample size should be used? How would you obtain a preliminary estimate of? nd a, use MS E from Problem 3- to estimate n n. Letting. 5, P(accept) =., a Trial and Error yields: n P(accept) Choose n 6, therefore N 8 Notice that we have used an estimate of the variance obtained from the present experiment. This indicates that we probably didn t use a large enough sample (n was 5 in problem 3-) to satisfy the criteria specified in this problem. However, the sample size was adequate to detect differences in one of the circuit types. When we have no prior estimate of variability, sometimes we will generate sample sizes for a range of possible variances to see what effect this has on the size of the experiment. Often a knowledgeable expert will be able to bound the variability in the response, by statements such as the standard deviation is going to be at least or the standard deviation shouldn t be larger than. 3-4 Refer to Problem 3-4. (a) If we wish to detect a maximum difference in mean battery life of.5 percent with a probability of at least.9, what sample size should be used? Discuss how you would obtain a preliminary estimate of for answering this question. Use the MS E from Problem 3-4. nd n n a Letting 5., P(accept) =., a 3-35

57 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Trial and Error yields: n P(accept) Choose n 45, therefore N 35 See the discussion from the previous problem about the estimate of variance. (b) If the difference between brands is great enough so that the standard deviation of an observation is increased by 5 percent, what sample size should be used if we wish to detect this with a probability of at least.9? a N a P( accept ).. P n. 5. n n 565 Trial and Error yields: n P(accept) Choose n 5, therefore N Consider the experiment in Problem 3-6. If we wish to construct a 95 percent confidence interval on the difference in two mean battery lives that has an accuracy of weeks, how many batteries of each brand must be tested?.5 MS E 5. 6 width t. 5,N a MS n Trial and Error yields: n t width Choose n 3, therefore N 93 E 3-6 Suppose that four normal populations have means of =5, =6, 3 =5, and 4 =6. How many observations should be taken from each population so that the probability or rejecting the null hypothesis 3-36

58 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY of equal population means is at least.9? Assume that =.5 and that a reasonable estimate of the error variance is =5.,i,, 3, 4 i 4 5, 4 i 4 i i i i 4 5, , 4 5 n i a n n n 4 5 n, 5 3, 4., From the O.C. curves we can construct the following: n Therefore, select n=5 3-7 Refer to Problem 3-6. (a) How would your answer change if a reasonable estimate of the experimental error variance were = 36? n, 5 3, 4. n i n. 6944n a n, From the O.C. curves we can construct the following: n Therefore, select n=7 (b) How would your answer change if a reasonable estimate of the experimental error variance were = 49? n, 5 3, 4. n i n. 5n a n, From the O.C. curves we can construct the following: 3-37

59 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY n Therefore, select n=9 (c) Can you draw any conclusions about the sensitivity of your answer in the particular situation about how your estimate of affects the decision about sample size? As our estimate of variability increases the sample size must increase to ensure the same power of the test. (d) Can you make any recommendations about how we should use this general approach to choosing n in practice? When we have no prior estimate of variability, sometimes we will generate sample sizes for a range of possible variances to see what effect this has on the size of the experiment. Often a knowledgeable expert will be able to bound the variability in the response, by statements such as the standard deviation is going to be at least or the standard deviation shouldn t be larger than. 3-8 Refer to the aluminum smelting experiment described in Section 4-. Verify that ratio control methods do not affect average cell voltage. Construct a normal probability plot of residuals. Plot the residuals versus the predicted values. Is there an indication that any underlying assumptions are violated? Design Expert Output Response: Cell Average ANOVA for Selected Factorial Model Analysis of variance table [Partial sum of squares] Sum of Mean F Source Squares DF Square Value Prob > F Model.746E E not significant A.746E E Residual E-3 Lack of Fit. Pure Error E-3 Cor Total.9 3 The "Model F-value" of. implies the model is not significant relative to the noise. There is a 89. % chance that a "Model F-value" this large could occur due to noise. Treatment Means (Adjusted, If Necessary) Estimated Standard Mean Error Mean Standard t for H Treatment Difference DF Error Coeff= Prob > t vs vs vs vs vs E vs The following residual plots are satisfactory. 3-38

60 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Normal plot of residuals Residuals vs. Predicted.5 99 Normal % probability Residuals Residual Predicted Residuals vs. Algorithm.5.55 Residuals Algorithm 3-9 Refer to the aluminum smelting experiment in Section 3-8. Verify the analysis of variance for pot noise summarized in Table 3-3. Examine the usual residual plots and comment on the experimental validity. Design Expert Output Response: Cell StDev Transform: Natural log Constant:. ANOVA for Selected Factorial Model Analysis of variance table [Partial sum of squares] Sum of Mean F Source Squares DF Square Value Prob > F Model <. significant A <. Residual Lack of Fit. Pure Error Cor Total The Model F-value of.96 implies the model is significant. There is only a.% chance that a "Model F-Value" this large could occur due to noise. 3-39

61 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Treatment Means (Adjusted, If Necessary) Estimated Standard Mean Error Mean Standard t for H Treatment Difference DF Error Coeff= Prob > t vs vs <. vs vs <. vs vs <. The following residual plots identify the residuals to be normally distributed, randomly distributed through the range of prediction, and uniformly distributed across the different algorithms. This validates the assumptions for the experiment. Normal plot of residuals.5896 Residuals vs. Predicted 99 Normal % probability Residuals Residual Predicted Residuals vs. Algorithm Residuals Algorithm 3-4

62 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY 3-3 Four different feed rates were investigated in an experiment on a CNC machine producing a component part used in an aircraft auxiliary power unit. The manufacturing engineer in charge of the experiment knows that a critical part dimension of interest may be affected by the feed rate. However, prior experience has indicated that only dispersion effects are likely to be present. That is, changing the feed rate does not affect the average dimension, but it could affect dimensional variability. The engineer makes five production runs at each feed rate and obtains the standard deviation of the critical dimension (in -3 mm). The data are shown below. Assume that all runs were made in random order. Feed Rate Production Run (in/min) (a) Does feed rate have any effect on the standard deviation of this critical dimension? Because the residual plots were not acceptable for the non-transformed data, a square root transformation was applied to the standard deviations of the critical dimension. Based on the computer output below, the feed rate has an effect on the standard deviation of the critical dimension. Design Expert Output Response: Run StDev Transform: Square root Constant:. ANOVA for Selected Factorial Model Analysis of variance table [Partial sum of squares] Sum of Mean F Source Squares DF Square Value Prob > F Model significant A Residual E-3 Lack of Fit. Pure Error E-3 Cor Total.7 9 The Model F-value of 7.5 implies the model is significant. There is only a.3% chance that a "Model F-Value" this large could occur due to noise. Treatment Means (Adjusted, If Necessary) Estimated Standard Mean Error Mean Standard t for H Treatment Difference DF Error Coeff= Prob > t vs 4.37E vs vs vs vs vs (b) Use the residuals from this experiment of investigate model adequacy. Are there any problems with experimental validity? The residual plots are satisfactory. 3-4

63 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Normal plot of residuals Residuals vs. Predicted 99 Normal % probability Residuals Residual Predicted Residuals vs. Feed Rate Residuals Feed Rate 3-3 Consider the data shown in Problem 3-. (a) Write out the least squares normal equations for this problem, and solve them for and i, using the usual constraint 3 3 i ˆ i. Estimate. 5 ˆ 5ˆ 5ˆ 5ˆ 3 =7 5 ˆ 5ˆ =54 5 ˆ 5ˆ = 5 ˆ 5ˆ 3 =4 Imposing ˆ, therefore ˆ 3. 8, ˆ 3, ˆ 8 4, ˆ 5 4 i i

64 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY ˆ ˆ (b) Solve the equations in (a) using the constraint ˆ 3. Are the estimators ˆ i and ˆ the same as you found in (a)? Why? Now estimate and compare your answer with that for (a). What statement can you make about estimating contrasts in the i? Imposing the constraint, ˆ 3 we get the following solution to the normal equations: ˆ 8. 4, ˆ. 4, ˆ 3. 8, and ˆ 3. These estimators are not the same as in part (a). However, ˆ ˆ , is the same as in part (a). The contrasts are estimable.. (c) Estimate, 3 and using the two solutions to the normal equations. Compare the results obtained in each case. Contrast Estimated from Part (a) Estimated from Part (b) Contrasts and are estimable, 3 is not estimable. 3-3 Apply the general regression significance test to the experiment in Example 3-. Show that the procedure yields the same results as the usual analysis of variance. From Table 3-3: y from Example 3-, we have: ˆ 5. 4 ˆ i 5 j R, SS E y ij ˆ 5. 4 ˆ ˆ. 36 ˆ , with 5 degrees of freedom. ˆ y 5 i.. 5 i ˆy i ) ,. 8 with 5 degrees of freedom. 5 j y ij R, with 5-5 degrees of freedom. This is identical to the SS E found in Example

65 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY The reduced model: ˆy R.. R, with degree of freedom. R, R Note: R SSTreatment Finally, F from Example 3-. R t 4 SSE which is the same as computed in Example 3-., with 5-=4 degrees of freedom Use the Kruskal-Wallis test for the experiment in Problem 3-. Are the results comparable to those found by the usual analysis of variance? From Design Expert Output of Problem 3- Response: Life in in h ANOVA for Selected Factorial Model Analysis of variance table [Partial sum of squares] Sum of Mean F Source Squares DF Square Value Prob > F Model not significant A Residual Lack of Fit. Pure Error Cor Total H NN a i R n i. i 3N 4. 5, Accept the null hypothesis; the treatments are not different. This agrees with the analysis of variance Use the Kruskal-Wallis test for the experiment in Problem 3-. Compare conclusions obtained with those from the usual analysis of variance? From Design Expert Output of Problem 3- Response: Noise ANOVA for Selected Factorial Model Analysis of variance table [Partial sum of squares] Sum of Mean F Source Squares DF Square Value Prob > F Model <. significant A <. Residual Lack of Fit. 3-44

66 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Pure Error Cor Total H NN a i R i. 3N ni. 5, Reject the null hypothesis because the treatments are different. This agrees with the analysis of variance Consider the experiment in Example 3-. Suppose that the largest observation on tensile strength is incorrectly recorded as 5. What effect does this have on the usual analysis of variance? What effect does is have on the Kruskal-Wallis test? The incorrect observation reduces the analysis of variance F from 4.76 to It does not change the value of the Kruskal-Wallis test. 3-45

67 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Chapter 4 Randomized Blocks, Latin Squares, and Related Designs Solutions 4- A chemist wishes to test the effect of four chemical agents on the strength of a particular type of cloth. Because there might be variability from one bolt to another, the chemist decides to use a randomized block design, with the bolts of cloth considered as blocks. She selects five bolts and applies all four chemicals in random order to each bolt. The resulting tensile strengths follow. Analyze the data from this experiment (use =.5) and draw appropriate conclusions. Bolt Chemical Design Expert Output Response: Strength ANOVA for Selected Factorial Model Analysis of variance table [Partial sum of squares] Sum of Mean F Source Squares DF Square Value Prob > F Block Model not significant A Residual.8.8 Cor Total The "Model F-value" of.38 implies the model is not significant relative to the noise. There is a. % chance that a "Model F-value" this large could occur due to noise. Std. Dev..35 R-Squared.377 Mean 7.75 Adj R-Squared.58 C.V..88 Pred R-Squared PRESS 6.56 Adeq Precision.558 Treatment Means (Adjusted, If Necessary) Estimated Standard Mean Error Mean Standard t for H Treatment Difference DF Error Coeff= Prob > t vs vs vs vs vs vs There is no difference among the chemical types at =.5 level. 4- Three different washing solutions are being compared to study their effectiveness in retarding bacteria growth in five-gallon milk containers. The analysis is done in a laboratory, and only three trials 4-

68 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY can be run on any day. Because days could represent a potential source of variability, the experimenter decides to use a randomized block design. Observations are taken for four days, and the data are shown here. Analyze the data from this experiment (use =.5) and draw conclusions. Days Solution Design Expert Output Response: Growth ANOVA for Selected Factorial Model Analysis of variance table [Partial sum of squares] Sum of Mean F Source Squares DF Square Value Prob > F Block Model significant A Residual Cor Total 86.5 The Model F-value of 4.7 implies the model is significant. There is only a.3% chance that a "Model F-Value" this large could occur due to noise. Std. Dev..94 R-Squared.934 Mean 8.75 Adj R-Squared.985 C.V Pred R-Squared.755 PRESS 7.33 Adeq Precision Treatment Means (Adjusted, If Necessary) Estimated Standard Mean Error Mean Standard t for H Treatment Difference DF Error Coeff= Prob > t vs vs vs There is a difference between the means of the three solutions. The Fisher LSD procedure indicates that solution 3 is significantly different than the other two. 4-3 Plot the mean tensile strengths observed for each chemical type in Problem 4- and compare them to a scaled t distribution. What conclusions would you draw from the display? 4-

69 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Scaled t Distribution () () (3,4) Mean Strength S y i. MS b E There is no obvious difference between the means. This is the same conclusion given by the analysis of variance. 4-4 Plot the average bacteria counts for each solution in Problem 4- and compare them to an appropriately scaled t distribution. What conclusions can you draw? Scaled t Distribution (3) () () Bacteria Growth S y i. MS b E There is no difference in mean bacteria growth between solutions and. However, solution 3 produces significantly lower mean bacteria growth. This is the same conclusion reached from the Fisher LSD procedure in Problem

70 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY 4-5 An article in the Fire Safety Journal ( The Effect of Nozzle Design on the Stability and Performance of Turbulent Water Jets, Vol. 4, August 98) describes an experiment in which a shape factor was determined for several different nozzle designs at six levels of efflux velocity. Interest focused on potential differences between nozzle designs, with velocity considered as a nuisance variable. The data are shown below: Jet Efflux Velocity (m/s) Nozzle Design (a) Does nozzle design affect the shape factor? Compare nozzles with a scatter plot and with an analysis of variance, using =.5. Design Expert Output Response: Shape ANOVA for Selected Factorial Model Analysis of variance table [Partial sum of squares] Sum of Mean F Source Squares DF Square Value Prob > F Block Model significant A Residual E-3 Cor Total. 9 The Model F-value of 8.9 implies the model is significant. There is only a.3% chance that a "Model F-Value" this large could occur due to noise. Std. Dev..54 R-Squared.647 Mean.86 Adj R-Squared.5688 C.V. 6.3 Pred R-Squared.96 PRESS.3 Adeq Precision Treatment Means (Adjusted, If Necessary) Estimated Standard Mean Error Mean Standard t for H Treatment Difference DF Error Coeff= Prob > t vs vs vs <. vs vs vs vs vs vs vs Nozzle design has a significant effect on shape factor. 4-4

71 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY.4 One Factor Plot.436 Shape Nozzle Design (b) Analyze the residual from this experiment. The plots shown below do not give any indication of serious problems. Thre is some indication of a mild outlier on the normal probability plot and on the plot of residualks versus the predicted velocity. Normal plot of residuals Residuals vs. Predicted Normal % probability Residuals Residual Predicted 4-5

72 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY.333 Residuals vs. Nozzle Design Residuals Nozzle Design (c) Which nozzle designs are different with respect to shape factor? Draw a graph of average shape factor for each nozzle type and compare this to a scaled t distribution. Compare the conclusions that you draw from this plot to those from Duncan s multiple range test. S y i. MS b E R = r.5 (,) S yi. = (.95)(.85)=.6446 R 3 = r.5 (3,) S yi. = (3.)(.85)=.6774 R 4 = r.5 (4,) S yi. = (3.8)(.85)=.6949 R 5 = r.5 (5,) S yi. = (3.5)(.85)=.7 Mean Difference R vs >.7 different vs 3. >.6949 different vs.767 >.6774 different vs < vs 4.3 >.6949 different 5 vs >.6774 different 5 vs.4 <.6446 vs 4.9 >.6774 different vs < vs <

73 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Scaled t Distribution () (5) () (3) (4 ) Shape Factor 4-6 Consider the ratio control algorithm experiment described in Chapter 3, Section 3-8. The experiment was actually conducted as a randomized block design, where six time periods were selected as the blocks, and all four ratio control algorithms were tested in each time period. The average cell voltage and the standard deviation of voltage (shown in parentheses) for each cell as follows: Ratio Control Time Period Algorithms (.5) 4.86 (.4) 4.75 (.5) 4.95 (.6) 4.79 (.3) 4.88 (.5) 4.85 (.4) 4.9 (.) 4.79 (.3) 4.85 (.5) 4.75 (.3) 4.85 (.) (.9) 4.88 (.3) 4.9 (.) 4.75 (.5) 4.8 (.8) 4.9 (.) (.3) 4.77 (.4) 4.94 (.5) 4.86 (.5) 4.79 (.3) 4.76 (.) (a) Analyze the average cell voltage data. (Use =.5.) Does the choice of ratio control algorithm affect the cell voltage? Design Expert Output Response: Average ANOVA for Selected Factorial Model Analysis of variance table [Partial sum of squares] Sum of Mean F Source Squares DF Square Value Prob > F Block E-3 Model.746E E not significant A.746E E Residual E-3 Cor Total.9 3 The "Model F-value" of.9 implies the model is not significant relative to the noise. There is a 9.4 % chance that a "Model F-value" this large could occur due to noise. Std. Dev..69 R-Squared.366 Mean 4.84 Adj R-Squared -.56 C.V..43 Pred R-Squared PRESS.8 Adeq Precision.688 Treatment Means (Adjusted, If Necessary) Estimated Standard Mean Error

74 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Mean Standard t for H Treatment Difference DF Error Coeff= Prob > t vs vs vs vs vs E vs The ratio control algorithm does not affect the mean cell voltage. (b) Perform an appropriate analysis of the standard deviation of voltage. (Recall that this is called pot noise. ) Does the choice of ratio control algorithm affect the pot noise? Design Expert Output Response: StDev Transform: Natural log Constant:. ANOVA for Selected Factorial Model Analysis of variance table [Partial sum of squares] Sum of Mean F Source Squares DF Square Value Prob > F Block Model <. significant A <. Residual Cor Total The Model F-value of 33.6 implies the model is significant. There is only a.% chance that a "Model F-Value" this large could occur due to noise. Std. Dev..5 R-Squared.8693 Mean -3.4 Adj R-Squared.843 C.V Pred R-Squared.6654 PRESS.37 Adeq Precision.446 Treatment Means (Adjusted, If Necessary) Estimated Standard Mean Error Mean Standard t for H Treatment Difference DF Error Coeff= Prob > t vs vs <. vs vs <. vs vs <. A natural log transformatio was applied to the pot noise data. The ratio control algorithm does affect the pot noise. (c) Conduct any residual analyses that seem appropriate. 4-8

75 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Normal plot of residuals Residuals vs. Predicted Normal % probability Residuals Residual Predicted Residuals vs. Algorithm Residuals Algorithm The normal probability plot shows slight deviations from normality; however, still acceptable. (d) Which ratio control algorithm would you select if your objective is to reduce both the average cell voltage and the pot noise? Since the ratio control algorithm has little effect on average cell voltage, select the algorithm that minimizes pot noise, that is algorithm #. 4-7 An aluminum master alloy manufacturer produces grain refiners in ingot form. This company produces the product in four furnaces. Each furnace is known to have its own unique operating characteristics, so any experiment run in the foundry that involves more than one furnace will consider furnace a nuisance variable. The process engineers suspect that stirring rate impacts the grain size of the product. Each furnace can be run at four different stirring rates. A randomized block design is run for a particular refiner and the resulting grain size data is shown below. Furnace 4-9

76 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Stirring Rate (a) Is there any evidence that stirring rate impacts grain size? Design Expert Output Response: Grain Size ANOVA for Selected Factorial Model Analysis of variance table [Partial sum of squares] Sum of Mean F Source Squares DF Square Value Prob > F Block Model not significant A Residual Cor Total The "Model F-value" of.85 implies the model is not significant relative to the noise. There is a % chance that a "Model F-value" this large could occur due to noise. Std. Dev..95 R-Squared.3 Mean 7.69 Adj R-Squared -.38 C.V Pred R-Squared -.46 PRESS 46.7 Adeq Precision 5.39 Treatment Means (Adjusted, If Necessary) Estimated Standard Mean Error Mean Standard t for H Treatment Difference DF Error Coeff= Prob > t vs vs vs vs vs vs The analysis of variance shown above indicates that there is no difference in mean grain size due to the different stirring rates. (b) Graph the residuals from this experiment on a normal probability plot. Interpret this plot. 4-

77 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Normal plot of residuals 99 Normal % probability Residual The plot indicates that normality assumption is valid. (c) Plot the residuals versus furnace and stirring rate. Does this plot convey any useful information? Residuals vs. Stirring Rate Residuals Stirring Rate The variance is consistent at different stirring rates. Not only does this validate the assumption of uniform variance, it also identifies that the different stirring rates do not affect variance. (d) What should the process engineers recommend concerning the choice of stirring rate and furnace for this particular grain refiner if small grain size is desirable? There really isn t any effect due to the stirring rate. 4-8 Analyze the data in Problem 4- using the general regression significance test. : : : ˆ 4ˆ ˆ 4ˆ ˆ 4ˆ ˆ 4ˆ 3ˆ ˆ ˆ 3ˆ ˆ ˆ 3ˆ ˆ ˆ 3ˆ =5 ˆ =9 ˆ = 4-

78 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY 3 : : : : 3 : 4 ˆ ˆ ˆ ˆ 3 3 ˆ ˆ ˆ 3 3 ˆ ˆ ˆ 4ˆ ˆ ˆ ˆ ˆ ˆ ˆ ˆ =3 3ˆ =34 3ˆ =5 3ˆ =36 3ˆ = ˆ ˆ ˆ ˆ Applying the constraints ˆ ˆ, we obtain: i j 5 ˆ, ˆ 5, ˆ 78, ˆ 3, ˆ, ˆ, ˆ 3, ˆ R,, ij 68 y, SS y R,, Model Restricted to i : E ij : : : : 3 : 4 ˆ 3ˆ 3ˆ 3ˆ 3 3ˆ 4 =5 3 ˆ 3ˆ =34 3 ˆ 3ˆ =5 3 ˆ 3ˆ 3 =36 ˆ 3ˆ =5 3 4 Applying the constraint ˆ j, we obtain: 5 ˆ, ˆ /, ˆ, ˆ 3, ˆ Now: R, R , R,, R, SSTreatments Model Restricted to : j : : : : 3 ˆ 4ˆ 4ˆ 4ˆ 3 =5 4 ˆ 4ˆ =9 4 ˆ 4ˆ = ˆ 4ˆ =3 4 3 Applying the constraint ˆ i, we obtain: 5 ˆ, ˆ 5, ˆ 78, 9 ˆ 3 4-

79 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY R R , , R,, R, SSBlocks 4-9 Assuming that chemical types and bolts are fixed, estimate the model parameters i and j in Problem 4-. Using Equations 4-4, Applying the constraints, we obtain: ˆ, ˆ, ˆ, ˆ 3, ˆ 4, 35 ˆ, 65 ˆ, 75 ˆ 3, ˆ 4, ˆ Draw an operating characteristic curve for the design in Problem 4-. Does this test seem to be sensitive to small differences in treatment effects? Assuming that solution type is a fixed factor, we use the OC curve in appendix V. Calculate i b a 4 i using MS E to estimate. We have: a a b 3 6. If ˆ i MSE, then: 4. 5 and. 7 3 If ˆ i MS E, then: and. 55, etc. 3 This test is not very sensitive to small differences. 4- Suppose that the observation for chemical type and bolt 3 is missing in Problem 4-. Analyze the problem by estimating the missing value. Perform the exact analysis and compare the results. y 3 is missing. ŷ a b ' ' ' ay. by. 3 y Thus, y. =357.5, y.3 =3.5, and y.. =435.5 Source SS DF MS F Chemicals

80 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Bolts Error Total F.,3, =.66, Chemicals are not significant. 4- Two missing values in a randomized block. Suppose that in Problem 4- the observations for chemical type and bolt 3 and chemical type 4 and bolt 4 are missing. (a) Analyze the design by iteratively estimating the missing values as described in Section ŷ 3 ' ' 4y. 5y. 3 y.. and ' ŷ 44 ' ' ' 4y4. 4 y.. 5y. Data is coded y-7. As an initial guess, set y 3 equal to the average of the observations available for chemical. Thus, y3. 5. Then, ŷ ŷ ŷ ŷ ŷ ŷ 5 44 ŷ Design Expert Output ANOVA for Selected Factorial Model Analysis of variance table [Partial sum of squares] Sum of Mean F Source Squares DF Square Value Prob > F Block Model not significant A Residual Cor Total (b) Differentiate SS E with respect to the two missing values, equate the results to zero, and solve for estimates of the missing values. Analyze the design using these two estimates of the missing values. SS SSE SSE From, we obtain: y y 3 44 E yij y 5 i. y 4. j SS E. 6y3. 6y y3 3. 7y44. y3 y44 y.. R 4-4

81 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY. ŷ. ŷ 3 3. ŷ. ŷ ŷ 5 45, ŷ These quantities are almost identical to those found in part (a). The analysis of variance using these new data does not differ substantially from part (a). (c) Derive general formulas for estimating two missing values when the observations are in different blocks. SS E y y y y y y y y y y y y i. iu k. kv.u iu.v kv.. iu iu ykv b a ab kv SSE SSE From, we obtain: y y 3 44 whose simultaneous solution is: ŷ ŷ iu kv y' i. ay' a i. ŷ ŷ iu kv a b ab ay' i. by'. j ab ( a )( b ) ay' k. by' ab ab y' a b ab y'.u b a b a b a b by'.u y'.. b a ay' k. by'.v y'.. a b.v.. ŷkv ab y'.. ŷiu ab ab y'.. aba b ay' k. by'.v y' a b ab.. (d) Derive general formulas for estimating two missing values when the observations are in the same block. Suppose that two observations y ij and y kj are missing, ik (same block j). SS E y SSE SSE From, we obtain y y 3 44 whose simultaneous solution is: y y y y y y y y y y i. ij k. kj. j ij kj.. ij ij ykj ŷ ŷ ij kj b ayi. by. j y.. ŷ a b a b kj ayk. by. j y.. ŷ ij a a b a b ab kj 4-5

82 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY ŷ ij b ay by y a b ay by y ayi. by. j y.. k.. j.. a b ŷ kj ayk. by. j y.. a b b a ay by y a b a b i. 4. j i.... j An industrial engineer is conducting an experiment on eye focus time. He is interested in the effect of the distance of the object from the eye on the focus time. Four different distances are of interest. He has five subjects available for the experiment. Because there may be differences among individuals, he decides to conduct the experiment in a randomized block design. The data obtained follow. Analyze the data from this experiment (use =.5) and draw appropriate conclusions. Subject Distance (ft) Design Expert Output Response: Focus Time ANOVA for Selected Factorial Model Analysis of variance table [Partial sum of squares] Sum of Mean F Source Squares DF Square Value Prob > F Block Model significant A Residual Cor Total The Model F-value of 8.6 implies the model is significant. There is only a.5% chance that a "Model F-Value" this large could occur due to noise. Std. Dev..3 R-Squared.689 Mean 4.85 Adj R-Squared.636 C.V. 3.8 Pred R-Squared.9 PRESS 4.5 Adeq Precision.43 Treatment Means (Adjusted, If Necessary) Estimated Standard Mean Error Mean Standard t for H Treatment Difference DF Error Coeff= Prob > t vs vs vs vs vs vs Distance has a statistically significant effect on mean focus time. 4-6

83 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY 4-4 The effect of five different ingredients (A, B, C, D, E) on reaction time of a chemical process is being studied. Each batch of new material is only large enough to permit five runs to be made. Furthermore, each runs requires approximately / hours, so only five runs can be made in one day. The experimenter decides to run the experiment as a Latin square so that day and batch effects can be systematically controlled. She obtains the data that follow. Analyze the data from this experiment (use =.5) and draw conclusions. Day Batch A=8 B=7 D= C=7 E=3 C= E= A=7 D=3 B=8 3 B=4 A=9 C= E= D=5 4 D=6 C=8 E=6 B=6 A= 5 E=4 D= B=3 A=8 C=8 Minitab Output General Linear Model Factor Type Levels Values Batch random Day random Catalyst fixed 5 A B C D E Analysis of Variance for Time, using Adjusted SS for Tests Source DF Seq SS Adj SS Adj MS F P Catalyst Batch Day Error Total An industrial engineer is investigating the effect of four assembly methods (A, B, C, D) on the assembly time for a color television component. Four operators are selected for the study. Furthermore, the engineer knows that each assembly method produces such fatigue that the time required for the last assembly may be greater than the time required for the first, regardless of the method. That is, a trend develops in the required assembly time. To account for this source of variability, the engineer uses the Latin square design shown below. Analyze the data from this experiment ( =.5) draw appropriate conclusions. Order of Operator Assembly 3 4 C= D=4 A=7 B=8 B=7 C=8 D= A=8 3 A=5 B= C= D=9 4 D= A= B= C=4 Minitab Output General Linear Model Factor Type Levels Values Order random Operator random Method fixed 4 A B C D Analysis of Variance for Time, using Adjusted SS for Tests Source DF Seq SS Adj SS Adj MS F P 4-7

84 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Method Order Operator Error Total Suppose that in Problem 4-4 the observation from batch 3 on day 4 is missing. Estimate the missing value from Equation 4-4, and perform the analysis using this value. y 354 is missing. ŷ p y y y p p y i... j...k Minitab Output General Linear Model Factor Type Levels Values Batch random Day random Catalyst fixed 5 A B C D E Analysis of Variance for Time, using Adjusted SS for Tests Source DF Seq SS Adj SS Adj MS F P Catalyst Batch Day Error Total Consider a p x p Latin square with rows ( i ), columns ( k ), and treatments ( j ) fixed. Obtain least squares estimates of the model parameters i, k, j. i p : p ˆ p ˆ p ˆ p ˆ y i p i j i j k p p j j k k p : pˆ pˆ p ˆ p ˆ y, i,,..., p j p : pˆ p ˆ pˆ p ˆ y, j,,..., p k i j i k p i i p j j p : pˆ p ˆ p ˆ p ˆ y k k i... j...k k..., k,,..., p There are 3p+ equations in 3p+ unknowns. The rank of the system is 3p-. Three side conditions are p necessary. The usual conditions imposed are: ˆ ˆ ˆ. The solution is then: i i p j j p k k y... ˆ y... p ˆ y y, i,,..., p i i

85 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY ˆ y y, j,,..., p j. j.... ˆ y y, k,,..., p k i Derive the missing value formula (Equation 4-4) for the Latin square design.... SS E y y i... j. y..k y yijk p p p p Let y ijk be missing. Then SS E y ijk y y y y y y y y i.. p ijk SS E where R is all terms without y ijk.. From, we obtain: y. j. p ijk ijk..k p ijk... p ijk R y ijk p p py' i.. y'. j. y'..k p p y'..., or y ijk p y' i.. y'. j. y'..k p p y' Designs involving several Latin squares. [See Cochran and Cox (957), John (97).] The p x p Latin square contains only p observations for each treatment. To obtain more replications the experimenter may use several squares, say n. It is immaterial whether the squares used are the same are different. The appropriate model is y ijkh h i( h ) j k( h ) ( ) jh ijkh i,,..., p j,,..., p k,,..., p h,,...,n where y ijkh is the observation on treatment j in row i and column k of the hth square. Note that ih ( ) and k( h) are row and column effects in the hth square, and h is the effect of the hth square, and ( ) jh is the interaction between treatments and squares. (a) Set up the normal equations for this model, and solve for estimates of the model parameters. Assume that appropriate side conditions on the parameters are ˆ, ˆ, and ˆ j for each h, j j h h ˆ, ˆ for each h, and ˆ jh h i i h for each j. jh k k h 4-9

86 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY ˆ y ˆ h y ˆ y j ˆ ˆ i( h ) k( h ) ^.... j.. jh...h y y y y i..h..kh y y. j.h...h y...h y. j.. y...h y... (b) Write down the analysis of variance table for this design. Source SS DF y.... j.. Treatments y np np y h Squares y... p np Treatment x Squares p-... n- y. j.h y... p np SS Treatments SS Squares (p-)(n-) yi..h Rows y... h n(p-) p np Columns y.. kh y... h p np n(p-) Error subtraction n(p-)(p-) y... Total y ijkh np np - 4- Discuss how the operating characteristics curves in the Appendix may be used with the Latin square design. For the fixed effects model use: p j j p, p p p For the random effects model use: p, p p p 4- Suppose that in Problem 4-4 the data taken on day 5 were incorrectly analyzed and had to be discarded. Develop an appropriate analysis for the remaining data. Two methods of analysis exist: () Use the general regression significance test, or () recognize that the design is a Youden square. The data can be analyzed as a balanced incomplete block design with a=b=5, r=k=4 and =3. Using either approach will yield the same analysis of variance. 4-

87 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Minitab Output General Linear Model Factor Type Levels Values Catalyst fixed 5 A B C D E Batch random Day random Analysis of Variance for Time, using Adjusted SS for Tests Source DF Seq SS Adj SS Adj MS F P Catalyst Batch Day Error Total The yield of a chemical process was measured using five batches of raw material, five acid concentrations, five standing times, (A, B, C, D, E) and five catalyst concentrations (,,,, ). The Graeco-Latin square that follows was used. Analyze the data from this experiment (use =.5) and draw conclusions. Acid Concentration Batch A=6 B=6 C=9 D=6 E=3 B=8 C= D=8 E= A= 3 C= D= E=6 A=5 B=3 4 D=5 E=5 A= B=4 C=7 5 E= A=4 B=7 C=7 D=4 Factor Type Levels Values Time fixed 5 A B C D E Catalyst random 5 a b c d e Batch random Acid random General Linear Model Analysis of Variance for Yield, using Adjusted SS for Tests Source DF Seq SS Adj SS Adj MS F P Time Catalyst Batch Acid Error Total Suppose that in Problem 4-5 the engineer suspects that the workplaces used by the four operators may represent an additional source of variation. A fourth factor, workplace (,,, ) may be introduced and another experiment conducted, yielding the Graeco-Latin square that follows. Analyze the data from this experiment (use =.5) and draw conclusions. Order of Operator Assembly 3 4 C= B= D=4 A=8 B=8 C= A= D= 3 A=9 D= B=7 C=5 4-

88 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY 4 D=9 A=8 C=8 B=6 Minitab Output General Linear Model Factor Type Levels Values Method fixed 4 A B C D Order random Operator random Workplac random 4 a b c d Analysis of Variance for Time, using Adjusted SS for Tests Source DF Seq SS Adj SS Adj MS F P Method Order Operator Workplac Error Total 5 5. However, there are only three degrees of freedom for error, so the test is not very sensitive. 4-4 Construct a 5 x 5 hypersquare for studying the effects of five factors. Exhibit the analysis of variance table for this design. Three 5 x 5 orthogonal Latin Squares are: ABCDE BCDEA CDEAB DEABC EABCD Let rows = factor, columns = factor, Latin letters = factor 3, Greek letters = factor 4 and numbers = factor 5. The analysis of variance table is: Source DF Rows 4 Columns 4 Latin Letters 4 Greek Letters 4 Numbers 4 Error 4 Total Consider the data in Problems 4-5 and 4-3. Suppressing the Greek letters in 4-3, analyze the data using the method developed in Problem 4-9. Square - Operator Batch 3 4 Row Total C= D=4 A=7 B=8 (39) B=7 C=8 D= A=8 (44) 3 A=5 B= C= D=9 (35) 4 D= A= B= C=4 (46) (3) (5) (4) (36) 64=y 4-

89 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Square - Operator Batch 3 4 Row Total C= B= D=4 A=8 (43) B=8 C= A= D= (4) 3 A=9 D= B=7 C=5 (4) 4 D=9 A=8 C=8 B=6 (4) (37) (4) (49) (4) 68=y Assembly Methods Totals A y... =65 B y... =68 C y.3.. =9 D y.4.. =9 Source SS DF MS F Assembly Methods Squares * A x S Assembly Order (Rows) Operators (columns) Error Total Significant at %. 4-6 Consider the randomized block design with one missing value in Table 4-7. Analyze this data by using the exact analysis of the missing value problem discussed in Section Compare your results to the approximate analysis of these data given in Table 4-8. : =7 3 4 =3 4 = 3 4 =- 4 =5 : 4 4 : : : : 4 : 4 3 : 3 4 : = =-3 =6 Applying the constraints ˆ i ˆ, we obtain: j 3 = ˆ, ˆ, ˆ, ˆ 3, ˆ 4, ˆ, ˆ, 36 4 ˆ 3, 36 ˆ

90 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY R With 7 degrees of freedom. 4 4 ˆ.. j y. j. i j,, ˆy ˆ i yi y ij, SS y R,, E which is identical to SS E obtained in the approximate analysis. In general, the SS E in the exact and approximate analyses will be the same. To test H o : the reduced model is yij j ij. The normal equations used are: i ij : 5 ˆ 4 ˆ 4ˆ 3ˆ 3 4ˆ =7 4 : 4 4 ˆ =-4 : 4 4ˆ =-3 3 : 3 3ˆ =6 3 4 : 4 4ˆ =8 4 Applying the constraint ˆ j, we obtain: 9 ˆ, 6 35 ˆ, 6 3 ˆ, 6 3 ˆ 3, ˆ 4. Now R, ˆy ˆ.. j y. j. 6 j 99 5 with 4 degrees of freedom. R, R,, R, SSTreatments with 7-4=3 degrees of freedom., R is used to test H o : i. The sum of squares for blocks is found from the reduced model used are: Model Restricted to : j yij. The normal equations i ij : =7 : 4 ˆ 4 =3 : 3 ˆ 3 = 3 : 4 ˆ 4 3 =- 4 : 4 ˆ 4 4 =5 Applying the constraint ˆ, we obtain: i ˆ, ˆ, ˆ, ˆ 3, ˆ 4 4-4

91 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY with 4 degrees of freedom. with 7-4=3 degrees of freedom. R R, ˆy 4.. ˆ i yi.. i 59 83, R,, R, SS Blocks Source DF SS(exact) SS(approximate) Tips Blocks Error Total Note that for the exact analysis, SS T SS SS SS. Tips Blocks E 4-7 An engineer is studying the mileage performance characteristics of five types of gasoline additives. In the road test he wishes to use cars as blocks; however, because of a time constraint, he must use an incomplete block design. He runs the balanced design with the five blocks that follow. Analyze the data from this experiment (use =.5) and draw conclusions. Car Additive There are several computer software packages that can analyze the incomplete block designs discussed in this chapter. The Minitab General Linear Model procedure is a widely available package with this capability. The output from this routine for Problem 4-7 follows. The adjusted sums of squares are the appropriate sums of squares to use for testing the difference between the means of the gasoline additives. Minitab Output General Linear Model Factor Type Levels Values Additive fixed Car random Analysis of Variance for Mileage, using Adjusted SS for Tests Source DF Seq SS Adj SS Adj MS F P Additive Car Error Total Construct a set of orthogonal contrasts for the data in Problem 4-7. Compute the sum of squares for each contrast. 4-5

92 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY One possible set of orthogonal contrasts is: H () : 4 5 : : 4 5 : The sums of squares and F-tests are: H () H (3) H (4) Brand -> Q i 33/4 /4-3/4-4/4-7/4 cq i i SS F () / () - -/ (3) - -3/ (4) /4.9.5 Contrasts () and () are significant at the % and 5% levels, respectively. 4-9 Seven different hardwood concentrations are being studied to determine their effect on the strength of the paper produced. However the pilot plant can only produce three runs each day. As days may differ, the analyst uses the balanced incomplete block design that follows. Analyze this experiment (use =.5) and draw conclusions. Hardwood Days Concentration (%) There are several computer software packages that can analyze the incomplete block designs discussed in this chapter. The Minitab General Linear Model procedure is a widely available package with this capability. The output from this routine for Problem 4-9 follows. The adjusted sums of squares are the appropriate sums of squares to use for testing the difference between the means of the hardwood concentrations. Minitab Output General Linear Model Factor Type Levels Values Concentr fixed Days random Analysis of Variance for Strength, using Adjusted SS for Tests Source DF Seq SS Adj SS Adj MS F P Concentr Days Error Total

93 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY 4-3 Analyze the data in Example 4-6 using the general regression significance test. : = =8 3 4 =4 3 =6 4 = : 3 3 : : : : 3 : 3 3 : 3 4 : = = =4 = Applying the constraints i j, we obtain: 87 /, 9/ 8, 7/ 8, 3 4/ 8, 4 / 8, 7/ 8, 3/ 8, 3 4 / 8, 4 / 8 with 7 degrees of freedom. y ij,. E ij 6356 SS y R(,, ) To test H o : i the reduced model is y ij j ij. The normal equations used are: : =87 : 3 3 = : 3 3 =7 3 : =4 4 : =8 Applying the constraint ˆ j, we obtain: 87 ˆ, R with 4 degrees of freedom. 7 ˆ, 6, ˆy ˆ, 6 3 ˆ 3, 6 4 ˆ.. j y. j,. j 633 ˆ

94 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY R, R,, R, SSTreatments with 7-4=3 degrees of freedom., R is used to test H o :. i The sum of squares for blocks is found from the reduced model used are: yij. The normal equations i ij Model Restricted to : j : =87 : 3 3 =8 : 3 3 =4 3 : =6 4 : = The sum of squares for blocks is found as in Example 4-6. We may use the method shown above to find an adjusted sum of squares for blocks from the reduced model, y. ij i ij 4-3 Prove that a k Q i i a is the adjusted sum of squares for treatments in a BIBD. We may use the general regression significance test to derive the computational formula for the adjusted treatment sum of squares. We will need the following: and the sum of squares we need is: R R kqi ˆ i, kqi kyi. a n y b i,, ˆy ˆ i yi. The normal equation for is, from equation (4-35), ij. j a b ˆ.. j y. j i j a, ˆ y.. ˆ i yi. ˆ j y. j b i j b j y. j k : kˆ a i n ˆ k ˆ ij i j y. j and from this we have: ky a ˆ. j j y. j ky. j ˆ y. jnijˆ i i 4-8

95 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY therefore, R, ˆy.. ˆ i yi. a b i j y k a y. j nij ˆ i. j kˆy. j i k k j y. k R(, ) a ˆ y a n y i i. ij. j i k a i i i i a kq Q k a Q i SS a Treatments( adjusted ) 4-3 An experimenter wishes to compare four treatments in blocks of two runs. Find a BIBD for this experiment with six blocks. Treatment Block Block Block 3 Block 4 Block 5 Block 6 X X X X X X 3 X X X 4 X X X Note that the design is formed by taking all combinations of the 4 treatments at a time. The parameters of the design are =, a=4, b=6, k=3, and r= 4-33 An experimenter wishes to compare eight treatments in blocks of four runs. Find a BIBD with 4 blocks and = 3. The design has parameters a=8, b=4, = 3, r= and k=4. It may be generated from a 3 factorial design confounded in two blocks of four observations each, with each main effect and interaction successively confounded (7 replications) forming the 4 blocks. The design is discussed by John (97, pg. ) and Cochran and Cox (957, pg. 473). The design follows: Blocks =(I) =a 3=b 4=ab 5=c 6=ac 7=bc 8=abc X X X X X X X X 3 X X X X 4 X X X X 5 X X X X 6 X X X X 7 X X X X 8 X X X X 9 X X X X X X X X X X X X X X X X 3 X X X X 4 X X X X 4-9

96 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY 4-34 Perform the interblock analysis for the design in Problem 4-7. The interblock analysis for Problem 4-7 uses ˆ. 77 and ˆ. 4. A summary of the interblock, intrablock and combined estimates is: Parameter Intrablock Interblock Combined Perform the interblock analysis for the design in Problem 4-9. The interblock analysis for problem MSBlocks( adj ) MSE b uses ˆ. 7 and 9.. A summary of ar 7 the interblock, intrablock, and combined estimates is give below Parameter Intrablock Interblock Combined Verify that a BIBD with the parameters a = 8, r = 8, k = 4, and b = 6 does not exist. These rk ( ) 83 ( ) 4 conditions imply that, which is not an integer, so a balanced design with these a 7 7 parameters cannot exist Show that the variance of the intra block estimators { i } is a k( a ). kqi Note that ˆ i, and Qi yi. a nij y k b j. j, and kq ky n y k i i. b j ij. j y i. b j n y ij. j y i. y i. contains r observations, and the quantity in the parenthesis is the sum of r(k-) observations, not including treatment i. Therefore, V kq k V Q rk rk i i 4-3

97 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY or r k Q r k k k k V i To find V ˆ i, note that: However, since a rk V k a ˆ V Q i, we have: i k a r k krk k a Furthermore, the i a k V ˆ i a V ˆ ˆ k a ˆ are not independent, this is required to show that i j 4-38 Extended incomplete block designs. Occasionally the block size obeys the relationship a < k < a. An extended incomplete block design consists of a single replicate or each treatment in each block along with an incomplete block design with k* = k-a. In the balanced case, the incomplete block design will have parameters k* = k-a, r* = r-b, and *. Write out the statistical analysis. (Hint: In the extended incomplete block design, we have = r-b+*.) As an example of an extended incomplete block design, suppose we have a=5 treatments, b=5 blocks and k=9. A design could be found by running all five treatments in each block, plus a block from the balanced incomplete block design with k* = k-a=9-5=4 and *=3. The design is: Block Complete Treatment Incomplete Treatment,,3,4,5,3,4,5,,3,4,5,,4,5 3,,3,4,5,3,4,5 4,,3,4,5,,3,4 5,,3,4,5,,3,5 Note that r=9, since the augmenting incomplete block design has r*=4, and r= r* + b = 4+5=9, and = rb+*=8-5+3=6. Since some treatments are repeated in each block it is possible to compute an error sum of squares between repeat observations. The difference between this and the residual sum of squares is due to interaction. The analysis of variance table is shown below: Source SS DF Treatments (adjusted) Qi k a a- y... j y Blocks b- k N Interaction Subtraction (a-)(b-) Error [SS between repeat observations] b(k-a) Total y.. yij N- N 4-3

98 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY 4-3

99 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Chapter 5 Introduction to Factorial Designs Solutions 5- The yield of a chemical process is being studied. The two most important variables are thought to be the pressure and the temperature. Three levels of each factor are selected, and a factorial experiment with two replicates is performed. The yield data follow: Pressure Temperature (a) Analyze the data and draw conclusions. Use =.5. Both pressure (A) and temperature (B) are significant, the interaction is not. Design Expert Output Response:Surface Finish ANOVA for Selected Factorial Model Analysis of variance table [Partial sum of squares] Sum of Mean F Source Squares DF Square Value Prob > F Model significant A B AB Residual Lack of Fit. Pure Error Cor Total.3 7 The Model F-value of 8. implies the model is significant. There is only a.6% chance that a "Model F-Value" this large could occur due to noise. Values of "Prob > F" less than.5 indicate model terms are significant. In this case A, B are significant model terms. Values greater than. indicate the model terms are not significant. If there are many insignificant model terms (not counting those required to support hierarchy), model reduction may improve your model. (b) Prepare appropriate residual plots and comment on the model s adequacy. The residuals plot show no serious deviations from the assumptions. 5-

100 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Residuals vs. Predicted Normal plot of residuals.5 99 Residuals E Normal % probability E Predicted Residual Residuals vs. Temperature Residuals vs. Pressure Residuals 4.636E-4 Residuals 4.636E Temperature Pressure (c) Under what conditions would you operate this process? 5-

101 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY DESIGN-EXPERT Plot Yield 9. 8 In teraction G raph Tem perature X = A: Pressure Y = B: Temperature Design Points B 5 B 6 B Yield P ressure Pressure set at 5 and Temperature at the high level, 7 degrees C, give the highest yield. The standard analysis of variance treats all design factors as if they were qualitative. In this case, both factors are quantitative, so some further analysis can be performed. In Section 5-5, we show how response curves and surfaces can be fit to the data from a factorial experiment with at least one quantative factor. Since both factors in this problem are quantitative and have three levels, we can fit linear and quadratic effects of both temperature and pressure, exactly as in Example 5-5 in the text. The Design-Expert output, including the response surface plots, now follows. Design Expert Output Response:Surface Finish ANOVA for Selected Factorial Model Analysis of variance table [Partial sum of squares] Sum of Mean F Source Squares DF Square Value Prob > F Model <. significant A B A <. B AB Residual.7.4 Lack of Fit 7.639E E not significant Pure Error Cor Total.3 7 The Model F-value of 6.8 implies the model is significant. There is only a.% chance that a "Model F-Value" this large could occur due to noise. Values of "Prob > F" less than.5 indicate model terms are significant. In this case A, B, A, B are significant model terms. Values greater than. indicate the model terms are not significant. If there are many insignificant model terms (not counting those required to support hierarchy), model reduction may improve your model. Std. Dev.. R-Squared.878 Mean 9.4 Adj R-Squared.87 C.V..3 Pred R-Squared.6794 PRESS.4 Adeq Precision.968 Coefficient Standard 95% CI 95% CI 5-3

102 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Factor Estimate DF Error Low High VIF Intercept A-Pressure B-Temperature E-4.5. A B AB E-3. Final Equation in Terms of Coded Factors: Yield = * A +.75 * B -.4 * A +.4 * B -.87 * A * B Final Equation in Terms of Actual Factors: Yield = * Pressure * Temperature -.848E-3 * Pressure E-3 * Temperature E-4 * Pressure * Temperature Yield 9.7 B: Temperature Yield A: Pressure A: Pres s ure 3. B: Temperature An engineer suspects that the surface finish of a metal part is influenced by the feed rate and the depth of cut. She selects three feed rates and four depths of cut. She then conducts a factorial experiment and obtains the following data: Depth of Cut (in) Feed Rate (in/min)

103 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY (a) Analyze the data and draw conclusions. Use =.5. The depth (A) and feed rate (B) are significant, as is the interaction (AB). Design Expert Output Response: Surface Finish ANOVA for Selected Factorial Model Analysis of variance table [Partial sum of squares] Sum of Mean F Source Squares DF Square Value Prob > F Model <. significant A <. B <. AB Residual Lack of Fit. Pure Error Cor Total The Model F-value of 8.49 implies the model is significant. There is only a.% chance that a "Model F-Value" this large could occur due to noise. Values of "Prob > F" less than.5 indicate model terms are significant. In this case A, B, AB are significant model terms. (b) Prepare appropriate residual plots and comment on the model s adequacy. The residual plots shown indicate nothing unusual. 8 Residuals vs. Predicted Normal plot of residuals 99 Residuals Normal % probability Predicted Residual 5-5

104 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY 8 Residuals vs. Feed Rate 8 Residuals vs. Depth of Cut Residuals Residuals Feed Rate Depth of Cut (c) Obtain point estimates of the mean surface finish at each feed rate. Feed Rate Average DESIGN-EXPERT Plot Surface Finish X = B: Feed Rate 4 O ne Factor P lot W a rn in g! F a c to r in v o lv e d in a n in te ra c tio n. Actual Factor A: Depth of Cut = Average.5 S urface Finish Feed R ate (d) Find P-values for the tests in part (a). The P-values are given in the computer output in part (a). 5-3 For the data in Problem 5-, compute a 95 percent interval estimate of the mean difference in response for feed rates of. and.5 in/min. 5-6

105 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY We wish to find a confidence interval on, where is the mean surface finish for. in/min and is the mean surface finish for.5 in/min. y y t MS E n y.. y.. t, abn ) n...., ab MS (8.7) ( ) (.64) Therefore, the 95% confidence interval for is n E 5-4 An article in Industrial Quality Control (956, pp. 5-8) describes an experiment to investigate the effect of the type of glass and the type of phosphor on the brightness of a television tube. The response variable is the current necessary (in microamps) to obtain a specified brightness level. The data are as follows: Glass Phosphor Type Type (a) Is there any indication that either factor influences brightness? Use =.5. Both factors, phosphor type (A) and Glass type (B) influence brightness. Design Expert Output Response: Current in microamps ANOVA for Selected Factorial Model Analysis of variance table [Partial sum of squares] Sum of Mean F Source Squares DF Square Value Prob > F Model <. significant A B <. AB Residual Lack of Fit. Pure Error Cor Total65. 7 The Model F-value of 58.8 implies the model is significant. There is only a.% chance that a "Model F-Value" this large could occur due to noise. Values of "Prob > F" less than.5 indicate model terms are significant. In this case A, B are significant model terms. (b) Do the two factors interact? Use =.5. There is no interaction effect. (c) Analyze the residuals from this experiment. 5-7

106 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY The residual plot of residuals versus phosphor content indicates a very slight inequality of variance. It is not serious enough to be of concern, however. Residuals vs. Predicted Normal plot of residuals 5 99 Residuals Normal % probability Predicted Residual Residuals vs. Glass Type Residuals vs. Phosphor Type Residuals.5 Residuals Glass Type Phosphor Type 5-5 Johnson and Leone (Statistics and Experimental Design in Engineering and the Physical Sciences, Wiley 977) describe an experiment to investigate the warping of copper plates. The two factors studies were the temperature and the copper content of the plates. The response variable was a measure of the amount of warping. The data were as follows: Copper Content (%) Temperature ( C) , 6, 4, 8,7 75,9 8,3 7, 7,3 6, 8, 5,3 3,3 5,7 3, 3, 9,3 5-8

107 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY (a) Is there any indication that either factor affects the amount of warping? Is there any interaction between the factors? Use =.5. Both factors, copper content (A) and temperature (B) affect warping, the interaction does not. Design Expert Output Response: Warping ANOVA for Selected Factorial Model Analysis of variance table [Partial sum of squares] Sum of Mean F Source Squares DF Square Value Prob > F Model <. significant A <. B AB Residual Lack of Fit. Pure Error Cor Total The Model F-value of 9.5 implies the model is significant. There is only a.% chance that a "Model F-Value" this large could occur due to noise. Values of "Prob > F" less than.5 indicate model terms are significant. In this case A, B are significant model terms. (b) Analyze the residuals from this experiment. There is nothing unusual about the residual plots. Normal plot of residuals Residuals vs. Predicted Normal % probability Residuals E E Residual Predicted 5-9

108 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY 3.5 Residuals vs. Copper Content 3.5 Residuals vs. Temperature Residuals E Residuals E Copper Content Temperature (c) Plot the average warping at each level of copper content and compare them to an appropriately scaled t distribution. Describe the differences in the effects of the different levels of copper content on warping. If low warping is desirable, what level of copper content would you specify? Design Expert Output Factor Name Level Low Level High Level A Copper Content 4 4 B Temperature Average 5 5 Prediction SE Mean 95% CI low 95% CI high SE Pred 95% PI low 95% PI high Warping Factor Name Level Low Level High Level A Copper Content 6 4 B Temperature Average 5 5 Prediction SE Mean 95% CI low 95% CI high SE Pred 95% PI low 95% PI high Warping Factor Name Level Low Level High Level A Copper Content 8 4 B Temperature Average 5 5 Prediction SE Mean 95% CI low 95% CI high SE Pred 95% PI low 95% PI high Warping Factor Name Level Low Level High Level A Copper Content 4 B Temperature Average 5 5 Prediction SE Mean 95% CI low 95% CI high SE Pred 95% PI low 95% PI high Warping Use a copper content of 4 for the lowest warping. MSE S. 3 b 4 5-

109 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Scaled t Distribution Cu=4 Cu=6 Cu=8 Cu= Warping (d) Suppose that temperature cannot be easily controlled in the environment in which the copper plates are to be used. Does this change your answer for part (c)? Use a copper of content of 4. This is the same as for part (c). DESIGN-EXPERT Plot Warping X = A: Copper Content Y = B: Temperature Design Points In teraction G raph Tem perature B 5 B 75 B3 B4 5 W a rp in g C opper C ontent 5-6 The factors that influence the breaking strength of a synthetic fiber are being studied. Four production machines and three operators are chosen and a factorial experiment is run using fiber from the same production batch. The results are as follows: Machine Operator

110 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY (a) Analyze the data and draw conclusions. Use =.5. Only the Operator (A) effect is significant. Design Expert Output Response:Stength ANOVA for Selected Factorial Model Analysis of variance table [Partial sum of squares] Sum of Mean F Source Squares DF Square Value Prob > F Model significant A B AB Residual Lack of Fit. Pure Error Cor Total The Model F-value of 5. implies the model is significant. There is only a.4% chance that a "Model F-Value" this large could occur due to noise. Values of "Prob > F" less than.5 indicate model terms aresignificant. In this case A are significant model terms. (b) Prepare appropriate residual plots and comment on the model s adequacy. The residual plot of residuals versus predicted shows that variance increases very slightly with strength. There is no indication of a severe problem. Residuals vs. Predicted Normal plot of residuals.5 99 Residuals E Normal % probability E Predicted Residual 5-

111 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY.5 Residuals vs. Operator.5 3 Residuals E Operator 5-7 A mechanical engineer is studying the thrust force developed by a drill press. He suspects that the drilling speed and the feed rate of the material are the most important factors. He selects four feed rates and uses a high and low drill speed chosen to represent the extreme operating conditions. He obtains the following results. Analyze the data and draw conclusions. Use =.5. (A) Feed Rate (B) Drill Speed Design Expert Output Response: Force ANOVA for Selected Factorial Model Analysis of variance table [Partial sum of squares] Sum of Mean F Source Squares DF Square Value Prob > F Model significant A <. B AB Residual. 8.6E-3 Lack of Fit. Pure Error. 8.6E-3 Cor Total.3 5 The Model F-value of 5.53 implies the model is significant. There is only a.5% chance that a "Model F-Value" this large could occur due to noise. Values of "Prob > F" less than.5 indicate model terms are significant. In this case A, B, AB are significant model terms. The factors speed and feed rate, as well as the interaction is important. 5-3

112 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY DESIGN-EXPERT Plot Force In teraction G raph D rill S p e e d X = B: Feed Rate Y = A: Drill Speed Design Points A 5 A F o rc e Feed R ate The standard analysis of variance treats all design factors as if they were qualitative. In this case, both factors are quantitative, so some further analysis can be performed. In Section 5-5, we show how response curves and surfaces can be fit to the data from a factorial experiment with at least one quantative factor. Since both factors in this problem are quantitative and have three levels, we can fit linear and quadratic effects of both temperature and pressure, exactly as in Example 5-5 in the text. The Design-Expert output, including the response surface plots, now follows. Design Expert Output Response: Force ANOVA for Selected Factorial Model Analysis of variance table [Partial sum of squares] Sum of Mean F Source Squares DF Square Value Prob > F Model significant A B B AB.5E-3.5E Residual.77 7.E-3 Lack of Fit significant Pure Error. 8.6E-3 Cor Total.3 5 The Model F-value of 8.5 implies the model is significant. There is only a.7% chance that a "Model F-Value" this large could occur due to noise. Values of "Prob > F" less than.5 indicate model terms are significant. In this case A, B are significant model terms. Values greater than. indicate the model terms are not significant. If there are many insignificant model terms (not counting those required to support hierarchy), model reduction may improve your model. Std. Dev..84 R-Squared.7455 Mean.77 Adj R-Squared.659 C.V. 3.3 Pred R-Squared.465 PRESS.6 Adeq Precision Coefficient Standard 95% CI 95% CI Factor Estimate DF Error Low High VIF Intercept A-Drill Speed B-Feed Rate B AB

113 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Final Equation in Terms of Coded Factors: Force = * A +.47 * B +.3 * B -. * A * B Final Equation in Terms of Actual Factors: Force = E-3 * Drill Speed * Feed Rate * Feed Rate * Drill Speed * Feed Rate.6 Force B: Feed Rate Force A: Drill Speed B: Feed Rate A: Drill Speed An experiment is conducted to study the influence of operating temperature and three types of faceplate glass in the light output of an oscilloscope tube. The following data are collected: Temperature Glass Type

114 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Use =.5 in the analysis. Is there a significant interaction effect? Does glass type or temperature affect the response? What conclusions can you draw? Use the method discussed in the text to partition the temperature effect into its linear and quadratic components. Break the interaction down into appropriate components. Design Expert Output Response: Light Output ANOVA for Selected Factorial Model Analysis of variance table [Partial sum of squares] Sum of Mean F Source Squares DF Square Value Prob > F Model.4E E <. significant A.59E <. B.97E E <. AB.96E <. Residual Lack of Fit. Pure Error Cor Total.48E+6 6 The Model F-value of implies the model is significant. There is only a.% chance that a "Model F-Value" this large could occur due to noise. Values of "Prob > F" less than.5 indicate model terms are significant. In this case A, B, AB are significant model terms. Both factors, Glass Type (A) and Temperature (B) are significant, as well as the interaction (AB). For glass types and the response is fairly linear, for glass type 3, there is a quadratic effect. DESIGN-EXPERT Plot Light Output 4.4 In teraction G raph G lass Type X = B: Temperature Y = A: Glass Type Design Points A A A L ig h t O u tp u t Tem perature Design Expert Output Response: Light Output ANOVA for Selected Factorial Model Analysis of variance table [Partial sum of squares] Sum of Mean F Source Squares DF Square Value Prob > F Model.4E E <. significant A.59E <. B.78E+6.78E <. B.96E+5.96E <. AB.6E+5.3E <. AB <. Pure Error

115 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Cor Total.48E+6 6 The Model F-value of implies the model is significant. There is only a.% chance that a "Model F-Value" this large could occur due to noise. Values of "Prob > F" less than.5 indicate model terms are significant. In this case A, B, B, AB, AB are significant model terms. Values greater than. indicate the model terms are not significant. If there are many insignificant model terms (not counting those required to support hierarchy), model reduction may improve your model. Std. Dev. 9. R-Squared.9973 Mean 94.9 Adj R-Squared.996 C.V..3 Pred R-Squared.9939 PRESS Adeq Precision Coefficient Standard 95% CI 95% CI Factor Estimate DF Error Low High VIF Intercept A[] A[] B-Temperature B A[]B A[]B A[]B A[]B Final Equation in Terms of Coded Factors: Light Output = * A[] -4. * A[] * B -78. * B +9. * A[]B * A[]B +7. * A[]B +76. * A[]B Final Equation in Terms of Actual Factors: Glass Type Light Output = * Temperature -.78 * Temperature Glass Type Light Output = * Temperature -.63 * Temperature Glass Type 3 Light Output = * Temperature * Temperature 5-9 Consider the data in Problem 5-. Use the method described in the text to compute the linear and quadratic effects of pressure. See the alternative analysis shown in Problem 5- part (c). 5-7

116 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY 5- Use Duncan s multiple range test to determine which levels of the pressure factor are significantly different for the data in Problem 5-. y y y S MS E y j. an 3 r., r. 3, R R vs. 3 =.5 >.64 (R 3 ) vs. =.3 >.498 (R ) vs. 3 =.9 <.498 (R ) Therefore, differs from and An experiment was conducted to determine if either firing temperature or furnace position affects the baked density of a carbon anode. The data are shown below. Temperature ( C) Position Suppose we assume that no interaction exists. Write down the statistical model. Conduct the analysis of variance and test hypotheses on the main effects. What conclusions can be drawn? Comment on the model s adequacy. The model for the two-factor, no interaction model is yijk i j ijk. Both factors, furnace position (A) and temperature (B) are significant. The residual plots show nothing unusual. Design Expert Output Response: Density ANOVA for Selected Factorial Model Analysis of variance table [Partial sum of squares] Sum of Mean F Source Squares DF Square Value Prob > F Model 9.55E E <. significant A B 9.453E E <. Residual Lack of Fit not significant Pure Error Cor Total 9.587E+5 7 The Model F-value of 78.4 implies the model is significant. There is only a.% chance that a "Model F-Value" this large could occur due to noise. Values of "Prob > F" less than.5 indicate model terms are significant. In this case A, B are significant model terms. 5-8

117 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Residuals vs. Predicted Residuals vs. Position Residuals Residuals Predicted Position Residuals vs. Temperature Residuals Temperature 5- Derive the expected mean squares for a two-factor analysis of variance with one observation per cell, assuming that both factors are fixed. E MS E MS E MS AB A B a i a i Degrees of Freedom b a- b a b- j b j a b ij a b i j a b ab 5-9

118 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY 5-3 Consider the following data from a two-factor factorial experiment. Analyze the data and draw conclusions. Perform a test for nonadditivity. Use =.5. Column Factor Row Factor Design Expert Output Response: data ANOVA for Selected Factorial Model Analysis of variance table [Partial sum of squares] Sum of Mean F Source Squares DF Square Value Prob > F Model significant A B Residual Cor Total The Model F-value of 5.36 implies the model is significant. There is only a.6% chance that a "Model F-Value" this large could occur due to noise. The row factor (A) is significant. The test for nonadditivity is as follows: SS SS SS SS N N N i j Error a SS b y y Re sidual y ij i.. j SS y.. SS abss N A SS A B SS B y.. ab Source of Sum of Degrees of Mean Variation Squares Freedom Square F Row Column Nonadditivity Error Total The shear strength of an adhesive is thought to be affected by the application pressure and temperature. A factorial experiment is performed in which both factors are assumed to be fixed. Analyze the data and draw conclusions. Perform a test for nonadditivity. Temperature ( F) 5-

119 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Pressure (lb/in) Design Expert Output Response: Strength ANOVA for Selected Factorial Model Analysis of variance table [Partial sum of squares] Sum of Mean F Source Squares DF Square Value Prob > F Model not significant A B Residual Cor Total 7.39 The "Model F-value" of.9 implies the model is not significant relative to the noise. There is a.4 % chance that a "Model F-value" this large could occur due to noise. Values of "Prob > F" less than.5 indicate model terms are significant. In this case B are significant model terms. Temperature (B) is a significant factor. a b y y i j SS SS SS SS N N N Error SS Re sidual y ij i.. j SS y.. SS abss SS SS y.. ab N A A B B Source of Sum of Degrees of Mean Variation Squares Freedom Square F Row Column Nonadditivity Error Total Consider the three-factor model yijk i j k ijk ij jk i,,...,a j,,...,b k,,...,c 5-

120 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Notice that there is only one replicate. Assuming the factors are fixed, write down the analysis of variance table, including the expected mean squares. What would you use as the experimental error in order to test hypotheses? Source Degrees of Freedom Expected Mean Square A a- a i bc a B b- C c- AB BC (a-)(b-) (b-)(c-) c a ac ab Error (AC + ABC) b(a-)(c-) Total abc- i b j b j c k c k a b ij a b i j b c jk b c j k 5-6 The percentage of hardwood concentration in raw pulp, the vat pressure, and the cooking time of the pulp are being investigated for their effects on the strength of paper. Three levels of hardwood concentration, three levels of pressure, and two cooking times are selected. A factorial experiment with two replicates is conducted, and the following data are obtained: Percentage Cooking Time 3. Hours Cooking Time 4. Hours of Hardwood Pressure Pressure Concentration (a) Analyze the data and draw conclusions. Use =.5. Design Expert Output Response: strength ANOVA for Selected Factorial Model Analysis of variance table [Partial sum of squares] Sum of Mean F Source Squares DF Square Value Prob > F Model <. significant A B <. C <. AB AC

121 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY BC ABC Residual Lack of Fit. Pure Error Cor Total The Model F-value of 9.6 implies the model is significant. There is only a.% chance that a "Model F-Value" this large could occur due to noise. Values of "Prob > F" less than.5 indicate model terms are significant. In this case A, B, C, AC are significant model terms. All three main effects, concentration (A), pressure (C) and time (B), as well as the concentration x pressure interaction (AC) are significant at the 5% level. The concentration x time (AB) and pressure x time interactions (BC) are significant at the % level. (b) Prepare appropriate residual plots and comment on the model s adequacy. Residuals vs. Pressure Residuals vs. Cooking Time Residuals Residuals Pressure Cooking Time Residuals vs. Predicted Residuals vs. Hardwood Residuals Residuals Predicted Hardwood There is nothing unusual about the residual plots. 5-3

122 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY (c) Under what set of conditions would you run the process? Why? DESIGN-EXPERT Plot strength.9 In teraction G raph P ressure DESIGN-EXPERT Plot strength.9 In teraction G raph H a rd w o o d X = B: Cooking Time Y = C: Pressure C 4 C 5 C3 65 Actual Factor A: Hardwood = Average98.5 strength X = B: Cooking Time Y = A: Hardwood A A 4 A3 8 Actual Factor C: Pressure = Average strength C o o k in g T im e C ooking Tim e DESIGN-EXPERT Plot strength.9 In teraction G raph H a rd w o o d X = C: Pressure Y = A: Hardwood A A 4 A3 8 Actual Factor B: Cooking Time = Average strength P ressure For the highest strength, run the process with the percentage of hardwood at, the pressure at 65, and the time at 4 hours. The standard analysis of variance treats all design factors as if they were qualitative. In this case, all three factors are quantitative, so some further analysis can be performed. In Section 5-5, we show how response curves and surfaces can be fit to the data from a factorial experiment with at least one quantative factor. Since the factors in this problem are quantitative and two of them have three levels, we can fit linear and quadratic. The Design-Expert output, including the response surface plots, now follows. Design Expert Output Response: Strength ANOVA for Selected Factorial Model Analysis of variance table [Partial sum of squares] Sum of Mean F Source Squares DF Square Value Prob > F Model <. significant 5-4

123 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY A B C A C AB AC BC AB AC AC BC ABC Residual Lack of Fit not significant Pure Error Cor Total The Model F-value of.85 implies the model is significant. There is only a.% chance that a "Model F-Value" this large could occur due to noise. Values of "Prob > F" less than.5 indicate model terms are significant. In this case A, B, C, AC, A C, AC, BC are significant model terms. Values greater than. indicate the model terms are not significant. If there are many insignificant model terms (not counting those required to support hierarchy), model reduction may improve your model. Std. Dev..6 R-Squared.875 Mean 98.6 Adj R-Squared.8 C.V..3 Pred R-Squared.6657 PRESS.7 Adeq Precision 4.7 Coefficient Standard 95% CI 95% CI Factor Estimate DF Error Low High VIF Intercept A-Hardwood B-Cooking Time C-Pressure A C AB AC BC AB AC AC E BC ABC Final Equation in Terms of Coded Factors: Strength = * A +.78 * B +.9 * C +.4 * A +.79 * C -. * A * B -.46 * A * C +.8 * B * C +.46 * A * B +.73 * A * C +.57 * A * C -.55 * B * C +.5 * A * B * C Final Equation in Terms of Actual Factors: 5-5

124 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Strength = * Hardwood * Cooking Time -.4 * Pressure * Hardwood E-4 * Pressure * Hardwood * Cooking Time * Hardwood * Pressure * Cooking Time * Pressure +.78 * Hardwood * Cooking Time E-4 * Hardwood * Pressure +.43E-5 * Hardwood * Pressure -7.E-5 * Cooking Time * Pressure E-4 * Hardwood * Cooking Time * Pressure 65. Strength C: Pressure Strength A: Hardwood A: Hardwood C: Pres s ure Cooking Time: B = The quality control department of a fabric finishing plant is studying the effect of several factors on the dyeing of cotton-synthetic cloth used to manufacture men s shirts. Three operators, three cycle times, and two temperatures were selected, and three small specimens of cloth were dyed under each set of conditions. The finished cloth was compared to a standard, and a numerical score was assigned. The results follow. Analyze the data and draw conclusions. Comment on the model s adequacy. Temperature 3 35 Operator Operator Cycle Time

125 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY All three main effects, and the AB, AC, and ABC interactions are significant. There is nothing unusual about the residual plots. Design Expert Output Response: Score ANOVA for Selected Factorial Model Analysis of variance table [Partial sum of squares] Sum of Mean F Source Squares DF Square Value Prob > F Model <. significant A <. B <. C AB <. AC BC ABC Residual Lack of Fit. Pure Error Cor Total The Model F-value of.4 implies the model is significant. There is only a.% chance that a "Model F-Value" this large could occur due to noise. Values of "Prob > F" less than.5 indicate model terms are significant. In this case A, B, C, AB, AC, ABC are significant model terms. DESIGN-EXPERT Plot Score 3 9 In teraction G raph Tem perature DESIGN-EXPERT Plot Score 3 9 In teraction G raph O perator X = A: Cycle Time Y = C: Temperature C 3 C 35 Actual Factor B: Operator = Average S c o re X = A: Cycle Time Y = B: Operator 3 5 B B B3 3 Actual Factor C: Temperature = Average 3 S c o re C yc le T im e C ycle Tim e 5-7

126 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Residuals vs. Operator Residuals vs. Cycle Time Residuals 4.636E-4 3 Residuals 4.636E Operator Cycle Time Residuals vs. Predicted Residuals vs. Temperature Residuals 4.636E-4 Residuals 4.636E Predicted Temperature 5-8 In Problem 5-, suppose that we wish to reject the null hypothesis with a high probability if the difference in the true mean yield at any two pressures is as great as.5. If a reasonable prior estimate of the standard deviation of yield is., how many replicates should be run? nad b n n n b abn 5 5 (3)(3)().4 replications will be enough to detect the given difference. 5-8

127 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY 5-9 The yield of a chemical process is being studied. The two factors of interest are temperature and pressure. Three levels of each factor are selected; however, only 9 runs can be made in one day. The experimenter runs a complete replicate of the design on each day. The data are shown in the following table. Analyze the data assuming that the days are blocks. Day Day Pressure Pressure Temperature Low Medium High Design Expert Output Response: Yield ANOVA for Selected Factorial Model Analysis of variance table [Partial sum of squares] Sum of Mean F Source Squares DF Square Value Prob > F Block Model <. significant A B <. AB Residual Cor Total The Model F-value of 5.84 implies the model is significant. There is only a.% chance that a "Model F-Value" this large could occur due to noise. Values of "Prob > F" less than.5 indicate model terms are significant. In this case A, B are significant model terms. Both main effects, temperature and pressure, are significant. 5- Consider the data in Problem 5-5. Analyze the data, assuming that replicates are blocks. Design Expert Output Response: Warping ANOVA for Selected Factorial Model Analysis of variance table [Partial sum of squares] Sum of Mean F Source Squares DF Square Value Prob > F Block.8.8 Model <. significant A <. B AB Residual Cor Total The Model F-value of 9.96 implies the model is significant. There is only a.% chance that a "Model F-Value" this large could occur due to noise. Values of "Prob > F" less than.5 indicate model terms are significant. In this case A, B are significant model terms. Both temperature and copper content are significant. This agrees with the analysis in Problem Consider the data in Problem 5-6. Analyze the data, assuming that replicates are blocks. 5-9

128 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Design-Expert Output Response: Stength ANOVA for Selected Factorial Model Analysis of variance table [Partial sum of squares] Sum of Mean F Source Squares DF Square Value Prob > F Block.4.4 Model significant A B AB Residual Cor Total The Model F-value of 4.89 implies the model is significant. There is only a.7% chance that a "Model F-Value" this large could occur due to noise. Values of "Prob > F" less than.5 indicate model terms are significant. In this case A are significant model terms. Only the operator factor (A) is significant. This agrees with the analysis in Problem An article in the Journal of Testing and Evaluation (Vol. 6, no., pp ) investigated the effects of cyclic loading and environmental conditions on fatigue crack growth at a constant MPa stress for a particular material. The data from this experiment are shown below (the response is crack growth rate). Environment Frequency Air H O Salt H O (a) Analyze the data from this experiment (use =.5). Design Expert Output Response: Crack Growth ANOVA for Selected Factorial Model Analysis of variance table [Partial sum of squares] Sum of Mean F Source Squares DF Square Value Prob > F Model <. significant A <. B <. AB <. Residual Lack of Fit. 5-3

129 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Pure Error Cor Total The Model F-value of 34. implies the model is significant. There is only a.% chance that a "Model F-Value" this large could occur due to noise. Values of "Prob > F" less than.5 indicate model terms are significant. In this case A, B, AB are significant model terms. Both frequency and environment, as well as their interaction are significant. (b) Analyze the residuals. The residual plots indicate that there may be some problem with inequality of variance. This is particularly noticable on the plot of residuals versus predicted response and the plot of residuals versus frequency. Residuals vs. Predicted Normal plot of residuals.7 99 Residuals Normal % probability Predicted Residual Residuals vs. Environment Residuals vs. Frequency Residuals -.4 Residuals Environment Frequency (c) Repeat the analyses from parts (a) and (b) using ln(y) as the response. Comment on the results. 5-3

130 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Design Expert Output Response: Crack Growth Transform: Natural log Constant:. ANOVA for Selected Factorial Model Analysis of variance table [Partial sum of squares] Sum of Mean F Source Squares DF Square Value Prob > F Model <. significant A <. B <. AB <. Residual E-3 Lack of Fit. Pure Error E-3 Cor Total The Model F-value of implies the model is significant. There is only a.% chance that a "Model F-Value" this large could occur due to noise. Values of "Prob > F" less than.5 indicate model terms are significant. In this case A, B, AB are significant model terms. Both frequency and environment, as well as their interaction are significant. The residual plots of the based on the transformed data look better. Residuals vs. Predicted Normal plot of residuals Residuals Normal % probability Predicted Residual 5-3

131 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Residuals vs. Environment Residuals vs. Frequency Residuals.44 Residuals Environment Frequency 5-3 An article in the IEEE Transactions on Electron Devices (Nov. 986, pp. 754) describes a study on polysilicon doping. The experiment shown below is a variation of their study. The response variable is base current. Polysilicon Anneal Temperature ( C) Doping (ions) 9 95 x x (a) Is there evidence (with =.5) indicating that either polysilicon doping level or anneal temperature affect base current? Design Expert Output Response: Base Current ANOVA for Selected Factorial Model Analysis of variance table [Partial sum of squares] Sum of Mean F Source Squares DF Square Value Prob > F Model <. significant A B <. AB Residual Lack of Fit. Pure Error Cor Total 3.3 The Model F-value of 35.9 implies the model is significant. There is only a.% chance that a "Model F-Value" this large could occur due to noise. Values of "Prob > F" less than.5 indicate model terms are significant. In this case A, B are significant model terms. Both factors, doping and anneal are significant. Their interaction is significant at the % level. 5-33

132 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY (b) Prepare graphical displays to assist in interpretation of this experiment..5 Interaction Graph Doping Base Current A A Anneal (c) Analyze the residuals and comment on model adequacy. Residuals vs. Predicted Normal plot of residuals.3 99 Residuals E Normal % probability E Predicted Residual 5-34

133 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY.3 Residuals vs. Doping.3 Residuals vs. Anneal.6.6 Residuals E-6 Residuals E Doping Anneal There is a funnel shape in the plot of residuals versus predicted, indicating some inequality of variance. (d) Is the model y x x x x x supported by this experiment (x = doping level, x = temperature)? Estimate the parameters in this model and plot the response surface. Design Expert Output Response: Base Current ANOVA for Response Surface Reduced Quadratic Model Analysis of variance table [Partial sum of squares] Sum of Mean F Source Squares DF Square Value Prob > F Model <. significant A B <. B <. AB Residual Lack of Fit not significant Pure Error Cor Total 3.3 The Model F-value of implies the model is significant. There is only a.% chance that a "Model F-Value" this large could occur due to noise. Values of "Prob > F" less than.5 indicate model terms are significant. In this case A, B, B, AB are significant model terms. Coefficient Standard 95% CI 95% CI Factor Estimate DF Error Low High VIF Intercept A-Doping B-Anneal B AB All of the coefficients in the assumed model are significant. The quadratic effect is easily observable in the response surface plot. 5-35

134 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY. Base Current 975. Anneal Base Current E+.5E+.5E+.75E+.E+ Doping.E+.5E+.5E+ Doping.75E+.E Anneal 5-36

135 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Chapter 6 The k Factorial Design Solutions 6- An engineer is interested in the effects of cutting speed (A), tool geometry (B), and cutting angle on the life (in hours) of a machine tool. Two levels of each factor are chosen, and three replicates of a 3 factorial design are run. The results follow: Treatment Replicate A B C Combination I II III () a b ab c ac bc abc (a) Estimate the factor effects. Which effects appear to be large? From the normal probability plot of effects below, factors B, C, and the AC interaction appear to be significant. DESIGN-EXPERT Plot Life Normal plot A: Cutting Speed B: Tool Geometry C: Cutting Angle Normal % probability A C B 5 AC Effect (b) Use the analysis of variance to confirm your conclusions for part (a). The analysis of variance confirms the significance of factors B, C, and the AC interaction. Design Expert Output Response: Life in hours 6-

136 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY ANOVA for Selected Factorial Model Analysis of variance table [Partial sum of squares] Sum of Mean F Source Squares DF Square Value Prob > F Model significant A B C AB AC BC ABC Pure Error Cor Total The Model F-value of 7.64 implies the model is significant. There is only a.4% chance that a "Model F-Value" this large could occur due to noise. The reduced model ANOVA is shown below. Factor A was included to maintain hierarchy. Design Expert Output Response: Life in hours ANOVA for Selected Factorial Model Analysis of variance table [Partial sum of squares] Sum of Mean F Source Squares DF Square Value Prob > F Model <. significant A B <. C AC Residual Lack of Fit not significant Pure Error Cor Total The Model F-value of.54 implies the model is significant. There is only a.% chance that a "Model F-Value" this large could occur due to noise. Effects B, C and AC are significant at %. (c) Write down a regression model for predicting tool life (in hours) based on the results of this experiment. y x x x x ijk A B C A x C Design Expert Output Coefficient Standard 95% CI 95% CI Factor Estimate DF Error Low High VIF Intercept A-Cutting Speed B-Tool Geometry C-Cutting Angle AC Final Equation in Terms of Coded Factors: Life = * A * B +3.4 * C -4.4 * A * C Final Equation in Terms of Actual Factors: 6-

137 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Life = * Cutting Speed * Tool Geometry * Cutting Angle * Cutting Speed * Cutting Angle The equation in part (c) and in the given in the computer output form a hierarchial model, that is, if an interaction is included in the model, then all of the main effects referenced in the interaction are also included in the model. (d) Analyze the residuals. Are there any obvious problems? Normal plot of residuals Residuals vs. Predicted.5 99 Normal % probability Residuals Residual Predicted There is nothing unusual about the residual plots. (e) Based on the analysis of main effects and interaction plots, what levels of A, B, and C would you recommend using? Since B has a positive effect, set B at the high level to increase life. The AC interaction plot reveals that life would be maximized with C at the high level and A at the low level. 6-3

138 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY DESIGN-EXPERT Plot Life 6 Interaction Graph Cutting Angle DESIGN-EXPERT Plot Life 6 One Factor Plot X = A: Cutting Speed Y = C: Cutting Angle 5.5 C- -. C+. Actual Factor B: Tool Geometry =. 4 Life X = B: Tool Geometry Actual Factors A: Cutting Speed =. C: Cutting Angle =. Life Cutting Speed Tool Geometry 6- Reconsider part (c) of Problem 6-. Use the regression model to generate response surface and contour plots of the tool life response. Interpret these plots. Do they provide insight regarding the desirable operating conditions for this process? The response surface plot and the contour plot in terms of factors A and C with B at the high level are shown below. They show the curvature due to the AC interaction. These plots make it easy to see the region of greatest tool life. DESIGN-EXPERT Plot Life X = A: Cutting Speed Y = C: Cutting Angle Life DESIGN-EXPERT Plot Life X = A: Cutting Speed Y = C: Cutting Angle Actual Factor B: Tool Geometry = Actual Factor B: Tool Geometry = Cutting Angle Life Cutting Speed Cutting Angle -.. Cutting Speed Find the standard error of the factor effects and approximate 95 percent confidence limits for the factor effects in Problem 6-. Do the results of this analysis agree with the conclusions from the analysis of variance? 6-4

139 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY SE( effect ) S 3. 7 k 3 n 3 Variable Effect CI A B * AB C * AC * BC ABC The 95% confidence intervals for factors B, C and AC do not contain zero. This agrees with the analysis of variance approach Plot the factor effects from Problem 6- on a graph relative to an appropriately scaled t distribution. Does this graphical display adequately identify the important factors? Compare the conclusions from this MSE 3. 7 plot with the results from the analysis of variance. S 3. 7 n 3 Scaled t Distribution AC C B -... Factor Effects This method identifies the same factors as the analysis of variance. 6-5 A router is used to cut locating notches on a printed circuit board. The vibration level at the surface of the board as it is cut is considered to be a major source of dimensional variation in the notches. Two factors are thought to influence vibration: bit size (A) and cutting speed (B). Two bit sizes (/6 and /8 inch) and two speeds (4 and 9 rpm) are selected, and four boards are cut at each set of conditions shown below. The response variable is vibration measured as a resultant vector of three accelerometers (x, y, and z) on each test circuit board. Treatment Replicate A B Combination I II III IV - - () a b

140 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY + + ab (a) Analyze the data from this experiment. Design Expert Output Response: Vibration ANOVA for Selected Factorial Model Analysis of variance table [Partial sum of squares] Sum of Mean F Source Squares DF Square Value Prob > F Model <. significant A <. B <. AB <. Residual Lack of Fit. Pure Error Cor Total The Model F-value of 9.36 implies the model is significant. There is only a.% chance that a "Model F-Value" this large could occur due to noise. (b) Construct a normal probability plot of the residuals, and plot the residuals versus the predicted vibration level. Interpret these plots. Normal plot of residuals Residuals vs. Predicted Normal % probability Residuals Residual Predicted There is nothing unusual about the residual plots. (c) Draw the AB interaction plot. Interpret this plot. What levels of bit size and speed would you recommend for routine operation? To reduce the vibration, use the smaller bit. Once the small bit is specified, either speed will work equally well, because the slope of the curve relating vibration to speed for the small tip is approximately zero. The process is robust to speed changes if the small bit is used. 6-6

141 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY DESIGN-EXPERT Plot Vibration 43.9 Interaction Graph Cutting Speed X = A: Bit Size Y = B: Cutting Speed Design Points 36.5 B- -. B+. Vibration Bit Size 6-6 Reconsider the experiment described in Problem 6-. Suppose that the experimenter only performed the eight trials from replicate I. In addition, he ran four center points and obtained the following response values: 36, 4, 43, 45. (a) Estimate the factor effects. Which effects are large? DESIGN-EXPERT Plot Life Normal plot A: Cutting Speed B: Tool Geometry C: Cutting Angle Normal % probability C B 5 AC Effect Effects B, C, and AC appear to be large. (b) Perform an analysis of variance, including a check for pure quadratic curvature. What are your conclusions? SS PureQuadratic Design Expert Output Response: Life in hours n F n C n y y F F n C C

142 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY ANOVA for Selected Factorial Model Analysis of variance table [Partial sum of squares] Sum of Mean F Source Squares DF Square Value Prob > F Model significant A B C AB AC BC ABC Curvature E not significant Pure Error Cor Total 94.9 The Model F-value of 9.77 implies the model is significant. There is only a 4.39% chance that a "Model F-Value" this large could occur due to noise. The "Curvature F-value" of. implies the curvature (as measured by difference between the average of the center points and the average of the factorial points) in the design space is not significant relative to the noise. There is a 96.7% chance that a "Curvature F-value" this large could occur due to noise. Design Expert Output Response: Life in hours ANOVA for Selected Factorial Model Analysis of variance table [Partial sum of squares] Sum of Mean F Source Squares DF Square Value Prob > F Model significant A B C AC Residual Lack of Fit not significant Pure Error Cor Total 94.9 The Model F-value of 7.9 implies the model is significant. There is only a.98% chance that a "Model F-Value" this large could occur due to noise. Effects B, C and AC are significant at 5%. There is no effect of curvature. (c) Write down an appropriate model for predicting tool life, based on the results of this experiment. Does this model differ in any substantial way from the model in Problem 7-, part (c)? Design Expert Output Final Equation in Terms of Coded Factors: Life = * A * B * C * A * C (d) Analyze the residuals. 6-8

143 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Normal plot of residuals Residuals vs. Predicted Normal % probability Studentized Residuals Studentized Residuals Predicted (e) What conclusions would you draw about the appropriate operating conditions for this process? To maximize life run with B at the high level, A at the low level and C at the high level Cube Graph Life B B: Tool Geometry C+ C: Cutting Angle B C- A- A+ A: Cutting Speed 6-7 An experiment was performed to improve the yield of a chemical process. Four factors were selected, and two replicates of a completely randomized experiment were run. The results are shown in the following table: Treatment Replicate Replicate Treatment Replicate Replicate Combination I II Combination I II () 9 93 d a ad 7 76 b 8 85 bd

144 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY ab 83 8 abd c cd 99 9 ac 8 8 acd bc 88 8 bcd abc 73 7 abcd 8 8 (a) Estimate the factor effects. Design Expert Output Term Effect SumSqr % Contribtn Model Intercept Error A Error B Error C Error D Error AB Error AC Error AD Error BC Error BD Error CD Error ABC Error ABD Error ACD Error BCD Error ABCD (b) Prepare an analysis of variance table, and determine which factors are important in explaining yield. Design Expert Output Response: yield ANOVA for Selected Factorial Model Analysis of variance table [Partial sum of squares] Sum of Mean F Source Squares DF Square Value Prob > F Model <. significant A <. B C D AB AC AD BC BD CD ABC <. ABD ACD BCD ABCD Residual Lack of Fit. Pure Error Cor Total The Model F-value of 3. implies the model is significant. There is only a.% chance that a "Model F-Value" this large could occur due to noise. Values of "Prob > F" less than.5 indicate model terms are significant. In this case A, C, D, AB, AD, ABC, ABD, ABCD are significant model terms. 6-

145 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY F 6.,, 853., and F 56.,, 6. therefore, factors A and D and interactions AB, ABC, and ABD are significant at %. Factor C and interactions AD and ABCD are significant at.5%. (b) Write down a regression model for predicting yield, assuming that all four factors were varied over the range from - to + (in coded units). Model with hierarchy maintained: Design Expert Output Final Equation in Terms of Coded Factors: yield = * A -.66 * B -.34 * C +.97 * D +.3 * A * B +.34 * A * C -.9 * A * D -.8 * B * C -.94 * B * D +.84 * C * D -.59 * A * B * C +.34 * A * B * D -.47 * A * C * D -.47 * B * C * D +. * A * B * C * D Model without hierarchy terms: Design Expert Output Final Equation in Terms of Coded Factors: yield = * A -.34 * C +.97 * D +.3 * A * B -.9 * A * D -.59 * A * B * C +.34 * A * B * D +. * A * B * C * D Confirmation runs might be run to see if the simpler model without hierarchy is satisfactory. (d) Plot the residuals versus the predicted yield and on a normal probability scale. Does the residual analysis appear satisfactory? There appears to be one large residual both in the normal probability plot and in the plot of residuals versus predicted. 6-

146 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Normal plot of residuals Residuals vs. Predicted Normal % probability Residuals Residual Predicted (e) Two three-factor interactions, ABC and ABD, apparently have large effects. Draw a cube plot in the factors A, B, and C with the average yields shown at each corner. Repeat using the factors A, B, and D. Do these two plots aid in data interpretation? Where would you recommend that the process be run with respect to the four variables? Cube Graph yield Cube Graph yield B B B: B C+ B: B D+ C: C D: D B C- A- A+ A: A B D- A- A+ A: A Run the process at A low B low, C low and D high. 6-8 A bacteriologist is interested in the effects of two different culture media and two different times on the growth of a particular virus. She performs six replicates of a design, making the runs in random order. Analyze the bacterial growth data that follow and draw appropriate conclusions. Analyze the residuals and comment on the model s adequacy. Culture Medium Time 6-

147 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY 5 6 hr hr Design Expert Output Response: Virus growth ANOVA for Selected Factorial Model Analysis of variance table [Partial sum of squares] Sum of Mean F Source Squares DF Square Value Prob > F Model <. significant A B <. AB Residual.7 5. Lack of Fit. Pure Error.7 5. Cor Total The Model F-value of 45. implies the model is significant. There is only a.% chance that a "Model F-Value" this large could occur due to noise. Values of "Prob > F" less than.5 indicate model terms are significant. In this case B, AB are significant model terms. Normal plot of residuals Residuals vs. Predicted Norm al % probability Residuals Residual Predicted Growth rate is affected by factor B (Time) and the AB interaction (Culture medium and Time). There is some very slight indication of inequality of variance shown by the small decreasing funnel shape in the plot of residuals versus predicted. 6-3

148 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY DESIGN-EXPERT Plot Virus growth 39 Interaction Graph Tim e X = A: Culture Medium Y = B: Time Design Points 34.5 B-. B+ 8. Virus growth Culture Medium 6-9 An industrial engineer employed by a beverage bottler is interested in the effects of two different typed of 3-ounce bottles on the time to deliver -bottle cases of the product. The two bottle types are glass and plastic. Two workers are used to perform a task consisting of moving 4 cases of the product 5 feet on a standard type of hand truck and stacking the cases in a display. Four replicates of a factorial design are performed, and the times observed are listed in the following table. Analyze the data and draw the appropriate conclusions. Analyze the residuals and comment on the model s adequacy. Worker Bottle Type Glass Plastic Design Expert Output Response: Times ANOVA for Selected Factorial Model Analysis of variance table [Partial sum of squares] Sum of Mean F Source Squares DF Square Value Prob > F Model significant A B AB Residual.49. Lack of Fit. Pure Error.49. Cor Total The Model F-value of 3.4 implies the model is significant. There is only a.4% chance that a "Model F-Value" this large could occur due to noise. Values of "Prob > F" less than.5 indicate model terms are significant. In this case A, B are significant model terms. There is some indication of non-constant variance in this experiment. 6-4

149 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Normal plot of residuals Residuals vs. Predicted Norm al % probability Residuals Residual Predicted Residuals vs. Worker Residuals Worker 6- In problem 6-9, the engineer was also interested in potential fatigue differences resulting from the two types of bottles. As a measure of the amount of effort required, he measured the elevation of heart rate (pulse) induced by the task. The results follow. Analyze the data and draw conclusions. Analyze the residuals and comment on the model s adequacy. Worker Bottle Type Glass Plastic Design Expert Output 6-5

150 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Response: Pulse ANOVA for Selected Factorial Model Analysis of variance table [Partial sum of squares] Sum of Mean F Source Squares DF Square Value Prob > F Model significant A <. B AB Residual Lack of Fit. Pure Error Cor Total The Model F-value of 6.3 implies the model is significant. There is only a.% chance that a "Model F-Value" this large could occur due to noise. Values of "Prob > F" less than.5 indicate model terms are significant. In this case A are significant model terms. Normal plot of residuals Residuals vs. Predicted Normal % probability Residuals Residual Predicted Residuals vs. Worker Residuals Worker There is an indication that one worker exhibits greater variability than the other. 6-6

151 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY 6- Calculate approximate 95 percent confidence limits for the factor effects in Problem 6-. Do the results of this analysis agree with the analysis of variance performed in Problem 6-? SE( effect ) S k n Variable Effect C.I. A (.96)= B (.96)= AB (.96)= The 95% confidence intervals for factors A does not contain zero. This agrees with the analysis of variance approach. 6- An article in the AT&T Technical Journal (March/April 986, Vol. 65, pp. 39-5) describes the application of two-level factorial designs to integrated circuit manufacturing. A basic processing step is to grow an epitaxial layer on polished silicon wafers. The wafers mounted on a susceptor are positioned inside a bell jar, and chemical vapors are introduced. The susceptor is rotated and heat is applied until the epitaxial layer is thick enough. An experiment was run using two factors: arsenic flow rate (A) and deposition time (B). Four replicates were run, and the epitaxial layer thickness was measured (in mm). The data are shown below: Replicate Factor Levels A B I II III IV Low (-) High (+) A 55% 59% B Short Long ( min) (5 min) (a) Estimate the factor effects. Design Expert Output Term Effect SumSqr % Contribtn Model Intercept Error A Error B Error AB Error Lack Of Fit Error Pure Error (b) Conduct an analysis of variance. Which factors are important? Design Expert Output Response: Thickness ANOVA for Selected Factorial Model Analysis of variance table [Partial sum of squares] Sum of Mean F Source Squares DF Square Value Prob > F Model not significant A B

152 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY AB Residual Lack of Fit. Pure Error Cor Total The "Model F-value" of.9 implies the model is not significant relative to the noise. There is a 4.5 % chance that a "Model F-value" this large could occur due to noise. Values of "Prob > F" less than.5 indicate model terms are significant. In this case there are no significant model terms. (c) Write down a regression equation that could be used to predict epitaxial layer thickness over the region of arsenic flow rate and deposition time used in this experiment. Design Expert Output Final Equation in Terms of Coded Factors: Thickness = * A +.9 * B +.4 * A * B Final Equation in Terms of Actual Factors: Thickness = * Flow Rate * Dep Time +.85 * Flow Rate * Dep Time (d) Analyze the residuals. Are there any residuals that should cause concern? Observation # falls outside the groupings in the normal probability plot and the plot of residual versus predicted. Normal plot of residuals Residuals vs. Predicted Norm al % probability Residuals Residual Predicted (e) Discuss how you might deal with the potential outlier found in part (d). One approach would be to replace the observation with the average of the observations from that experimental cell. Another approach would be to identify if there was a recording issue in the original data. The first analysis below replaces the data point with the average of the other three. The second analysis assumes that the reading was incorrectly recorded and should have been

153 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Analysis with the run associated with standard order replaced with the average of the remaining three runs in the cell, 3.97: Design Expert Output Response: Thickness ANOVA for Selected Factorial Model Analysis of variance table [Partial sum of squares] Sum of Mean F Source Squares DF Square Value Prob > F Model <. significant A 7.439E E B <. AB.76E-4.76E Pure Error..8 Cor Total The Model F-value of implies the model is significant. There is only a.% chance that a "Model F-Value" this large could occur due to noise. Values of "Prob > F" less than.5 indicate model terms are significant. In this case B are significant model terms. Final Equation in Terms of Coded Factors: Thickness = * A +.43 * B E-3 * A * B Final Equation in Terms of Actual Factors: Thickness = * Flow Rate * Dep Time E-4 * Flow Rate * Dep Time Normal plot of residuals Residuals vs. Predicted Normal % probability Residuals Residual Predicted A new outlier is present and should be investigated. Analysis with the run associated with standard order replaced with the value 4.65: 6-9

154 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Design Expert Output Response: Thickness ANOVA for Selected Factorial Model Analysis of variance table [Partial sum of squares] Sum of Mean F Source Squares DF Square Value Prob > F Model <. significant A B <. AB 3.969E E Pure Error.5. Cor Total The Model F-value of 45.8 implies the model is significant. There is only a.% chance that a "Model F-Value" this large could occur due to noise. Values of "Prob > F" less than.5 indicate model terms are significant. In this case B are significant model terms. Final Equation in Terms of Coded Factors: Thickness = * A +.4 * B +.6 * A * B Final Equation in Terms of Actual Factors: Thickness = * Flow Rate -.35 * Dep Time +3.5E-3 * Flow Rate * Dep Time Normal plot of residuals Residuals vs. Predicted Normal % probability Studentized Residuals Studentized Residuals Predicted Another outlier is present and should be investigated. 6-3 Continuation of Problem 6-. Use the regression model in part (c) of Problem 6- to generate a response surface contour plot for epitaxial layer thickness. Suppose it is critically important to obtain 6-

155 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY layer thickness of 4.5 mm. What settings of arsenic flow rate and deposition time would you recommend? Arsenic flow rate may be set at any of the experimental levels, while the deposition time should be set at.4 minutes. DESIGN-EXPERT Plot Thickness X = A: Flow Rate Y = B: Dep Time Design Points 3.75 Thickness DESIGN-EXPERT Plot Thickness X = A: Flow Rate Y = B: Dep Time Design Points Interaction Graph Dep Time Dep Time B-. B+ 5. Thickness Flow Rate Flow Rate 6-4 Continuation of Problem 6-3. How would your answer to Problem 6-3 change if arsenic flow rate was more difficult to control in the process than the deposition time? Running the process at a high level of Deposition Time there is no change in thickness as flow rate changes. 6-5 A nickel-titanium alloy is used to make components for jet turbine aircraft engines. Cracking is a potentially serious problem in the final part, as it can lead to non-recoverable failure. A test is run at the parts producer to determine the effects of four factors on cracks. The four factors are pouring temperature (A), titanium content (B), heat treatment method (C), and the amount of grain refiner used (D). Two replicated of a 4 design are run, and the length of crack (in m) induced in a sample coupon subjected to a standard test is measured. The data are shown below: Treatment Replicate Replicate A B C D Combination I II () a b ab c ac bc abc d

156 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY ad bd abd cd acd bcd abcd (a) Estimate the factor effects. Which factors appear to be large? Design Expert Output Term Effect SumSqr % Contribtn Model Intercept Model A Model B Model C Model D Model AB Model AC Error AD Error BC Error BD Error CD Model ABC Error ABD Error ACD Error BCD Error ABCD (b) Conduct an analysis of variance. Do any of the factors affect cracking? Use =.5. Design Expert Output Response: Crack Lengthin mm x ^- ANOVA for Selected Factorial Model Analysis of variance table [Partial sum of squares] Sum of Mean F Source Squares DF Square Value Prob > F Model <. significant A <. B <. C <. D <. AB <. AC <. AD BC BD CD ABC <. ABD ACD.96E-3.96E BCD ABCD.596E-3.596E Residual Lack of Fit. Pure Error Cor Total The Model F-value of implies the model is significant. There is only a.% chance that a "Model F-Value" this large could occur due to noise. Values of "Prob > F" less than.5 indicate model terms are significant. In this case A, B, C, D, AB, AC, ABC are significant model terms. 6-

157 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY (c) Write down a regression model that can be used to predict crack length as a function of the significant main effects and interactions you have identified in part (b). Design Expert Output Final Equation in Terms of Coded Factors: Crack Length= *A +.99 *B -.8 *C +.98 *D +.97 *A*B -. *A*C +.57 * A * B * C (d) Analyze the residuals from this experiment. Normal plot of residuals Residuals vs. Predicted Normal % probability Residuals Residual Predicted There is nothing unusual about the residuals. (e) Is there an indication that any of the factors affect the variability in cracking? By calculating the range of the two readings in each cell, we can also evaluate the effects of the factors on variation. The following is the normal probability plot of effects: 6-3

158 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY DESIGN-EXPERT Plot Range Normal plot A: Pour Temp B: Titanium Content C: Heat Treat Method D: Grain Refiner Normal % probability AB CD Effect It appears that the AB and CD interactions could be significant. The following is the ANOVA for the range data: Design Expert Output Response: Range ANOVA for Selected Factorial Model Analysis of variance table [Partial sum of squares] Sum of Mean F Source Squares DF Square Value Prob > F Model significant AB CD Residual Cor Total.45 5 The Model F-value of.46 implies the model is significant. There is only a.4% chance that a "Model F-Value" this large could occur due to noise. Values of "Prob > F" less than.5 indicate model terms are significant. In this case AB, CD are significant model terms. Final Equation in Terms of Coded Factors: Range = * A * B +. * C * D (f) What recommendations would you make regarding process operations? Use interaction and/or main effect plots to assist in drawing conclusions. From the interaction plots, choose A at the high level and B at the high level. In each of these plots, D can be at either level. From the main effects plot of C, choose C at the high level. Based on the range analysis, with C at the high level, D should be set at the low level. From the analysis of the crack length data: 6-4

159 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY DESIGN-EXPERT Plot Crack Length 9.84 Interaction Graph B: Titanium Content DESIGN-EXPERT Plot Crack Length 9.84 Interaction Graph C: Heat Treat Method X = A: Pour Temp Y = B: Titanium Content X = A: Pour Temp Y = C: Heat Treat Method B- -. B+. Actual Factors C: Heat Treat Method = D: Grain Refiner =..96 Crack Length C - C Actual Factors B: Titanium Content =. D: Grain Refiner =..96 Crack Length A: Pou r Tem p A: Pou r Tem p DESIGN-EXPERT Plot Crack Length 9.84 X = D: Grain Refiner One Factor Plot DESIGN-EXPERT Plot Crack Length X = A: Pour Temp Y = B: Titanium Content Z = C: Heat Treat Method Cube Graph Crack Length Actual Factors A: Pour T emp =. B: Titanium Content =. C: Heat Treat Method = Crack Length Actual Factor D: Grain Refiner =. B: Titanium Content B C+ C: Heat Treat Metho B C- A- A+ A: Pour Tem p D: Grain Refiner From the analysis of the ranges: 6-5

160 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY DESIGN-EXPERT Plot Range.66 Interaction Graph B: Titanium Content DESIGN-EXPERT Plot Range.66 Interaction Graph D: Grain Refiner X = A: Pour Temp Y = B: Titanium Content X = C: Heat Treat Method Y = D: Grain Refiner.55 B- -. B+. Actual Factors C: Heat Treat Method =. D: Grain Refiner =..384 Range.55 D- -. D+. Actual Factors A: Pour Temp =. B: Titanium Content =..384 Range A: Pou r Tem p C: Heat Treat Method 6-6 Continuation of Problem 6-5. One of the variables in the experiment described in Problem 6-5, heat treatment method (c), is a categorical variable. Assume that the remaining factors are continuous. (a) Write two regression models for predicting crack length, one for each level of the heat treatment method variable. What differences, if any, do you notice in these two equations? Design Expert Output Final Equation in Terms of Coded Factors Heat Treat Method - Crack Length = * Pour Temp * Titanium Content * Grain Refiner * Pour Temp * Titanium Content Heat Treat Method Crack Length = * Pour Temp * Titanium Content * Grain Refiner * Pour Temp * Titanium Content (b) Generate appropriate response surface contour plots for the two regression models in part (a). 6-6

161 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY DESIGN-EXPERT Plot Crack Length X = A: Pour Temp Y = B: Titanium Content. Crack Length 8 DESIGN-EXPERT Plot Crack Length X = A: Pour Temp Y = B: Titanium Content. Crack Length Actual Factors C: Heat Treat Method = -.5 D: Grain Refiner =. B: Titanium Content. 4 6 Actual Factors C: Heat T reat M ethod =.5 D: Grain Refiner =. B: Titanium Content A: Pour Tem p A: Pour Temp (c) What set of conditions would you recommend for the factors A, B and D if you use heat treatment method C=+? High level of A, low level of B, and low level of D. (d) Repeat part (c) assuming that you wish to use heat treatment method C=-. Low level of A, low level of B, and low level of D. 6-7 An experimenter has run a single replicate of a 4 design. The following effect estimates have been calculated: A = AB = -5.3 ABC = -.8 B = AC =.69 ABD = -6.5 C = AD = 9.78 ACD =. D = BC =.78 BCD = BD = 4.74 ABCD = -6.5 CD =.7 (a) Construct a normal probability plot of these effects. The plot from Minitab follows. 6-7

162 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Normal Probability Plot for Value ML Estimates 99 ML Estimates Percent B AB A Mean -.57 StDev 3.96 Goodness of Fit AD* Data 5 (b) Identify a tentative model, based on the plot of the effects in part (a). ŷ Intercept x x 5. 66x A B A x B 6-8 An article in Solid State Technology ( Orthogonal Design for Process Optimization and Its Application in Plasma Etching, May 987, pp. 7-3) describes the application of factorial designs in developing a nitride etch process on a single-wafer plasma etcher. The process uses C F 6 as the reactant gas. Four factors are of interest: anode-cathode gap (A), pressure in the reactor chamber (B), C F 6 gas flow (C), and power applied to the cathode (D). The response variable of interest is the etch rate for silicon nitride. A single replicate of a 4 design in run, and the data are shown below: Actual Etch Run Run Rate Factor Levels Number Order A B C D (A/min) Low (-) High (+) A (cm) B (mtorr) C (SCCM) D (W)

163 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY (a) Estimate the factor effects. Construct a normal probability plot of the factor effects. Which effects appear large? DESIGN-EXPERT Plot Etch Rate Normal plot A: Gap B: Pressure C: Gas Flow D: Power Norm al % probability AD A D Effect (b) Conduct an analysis of variance to confirm your findings for part (a). Design Expert Output Response: Etch Rate in A/min ANOVA for Selected Factorial Model Analysis of variance table [Partial sum of squares] Sum of Mean F Source Squares DF Square Value Prob > F Model 5.6E+5 3.7E <. significant A D 3.749E E <. AD <. Residual Cor Total 5.34E+55 The Model F-value of 97.9 implies the model is significant. There is only a.% chance that a "Model F-Value" this large could occur due to noise. Values of "Prob > F" less than.5 indicate model terms are significant. In this case A, D, AD are significant model terms. (c) What is the regression model relating etch rate to the significant process variables? Design Expert Output Final Equation in Terms of Coded Factors: Etch Rate = * A * D * A * D Final Equation in Terms of Actual Factors: Etch Rate =

164 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY * Gap * Power * Gap * Power (d) Analyze the residuals from this experiment. Comment on the model s adequacy. Normal plot of residuals Residuals vs. Predicted Norm al % probability Residuals Residual Predicted The residual versus predicted plot shows a slight football shape indicating very mild inequality of variance. (e) If not all the factors are important, project the 4 design into a k design with k<4 and conduct that analysis of variance. The analysis of variance table is the same as in part (b). Design Expert Output Response: Etch Rate in A/min ANOVA for Selected Factorial Model Analysis of variance table [Partial sum of squares] Sum of Mean F Source Squares DF Square Value Prob > F Model 5.6E+5 3.7E <. significant A B 3.749E E <. AB <. Residual Lack of Fit. Pure Error Cor Total 5.34E+55 The Model F-value of 97.9 implies the model is significant. There is only a.% chance that a "Model F-Value" this large could occur due to noise. Values of "Prob > F" less than.5 indicate model terms are significant. In this case A, B, AB are significant model terms. (f) Draw graphs to interpret any significant interactions. 6-3

165 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY DESIGN-EXPERT Plot Etch Rate Interaction Graph Power X = A: Gap Y = B: Power Design Points B- 75. B+ 35. Etch Rate Gap (g) Plot the residuals versus the actual run order. What problems might be revealed by this plot? Residuals vs. Run Residuals Run Number The plot of residuals versus run order can reveal trends in the process over time, inequality of variance with time, and possibly indicate that there may be factors that were not included in the original experiment. 6-9 Continuation of Problem 6-8. Consider the regression model obtained in part (c) of Problem 6-8. (a) Construct contour plots of the etch rate using this model. 6-3

166 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY DESIGN-EXPERT Plot 35. Etch Rate X = A: Gap Y = D: Power Actual Factors B: Pressure = 5. C: Gas Flow = Etch Rate Power Gap (b) Suppose that it was necessary to operate this process at an etch rate of 8 Å/min. What settings of the process variables would you recommend? Run at the low level of anode-cathode gap (.8 cm) and at a cathode power level of about 86 watts. The curve is flatter (more robust) on the low end of the anode-cathode variable. 6- Consider the single replicate of the 4 design in Example 6-. Suppose we had arbitrarily decided to analyze the data assuming that all three- and four-factor interactions were negligible. Conduct this analysis and compare your results with those obtained in the example. Do you think that it is a good idea to arbitrarily assume interactions to be negligible even if they are relatively high-order ones? Design Expert Output Response: Etch Rate in A/min ANOVA for Selected Factorial Model Analysis of variance table [Partial sum of squares] Sum of Mean F Source Squares DF Square Value Prob > F Model 5.E significant A B E C D 3.749E E <. AB AC AD BC BD E CD E Residual Cor Total 5.34E+55 The Model F-value of 5.58 implies the model is significant. There is only a.% chance that a "Model F-Value" this large could occur due to noise. Values of "Prob > F" less than.5 indicate model terms are significant. In this case A, D, AD are significant model terms. This analysis of variance identifies the same effects as the normal probability plot of effects approach used in Example 6-. In general, it is not a good idea to arbitrarily pool interactions. Use the normal 6-3

167 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY probability plot of effect estimates as a guide in the choice of which effects to tentatively include in the model. 6- An experiment was run in a semiconductor fabrication plant in an effort to increase yield. Five factors, each at two levels, were studied. The factors (and levels) were A = aperture setting (small, large), B = exposure time (% below nominal, % above nominal), C = development time (3 s, 45 s), D = mask dimension (small, large), and E = etch time (4.5 min, 5.5 min). The unreplicated 5 design shown below was run. () = 7 d = 8 e = 8 de = 6 a = 9 ad = ae = ade = b = 34 bd = 3 be = 35 bde = 3 ab = 55 abd = 5 abe = 5 abde = 53 c = 6 cd = 8 ce = 5 cde = 5 ac = acd = ace = acde = bc = 4 bcd = 44 bce = 45 bcde = 4 abc = 6 abcd = 6 abce = 65 abcde = 63 (a) Construct a normal probability plot of the effect estimates. Which effects appear to be large? DESIGN-EXPERT Plot Yield Normal plot A: Aperture B: Exposure Time C: Develop Tim e D: M ask Dim ension E: Etch Time Normal % probability C AB A B Effect (b) Conduct an analysis of variance to confirm your findings for part (a). Design Expert Output Response: Yield ANOVA for Selected Factorial Model Analysis of variance table [Partial sum of squares] Sum of Mean F Source Squares DF Square Value Prob > F Model <. significant A <. B <. C <. AB <. Residual Cor Total

168 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY The Model F-value of implies the model is significant. There is only a.% chance that a "Model F-Value" this large could occur due to noise. Values of "Prob > F" less than.5 indicate model terms are significant. In this case A, B, C, AB are significant model terms. (c) Write down the regression model relating yield to the significant process variables. Design Expert Output Final Equation in Terms of Actual Factors: Aperture small Yield = * Exposure Time * Develop Time Aperture large Yield = * Exposure Time * Develop Time (d) Plot the residuals on normal probability paper. Is the plot satisfactory? Normal plot of residuals 99 Norm al % probability E There is nothing unusual about this plot. Residual (e) Plot the residuals versus the predicted yields and versus each of the five factors. Comment on the plots. 6-34

169 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Residuals vs. Aperture Residuals vs. Exposure Time Residuals 3.557E-5 Residuals 3.557E Ape rture Expos ure Tim e Residuals vs. Develop Time Residuals vs. Mask Dimension Residuals 3.557E-5 Residuals 3.557E Develop Time Mas k Dim ens ion Residuals vs. Etch Time Residuals 3.557E Etch Time 6-35

170 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY The plot of residual versus exposure time shows some very slight inequality of variance. There is no strong evidence of a potential problem. (f) Interpret any significant interactions. DESIGN-EXPERT Plot Yield 65 Interaction Graph Ape rture X = B: Exposure Time Y = A: Aperture 5.5 A small A large Actual Factors C: Develop Tim e = 37.5 D: Mask Dimension = Small 35.5 E: Etch Time = 5. Yield Exposure Time Factor A does not have as large an effect when B is at its low level as it does when B is at its high level. (g) What are your recommendations regarding process operating conditions? For the highest yield, run with B at the high level, A at the high level and C at the high level. (h) Project the 5 design in this problem into a k design in the important factors. Sketch the design and show the average and range of yields at each run. Does this sketch aid in interpreting the results of this experiment? DESIGN-EASE Analysis Actual Yield 4.5 R=5 6.5 R=5 E x p o s u r e T i m e B R=5 6. R=3 5.5 R=5 B C- A- R= R=3 A+ Aperture.75 R= D e v e l o p C+ T i m e 6-36

171 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY This cube plot aids in interpretation. The strong AB interaction and the large positive effect of C are clearly evident. 6- Continuation of Problem 6-. Suppose that the experimenter had run four runs at the center points in addition to the 3 trials in the original experiment. The yields obtained at the center point runs were 68, 74, 76, and 7. (a) Reanalyze the experiment, including a test for pure quadratic curvature. SS PureQuadratic n F n C n y y F F n C C Design Expert Output Response: Yield ANOVA for Selected Factorial Model Analysis of variance table [Partial sum of squares] Sum of Mean F Source Squares DF Square Value Prob > F Model <. significant A <. B <. C <. AB <. Curvature <. significant Residual Cor Total The Model F-value of implies the model is significant. There is only a.% chance that a "Model F-Value" this large could occur due to noise. Values of "Prob > F" less than.5 indicate model terms are significant. In this case A, B, C, AB are significant model terms. (b) Discuss what your next step would be. Add axial points and fit a second-order model. 6-3 In a process development study on yield, four factors were studied, each at two levels: time (A), concentration (B), pressure (C), and temperature (D). A single replicate of a 4 design was run, and the resulting data are shown in the following table: Actual Run Run Yield Factor Levels Number Order A B C D (lbs) Low (-) High (+) A (h) B (%) C (psi) D (ºC)

172 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY (a) Construct a normal probability plot of the effect estimates. Which factors appear to have large effects? DESIGN-EXPERT Plot Yield Normal plot A: Time B: Concentration C: Pressure D: Temperature Normal % probability AC C AD D A Effect A, C, D and the AC and AD interactions. (b) Conduct an analysis of variance using the normal probability plot in part (a) for guidance in forming an error term. What are your conclusions? Design Expert Output Response: Yield ANOVA for Selected Factorial Model Analysis of variance table [Partial sum of squares] Sum of Mean F Source Squares DF Square Value Prob > F Model <. significant A <. C D AC <. AD <. Residual Cor Total The Model F-value of 33.9 implies the model is significant. There is only a.% chance that a "Model F-Value" this large could occur due to noise. Values of "Prob > F" less than.5 indicate model terms are significant. In this case A, C, D, AC, AD are significant model terms. (c) Write down a regression model relating yield to the important process variables. 6-38

173 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Design Expert Output Final Equation in Terms of Coded Factors: Yield = *A +. *C +.63 *D -.3 *A*C +. *A*D Final Equation in Terms of Actual Factors: Yield = * Time * Pressure -.63 * Temperature -.85 * Time * Pressure +.64 * Time * Temperature (d) Analyze the residuals from this experiment. Does your analysis indicate any potential problems? Normal plot of residuals Residuals vs. Predicted Normal % probability Residuals Residual Predicted 6-39

174 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY.375 Residuals vs. Run.65 Residuals Run Number There is nothing unusual about the residual plots. (e) Can this design be collapsed into a 3 design with two replicates? If so, sketch the design with the average and range of yield shown at each point in the cube. Interpret the results. DESIGN-EASE Analysis Actual yield 8. R=. R= t e m p e r a t u r e D+.5 R=3 8.5 R=3 4.5 R= D C- A- R= R= A+ time 5. C+ R= p r e s s u r e 6-4 Continuation of Problem 6-3. Use the regression model in part (c) of Problem 6-3 to generate a response surface contour plot of yield. Discuss the practical purpose of this response surface plot. 6-4

175 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY The response surface contour plot shows the adjustments in the process variables that lead to an increasing or decreasing response. It also displays the curvature of the response in the design region, possibly indicating where robust operating conditions can be found. Two response surface contour plots for this process are shown below. These were formed from the model written in terms of the original design variables. DESIGN-EXPERT Plot Yield X = A: Time Y = C: Pressure 8. Yield DESIGN-EXPERT Plot 5. Yield X = A: Time Y = D: Temperature Yield Actual Factors B: Concentration = D: Temperature = Actual Factors B: Concentration = C: Pressure = Pressure Temperature Tim e Tim e 6-5 The scrumptious brownie experiment. The author is an engineer by training and a firm believer in learning by doing. I have taught experimental design for many years to a wide variety of audiences and have always assigned the planning, conduct, and analysis of an actual experiment to the class participants. The participants seem to enjoy this practical experience and always learn a great deal from it. This problem uses the results of an experiment performed by Gretchen Krueger at Arizona State University. There are many different ways to bake brownies. The purpose of this experiment was to determine how the pan material, the brand of brownie mix, and the stirring method affect the scrumptiousness of brownies. The factor levels were Factor Low (-) High (+) A = pan material Glass Aluminum B = stirring method Spoon Mixer C = brand of mix Expensive Cheap The response variable was scrumptiousness, a subjective measure derived from a questionnaire given to the subjects who sampled each batch of brownies. (The questionnaire dealt with such issues as taste, appearance, consistency, aroma, and so forth.) An eight-person test panel sampled each batch and filled out the questionnaire. The design matrix and the response data are shown below: Brownie Test Panel Results Batch A B C

176 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY (a) Analyze the data from this experiment as if there were eight replicates of a 3 design. Comment on the results. Design Expert Output ANOVA for Selected Factorial Model Analysis of variance table [Partial sum of squares] Sum of Mean F Source Squares DF Square Value Prob > F Model significant A B C AB AC BC ABC Residual Lack of Fit. Pure Error Cor Total The Model F-value of. implies the model is significant. There is only a 4.75% chance that a "Model F-Value" this large could occur due to noise. Values of "Prob > F" less than.5 indicate model terms are significant. In this case A are significant model terms. In this analysis, A, the pan material and B, the stirring method, appear to be significant. There are 56 degrees of freedom for the error, yet only eight batches of brownies were cooked, one for each recipe. (b) Is the analysis in part (a) the correct approach? There are only eight batches; do we really have eight replicates of a 3 factorial design? The different rankings by the taste-test panel are not replicates, but repeat observations by differenttesters on the same batch of brownies. It is not a good idea to use the analysis in part (a) because the estimate of error may not reflect the batch-to-batch variation. (c) Analyze the average and standard deviation of the scrumptiousness ratings. Comment on the results. Is this analysis more appropriate than the one in part (a)? Why or why not? 6-4

177 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY DESIGN-EXPERT Plot Average Normal plot DESIGN-EXPERT Plot Stdev Normal plot A: Pan M aterial B: Stirring Method C: Mix Brand A A: Pan M aterial B: Stirring Method C: Mix Brand C Normal % probability B Normal % probability AC A Effect Effect Design Expert Output Response: Average ANOVA for Selected Factorial Model Analysis of variance table [Partial sum of squares] Sum of Mean F Source Squares DF Square Value Prob > F Model significant A B Residual Cor Total.65 7 The Model F-value of 76.3 implies the model is significant. There is only a.% chance that a "Model F-Value" this large could occur due to noise. Values of "Prob > F" less than.5 indicate model terms are significant. In this case A, B are significant model terms. Design Expert Output Response: Stdev ANOVA for Selected Factorial Model Analysis of variance table [Partial sum of squares] Sum of Mean F Source Squares DF Square Value Prob > F Model significant A C AC Residual.8 4. Cor Total The Model F-value of 9.77 implies the model is significant. There is only a.59% chance that a "Model F-Value" this large could occur due to noise. Values of "Prob > F" less than.5 indicate model terms are significant. In this case AC are significant model terms. Variables A and B affect the mean rank of the brownies. Note that the AC interaction affects the standard deviation of the ranks. This is an indication that both factors A and C have some effect on the variability in the ranks. It may also indicate that there is some inconsistency in the taste test panel members. For the analysis of both the average of the ranks and the standard deviation of the ranks, the mean square error is 6-43

178 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY now determined by pooling apparently unimportant effects. This is a more estimate of error than obtained assuming that all observations were replicates. 6-6 An experiment was conducted on a chemical process that produces a polymer. The four factors studied were temperature (A), catalyst concentration (B), time (C), and pressure (D). Two responses, molecular weight and viscosity, were observed. The design matrix and response data are shown below: Actual Run Run Molecular Factor Levels Number Order A B C D Weight Viscosity Low (-) High (+) A (ºC) B (%) C (min) D (psi) (a) Consider only the molecular weight response. Plot the effect estimates on a normal probability scale. What effects appear important? 6-44

179 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY DESIGN-EXPERT Plot Molecular Wt Half Normal plot A: Temperature B: Catalyst Con. C: Tim e D: Pressure Half Normal % probability AB A C Effect A, C and the AB interaction. (b) Use an analysis of variance to confirm the results from part (a). Is there an indication of curvature? A,C and the AB interaction are significant. While the main effect of B is not significant, it could be included to preserve hierarchy in the model. There is no indication of quadratic curvature. Design Expert Output Response: Molecular Wt ANOVA for Selected Factorial Model Analysis of variance table [Partial sum of squares] Sum of Mean F Source Squares DF Square Value Prob > F Model.89E <. significant A <. C.6E+5.6E <. AB <. Curvature not significant Residual Lack of Fit not significant Pure Error Cor Total 3.37E+59 The Model F-value of 73. implies the model is significant. There is only a.% chance that a "Model F-Value" this large could occur due to noise. Values of "Prob > F" less than.5 indicate model terms are significant. In this case A, C, AB are significant model terms. (c) Write down a regression model to predict molecular weight as a function of the important variables. Design Expert Output Final Equation in Terms of Coded Factors: Molecular Wt = * A +.63 * C +6. * A * B (d) Analyze the residuals and comment on model adequacy. 6-45

180 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Normal plot of residuals Residuals vs. Predicted Normal % probability Residuals Residual Predicted There are two residuals that appear to be large and should be investigated. (e) Repeat parts (a) - (d) using the viscosity response. DESIGN-EXPERT Plot Viscosity Normal plot A: Temperature B: Catalyst Con. C: Tim e D: Pressure Normal % probability B A Effect Design Expert Output Response: Viscosity ANOVA for Selected Factorial Model Analysis of variance table [Partial sum of squares] Sum of Mean F Source Squares DF Square Value Prob > F Model <. significant A <. B <. Curvature not significant Residual Lack of Fit not significant Pure Error Cor Total

181 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY The Model F-value of implies the model is significant. There is only a.% chance that a "Model F-Value" this large could occur due to noise. Values of "Prob > F" less than.5 indicate model terms are significant. In this case A, B are significant model terms. Final Equation in Terms of Coded Factors: Viscosity = * A * B Normal plot of residuals Residuals vs. Predicted Norm al % probability Residuals Residual Predicted There is one large residual that should be investigated. 6-7 Continuation of Problem 6-6. Use the regression models for molecular weight and viscosity to answer the following questions. (a) Construct a response surface contour plot for molecular weight. In what direction would you adjust the process variables to increase molecular weight? Increase temperature, catalyst and time. 6-47

182 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY DESIGN-EXPERT Plot Molecular Wt X = A: Temperature Y = B: Catalyst Con. Design Points Actual Factors C: T ime = 5. D: Pressure = 67.5 B: Catalys t Con Molecular W t A: Temperature (a) Construct a response surface contour plot for viscosity. In what direction would you adjust the process variables to decrease viscosity? DESIGN-EXPERT Plot V i sco si ty X = A: Temperature Y = B: Catalyst Con. Design Points Actual Factors C: T ime = 5. D: Pressure = 67.5 B: Catalys t Con Viscosity A: Temperature Decrease temperature and catalyst. (c) What operating conditions would you recommend if it was necessary to produce a product with a molecular weight between 4 and 5, and the lowest possible viscosity? 6-48

183 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY DESIGN-EXPERT Plot Overlay Plot X = A: Temperature Y = B: Catalyst Con. 8. Overlay Plot Actual Factors C: T ime = 4.5 D: Pressure = B: Catalys t Con Molecular Wt: 5 Viscosity: A: Temperature Set the temperature between and 5, the catalyst between 4 and 5%, and the time at 4.5 minutes. The pressure was not significant and can be set at conditions that may improve other results of the process such as cost. 6-8 Consider the single replicate of the 4 design in Example 6-. Suppose that we ran five points at the center (,,,) and observed the following responses: 73, 75, 7, 69, and 76. Test for curvature in this experiment. Interpret the results. Design Expert Output Response: Filtration Rate ANOVA for Selected Factorial Model Analysis of variance table [Partial sum of squares] Sum of Mean F Source Squares DF Square Value Prob > F Model <. significant A <. C D <. AC <. AD <. Curvature not significant Residual Lack of Fit not significant Pure Error Cor Total The Model F-value of 68. implies the model is significant. There is only a.% chance that a "Model F-Value" this large could occur due to noise. Values of "Prob > F" less than.5 indicate model terms are significant. In this case A, C, D, AC, AD are significant model terms. The "Curvature F-value" of.75 implies the curvature (as measured by difference between the average of the center points and the average of the factorial points) in the design space is not significant relative to the noise. There is a.66% chance that a "Curvature F-value" this large could occur due to noise. There is no indication of curvature. 6-49

184 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY 6-9 A missing value in a k factorial. It is not unusual to find that one of the observations in a k design is missing due to faulty measuring equipment, a spoiled test, or some other reason. If the design is replicated n times (n>) some of the techniques discussed in Chapter 4 can be employed, including estimating the missing observations. However, for an unreplicated factorial (n-) some other method must be used. One logical approach is to estimate the missing value with a number that makes the highestorder interaction contrast zero. Apply this technique to the experiment in Example 6- assuming that run ab is missing. Compare the results with the results of Example 6-. Treatment Com bination R esponse R esponse * ABCD ABCD A B C D () a b ab missing m issing * - - c ac bc abc d ad - - bd abd cd acd bcd abcd Contrast(ABCD )= missing -54 = missing = 54 Substitute the value 54 for the missing run at ab. Term Effect SumSqr % Contribtn Model Intercept Model A Model B Model C Model D Model AB Model AC Model AD Model BC Model BD Model CD Model ABC Model ABD Model ACD Model BCD Model ABCD Lenth's ME.5676 Lenth's SME

185 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY DESIGN-EXPERT Plot Filtration Rate Normal plot A: Temperature B: Pressure C: Concentration D: Stirring Rate Normal % probability AC C A AD D Effect 6-3 An engineer has performed an experiment to study the effect of four factors on the surface roughness of a machined part. The factors (and their levels) are A = tool angle ( degrees, 5 degrees), B = cutting fluid viscosity (3, 4), C = feed rate ( in/min, 5 in/min), and D = cutting fluid cooler used (no, yes). The data from this experiment (with the factors coded to the usual -, + levels) are shown below. Run A B C D Surface Roughness (a) Estimate the factor effects. Plot the effect estimates on a normal probability plot and select a tentative model. 6-5

186 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY DESIGN-EXPERT Plot Surface Roughness Normal plot A: Tool Angle B: Viscosity C: Feed Rate D: Cutting Fluid Normal % probability B AB A C Effect (b) Fit the model identified in part (a) and analyze the residuals. Is there any indication of model inadequacy? Design Expert Output Response: Surface Roughness ANOVA for Selected Factorial Model Analysis of variance table [Partial sum of squares] Sum of Mean F Source Squares DF Square Value Prob > F Model 6.46E-6 4.6E <. significant A 8.556E E <. B 3.8E-6 3.8E-6. <. C.3E-6.3E <. AB.44E-6.44E <. Residual.53E-7.393E-8 Cor Total 6.559E-6 5 The Model F-value of 4.97 implies the model is significant. There is only a.% chance that a "Model F-Value" this large could occur due to noise. Values of "Prob > F" less than.5 indicate model terms are significant. In this case A, B, C, AB are significant model terms. 6-5

187 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Normal plot of residuals Residuals vs. Predicted Normal % probability E-5 Residuals 5E E E-5 5E E Residual Predicted Residuals vs. Tool Angle E-5 Residuals 5E E Tool Angle The plot of residuals versus predicted shows a slight u-shaped appearance in the residuals, and the plot of residuals versus tool angle shows an outward-opening funnel. (c) Repeat the analysis from parts (a) and (b) using /y as the response variable. Is there and indication that the transformation has been useful? The plots of the residuals are more representative of a model that does not violate the constant variance assumption. 6-53

188 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY DESIGN-EXPERT Plot./(Surface Roughness) Normal plot A: Tool Angle B: Viscosity C: Feed Rate D: Cutting Fluid Normal % probability C AB A B Effect Design Expert Output Response: Surface RoughnessTransform: Inverse ANOVA for Selected Factorial Model Analysis of variance table [Partial sum of squares] Sum of Mean F Source Squares DF Square Value Prob > F Model.59E <. significant A <. B <. C <. AB <. Residual Cor Total.63E+55 The Model F-value of implies the model is significant. There is only a.% chance that a "Model F-Value" this large could occur due to noise. Values of "Prob > F" less than.5 indicate model terms are significant. In this case A, B, C, AB are significant model terms. DESIGN-EXPERT Plot./(Surface Roughness) Residuals vs. Predicted DESIGN-EXPERT Plot./(Surface Roughness) Residuals vs. Tool Angle Residuals Residuals Predicted Tool Angle 6-54

189 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY (d) Fit a model in terms of the coded variables that can be used to predict the surface roughness. Convert this prediction equation into a model in the natural variables. Design Expert Output Final Equation in Terms of Coded Factors:./(Surface Roughness) = * A * B * C * A * B 6-3 Resistivity on a silicon wafer is influenced by several factors. The results of a 4 factorial experiment performed during a critical process step is shown below. Run A B C D Resistivity (a) Estimate the factor effects. Plot the effect estimates on a normal probability plot and select a tentative model. 6-55

190 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY DESIGN-EXPERT Plot Re si stivi ty Normal plot A: A B: B C: C D: D Normal % probability B AB A Effect (b) Fit the model identified in part (a) and analyze the residuals. Is there any indication of model inadequacy? The normal probability plot of residuals is not satisfactory. The plots of residual versus predicted, residual versus factor A, and the residual versus factor B are funnel shaped indicating non-constant variance. Design Expert Output Response: Resistivity ANOVA for Selected Factorial Model Analysis of variance table [Partial sum of squares] Sum of Mean F Source Squares DF Square Value Prob > F Model <. significant A <. B <. AB <. Residual Cor Total The Model F-value of 48.8 implies the model is significant. There is only a.% chance that a "Model F-Value" this large could occur due to noise. Values of "Prob > F" less than.5 indicate model terms are significant. In this case A, B, AB are significant model terms. 6-56

191 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Normal plot of residuals Residuals vs. Predicted Normal % probability Residuals Residual Predicted Residuals vs. A Residuals vs. B Residuals.75 Residuals A B (c) Repeat the analysis from parts (a) and (b) using ln(y) as the response variable. Is there any indication that the transformation has been useful? 6-57

192 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY DESIGN-EXPERT Plot L n (Re si stivi ty) Normal plot A: A B: B C: C D: D Normal % probability B A Effect Design Expert Output Response: Resistivity Transform: Natural log Constant:. ANOVA for Selected Factorial Model Analysis of variance table [Partial sum of squares] Sum of Mean F Source Squares DF Square Value Prob > F Model <. significant A <. B <. Residual.4 3. Cor Total.3 5 The Model F-value of implies the model is significant. There is only a.% chance that a "Model F-Value" this large could occur due to noise. Values of "Prob > F" less than.5 indicate model terms are significant. In this case A, B are significant model terms. The transformed data no longer indicates that the AB interaction is significant. A simpler model has resulted from the log transformation. Normal plot of residuals Residuals vs. Predicted Norm al % probability Residuals Residual Predicted 6-58

193 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Residuals vs. A Residuals vs. B Residuals Residuals A B The residual plots are much improved. (d) Fit a model in terms of the coded variables that can be used to predict the resistivity. Design Expert Output Final Equation in Terms of Coded Factors: Ln(Resistivity) = * A -.3 * B 6.3 Continuation of Problem 6-3. Suppose that the experiment had also run four center points along with the 6 runs in Problem 6-3. The resistivity measurements at the center points are: 8.5, 7.63, 8.95, Analyze the experiment again incorporating the center points. What conclusions can you draw now? DESIGN-EXPERT Plot Re si stivi ty Normal plot A: A B: B C: C D: D Norm al % probability B AB A Effect 6-59

194 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Design Expert Output Response: Resistivity ANOVA for Selected Factorial Model Analysis of variance table [Partial sum of squares] Sum of Mean F Source Squares DF Square Value Prob > F Model <. significant A <. B <. AB <. Curvature <. significant Residual Lack of Fit not significant Pure Error Cor Total The Model F-value of 9.35 implies the model is significant. There is only a.% chance that a "Model F-Value" this large could occur due to noise. Values of "Prob > F" less than.5 indicate model terms are significant. In this case A, B, AB are significant model terms. Normal plot of residuals Residuals vs. Predicted Norm al % probability Residuals Residual Predicted Repeated analysis with the natural log transformation. 6-6

195 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY DESIGN-EXPERT Plot L n (Re si stivi ty) Normal plot A: A B: B C: C D: D Normal % probability B A Effect Design Expert Output Response: Resistivity Transform: Natural log Constant:. ANOVA for Selected Factorial Model Analysis of variance table [Partial sum of squares] Sum of Mean F Source Squares DF Square Value Prob > F Model <. significant A <. B <. Curvature <. significant Residual. 6. Lack of Fit not significant Pure Error Cor Total The Model F-value of implies the model is significant. There is only a.% chance that a "Model F-Value" this large could occur due to noise. Values of "Prob > F" less than.5 indicate model terms are significant. In this case A, B are significant model terms. The "Curvature F-value" of 9.98 implies there is significant curvature (as measured by difference between the average of the center points and the average of the factorial points) in the design space. There is only a.% chance that a "Curvature F-value" this large could occur due to noise. The curvature test indicates that the model has significant pure quadratic curvature Often the fitted regression model from a k factorial design is used to make predictions at points of interest in the design space. (a) Find the variance of the predicted response ŷ at the point x, x,, x k in the design space. Hint: Remember that the x s are coded variables, and assume a k design with an equal number of replicates n at each design point so that the variance of a regression coefficient ˆ is covariance between any pair of regression coefficients is zero. Let s assume that the model can be written as follows: k n and that the 6-6

196 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY yˆ( x )= ˆ ˆ x ˆ x... ˆ pxp where x [ x, x,..., x k ] are the values of the original variables in the design at the point of interest where a prediction is required, and the variables in the model x, x,..., xp potentially include interaction terms among the original k variables. Now the variance of the predicted response is V[ yˆ ( x)] V( ˆ ˆ ˆ ˆ xx... pxp) V( ˆ ) V( ˆ x ) V( ˆ x )... V( ˆ x ) n p k xi i This result follows because the design is orthogonal and all model parameter estimates have the same variance. Remember that some of the x s involved in this equation are potentially interaction terms. (b) Use the result of part (a) to find an equation for a (-)% confidence interval on the true mean response at the point x, x,, x k in the design space. The confidence interval is /, dfe /, df E yˆ( x) t V[ yˆ( x)] y( x) yˆ( x) t V[ yˆ( x )] p p where df E is the number of degrees of freedom used to estimate and the estimate of has been used in computing the variance of the predicted value of the response at the point of interest Hierarchical Models. Several times we have utilized the hierarchy principal in selecting a model; that is, we have included non-significant terms in a model because they were factors involved in significant higher-order terms. Hierarchy is certainly not an absolute principle that must be followed in all cases. To illustrate, consider the model resulting from Problem 6-, which required that a nonsignificant main effect be included to achieve hierarchy. Using the data from Problem 6-: (a) Fit both the hierarchical model and the non-hierarchical model. Design Expert Output for Hierarchial Model Response: Life in hours ANOVA for Selected Factorial Model Analysis of variance table [Partial sum of squares] Sum of Mean F Source Squares DF Square Value Prob > F Model <. significant A B <. C AC Residual Lack of Fit not significant Pure Error Cor Total The Model F-value of.54 implies the model is significant. There is only 6-6

197 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY a.% chance that a "Model F-Value" this large could occur due to noise. Values of "Prob > F" less than.5 indicate model terms are significant. In this case B, C, AC are significant model terms. Std. Dev. 5.5 R-Squared.753 Mean 4.83 Adj R-Squared.6674 C.V Pred R-Squared.566 PRESS 98.5 Adeq Precision.747 The "Pred R-Squared" of.566 is in reasonable agreement with the "Adj R-Squared" of A difference greater than. between the "Pred R-Squared" and the "Adj R-Squared" indicates a possible problem with your model and/or data. Design Expert Output for Non-Hierarchical Model Response: Life in hours ANOVA for Selected Factorial Model Analysis of variance table [Partial sum of squares] Sum of Mean F Source Squares DF Square Value Prob > F Model <. significant B <. C AC Residual Lack of Fit not significant Pure Error Cor Total The Model F-value of 7.57 implies the model is significant. There is only a.% chance that a "Model F-Value" this large could occur due to noise. Values of "Prob > F" less than.5 indicate model terms are significant. In this case B, C, AC are significant model terms. The "Lack of Fit F-value" of.78 implies the Lack of Fit is not significant relative to the pure error. There is a 55.66% chance that a "Lack of Fit F-value" this large could occur due to noise. Non-significant lack of fit is good -- we want the model to fit. Std. Dev R-Squared.749 Mean 4.83 Adj R-Squared.6837 C.V. 3.5 Pred R-Squared.639 PRESS 89.9 Adeq Precision.3 The "Pred R-Squared" of.639 is in reasonable agreement with the "Adj R-Squared" of A difference greater than. between the "Pred R-Squared" and the "Adj R-Squared" indicates a possible problem with your model and/or data. (b) Calculate the PRESS statistic, the adjusted R and the mean square error for both models. The PRESS and R are in the Design Expert Output above. The PRESS is smaller for the nonhierarchical model than the hierarchical model suggesting that the non-hierarchical model is a better predictor. (c) Find a 95 percent confidence interval on the estimate of the mean response at a cube corner ( x = x = x 3 = ). Hint: Use the result of Problem Design Expert Output Prediction SE Mean 95% CI low 95% CI high SE Pred 95% PI low 95% PI high Life Life Life Life Life Life

198 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Life Life (d) Based on the analyses you have conducted, which model would you prefer? Notice that PRESS is smaller and the adjusted R is larger for the non-hierarchical model. This is an indication that strict adherence to the hierarchy principle isn t always necessary. Note also that the confidence interval is shorter for the non-hierarchical model. 6-64

199 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Chapter 7 Blocking and Confounding in the k Factorial Design Solutions 7- Consider the experiment described in Problem 6-. Analyze this experiment assuming that each replicate represents a block of a single production shift. Source of Sum of Degrees of Mean Variation Squares Freedom Square F Cutting Speed (A) < Tool Geometry (B) * Cutting Angle (C) * AB < AC * BC ABC < Blocks.58.9 Error Total Design Expert Output Response: Life in hours ANOVA for Selected Factorial Model Analysis of variance table [Partial sum of squares] Sum of Mean F Source Squares DF Square Value Prob > F Block.58.9 Model significant A B C AC Residual Cor Total The Model F-value of.3 implies the model is significant. There is only a.% chance that a "Model F-Value" this large could occur due to noise. Values of "Prob > F" less than.5 indicate model terms are significant. In this case B, C, AC are significant model terms. These results agree with the results from Problem 6-. Tool geometry, cutting angle and the cutting speed x cutting angle factors are significant at the 5% level. The Design Expert program also includes A, speed, in the model to preserve hierarchy. 7- Consider the experiment described in Problem 6-5. Analyze this experiment assuming that each one of the four replicates represents a block. Source of Sum of Degrees of Mean Variation Squares Freedom Square F Bit Size (A) * 7-

200 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Cutting Speed (B) * AB * Blocks Error Total These results agree with those from Problem 6-5. Bit size, cutting speed and their interaction are significant at the % level. Design Expert Output Response: Vibration ANOVA for Selected Factorial Model Analysis of variance table [Partial sum of squares] Sum of Mean F Source Squares DF Square Value Prob > F Block Model <. significant A <. B <. AB <. Residual Cor Total The Model F-value of 79.6 implies the model is significant. There is only a.% chance that a "Model F-Value" this large could occur due to noise. Values of "Prob > F" less than.5 indicate model terms are significant. In this case A, B, AB are significant model terms. 7-3 Consider the alloy cracking experiment described in Problem 6-5. Suppose that only 6 runs could be made on a single day, so each replicate was treated as a block. Analyze the experiment and draw conclusions. The analysis of variance for the full model is as follows: Design Expert Output Response: Crack Lengthin mm x ^- ANOVA for Selected Factorial Model Analysis of variance table [Partial sum of squares] Sum of Mean F Source Squares DF Square Value Prob > F Block.6.6 Model <. significant A <. B <. C <. D <. AB <. AC <. AD BC BD CD ABC <. ABD ACD.96E-3.96E BCD ABCD.596E-3.596E Residual Cor Total The Model F-value of 445. implies the model is significant. There is only 7-

201 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY a.% chance that a "Model F-Value" this large could occur due to noise. Values of "Prob > F" less than.5 indicate model terms are significant. In this case A, B, C, D, AB, AC, ABC are significant model terms. The analysis of variance for the reduced model based on the significant factors is shown below. The BC interaction was included to preserve hierarchy. Design Expert Output Response: Crack Lengthin mm x ^- ANOVA for Selected Factorial Model Analysis of variance table [Partial sum of squares] Sum of Mean F Source Squares DF Square Value Prob > F Block.6.6 Model <. significant A <. B <. C <. D <. AB <. AC <. BC ABC <. Residual Cor Total The Model F-value of 56. implies the model is significant. There is only a.% chance that a "Model F-Value" this large could occur due to noise. Values of "Prob > F" less than.5 indicate model terms are significant. In this case A, B, C, D, AB, AC, ABC are significant model terms. Blocking does not change the results of Problem Consider the data from the first replicate of Problem 6-. Suppose that these observations could not all be run using the same bar stock. Set up a design to run these observations in two blocks of four observations each with ABC confounded. Analyze the data. Block Block () a ab b ac c bc abc From the normal probability plot of effects, B, C, and the AC interaction are significant. Factor A was included in the analysis of variance to preserve hierarchy. 7-3

202 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY DESIGN-EXPERT Plot Life Normal plot A: Cutting Speed B: Tool Geometry C: Cutting Angle B Normal % probability AC C Effect Design Expert Output Response: Life in hours ANOVA for Selected Factorial Model Analysis of variance table [Partial sum of squares] Sum of Mean F Source Squares DF Square Value Prob > F Block Model not significant A B C AC Residual Cor Total The "Model F-value" of 7.3 implies the model is not significant relative to the noise. There is a.38 % chance that a "Model F-value" this large could occur due to noise. Values of "Prob > F" less than.5 indicate model terms are significant. In this case there are no significant model terms. This design identifies the same significant factors as Problem Consider the data from the first replicate of Problem 6-7. Construct a design with two blocks of eight observations each with ABCD confounded. Analyze the data. Block Block () a ab b ac c bc d ad abc bd abd cd acd abcd bcd 7-4

203 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY The significant effects are identified in the normal probability plot of effects below: DESIGN-EXPERT Plot yield Normal plot A: A B: B C: C D: D D Normal % probability A AD ABC ABD AB Effect AC, BC, and BD were included in the model to preserve hierarchy. Design Expert Output Response: yield ANOVA for Selected Factorial Model Analysis of variance table [Partial sum of squares] Sum of Mean F Source Squares DF Square Value Prob > F Block Model significant A B C D AB AC AD BC BD ABC ABD Residual Cor Total The Model F-value of 9.64 implies the model is significant. There is only a 4.38% chance that a "Model F-Value" this large could occur due to noise. Values of "Prob > F" less than.5 indicate model terms are significant. In this case A, D, ABC, ABD are significant model terms. 7-6 Repeat Problem 7-5 assuming that four blocks are required. Confound ABD and ABC (and consequently CD) with blocks. Block Block Block 3 Block 4 () ac c a ab bc abc b acd d ad cd 7-5

204 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY bcd abd bd abcd DESIGN-EXPERT Plot yield Normal plot A: A B: B C: C D: D Normal % probability A D AB ABCD Effect Design Expert Output Response: yield ANOVA for Selected Factorial Model Analysis of variance table [Partial sum of squares] Sum of Mean F Source Squares DF Square Value Prob > F Block Model significant A D AB ABCD Residual Cor Total The Model F-value of 3.37 implies the model is significant. There is only a.3% chance that a "Model F-Value" this large could occur due to noise. Values of "Prob > F" less than.5 indicate model terms are significant. In this case A, D, AB are significant model terms. 7-7 Using the data from the 5 design in Problem 6-, construct and analyze a design in two blocks with ABCDE confounded with blocks. Block Block Block Block () ae a e ab be b abe ac ce c ace bc abce abc bce ad de d ade bd abde abd bde cd acde acd cde abcd bcde bcd abcde 7-6

205 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY The normal probability plot of effects identifies factors A, B, C, and the AB interaction as being significant. This is confirmed with the analysis of variance. DESIGN-EXPERT Plot Yield Normal plot A: Aperture B: Exposure Time C: Develop Time D: M ask Dim ension E: Etch Time Normal % probability C AB A B Effect Design Expert Output Response: Yield ANOVA for Selected Factorial Model Analysis of variance table [Partial sum of squares] Sum of Mean F Source Squares DF Square Value Prob > F Block.8.8 Model <. significant A <. B <. C <. AB <. Residual Cor Total The Model F-value of implies the model is significant. There is only a.% chance that a "Model F-Value" this large could occur due to noise. Values of "Prob > F" less than.5 indicate model terms are significant. In this case A, B, C, AB are significant model terms. 7-8 Repeat Problem 7-7 assuming that four blocks are necessary. Suggest a reasonable confounding scheme. Use ABC, CDE, confounded with ABDE. The four blocks follow. Block Block Block 3 Block 4 () a ac c ab b bc abc acd cd d ad bcd abcd abd bd ace ce e ae bce abce abe be 7-7

206 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY de ade acde cde abde bde bcde abcde The normal probability plot of effects identifies the same significant effects as in Problem 7-7. DESIGN-EXPERT Plot Yield Normal plot A: Aperture B: Exposure Time C: Develop Time D: M ask Dim ension E: Etch Time Normal % probability C AB A B Effect Design Expert Output Response: Yield ANOVA for Selected Factorial Model Analysis of variance table [Partial sum of squares] Sum of Mean F Source Squares DF Square Value Prob > F Block Model <. significant A <. B <. C <. AB <. Residual Cor Total The Model F-value of 69.4 implies the model is significant. There is only a.% chance that a "Model F-Value" this large could occur due to noise. Values of "Prob > F" less than.5 indicate model terms are significant. In this case A, B, C, AB are significant model terms. 7-9 Consider the data from the 5 design in Problem 6-. Suppose that it was necessary to run this design in four blocks with ACDE and BCD (and consequently ABE) confounded. Analyze the data from this design. Block Block Block 3 Block 4 () a b c ae e abe ace cd acd bcd d abc bc ac ab acde cde abcde ade 7-8

207 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY bce abce ce be abd bd ab abcd bde abde de bcde Even with four blocks, the same effects are identified as significant per the normal probability plot and analysis of variance below: DESIGN-EXPERT Plot Yield Normal plot A: Aperture B: Exposure Time C: Develop Time D: M ask Dim ension E: Etch Time Normal % probability C AB A B Effect Design Expert Output Response: Yield ANOVA for Selected Factorial Model Analysis of variance table [Partial sum of squares] Sum of Mean F Source Squares DF Square Value Prob > F Block Model <. significant A <. B <. C <. AB <. Residual Cor Total The Model F-value of 9.6 implies the model is significant. There is only a.% chance that a "Model F-Value" this large could occur due to noise. Values of "Prob > F" less than.5 indicate model terms are significant. In this case A, B, C, AB are significant model terms. 7- Design an experiment for confounding a 6 factorial in four blocks. Suggest an appropriate confounding scheme, different from the one shown in Table 7-8. We choose ABCE and ABDF. Which also confounds with CDEF Block Block Block 3 Block 4 a c ac () b abc bc ab 7-9

208 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY cd ad d acd abcd bd abd bcd ace e ae ce bce abe be abce de acde cde ade abde bcde abcde bde cf af f acf abcf bf abf bcf adf cdf acdf df bdf abcdf bcdf abdf ef acef cef aef abef bcef abcef bef acdef def adef cdef bcdef abdef bdef abcdef 7- Consider the 6 design in eight blocks of eight runs each with ABCD, ACE, and ABEF as the independent effects chosen to be confounded with blocks. Generate the design. Find the other effects confound with blocks. Block Block Block 3 Block 4 Block 5 Block 6 Block 7 Block 8 b abc a c ac () bc ab acd d bcd abd bd abcd ad cd ce ae abce be abe bce e ace abde bcde de acde cde ade abcde bde abcf bf cf af f acf abf bcf de acdf abdf bcdf abcdf bdf cdf adf aef cef def abcef bcef abef acef ef bcdef abdef acdef def adef cdef bdef abcdef The factors that are confounded with blocks are ABCD, ABEF, ACE, BDE, CDEF, BCF, and ADF. 7- Consider the design in two blocks with AB confounded. Prove algebraically that SS AB = SS Blocks. If AB is confounded, the two blocks are: SS SS Blocks Blocks ab a b ab ab a Block Block () a ab b () + ab a + b ab a b 4 b ab 7-

209 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY SS SS Blocks Blocks ab a b ab a b aab bab ab a b ab ab a b aab bab 4 ab a b SS AB 4 4 ab 7-3 Consider the data in Example 7-. Suppose that all the observations in block are increased by. Analyze the data that would result. Estimate the block effect. Can you explain its magnitude? Do blocks now appear to be an important factor? Are any other effect estimates impacted by the change you made in the data? Block Effect yblock yblock This is the block effect estimated in Example 7- plus the additional units that were added to each observation in block. All other effects are the same. Source of Sum of Degrees of Mean Variation Squares Freedom Square F A C D AC AD Blocks Error Total Design Expert Output Response: Filtration in gal/hr ANOVA for Selected Factorial Model Analysis of variance table [Partial sum of squares] Sum of Mean F Source Squares DF Square Value Prob > F Block Model <. significant A <. C D AC <. AD <. Residual Cor Total The Model F-value of 53.3 implies the model is significant. There is only a.% chance that a "Model F-Value" this large could occur due to noise. Values of "Prob > F" less than.5 indicate model terms are significant. In this case A, C, D, AC, AD are significant model terms. 7-

210 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY 7-4 Suppose that the data in Problem 7- we had confounded ABC in replicate I, AB in replicate II, and BC in replicate III. Construct the analysis of variance table. Replicate I Replicate II Replicate III (ABC Confounded) (AB Confounded) (BC Confounded) Block-> () a () a () b ab b ab b bc c ac c abc ac abc ab bc abc c bc a ac Source of Sum of Degrees of Mean Variation Squares Freedom Square F A < B C AB (reps and III) < AC BC (reps I and II) < ABC (reps II and III).6.6 < Blocks within replicates Replicates.58 Error Total Repeat Problem 7- assuming that ABC was confounded with blocks in each replicate. Replicate I, II, and III (ABC Confounded) Block-> () a ab b ac c bc abc Source of Sum of Degrees of Mean Variation Squares Freedom Square F A < B C AB < AC BC Blocks (or ABC) Replicates/Lack of Fit

211 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Error Total Suppose that in Problem 7-7 ABCD was confounded in replicate I and ABC was confounded in replicate II. Perform the statistical analysis of variance. Source of Sum of Degrees of Mean Variation Squares Freedom Square F A B C D AB AC < AD BC < BD.8.8 < CD ABC ABD ACD < BCD < ABCD Replicates.8.8 Blocks Error Total Construct a 3 design with ABC confounded in the first two replicates and BC confounded in the third. Outline the analysis of variance and comment on the information obtained. Replicate I Replicate II Replicate III (ABC Confounded) (ABC Confounded) (BC Confounded) Block-> () a () a () b ab b ab b bc c ac c ac c abc ab bc abc bc abc a ac Source of Degrees of Variation Freedom A B C 7-3

212 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY AB AC BC ABC Replicates Blocks 3 Error Total 3 This design provides two-thirds information on BC and one-third information on ABC. 7-4

213 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Chapter 8 Two-Level Fractional Factorial Designs Solutions 8- Suppose that in the chemical process development experiment in Problem 6-7, it was only possible to run a one-half fraction of the 4 design. Construct the design and perform the statistical analysis, using the data from replicate. The required design is a 4- with I=ABCD. A B C D=ABC () ad bd ab cd ac bc abcd 8 Design Expert Output Term Effect SumSqr % Contribtn Model Intercept Model A Model B Model C Model D Model AB Model AC Model AD Error BC Aliased Error BD Aliased Error CD Aliased Error ABC Aliased Error ABD Aliased Error ACD Aliased Error BCD Aliased Error ABCD Aliased Lenth's ME.5856 Lenth's SME

214 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Half Normal plot 99 Half Normal % probability D B AD AB A Effect The largest effect is A. The next largest effects are the AB and AD interactions. A plausible tentative model would be A, AB and AD, along with B and D to preserve hierarchy. Design Expert Output Response: yield ANOVA for Selected Factorial Model Analysis of variance table [Partial sum of squares] Sum of Mean F Source Squares DF Square Value Prob > F Model not significant A B D AB AD Residual Cor Total The "Model F-value" of 4.87 implies the model is not significant relative to the noise. There is a 7.9 % chance that a "Model F-value" this large could occur due to noise. Std. Dev. 4. R-Squared.94 Mean 85. Adj R-Squared.7344 C.V Pred R-Squared -.43 PRESS 544. Adeq Precision 6.44 Coefficient Standard 95% CI 95% CI Factor Estimate DF Error Low High VIF Intercept A-A B-B D-D AB AD Final Equation in Terms of Coded Factors: yield = * A -.5 * B -.5 * D +3. * A * B -.5 * A * D 8-

215 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Final Equation in Terms of Actual Factors: yield = * A -.5 * B -.5 * D +3. * A * B -.5 * A * D The Design-Expert output indicates that we really only need the main effect of factor A. The updated analysis is shown below: Design Expert Output Response: yield ANOVA for Selected Factorial Model Analysis of variance table [Partial sum of squares] Sum of Mean F Source Squares DF Square Value Prob > F Model significant A Residual Cor Total The Model F-value of.8 implies the model is significant. There is only a.67% chance that a "Model F-Value" this large could occur due to noise. Std. Dev. 5.6 R-Squared.649 Mean 85. Adj R-Squared.5833 C.V. 6.8 Pred R-Squared.365 PRESS Adeq Precision Coefficient Standard 95% CI 95% CI Factor Estimate DF Error Low High VIF Intercept A-A Final Equation in Terms of Coded Factors: yield = * A Final Equation in Terms of Actual Factors: yield = * A 8- Suppose that in Problem 6-5, only a one-half fraction of the 4 design could be run. Construct the design and perform the analysis, using the data from replicate I. The required design is a 4- with I=ABCD. A B C D=ABC () ad bd ab cd ac bc

216 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY abcd.85 Design Expert Output Term Effect SumSqr % Contribtn Model Intercept Model A Error B Model C Error D Error AB Model AC Error AD Error BC Aliased Error BD Aliased Error CD Aliased Error ABC Aliased Error ABD Aliased Error ACD Aliased Error BCD Aliased Error ABCD Aliased Lenth's ME.397 Lenth's SME.958 C, A and AC + BD are the largest three effects. Now because the main effects of A and C are large, the large effect estimate for the AC + BD alias chain probably indicates that the AC interaction is important. Half Normal plot 99 Half Normal % probability AC A C Effect Design Expert Output Response: Crack Lengthin mm x ^- ANOVA for Selected Factorial Model Analysis of variance table [Partial sum of squares] Sum of Mean F Source Squares DF Square Value Prob > F Model not significant A C AC Residual Cor Total.85 7 The Model F-value of 5.64 implies there is a 6.4% chance that a "Model F-Value" this large could occur due to noise. Std. Dev.. R-Squared

217 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Mean.55 Adj R-Squared.665 C.V. 3. Pred R-Squared.348 PRESS.65 Adeq Precision 5.7 Coefficient Standard 95% CI 95% CI Factor Estimate DF Error Low High VIF Intercept A-Pour Temp C-Heat Tr Mtd E-3. AC Final Equation in Terms of Coded Factors: Crack Length = * A -. * C -.5 * A * C Final Equation in Terms of Actual Factors: Crack Length = * Pour Temp -.75 * Heat Treat Method -.5 * Pour Temp * Heat Treat Method 8-3 Consider the plasma etch experiment described in Problem 6-8. Suppose that only a one-half fraction of the design could be run. Set up the design and analyze the data. Etch Rate Factor Levels A B C D=ABC (A/min) Low (-) High (+) A (cm) B (mtorr) C (SCCM) D (W) Design Expert Output Term Effect SumSqr % Contribtn Model Intercept Error A Error B Model C Model D Error AB Error AC Error AD -.79 Error BC Aliased Error BD Aliased Model CD Aliased Error ABC Aliased Error ABD Aliased Error ACD Aliased Error BCD Aliased Error ABCD Aliased Lenth's ME Lenth's SME

218 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Half Normal plot 99 Half Normal % probability D AB C Effect The large AB + CD alias chain is most likely the CD interaction. Design Expert Output Response: Etch Rate in A/min ANOVA for Selected Factorial Model Analysis of variance table [Partial sum of squares] Sum of Mean F Source Squares DF Square Value Prob > F Model.79E <. significant C.688E+5.688E <. D CD Residual Cor Total.88E+5 7 The Model F-value of 7.5 implies the model is significant. There is only a.% chance that a "Model F-Value" this large could occur due to noise. Std. Dev.. R-Squared.9936 Mean 756. Adj R-Squared.9888 C.V..8 Pred R-Squared.9744 PRESS 788. Adeq Precision 3.56 Coefficient Standard 95% CI 95% CI Factor Estimate DF Error Low High VIF Intercept C-Gas Flow D-Power CD Final Equation in Terms of Coded Factors: Etch Rate = * C * D * C * D Final Equation in Terms of Actual Factors: Etch Rate = * Gas Flow * Power * Gas Flow * Power 8-6

219 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY 8-4 Problem 6-describes a process improvement study in the manufacturing process of an integrated circuit. Suppose that only eight runs could be made in this process. Set up an appropriate 5- design and find the alias structure. Use the appropriate observations from Problem 6- as the observations in this design and estimate the factor effects. What conclusions can you draw? I = ABD = ACE = BCDE A (ABD) =BD A (ACE) =CE A (BCDE) =ABCDE A=BD=CE=ABCDE B (ABD) =AD B (ACE) =ABCE B (BCDE) =CDE B=AD=ABCE=CDE C (ABD) =ABCD C (ACE) =AE C (BCDE) =BDE C=ABCD=AE=BDE D (ABD) =AB D (ACE) =ACDE D (BCDE) =BCE D=AB=ACDE=BCE E (ABD) =ABDE E (ACE) =AC E (BCDE) =BCD E=ABDE=AC=BCD BC (ABD) =ACD BC (ACE) =ABE BC (BCDE) =DE BC=ACD=ABE=DE BE (ABD) =ADE BE (ACE) =ABC BE (BCDE) =CD BE=ADE=ABC=CD A B C D=AB E=AC de a be abd cd ace bc abcde 63 Design Expert Output Term Effect SumSqr % Contribtn Model Intercept Model A Model B Model C Model D Error E Error BC Error BE Lenth's ME 8.3 Lenth's SME Half Normal plot 99 Half Normal % probability D C A B Effect 8-7

220 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY The main A, B, C, and D are large. However, recall that you are really estimating A+BD+CE, B+AD, C+DE and D+AD. There are other possible interpretations of the experiment because of the aliasing. Design Expert Output Response: Yield ANOVA for Selected Factorial Model Analysis of variance table [Partial sum of squares] Sum of Mean F Source Squares DF Square Value Prob > F Model significant A B C D Residual Cor Total The Model F-value of implies the model is significant. There is only a.7% chance that a "Model F-Value" this large could occur due to noise. Std. Dev..73 R-Squared.99 Mean 3.38 Adj R-Squared.986 C.V Pred R-Squared.9439 PRESS 59. Adeq Precision 5.59 Coefficient Standard 95% CI 95% CI Factor Estimate DF Error Low High VIF Intercept A-Aperture B-Exposure Time C-Develop Time D-Mask Dimension Final Equation in Terms of Coded Factors: Yield = * A * B * C * D Final Equation in Terms of Actual Factors: Aperture small Mask Dimension Small Yield = * Exposure Time * Develop Time Aperture large Mask Dimension Small Yield = * Exposure Time * Develop Time Aperture small Mask Dimension Large Yield = * Exposure Time * Develop Time Aperture large Mask Dimension Large Yield =

221 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY * Exposure Time * Develop Time 8-5 Continuation of Problem 8-4. Suppose you have made the eight runs in the 5- design in Problem 8-4. What additional runs would be required to identify the factor effects that are of interest? What are the alias relationships in the combined design? We could fold over the original design by changing the signs on the generators D = AB and E = AC to produce the following new experiment. A B C D=-AB E=-AC () ade bd abe ce acd bcde abc 6 A (-ABD) =-BD A (-ACE) =-CE A (BCDE) =ABCDE A=-BD=-CE=ABCDE B (-ABD) =-AD B (-ACE) =-ABCE B (BCDE) =CDE B=-AD=-ABCE=CDE C (-ABD) =-ABCD C (-ACE) =-AE C (BCDE) =BDE C=-ABCD=-AE=BDE D (-ABD) =-AB D (-ACE) =-ACDE D (BCDE) =BCE D=-AB=-ACDE=BCE E (-ABD) =-ABDE E (-ACE) =-AC E (BCDE) =BCD E=-ABDE=-AC=BCD BC (-ABD) =-ACD BC (-ACE) =-ABE BC (BCDE) =DE BC=-ACD=-ABE=DE BE (-ABD) =-ADE BE (-ACE) =-ABC BE (BCDE) =CD BE=-ADE=-ABC=CD Assuming all three factor and higher interactions to be negligible, all main effects can be separated from their two-factor interaction aliases in the combined design. 8-6 R.D. Snee ( Experimenting with a Large Number of Variables, in Experiments in Industry: Design, Analysis and Interpretation of Results, by R.D. Snee, L.B. Hare, and J.B. Trout, Editors, ASQC, 985) describes an experiment in which a 5- design with I=ABCDE was used to investigate the effects of five factors on the color of a chemical product. The factors are A = solvent/reactant, B = catalyst/reactant, C = temperature, D = reactant purity, and E = reactant ph. The results obtained were as follows: e = -.63 d = 6.79 a =.5 ade = 5.47 b = -.68 bde = 3.45 abe =.66 abd = 5.68 c =.6 cde = 5. ace =. acd = 4.38 bce = -.9 bcd = 4.3 abc =.93 abcde = 4.5 (a) Prepare a normal probability plot of the effects. Which effects seem active? Factors A, B, D, and the AB, AD interactions appear to be active. 8-9

222 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY DESIGN-EXPERT Plot Color Normal plot A: Solvent/Reactant B: Catalyst/Reactant C: Temperature D: Reactant Purity E: Reactant ph Normal % probability B AD A AB D Effect Design Expert Output Term Effect SumSqr % Contribtn Model Intercept Model A Model B Error C Model D Error E Model AB Error AC Model AD Error AE Error BC Error BD Error BE Error CD Error CE Error DE Lenth's ME Lenth's SME Design Expert Output Response: Color ANOVA for Selected Factorial Model Analysis of variance table [Partial sum of squares] Sum of Mean F Source Squares DF Square Value Prob > F Model <. significant A B D <. AB AD Residual Cor Total The Model F-value of 4.53 implies the model is significant. There is only a.% chance that a "Model F-Value" this large could occur due to noise. Std. Dev..93 R-Squared.946 Mean.7 Adj R-Squared.8869 C.V Pred R-Squared.87 PRESS.4 Adeq Precision Coefficient Standard 95% CI 95% CI 8-

223 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Factor Estimate DF Error Low High VIF Intercept A-Solvent/Reactant B-Catalyst/Reactant D-Reactant Purity AB AD Final Equation in Terms of Coded Factors: Color = * A -.67 * B +. * D +.64 * A * B -.68 * A * D Final Equation in Terms of Actual Factors: Color = * Solvent/Reactant -.67 * Catalyst/Reactant +. * Reactant Purity * Solvent/Reactant * Catalyst/Reactant * Solvent/Reactant * Reactant Purity (b) Calculate the residuals. Construct a normal probability plot of the residuals and plot the residuals versus the fitted values. Comment on the plots. Design Expert Output Diagnostics Case Statistics Standard Actual Predicted Student Cook's Outlier Run Order Value Value Residual Leverage Residual Distance t Order

224 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Normal plot of residuals Residuals vs. Predicted Normal % probability Residuals Residual Predicted The residual plots are satisfactory. (c) If any factors are negligible, collapse the 5- design into a full factorial in the active factors. Comment on the resulting design, and interpret the results. The design becomes two replicates of a 3 in the factors A, B and D. When re-analyzing the data in three factors, D becomes labeled as C. Design Expert Output Response: Color ANOVA for Selected Factorial Model Analysis of variance table [Partial sum of squares] Sum of Mean F Source Squares DF Square Value Prob > F Model significant A B C <. AB AC BC ABC Residual Lack of Fit. Pure Error Cor Total The Model F-value of 4.89 implies the model is significant. There is only a.5% chance that a "Model F-Value" this large could occur due to noise. Std. Dev.. R-Squared.987 Mean.7 Adj R-Squared.8663 C.V Pred R-Squared.748 PRESS 3.7 Adeq Precision.736 Coefficient Standard 95% CI 95% CI Factor Estimate DF Error Low High VIF Intercept A-Solvent/Reactant B-Catalyst/Reactant C-Reactant Purity AB AC

225 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY BC ABC Final Equation in Terms of Coded Factors: Color = * A -.67 * B +. * C +.64 * A * B -.68 * A * C +. * B * C -. * A * B * C Final Equation in Terms of Actual Factors: Color = * Solvent/Reactant -.67 * Catalyst/Reactant +. * Reactant Purity * Solvent/Reactant * Catalyst/Reactant * Solvent/Reactant * Reactant Purity +.5 * Catalyst/Reactant * Reactant Purity -. * Solvent/Reactant * Catalyst/Reactant * Reactant Purity 8-7 An article by J.J. Pignatiello, Jr. And J.S. Ramberg in the Journal of Quality Technology, (Vol. 7, 985, pp. 98-6) describes the use of a replicated fractional factorial to investigate the effects of five factors on the free height of leaf springs used in an automotive application. The factors are A = furnace temperature, B = heating time, C = transfer time, D = hold down time, and E = quench oil temperature. The data are shown below: A B C D E Free Height (a) Write out the alias structure for this design. What is the resolution of this design? I=ABCD, Resolution IV A (ABCD)= BCD B (ABCD)= ACD C (ABCD)= ABD D (ABCD)= ABC E (ABCD)= ABCDE AB (ABCD)= CD AC (ABCD)= BD AD (ABCD)= BC 8-3

226 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY AE (ABCD)= BCDE BE (ABCD)= ACDE CE (ABCD)= ABDE DE (ABCD)= ABCE (b) Analyze the data. What factors influence the mean free height? Design Expert Output Term Effect SumSqr % Contribtn Model Intercept Model A Model B Model C Model D Model E Model AB Model AC.5.875E Model AD Model AE Error BC Aliased Error BD Aliased Model BE Error CD Aliased Model CE Model DE Error Pure Error Lenth's ME.8857 Lenth's SME Normal plot 99 BE A 95 Normal % probability E B Effect Design Expert Output Response:Free Height ANOVA for Selected Factorial Model Analysis of variance table [Partial sum of squares] Sum of Mean F Source Squares DF Square Value Prob > F Model <. significant A <. B E <. BE Residual Lack of Fit not significant 8-4

227 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Pure Error Cor Total The Model F-value of 3.74 implies the model is significant. There is only a.% chance that a "Model F-Value" this large could occur due to noise. Std. Dev..4 R-Squared.6883 Mean 7.63 Adj R-Squared.6593 C.V..9 Pred R-Squared.66 PRESS. Adeq Precision Coefficient Standard 95% CI 95% CI Factor Estimate DF Error Low High VIF Intercept A-Furnace Temp B-Heating Time E-Quench Temp BE Final Equation in Terms of Coded Factors: Free Height = * A -.8 * B -. * E +.76 * B * E Final Equation in Terms of Actual Factors: Free Height = * Furnace Temp * Heating Time * Quench Temp * Heating Time * Quench Temp (c) Calculate the range and standard deviation of the free height for each run. Is there any indication that any of these factors affects variability in the free height? Design Expert Output (Range) Term Effect SumSqr % Contribtn Model Intercept Model A Model B Model C Error D Model E Error AB Error AC Error AD Error AE E Model BC Aliased Error BD Aliased Model BE Error CD Aliased Model CE Error DE Error ABC Aliased Error ABD Aliased Error ABE Error ACD Aliased Error ACE Error ADE Error BCD Aliased Model BCE Aliased Error BDE Aliased Error CDE Aliased Lenth's ME

228 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Lenth's SME.6494 Interaction ADE is aliased with BCE. Although the plot below identifies ADE, BCE was included in the analysis. DESIGN-EXPERT Plot Range Half Normal plot A: Furnace Temp B: Heating Time C: Transfer Time D: Hold Tim e E: Quench Temp Half Normal % probability E C A B AD E CE Effect Design Expert Output (Range) Response: Range ANOVA for Selected Factorial Model Analysis of variance table [Partial sum of squares] Sum of Mean F Source Squares DF Square Value Prob > F Model significant A B C.756E-3.756E E 7.56E E BC 5.56E E BE.56E-3.56E CE BCE Residual E-3 Cor Total.3 5 The Model F-value of 5.7 implies the model is significant. There is only a.67% chance that a "Model F-Value" this large could occur due to noise. Std. Dev..78 R-Squared.8668 Mean. Adj R-Squared.746 C.V Pred R-Squared.343 PRESS. Adeq Precision 7.66 Coefficient Standard 95% CI 95% CI Factor Estimate DF Error Low High VIF Intercept A-Furn Temp B-Heat Time C-Transfer Time E-Qnch Temp E BC BE 8.5E CE BCE Final Equation in Terms of Coded Factors: 8-6

229 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Range = * A -.63 * B +.3 * C E-3 * E +.8 * B * C +8.5E-3 * B * E -.68 * C * E +.69 * B * C * E Final Equation in Terms of Actual Factors: Range = * Furnace Temp * Heating Time +.35 * Transfer Time E-3 * Quench Temp +.85 * Heating Time * Transfer Time +8.5E-3 * Heating Time * Quench Temp * Transfer Time * Quench Temp * Heating Time * Transfer Time * Quench Temp Design Expert Output (StDev) Term Effect SumSqr % Contribtn Model Intercept Model A Model B Model C Error D Model E Error AB Error AC Error AD Error AE E Model BC Aliased Error BD Aliased Model BE Error CD Aliased Model CE Error DE E Error ABC Aliased Error ABD Aliased Error ABE Error ACD Aliased Error ACE Error ADE Aliased Error BCD Aliased Model BCE Error BDE Aliased Error CDE Aliased Lenth's ME Lenth's SME.66 Interaction ADE is aliased with BCE. Although the plot below identifies ADE, BCE was included in the analysis. 8-7

230 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY DESIGN-EXPERT Plot StDev Half Normal plot A: Furnace Temp B: Heating Time C: Transfer Time D: Hold Tim e E: Quench Temp Half Normal % probability C BE A B CE AD E Effect Design Expert Output (StDev) Response: StDev ANOVA for Selected Factorial Model Analysis of variance table [Partial sum of squares] Sum of Mean F Source Squares DF Square Value Prob > F Model significant A B C 4.466E E E.87E-4.87E BC.453E-3.453E BE 3.74E E CE BCE Residual. 7.58E-3 Cor Total.93 5 The Model F-value of 6.8 implies the model is significant. There is only a.% chance that a "Model F-Value" this large could occur due to noise. Std. Dev..39 R-Squared.8863 Mean. Adj R-Squared.7565 C.V Pred R-Squared.46 PRESS.55 Adeq Precision 7.86 Coefficient Standard 95% CI 95% CI Factor Estimate DF Error Low High VIF Intercept. 9.78E A-Furnace Temp E E B-Heating Time E C-Transfer Time 5.83E E E-Quench Temp -3.4E E BC 9.53E E BE 4.383E E CE E BCE E Final Equation in Terms of Coded Factors: StDev = * A -.36 * B +5.83E-3 * C 8-8

231 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY -3.4E-3 * E +9.53E-3 * B * C E-3 * B * E -.36 * C * E +.38 * B * C * E Final Equation in Terms of Actual Factors: StDev = * Furnace Temp * Heating Time E-3 * Transfer Time -3.47E-3 * Quench Temp E-3 * Heating Time * Transfer Time E-3 * Heating Time * Quench Temp * Transfer Time * Quench Temp * Heating Time * Transfer Time * Quench Temp (d) Analyze the residuals from this experiment, and comment on your findings. The residual plot follows. All plots are satisfactory. Normal plot of residuals Residuals vs. Predicted Norm al % probability Residuals Residual Predicted 8-9

232 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Residuals vs. Furnace Temp Residuals vs. Heating Time Residuals Residuals Furnace Temp Heating Time Residuals vs. Transfer Time Residuals vs. Hold Time Residuals Residuals Transfer Tim e Hold Time Residuals vs. Quench Temp Residuals Quench Temp 8-

233 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY (e) Is this the best possible design for five factors in 6 runs? Specifically, can you find a fractional design for five factors in 6 runs with a higher resolution than this one? This was not the best design. A resolution V design is possible by setting the generator equal to the highest order interaction, ABCDE. 8-8 An article in Industrial and Engineering Chemistry ( More on Planning Experiments to Increase Research Efficiency, 97, pp. 6-65) uses a 5- design to investigate the effect of A = condensation, B = amount of material, C = solvent volume, D = condensation time, and E = amount of material on yield. The results obtained are as follows: e = 3. ad = 6.9 cd = 3.8 bde = 6.8 ab = 5.5 bc = 6. ace = 3.4 abcde = 8. (a) Verify that the design generators used were I = ACE and I = BDE. A B C D=BE E=AC e ad bde ab cd ace bc abcde (b) Write down the complete defining relation and the aliases for this design. I=BDE=ACE=ABCD. A (BDE) =ABDE A (ACE) =CE A (ABCD) =BCD A=ABDE=CE=BCD B (BDE) =DE B (ACE) =ABCE B (ABCD) =ACD B=DE=ABCE=ACD C (BDE) =BCDE C (ACE) =AE C (ABCD) =ABD C=BCDE=AE=ABD D (BDE) =BE D (ACE) =ACDE D (ABCD) =ABC D=BE=ACDE=ABC E (BDE) =BD E (ACE) =AC E (ABCD) =ABCDE E=BD=AC=ABCDE AB (BDE) =ADE AB (ACE) =BCE AB (ABCD) =CD AB=ADE=BCE=CD AD (BDE) =ABE AD (ACE) =CDE AD (ABCD) =BC AD=ABE=CDE=BC (c) Estimate the main effects. Design Expert Output Term Effect SumSqr % Contribtn Model Intercept Model A Model B Model C Model D Model E (d) Prepare an analysis of variance table. Verify that the AB and AD interactions are available to use as error. The analysis of variance table is shown below. Part (b) shows that AB and AD are aliased with other factors. If all two-factor and three factor interactions are negligible, then AB and AD could be pooled as an estimate of error. 8-

234 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Design Expert Output Response: Yield ANOVA for Selected Factorial Model Analysis of variance table [Partial sum of squares] Sum of Mean F Source Squares DF Square Value Prob > F Model not significant A B C D E Residual Cor Total The "Model F-value" of 3. implies the model is not significant relative to the noise. There is a 5.37 % chance that a "Model F-value" this large could occur due to noise. Std. Dev..3 R-Squared.8895 Mean 9.4 Adj R-Squared.634 C.V..57 Pred R-Squared PRESS 58.6 Adeq Precision 5.44 Coefficient Standard 95% CI 95% CI Factor Estimate DF Error Low High VIF Intercept A-Condensation B-Material C-Solvent D-Time E-Material Final Equation in Terms of Coded Factors: Yield = * A -.59 * B +.4 * C -.34 * D +.4 * E Final Equation in Terms of Actual Factors: Yield = * Condensation * Material * Solvent * Time * Material (e) Plot the residuals versus the fitted values. Also construct a normal probability plot of the residuals. Comment on the results. The residual plots are satisfactory. 8-

235 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Residuals vs. Predicted Normal plot of residuals Residuals E Normal % probability E Predicted Residual 8-9 Consider the leaf spring experiment in Problem 8-7. Suppose that factor E (quench oil temperature) is very difficult to control during manufacturing. Where would you set factors A, B, C and D to reduce variability in the free height as much as possible regardless of the quench oil temperature used? DESIGN-EXPERT Plot Free Height 8.8 Interaction Graph Heating Time X = E: Quench Temp Y = B: Heating Time 7.95 B- -. B+. Actual Factors A: Furnace Temp =. C: Transfer Time = D: Hold Time =. Free Height Quench Tem p Run the process with A at the high level, B at the low level, C at the low level and D at either level (the low level of D may give a faster process). 8- Construct a 7- design by choosing two four-factor interactions as the independent generators. Write down the complete alias structure for this design. Outline the analysis of variance table. What is the resolution of this design? I=CDEF=ABCG=ABDEFG, Resolution IV A B C D E F=CDE G=ABC 8-3

236 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY () ag bg ab cfg acf bcf abcfg df adfg bdfg abdf cdg acd bcd abcdg ef aefg befg abef ceg ace bce abceg de adeg bdeg abde cdefg acdef bcdef abcdefg Alias Structure A (CDEF)= ACDEF A(ABCG)= BCG A (ABDEFG)= BDEFG A=ACDEF=BCG=BDEFG B (CDEF)= BCDEF B(ABCG)= ACG B (ABDEFG)= ADEFG B=BCDEF=ACG=ADEFG C (CDEF)= DEF C(ABCG)= ABG C (ABDEFG)= ABCDEFG C=DEF=ABG=ABCDEFG D (CDEF)= CEF D(ABCG)= ABCDG D (ABDEFG)= ABEFG D=CEF=ABCDG=ABEFG E (CDEF)= CDF E(ABCG)= ABCEG E (ABDEFG)= ABDFG E=CDF=ABCEG=ABDFG F (CDEF)= CDE F(ABCG)= ABCFG F (ABDEFG)= ABDEG F=CDE=ABCFG=ABDEG G (CDEF)= CDEFG G(ABCG)= ABC G (ABDEFG)= ABDEF G=CDEFG=ABC=ABDEF AB (CDEF)= ABCDEF AB(ABCG)= CG AB (ABDEFG)= DEFG AB=ABCDEF=CG=DEFG AC (CDEF)= ADEF AC(ABCG)= BG AC (ABDEFG)= BCDEFG AC=ADEF=BG=BCDEFG AD (CDEF)= ACEF AD(ABCG)= BCDG AD (ABDEFG)= BEFG AD=ACEF=BCDG=BEFG AE (CDEF)= ACDF AE(ABCG)= BCEG AE (ABDEFG)= BDFG AE=ACDF=BCEG=BDFG AF (CDEF)= ACDE AF(ABCG)= BCFG AF (ABDEFG)= BDEG AF=ACDE=BCFG=BDEG AG (CDEF)= ACDEFG AG(ABCG)= BC AG (ABDEFG)= BDEF AG=ACDEFG=BC=BDEF BD (CDEF)= BCEF BD(ABCG)= ACDG BD (ABDEFG)= AEFG BD=BCEF=ACDG=AEFG BE (CDEF)= BCDF BE(ABCG)= ACEG BE (ABDEFG)= ADFG BE=BCDF=ACEG=ADFG BF (CDEF)= BCDE BF(ABCG)= ACFG BF (ABDEFG)= ADEG BF=BCDE=ACFG=ADEG CD (CDEF)= EF CD(ABCG)= ABDG CD (ABDEFG)= ABCEFG CD=EF=ABDG=ABCEFG CE (CDEF)= DF CE(ABCG)= ABEG CE (ABDEFG)= ABCDFG CE=DF=ABEG=ABCDFG CF (CDEF)= DE CF(ABCG)= ABFG CF (ABDEFG)= ABCDEG CF=DE=ABFG=ABCDEG DG (CDEF)= CEFG DG(ABCG)= ABCD DG (ABDEFG)= ABEF DG=CEFG=ABCD=ABEF EG (CDEF)= CDFG EG(ABCG)= ABCE EG (ABDEFG)= ABDF EG=CDFG=ABCE=ABDF FG (CDEF)= CDEG FG(ABCG)= ABCF FG (ABDEFG)= ABDE FG=CDEG=ABCF=ABDE Analysis of Variance Table Source Degrees of Freedom A B C D E F G AB=CG 8-4

237 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY AC=BG AD AE AF AG=BC BD BE CD=EF CE=DF CF=DE DG EG FG Error 9 Total 3 8- Consider the 5 design in Problem 6-. Suppose that only a one-half fraction could be run. Furthermore, two days were required to take the 6 observations, and it was necessary to confound the 5- design in two blocks. Construct the design and analyze the data. A B C D E=ABCD Data Blocks = AB Block e a b abe c ace bce abc d ade bde abd cde acd bcd abcde 63 + Design Expert Output Term Effect SumSqr % Contribtn Model Intercept Model A Model B Model C Error D Error E Error AB Aliased Error AC Error AD Error AE Error BC Error BD Error BE Error CD Error CE Error DE Lenth's ME.4663 Lenth's SME 5.57 The AB interaction in the above table is aliased with the three-factor interaction BCD, and is also confounded with blocks. 8-5

238 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Half Normal plot 99 Half Normal % probability A C B Effect Design Expert Output Response: Yield ANOVA for Selected Factorial Model Analysis of variance table [Partial sum of squares] Sum of Mean F Source Squares DF Square Value Prob > F Block Model <. significant A <. B <. C <. Residual Cor Total The Model F-value of 63.3 implies the model is significant. There is only a.% chance that a "Model F-Value" this large could occur due to noise. Std. Dev..7 R-Squared.994 Mean 3.44 Adj R-Squared.996 C.V Pred R-Squared.9878 PRESS 67.4 Adeq Precision 58. Coefficient Standard 95% CI 95% CI Factor Estimate DF Error Low High VIF Intercept Block 3.56 Block A-Aperture B-Exposure Time C-Develop Time Final Equation in Terms of Coded Factors: Yield = * A +6.8 * B +5.3 * C Final Equation in Terms of Actual Factors: Aperture small Yield = * Exposure Time 8-6

239 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY * Develop Time Aperture large Yield = * Exposure Time * Develop Time 8- Analyze the data in Problem 6-3 as if it came from a IV design with I = ABCD. Project the design into a full factorial in the subset of the original four factors that appear to be significant. 4 Run Yield Factor Levels Number A B C D=ABC (lbs) Low (-) High (+) () A (h) ad 5 B (%) bd 3 C (psi) ab 6 D (ºC) cd ac bc abcd 3 Design Expert Output Term Effect SumSqr % Contribtn Model Intercept Model A Error B Model C Model D Error AB Model AC Model AD Lenth's ME.74 Lenth's SME Half Normal plot 99 Half Normal % probability C A AD D AC Effect Design Expert Output Response: Yield in lbs ANOVA for Selected Factorial Model Analysis of variance table [Partial sum of squares] 8-7

240 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Sum of Mean F Source Squares DF Square Value Prob > F Model significant A C D AC AD Residual.5.6 Cor Total The Model F-value of 48.5 implies the model is significant. There is only a.3% chance that a "Model F-Value" this large could occur due to noise. Std. Dev..79 R-Squared.998 Mean 7.88 Adj R-Squared.974 C.V. 4.4 Pred R-Squared.869 PRESS. Adeq Precision 7.89 Coefficient Standard 95% CI 95% CI Factor Estimate DF Error Low High VIF Intercept A-Time C-Pressure D-Temperature AC AD Final Equation in Terms of Coded Factors: Yield = * A +.37 * C +.3 * D -.3 * A * C +.3 * A * D Final Equation in Terms of Actual Factors: Yield = * Time * Pressure -.7 * Temperature -.85 * Time * Pressure +.68 * Time * Temperature 8-3 Repeat Problem 8- using I = -ABCD. Does use of the alternate fraction change your interpretation of the data? Run Yield Factor Levels Number A B C D=ABC (lbs) Low (-) High (+) d A (h) a 8 B (%) b 3 C (psi) abd 4 D (ºC) c acd bcd abc 5 Design Expert Output Term Effect SumSqr % Contribtn Model Intercept 8-8

241 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Model A Error B Model C Model D Error AB Model AC Model AD Lenth's ME.744 Lenth's SME 3.44 Half Normal plot 99 Half Normal % probability C D AD AC A Effect Design Expert Output Response: Yield in lbs ANOVA for Selected Factorial Model Analysis of variance table [Partial sum of squares] Sum of Mean F Source Squares DF Square Value Prob > F Model significant A C D AC AD Residual.5. Cor Total The Model F-value of 3.58 implies the model is significant. There is only a 4.% chance that a "Model F-Value" this large could occur due to noise. Std. Dev..6 R-Squared.9833 Mean 6.88 Adj R-Squared.946 C.V. 6.9 Pred R-Squared.733 PRESS 36. Adeq Precision 4.45 Coefficient Standard 95% CI 95% CI Factor Estimate DF Error Low High VIF Intercept A-Time C-Pressure D-Temperature AC AD Final Equation in Terms of Coded Factors: Yield = 8-9

242 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY * A +.63 * C +.3 * D -.3 * A * C +.88 * A * D Final Equation in Terms of Actual Factors: Yield = * Time +.4 * Pressure -.56 * Temperature -.85 * Time * Pressure +.6 * Time * Temperature Project the IV design in Example 8- into two replicates of a design in the factors A and B. Analyze the data and draw conclusions. Design Expert Output Response: Filtration Rate ANOVA for Selected Factorial Model Analysis of variance table [Partial sum of squares] Sum of Mean F Source Squares DF Square Value Prob > F Model not significant A B E AB E Residual Lack of Fit. Pure Error Cor Total The "Model F-value" of.4 implies the model is not significant relative to the noise. There is a 75.3 % chance that a "Model F-value" this large could occur due to noise. Std. Dev. 4. R-Squared.37 Mean 7.75 Adj R-Squared C.V. 34. Pred R-Squared -.53 PRESS 937. Adeq Precision.98 Coefficient Standard 95% CI 95% CI Factor Estimate DF Error Low High VIF Intercept A-Temperature B-Pressure AB Final Equation in Terms of Coded Factors: Filtration Rate = * A +.75 * B -.5 * A * B Final Equation in Terms of Actual Factors: Filtration Rate = * Temperature +.75 * Pressure -.5 * Temperature * Pressure 8-3

243 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Construct a III design. Determine the effects that may be estimated if a second fraction of this design is run with all signs reversed. A B C D=AB E=AC F=BC def af be abd cd ace bcf abcdef Principal Fraction l A =A+BD+CE l B =B+AD+CF l C =C+AE+BF l D =D+AB+EF l E =E+AC+DF l F =F+BC+DE l BE =BE+CD+AF Second Fraction l* A =A-BD-CE l* B =B-AD-CF l* C =C-AE-BF l* D =D-AB-EF l* E =E-AC-DF l* F =F-BC-DE l* BE =BE+CD+AF By combining the two fractions we can estimate the following: ( l i +l* I )/ ( l i -l* I )/ A BD+CE B AD+CF C AE+BF D AB+EF E AC+DF F BC+DE BE+CD+AF Consider the III design in Problem 8-5. Determine the effects that may be estimated if a second fraction of this design is run with the signs for factor A reversed. Principal Fraction l A =A+BD+CE l B =B+AD+CF l C =C+AE+BF l D =D+AB+EF l E =E+AC+DF l F =F+BC+DE l BE =BE+CD+AF Second Fraction l* A =-A+BD+CE l* B =B-AD+CF l* C =C-AE+BF l* D =D-AB+EF l* E =E-AC+DF l* F =F+BC+DE l* BE =BE+CD-AF By combining the two fractions we can estimate the following: ( l i -l* I )/ ( l i +l* I )/ A BD+CE AD B+CF 8-3

244 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY AE AB AC AF C+BF D+EF E+DF F+BC+DE 8-7 Fold over the design is a III design in Table 8-9 to produce a eight-factor design. Verify that the resulting IV design. Is this a minimal design? H A B C D=AB E=AC F=BC G=ABC Original Design Second Set of Runs w/ all Signs Switched After folding the original design over, we add a new factor H, and we have a design with generators 8 4 D=ABH, E=ACH, F=BCH, and G=ABC. This is a IV design. It is a minimal design, since it contains k=(8)=6 runs. 8-8 Fold over a design. Compare this 5 III design to produce a six-factor design. Verify that the resulting design is a 6 IV design to the in Table IV F A B C D=AB E=BC Original Design Second Set of Runs w/ all Signs Switched If we relabel the factors from left to right as A, B, C, D, E, F, then this design becomes IV with generators I=ABDF and I=BCEF. It is not a minimal design, since k=(6)= runs, and the design contains 6 runs

245 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY 8-9 An industrial engineer is conducting an experiment using a Monte Carlo simulation model of an inventory system. The independent variables in her model are the order quantity (A), the reorder point (B), the setup cost (C), the backorder cost (D), and the carrying cost rate (E). The response variable is 5 average annual cost. To conserve computer time, she decides to investigate these factors using a III design with I = ABD and I = BCE. The results she obtains are de = 95, ae = 34, b = 58, abd = 9, cd = 9, ac = 87, bce = 55, and abcde = 85. (a) Verify that the treatment combinations given are correct. Estimate the effects, assuming three-factor and higher interactions are negligible. A B C D=AB E=BC de ae b abd cd ac bce abcde Design Expert Output Term Effect SumSqr % Contribtn Model Intercept Model A Model B Error C Error D Error E Error AC Error AE Lenth's ME Lenth's SME DESIGN-EXPERT Plot Avg Annual Cost Normal plot Half Normal plot A: Order Quantity B: Re -o rd er P o in t C: Setup Cost D: Backorder Cost E: Ca rryin g Co st Normal % probability B A Half Normal % probability B A Effect Effect (b) Suppose that a second fraction is added to the first, for example ade = 36, e = 93, ab = 87, bd = 53, acd = 39, c = 99, abce - 9, and bcde = 5. How was this second fraction obtained? Add this data to the original fraction, and estimate the effects. 8-33

246 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY This second fraction is formed by reversing the signs of factor A. A B C D=AB E=BC ade e ab bd acd c abce bcde Design Expert Output Term Effect SumSqr % Contribtn Model Intercept Model A Model B Error C Error D Error E Error AB Error AC Error AD Error AE Error BD Error CD Error DE Error ACD Error ADE Lenth's ME 5.88 Lenth's SME DESIGN-EXPERT Plot Avg Annual Cost Normal plot Half Normal plot A: Order Quantity B: Re -o rd er P o in t C: Setup Cost D: Backorder Cost E: Ca rryin g Co st Normal % probability A B Half Normal % probability A B Effect Effect (c) Suppose that the fraction abc = 89, ce = 96, bcd = 54, acde = 35, abe = 93, bde = 5, ad = 37, and () = 98 was run. How was this fraction obtained? Add this data to the original fraction and estimate the effects. This second fraction is formed by reversing the signs of all factors. A B C D=AB E=BC abc 8-34

247 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY bcd acde ce abe bde ad () Design Expert Output Term Effect SumSqr % Contribtn Model Intercept Model A Model B Error C Error D Error E Error AB Error AC Error AD Error AE Error BC Error BD Error BE Error ABC Error ABE Lenth's ME Lenth's SME DESIGN-EXPERT Plot Avg Annual Cost Normal plot Half Normal plot A: Order Quantity B: Re -o rd er P o in t C: Setup Cost D: Backorder Cost E: Ca rryin g Co st Normal % probability A B Half Normal % probability A B Effect Effect 8- Construct a 5 design. Show how the design may be run in two blocks of eight observations each. Are any main effects or two-factor interactions confounded with blocks? A B C D E=ABCD Blocks = AB Block e a b abe c ace

248 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY bce abc d ade bde abd cde acd bcd abcde + Blocks are confounded with AB and CDE. 8- Construct a 7 design. Show how the design may be run in four blocks of eight observations each. Are any main effects or two-factor interactions confounded with blocks? A B C D E F=CDE G=ABC Block=ACE Block=BFG Block assignment () ag bg ab cfg acf bcf abcfg df adfg bdfg abdf cdg acd bcd abcdg ef aefg befg abef ceg ace bce abceg de adeg bdeg abde cdefg acdef bcdef abcdefg Blocks are confounded with ACE, BFG, and ABCEFG. 8- Irregular fractions of the k [John (97)]. Consider a 4 design. We must estimate the four main effects and the six two-factor interactions, but the full 4 factorial cannot be run. The largest possible block contains runs. These runs can be obtained from the four one-quarter fractions defined by I = AB = ACD = BCD by omitting the principal fraction. Show how the remaining three 4- fractions can be combined to estimate the required effects, assuming that three-factor and higher interactions are negligible. This design could be thought of as a three-quarter fraction. The four 4- fractions are as follows: 8-36

249 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY () I=+AB=+ACD=+BCD Runs: c,d,ab,abcd () I=+AB=-ACD=-BCD Runs: (), cd, abc, abd (3) I=-AB=+ACD=-BCD Runs: a, bc, bd, acd (4) I=-AB=-ACD=+BCD Runs: b, ac, ad, bcd If we do not run the principal fraction (), then we can combine the remaining 3 fractions to from 3 onehalf fractions of the 4 as follows: Fraction : () + (3) implies I=-BCD. This fraction estimates: A, AB, AC, and AD Fraction : () + (4) implies I=-ACD. This fraction estimates: B, BC, BD, and AB Fraction 3: (3) + (4) implies I=-AB. This fraction estimates: C, D, and CD In estimating these effects we assume that all three-factor and higher interactions are negligible. Note that AB is estimated in two of the one-half fractions: and. We would average these quantities and obtain a single estimate of AB. John (97, pp. 6-63) discusses this design and shows that the estimates obtained above are also the least squares estimates. John also derives the variances and covariances of these estimators. 8-3 Carbon anodes used in a smelting process are baked in a ring furnace. An experiment is run in the furnace to determine which factors influence the weight of packing material that is stuck to the anodes after baking. Six variables are of interest, each at two levels: A = pitch/fines ratio (.45,.55); B = packing material type (, ); C = packing material temperature (ambient, 35 C); D = flue location (inside, outside); E = pit temperature (ambient, 95 C); and F = delay time before packing (zero, 4 hours). A 6-3 design is run, and three replicates are obtained at each of the design points. The weight of packing material stuck to the anodes is measured in grams. The data in run order are as follows: abd = (984, 86, 936); abcdef = (75, 976, 457); be = (7,, 89); af = (474, 64, 54); def = (3, 56, 93); cd = (765, 75, 8); ace = (338, 54, 94); and bcf = (35, 99, 53). We wish to minimize the amount stuck packing material. (a) Verify that the eight runs correspond to a 6 3 III design. What is the alias structure? A B C D=AB E=AC F=BC def af be abd cd ace bcf abcdef I=ABD=ACE=BCF=BCDE=ACDF=ABEF=DEF, Resolution III A=BD=CE=CDF=BEF B=AD=CF=CDE=AEF C=AE=BF=BDE=ADF 8-37

250 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY D=AB=EF=BCE=ACF E=AC=DF=BCD=ABF F=BC=DE=ACD=ABE CD=BE=AF=ABC=ADE=BDF=CEF (b) Use the average weight as a response. What factors appear to be influential? Design Expert Output Term Effect SumSqr % Contribtn Model Intercept Model A Error B Error C Model D Model E Model F Error BC Lenth's ME Lenth's SME Half Normal plot 99 Half Normal % probability E A F D Effect Factors A, D, E and F (and their aliases) are apparently important. (c) Use the range of the weights as a response. What factors appear to be influential? Design Expert Output Term Effect SumSqr % Contribtn Model Intercept Error A Error B Model C Error D Model E Model F Model AF Lenth's ME Lenth's SME Factors C, E, F and the AF interaction (and their aliases) appear to be large. 8-38

251 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Half Normal plot 99 Half Normal % probability C AF E F Effect (d) What recommendations would you make to the process engineers? It is not known exactly what to do here, since A, D, E and F are large effects, and because the design is resolution III, the main effects are aliased with two-factor interactions. Note, for example, that D is aliased with EF and the main effect could really be a EF interaction. If the main effects are really important, then setting all factors at the low level would minimize the amount of material stuck to the anodes. It would be necessary to run additional experiments to confirm these findings. 8-4 A 6-run experiment was performed in a semiconductor manufacturing plant to study the effects of six factors on the curvature or camber of the substrate devices produced. The six variables and their levels are shown below: Lamination Lamination Lamination Firing Firing Firing Temperature Time Pressure Temperature Cycle Time Dew Point Run (c) (s) (tn) (c) (h) (c) Each run was replicated four times, and a camber measurement was taken on the substrate. The data are shown below: Camber for Replicate (in/in) Total Mean Standard Run 3 4 ( -4 in/in) ( -4 in/in) Deviation

252 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY (a) What type of design did the experimenters use? The 6 IV, a 6-run design. (b) What are the alias relationships in this design? The defining relation is I=ABCE=ACDF=BDEF A (ABCE)= BCE A (ACDF)= CDF A (BDEF)= ABCDEF A=BCE=CDF=ABDEF B (ABCE)= ACE B (ACDF)= ABCDF B (BDEF)= DEF B=ACE=ABCDF=DEF C (ABCE)= ABE C (ACDF)= ADF C (BDEF)= BCDEF C=ABE=ADF=BCDEF D (ABCE)= ABCDE D (ACDF)= ACF D (BDEF)= BEF D=ABCDE=ACF=BEF E (ABCE)= ABC E (ACDF)= ACDEF E (BDEF)= BDF E=ABC=ABDEF=BDF F (ABCE)= ABCEF F (ACDF)= ACD F (BDEF)= BDE F=ABCEF=ACD=BDE AB (ABCE)= CE AB (ACDF)= BCDF AB (BDEF)= ADEF AB=CE=BCDF=ADEF AC (ABCE)= BE AC (ACDF)= DF AC (BDEF)= ABCDEF AC=BE=DF=ABCDEF AD (ABCE)= BCDE AD (ACDF)= CF AD (BDEF)= ABEF AD=BCDE=CF=ABEF AE (ABCE)= BC AE (ACDF)= CDEF AE (BDEF)= ABDF AE=BC=CDEF=ABDF AF (ABCE)= BCEF AF (ACDF)= CD AF (BDEF)= ABDE AF=BCEF=CD=ABDE BD (ABCE)= ACDE BD (ACDF)= ABCF BD (BDEF)= EF BD=ACDE=ABCF=EF BF (ABCE)= ACEF BF (ACDF)= ABCD BF (BDEF)= DE BF=ACEF=ABCD=DE (c) Do any of the process variables affect average camber? Yes, per the analysis below, variables A, C, D, and F affect average camber. Design Expert Output Term Effect SumSqr % Contribtn Model Intercept Model A Error B Model C Error D Model E Model F Error AB Error AC Error AD Error AE Error AF Error BC Aliased Error BD Error BE Aliased Error BF Error CD Aliased Error CE Aliased Error CF Aliased Error DE Aliased Error DF Aliased Error EF Aliased Error ABC Aliased Error ABD

253 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Error ABE Aliased Error ABF Lenth's ME Lenth's SME 46.4 DESIGN-EXPERT Plot Cam ber Avg Normal plot A: Lam Temp B: Lam Time C: Lam Pres D: Fire Temp E: Fire Time F: Fire DP Norm al % probability F E A C Effect Design Expert Output Response: Camber Avg in in/in ANOVA for Selected Factorial Model Analysis of variance table [Partial sum of squares] Sum of Mean F Source Squares DF Square Value Prob > F Model significant A C E F Residual Cor Total The Model F-value of.39 implies the model is significant. There is only a.7% chance that a "Model F-Value" this large could occur due to noise. Std. Dev. 3.4 R-Squared.855 Mean 7. Adj R-Squared.7348 C.V. 3.3 Pred R-Squared.5886 PRESS Adeq Precision.478 Coefficient Standard 95% CI 95% CI Factor Estimate DF Error Low High VIF Intercept A-Lam Temp C-Lam Pres E-Fire Time F-Fire DP Final Equation in Terms of Coded Factors: Camber Avg = * A +8. * C -7.3 * E * F Final Equation in Terms of Actual Factors: 8-4

254 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Camber Avg = * Lam Temp +.65 * Lam Pres * Fire Time * Fire DP (d) Do any of the process variables affect the variability in camber measurements? Yes, A, B, F, and AF interaction affect the variability in camber measurements. Design Expert Output Term Effect SumSqr % Contribtn Model Intercept Model A Model B Error C Error D Error E Model F Error AB Error AC Error AD Error AE Model AF Error BC Aliased Error BD Error BE Aliased Error BF Error CD Aliased Error CE Aliased Error CF Aliased Error DE Aliased Error DF Aliased Error EF Aliased Error ABC Aliased Error ABD Error ABE Aliased Error ABF Lenth's ME Lenth's SME 36.6 DESIGN-EXPERT Plot Cam ber StDev Normal plot A: Lam Temp B: Lam Time C: Lam Pres D: Fire Temp E: Fire Time F: Fire DP Norm al % probability B F AF A Effect 8-4

255 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Response: Camber StDev ANOVA for Selected Factorial Model Analysis of variance table [Partial sum of squares] Sum of Mean F Source Squares DF Square Value Prob > F Model significant A B F AF Residual Cor Total The Model F-value of. implies the model is significant. There is only a.8% chance that a "Model F-Value" this large could occur due to noise. Std. Dev. 8.5 R-Squared.8 Mean 5.35 Adj R-Squared.776 C.V. 3.5 Pred R-Squared.5773 PRESS Adeq Precision 9.56 Coefficient Standard 95% CI 95% CI Factor Estimate DF Error Low High VIF Intercept A-Lam Temp B-Lam Time F-Fire DP AF Final Equation in Terms of Coded Factors: Camber StDev = * A -8.9 * B * F * A * F Final Equation in Terms of Actual Factors: Camber StDev = * Lam Temp -.55 * Lam Time * Fire DP -.84 * Lam Temp * Fire DP (e) If it is important to reduce camber as much as possible, what recommendations would you make? 8-43

256 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY One Factor Plot One Factor Plot Camber Avg 3.75 Camber Avg Lam Temp Lam Pres One Factor Plot One Factor Plot Camber Avg 3.75 Camber Avg Fire Tim e Fire DP 8-44

257 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY DESIGN-EXPERT Plot Cam ber StDev Interaction Graph Fire DP DESIGN-EXPERT Plot Cam ber StDev One Factor Plot X = A: Lam Temp Y = F: Fire DP F-. F+ 6. Actual Factors B: Lam Time = 7.5 C: Lam Pres = D: Fire Temp = 6. E: Fire Time = 3.5 Camber StDev X = B: Lam Time Actual Factors A: Lam Temp = 65. C: Lam Pres = 7.5 D: Fire Temp = 6. E: Fire Tim e = 3.5 F: Fire DP = Camber StDev Lam Temp Lam Time Run A and C at the low level and E and F at the high level. B at the low level enables a lower variation without affecting the average camber. 8-5 A spin coater is used to apply photoresist to a bare silicon wafer. This operation usually occurs early in the semiconductor manufacturing process, and the average coating thickness and the variability in the coating thickness has an important impact on downstream manufacturing steps. Six variables are used in the experiment. The variables and their high and low levels are as follows: Factor Low Level High Level Final Spin Speed 735 rpm 665 rpm Acceleration Rate 5 Volume of Resist Applied 3 cc 5 cc Time of Spin 4 s 6 s Resist Batch Variation Batch Batch Exhaust Pressure Cover Off Cover On The experimenter decides to use a 6- design and to make three readings on resist thickness on each test wafer. The data are shown in table 8-9. Table 8-9 A B C D E F Resist Thick ness Run Volume Batch Time Speed Acc. Cover Left Center Right Avg. Range Off Off Off Off Off Off On Off Off On On Off On On On On On Off

258 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Off Off On On Off On On Off On Off On On On Off (a) Verify that this is a 6- design. Discuss the alias relationships in this design. I=ABCDEF. This is a resolution VI design where main effects are aliased with five-factor interactions and two-factor interactions are aliased with four-factor interactions. (b) What factors appear to affect average resist thickness? Factors B, D, and E appear to affect the average resist thickness. Design Expert Output Term Effect SumSqr % Contribtn Model Intercept Error A Model B Error C Model D Model E Error F Error AB Error AC Error AD Error AE Error AF Error BC Error BD Error BE Error BF Error CD Error CE Error CF Error DE Error DF Error EF Error ABC Error ABD Aliased Error ABE Error ABF Error ACD Error ACE Aliased Error ACF Aliased Error ADE Error ADF Aliased Error AEF Aliased Error BCD Error BCE Error BCF Aliased Error BDE Error BDF Error BEF Aliased Error CDE Aliased Error CDF Aliased Error CEF

259 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Error DEF Aliased Lenth's ME Lenth's SME DESIGN-EXPERT Plot Thick Avg Normal plot A: Volume B: Batch C: Tim e D: Speed E: Acc F: Cover Norm al % probability D E B Effect Design Expert Output Response: Thick Avg ANOVA for Selected Factorial Model Analysis of variance table [Partial sum of squares] Sum of Mean F Source Squares DF Square Value Prob > F Model 6.54E+5 3.8E <. significant B D 3.43E E <. E.677E+5.677E <. Residual Cor Total 7.3E+53 The Model F-value of 79. implies the model is significant. There is only a.% chance that a "Model F-Value" this large could occur due to noise. Std. Dev R-Squared.8946 Mean Adj R-Squared.8833 C.V..8 Pred R-Squared.863 PRESS.6E+5 Adeq Precision Coefficient Standard 95% CI 95% CI Factor Estimate DF Error Low High VIF Intercept B-Batch D-Speed E-Acc Final Equation in Terms of Coded Factors: Thick Avg = * B * D * E Final Equation in Terms of Actual Factors: Batch Batch Thick Avg =

260 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY * Speed -.95 * Acc Batch Batch Thick Avg = * Speed -.95 * Acc (c) Since the volume of resist applied has little effect on average thickness, does this have any important practical implications for the process engineers? Yes, less material could be used. (d) Project this design into a smaller design involving only the significant factors. Graphically display the results. Does this aid in interpretation? Cube Graph Thick Avg D Speed E+ Acc D E- B- B+ Batch The cube plot usually assists the experimenter in drawing conclusions. (e) Use the range of resist thickness as a response variable. Is there any indication that any of these factors affect the variability in resist thickness? Design Expert Output Term Effect SumSqr % Contribtn Model Intercept Model A Model B Error C Error D Model E Model F Model AB Error AC Error AD Error AE Model AF Error BC Error BD Error BE Model BF Error CD

261 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Error CE Error CF Error DE Error DF Model EF Error ABC Aliased Error ABD Aliased Error ABE Model ABF Error ACD Error ACE Aliased Error ACF Aliased Error ADE Error ADF Error AEF 8.99 Error BCD Aliased Error BCE Aliased Error BCF Aliased Error BDE Error BDF Error BEF Aliased Error CDE Aliased Error CDF Aliased Error CEF Error DEF Lenth's ME 9.54 Lenth's SME DESIGN-EXPERT Plot Thick StDev Normal plot A: Volume B: Batch C: Tim e D: Speed E: Acc F: Cover Normal % probability B BF AB AF A EF F ABF 5 E Effect Design Expert Output Response: Thick StDev ANOVA for Selected Factorial Model Analysis of variance table [Partial sum of squares] Sum of Mean F Source Squares DF Square Value Prob > F Model significant A B E F AB AF BF EF ABF

262 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Residual Cor Total The Model F-value of.48 implies the model is significant. There is only a 4.% chance that a "Model F-Value" this large could occur due to noise. Std. Dev R-Squared.53 Mean 6.56 Adj R-Squared.3 C.V Pred R-Squared -.5 PRESS Adeq Precision Coefficient Standard 95% CI 95% CI Factor Estimate DF Error Low High VIF Intercept A-Volume B-Batch E-Acc F-Cover AB AF BF EF ABF Final Equation in Terms of Coded Factors: Thick StDev = * A +.6 * B -.69 * E * F +.3 * A * B +.88 * A * F +.63 * B * F +4. * E * F +4.5 * A * B * F Final Equation in Terms of Actual Factors: Batch Batch Cover Off Thick StDev = * Volume * Acc Batch Batch Cover Off Thick StDev = * Volume * Acc Batch Batch Cover On Thick StDev = * Volume +.75 * Acc Batch Batch Cover On Thick StDev = * Volume +.75 * Acc 8-5

263 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY The model here for variability isn t very strong. Notice the small value of R, and in particular, the adjusted R. Often we find that obtaining a good model for a response that expresses variability isn t as easy as finding a satisfactory model for a response that essentially measures the mean. (f) Where would you recommend that the process engineers run the process? Considering only the average thickness results, the engineers could use factors B, D and E to put the process mean at target. Then the engineer could consider the other factors on the range model to try to set the factors to reduce the variation in thickness at that mean. 8-6 Harry and Judy Peterson-Nedry (two friends of the author) own a vineyard in Oregon. They grow several varieties of grapes and manufacture wine. Harry and Judy have used factorial designs for process and product development in the winemaking segment of their business. This problem describes the experiment conducted for their 985 Pinot Noir. Eight variables, shown below, were originally studied in this experiment: Variable Low Level High Level A Pinot Noir Clone Pommard Wadenswil B Oak Type Allier Troncais C Age of Barrel Old New D Yeast/Skin Contact Champagne Montrachet E Stems None All F Barrel Toast Light Medium G Whole Cluster None % H Fermentation Temperature Low (75 F Max) High (9 F Max) Harry and Judy decided to use a 84 IV design with 6 runs. The wine was taste-tested by a panel of experts on 8 March 986. Each expert ranked the 6 samples of wine tasted, with rank being the best. The design and taste-test panel results are shown in Table 8-3. Table 8-3 Run A B C D E F G H HPN JPN CAL DCM RGB y bar s (a) What are the alias relationships in the design selected by Harry and Judy? E = BCD, F = ACD, G = ABC, H = ABD Defining Contrast : I = BCDE = ACDF = ABEF = ABCG = ADEG = BDFG = CEFG = ABDH = ACEH = BCFH = DEFH = CDGH = BEGH = AFGH = ABCDEFGH Aliases: A = BCG = BDH = BEF = CDF = CEH = DEG = FGH 8-5

264 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY B = ACG = ADH = AEF = CDE = CFH = DFG = EGH C = ABG = ADF = AEH = BDE = BFH = DGH = EFG D = ABH = ACF = AEG = BCE = BFG = CGH = EFH E = ABF = ACH = ADG = BCD = BGH = CFG = DFH F = ABE = ACD = AGH = BCH = BDG = CEG = DEH G = ABC = ADE = AFH = BDF = BEH = CDH = CEF H = ABD = ACE = AFG = BCF = BEG = CDG = DEF AB = CG = DH = EF AC = BG = DF = EH AD = BH = CF = EG AE = BF = CH = DG AF = BE = CD = GH AG = BC = DE = FH AH = BD = CE = FG (b) Use the average ranks ( y ) as a response variable. Analyze the data and draw conclusions. You will find it helpful to examine a normal probability plot of effect estimates. Design Expert Output Term Effect SumSqr % Contribtn Model Intercept Error A Error B Error C Model D Error E Model F Model G Error H Error AB Error AC Model AD Error AE Error AF Error AG Error AH Lenth's ME 6.73 Lenth's SME.39 DESIGN-EXPERT Plot Taste Avg Normal plot A: A B: B C: C D: D E: E F: F G: G H: H Normal % probability D AD F G Effect 8-5

265 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Design Expert Output Response: Taste Avg ANOVA for Selected Factorial Model Analysis of variance table [Partial sum of squares] Sum of Mean F Source Squares DF Square Value Prob > F Model significant A D F G AD Residual Cor Total 5. 5 The Model F-value of 4.57 implies the model is significant. There is only a.98% chance that a "Model F-Value" this large could occur due to noise. Std. Dev..77 R-Squared.6957 Mean 8.8 Adj R-Squared.5435 C.V Pred R-Squared.9 PRESS 96.4 Adeq Precision 7.57 Coefficient Standard 95% CI 95% CI Factor Estimate DF Error Low High VIF Intercept A-A D-D F-F G-G AD Final Equation in Terms of Coded Factors: Taste Avg = * A -.99 * D -.3 * F +.89 * G -.9 * A * D Factors D, F, G and the AD interaction are important. Factor A is added to the model to preserve hierarchy. Notice that the AD interaction is aliased with other two-factor interactions that could also be important. So the interpretation of the two-factor interaction is somewhat uncertain. Normally, we would add runs to the design to isolate the significant interactions, but that won t work very well here because each experiment requires a full growing season. In other words, it would require a very long time to add runs to dealias the alias chain of interest. 8-53

266 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY DESIGN-EXPERT Plot Taste Avg 5. Interaction Graph D X = A: A Y = D: D Design Points.95 D D D D Actual Factors B: B = B C: C = C E: E = E F: F = F G: G = G H: H = H Tas te Avg A A A One Factor Plot One Factor Plot Tas te Avg 8.7 Tas te Avg F F G G F G (c) Use the standard deviation of the ranks (or some appropriate transformation such as log s) as a response variable. What conclusions can you draw about the effects of the eight variables on variability in wine quality? 8-54

267 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY DESIGN-EXPERT Plot Ln(Taste StDev) Normal plot A: A B: B C: C D: D E: E F: F G: G H: H Normal % probability Warning! No term s are s ele Effect There do not appear to be any significant factors. (d) After looking at the results, Harry and Judy decide that one of the panel members (DCM) knows more about beer than he does about wine, so they decide to delete his ranking. What affect would this have on the results and on conclusions from parts (b) and (c)? Design Expert Output Term Effect SumSqr % Contribtn Model Intercept Model A Error B Error C Model D Error E Model F Model G Error H Error AB Error AC Model AD Error AE Error AF Error AG Error AH Lenth's ME Lenth's SME

268 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY DESIGN-EXPERT Plot Taste Avg Normal plot A: A B: B C: C D: D E: E F: F G: G H: H Normal % probability D F AD A G Effect Design Expert Output Response: Taste Avg ANOVA for Selected Factorial Model Analysis of variance table [Partial sum of squares] Sum of Mean F Source Squares DF Square Value Prob > F Model not significant A D F G AD Residual Cor Total The Model F-value of 3. implies there is a 6.46% chance that a "Model F-Value" this large could occur due to noise. Std. Dev. 3.3 R-Squared.64 Mean 8.5 Adj R-Squared.4 C.V Pred R-Squared -.4 PRESS Adeq Precision Coefficient Standard 95% CI 95% CI Factor Estimate DF Error Low High VIF Intercept A-A D-D F-F G-G AD Final Equation in Terms of Coded Factors: Taste Avg = * A -.5 * D -.9 * F +.47 * G -.75 * A * D The results are the same for average taste without DCM as they were with DCM. 8-56

269 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY DESIGN-EXPERT Plot Taste StDev Normal plot A: A B: B C: C D: D E: E F: F G: G H: H Normal % probability Warning! No term s are s ele Effect The standard deviation response is much the same with or without DCM s responses. Again, there are no significant factors. (e) Suppose that just before the start of the experiment, Harry and Judy discovered that the eight new barrels they ordered from France for use in the experiment would not arrive in time, and all 6 runs would have to be made with old barrels. If Harry and Judy just drop column C from their design, what does this do to the alias relationships? Do they need to start over and construct a new design? The resulting design is a 7 3 IV BEGH = AFGH. with defining relations: I = ABEF = ADEG = BDFG = ABDH = DEFH = (f) Harry and Judy know from experience that some treatment combinations are unlikely to produce good results. For example, the run with all eight variables at the high level generally results in a poorly rated wine. This was confirmed in the 8 March 986 taste test. They want to set up a new design to make the run with all eight factors at the high level. What design would you suggest? By changing the sign of any of the design generators, a design that does not include the principal fraction will be generated. This will give a design without an experimental run combination with all of the variables at the high level. 8-7 In an article in Quality Engineering ( An Application of Fractional Factorial Experimental Designs, 988, Vol. pp. 9-3) M.B. Kilgo describes an experiment to determine the effect of CO pressure (A), CO temperature (B), peanut moisture (C), CO flow rate (D), and peanut particle size (E) on the total yield of oil per batch of peanuts (y). The levels she used for these factors are as follows: A B C D E Coded Pressure Temp Moisture Flow Particle Size Level (bar) (C) (% by weight) (liters/min) (mm) She conducted the 6-run fractional factorial experiment shown below: A B C D E y

270 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY (a) What type of design has been used? Identify the defining relation and the alias relationships. A 5 V, 6-run design, with I= -ABCDE. A(-ABCDE)= -BCDE B(-ABCDE)= -ACDE C(-ABCDE)= -ABDE D(-ABCDE)= -ABCE E(-ABCDE)= -ABCD AB(-ABCDE)= -CDE AC(-ABCDE)= -BDE AD(-ABCDE)= -BCE AE(-ABCDE)= -BCD BC(-ABCDE)= -ADE BD(-ABCDE)= -ACE BE(-ABCDE)= -ACD CD(-ABCDE)= -ABE CE(-ABCDE)= -ABD DE(-ABCDE)= -ABC A = -BCDE B = -ACDE C = -ABDE D = -ABCE E = -ABCD AB = -CDE AC = -BDE AD = -BCE AE = -BCD BC = -ADE BD = -ACE BE = -ACD CD = -ABE CE = -ABD DE = -ABC (b) Estimate the factor effects and use a normal probability plot to tentatively identify the important factors. Design Expert Output Term Effect SumSqr % Contribtn Model Intercept Error A Model B Error C Error D Model E Error AB Error AC Error AD Error AE Error BC Error BD Error BE Error CD Error CE Error DE Lenth's ME.5676 Lenth's SME

271 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY DESIGN-EXPERT Plot Yield Half Normal plot A: Pressure B: Temperature C: M oisture D: Flow E: Particle Size Half Normal % probability B E Effect (c) Perform an appropriate statistical analysis to test the hypothesis that the factors identified in part above have a significant effect on the yield of peanut oil. Design Expert Output Response: Yield ANOVA for Selected Factorial Model Analysis of variance table [Partial sum of squares] Sum of Mean F Source Squares DF Square Value Prob > F Model <. significant B E <. Residual Cor Total The Model F-value of implies the model is significant. There is only a.% chance that a "Model F-Value" this large could occur due to noise. Std. Dev. 8.4 R-Squared.949 Mean 54.5 Adj R-Squared.98 C.V. 5.8 Pred R-Squared.87 PRESS Adeq Precision 8.7 Coefficient Standard 95% CI 95% CI Factor Estimate DF Error Low High VIF Intercept B-Temperature E-Particle Size (d) Fit a model that could be used to predict peanut oil yield in terms of the factors that you have identified as important. Design Expert Output Final Equation in Terms of Coded Factors: Yield = * B +.5 * E Final Equation in Terms of Actual Factors: Yield = 8-59

272 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY * Temperature * Particle Size (e) Analyze the residuals from this experiment and comment on model adequacy. The residual plots are satisfactory. There is a slight tendency for the variability of the residuals to increase with the predicted value of y. Normal plot of residuals Residuals vs. Predicted Norm al % probability Residuals Residual Predicted Residuals vs. Temperature Residuals vs. Particle Size Residuals Residuals Tem perature Particle Size 8-8 A 6-run fractional factorial experiment in factors on sand-casting of engine manifolds was conducted by engineers at the Essex Aluminum Plant of the Ford Motor Company and described in the article Evaporative Cast Process 3. Liter Intake Manifold Poor Sandfill Study, by D. Becknell (Fourth Symposium on Taguchi Methods, American Supplier Institute, Dearborn, MI, 986, pp. -3). The purpose was to determine which of factors has an effect on the proportion of defective castings. The design and the resulting proportion of nondefective castings p observed on each run are shown below. 8-6

273 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY This is a resolution III fraction with generators E=CD, F=BD, G=BC, H=AC, J=AB, and K=ABC. Assume that the number of castings made at each run in the design is. F&T s Run A B C D E F G H J K p arcsin Modification (a) Find the defining relation and the alias relationships in this design. I=CDE=BDF=BCG=ACH=ABJ=ABCK=BCEF=BDEG=ADEH=ABCDEJ=ABDEK=CDFG=ABCDFH = ADFJ=ACDFK=ABGH=ACGJ=AGK=BCHJ=BHK=CKJ (b) Estimate the factor effects and use a normal probability plot to tentatively identify the important factors. Design Expert Output Term Effect SumSqr % Contribtn Model Intercept Error A Error B Error C Error D Error E Model F Error G Error H Error J Model K Error AB Aliased Error AC Aliased Error AD E Error AE Error AF Error BE Error DK Lenth's ME.345 Lenth's SME

274 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY DESIGN-EXPERT Plot p Normal plot A: A B: B C: C D: D E: E F: F G: G H: H J: J K: K Normal % probability K F Effect (c) Fit an appropriate model using the factors identified in part (b) above. Design Expert Output Response: p ANOVA for Selected Factorial Model Analysis of variance table [Partial sum of squares] Sum of Mean F Source Squares DF Square Value Prob > F Model significant F K Residual E-3 Cor Total.4 5 The Model F-value of.7 implies the model is significant. There is only a.8% chance that a "Model F-Value" this large could occur due to noise. Std. Dev..63 R-Squared.65 Mean.89 Adj R-Squared.5645 C.V. 7.9 Pred R-Squared.48 PRESS.79 Adeq Precision Coefficient Standard 95% CI 95% CI Factor Estimate DF Error Low High VIF Intercept F-F K-K Final Equation in Terms of Coded Factors: p = * F +.5 * K Final Equation in Terms of Actual Factors: p = * F * K (d) Plot the residuals from this model versus the predicted proportion of nondefective castings. Also prepare a normal probability plot of the residuals. Comment on the adequacy of these plots. 8-6

275 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY The residual versus predicted plot identifies an inequality of variances. This is likely caused by the response variable being a proportion. A transformation could be used to correct this. Normal plot of residuals Residuals vs. Predicted Normal % probability Residuals Residual Predicted (e) In part (d) you should have noticed an indication that the variance of the response is not constant (considering that the response is a proportion, you should have expected this). The previous table also shows a transformation on P, the arcsin square root, that is a widely used variance stabilizing transformation for proportion data (refer to the discussion of variance stabilizing transformations is Chapter 3). Repeat parts (a) through (d) above using the transformed response and comment on your results. Specifically, are the residuals plots improved? Design Expert Output Term Effect SumSqr % Contribtn Model Intercept Error A Error B.5.5E E-5 Error C Error D Error E Model F Error G Error H Error J Model K Error AD Error AF Error BE Error DH Error DK Lenth's ME.535 Lenth's SME.4684 As with the original analysis, factors F and K remain significant with a slight increase with the R. 8-63

276 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY DESIGN-EXPERT Plot arcsin Normal plot A: A B: B C: C D: D E: E F: F G: G H: H J: J K: K Normal % probability F K Effect Design Expert Output Response: arcsin ANOVA for Selected Factorial Model Analysis of variance table [Partial sum of squares] Sum of Mean F Source Squares DF Square Value Prob > F Model significant F K Residual.6 3. Cor Total.46 5 The Model F-value of.59 implies the model is significant. There is only a.9% chance that a "Model F-Value" this large could occur due to noise. Std. Dev.. R-Squared.6595 Mean.8 Adj R-Squared.67 C.V Pred R-Squared.484 PRESS.4 Adeq Precision 8.93 Coefficient Standard 95% CI 95% CI Factor Estimate DF Error Low High VIF Intercept F-F K-K Final Equation in Terms of Coded Factors: arcsin = * F +.97 * K Final Equation in Terms of Actual Factors: arcsin = * F * K The inequality of variance has improved; however, there remain hints of inequality in the residuals versus predicted plot and the normal probability plot now appears to be irregular. 8-64

277 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Normal plot of residuals Residuals vs. Predicted Normal % probability Residuals Residual Predicted (f) There is a modification to the arcsin square root transformation, proposed by Freeman and Tukey ( Transformations Related to the Angular and the Square Root, Annals of Mathematical Statistics, Vol., 95, pp. 67-6) that improves its performance in the tails. F&T s modification is: arcsin npˆ arcsin n npˆ n Rework parts (a) through (d) using this transformation and comment on the results. (For an interesting discussion and analysis of this experiment, refer to Analysis of Factorial Experiments with Defects or Defectives as the Response, by S. Bisgaard and H.T. Fuller, Quality Engineering, Vol. 7, 994-5, pp ) Design Expert Output Term Effect SumSqr % Contribtn Model Intercept Error A Error B.5 6.5E-8.466E-5 Error C Error D Error E Model F Error G Error H Error J Model K Error AD Error AF Error BE Error DH Error DK Lenth's ME.9348 Lenth's SME As with the prior analysis, factors F and K remain significant. 8-65

278 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY DESIGN-EXPERT Plot F&T Transform Normal plot A: A B: B C: C D: D E: E F: F G: G H: H J: J K: K Normal % probability F K Effect Design Expert Output Response: F&T Transform ANOVA for Selected Factorial Model Analysis of variance table [Partial sum of squares] Sum of Mean F Source Squares DF Square Value Prob > F Model significant F K Residual.5 3. Cor Total.44 5 The Model F-value of.67 implies the model is significant. There is only a.9% chance that a "Model F-Value" this large could occur due to noise. Std. Dev.. R-Squared.66 Mean.7 Adj R-Squared.688 C.V Pred R-Squared.4864 PRESS.3 Adeq Precision 8. Coefficient Standard 95% CI 95% CI Factor Estimate DF Error Low High VIF Intercept F-F K-K Final Equation in Terms of Coded Factors: F&T Transform = * F +.95 * K Final Equation in Terms of Actual Factors: F&T Transform = * F * K The residual plots appears as they did with the arcsin square root transformation. 8-66

279 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Normal plot of residuals Residuals vs. Predicted.4487 Normal % probability Residuals Residual Predicted 8-9 A 6-run fractional factorial experiment in 9 factors was conducted by Chrysler Motors Engineering and described in the article Sheet Molded Compound Process Improvement, by P.I. Hsieh and D.E. Goodwin (Fourth Symposium on Taguchi Methods, American Supplier Institute, Dearborn, MI, 986, pp. 3-). The purpose was to reduce the number of defects in the finish of sheet-molded grill opening panels. The design, and the resulting number of defects, c, observed on each run, is shown below. This is a resolution III fraction with generators E=BD, F=BCD, G=AC, H=ACD, and J=AB. F&T s Run A B C D E F G H J c c Modification (a) Find the defining relation and the alias relationships in this design. I=BDE=BCDF=CEF=ACG=ABCDEG=ABDEG=AEFG=ACDH=ABCEH=ABFH=ADEFH=DGH= BEGH=BCRG=CDEFGH=ABJ=ADEJ=ACDFJ=ABCEFJ=BCGJ=CDEGJ=DEGJ=BEFGJ=BCDHJ= CEHJ=FHJ=BDEFHJ=ABDGHJ=AEGHJ=ACEGJ=ABCDEFGHJ (b) Estimate the factor effects and use a normal probability plot to tentatively identify the important factors. Design Expert Output Term Effect SumSqr % Contribtn Model Intercept 8-67

280 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Model A Model B Model C Model D Error E Model F Model G Error H Error J Model AD Error AE Model AF Error AH Model BC Model BG Lenth's ME Lenth's SME DESIGN-EXPERT Plot c Normal plot A: A B: B C: C D: D E: E F: F G: G H: H J: J Normal % probability F BG D A AD BC AF Effect (c) Fit an appropriate model using the factors identified in part (b) above. Design Expert Output Response: c ANOVA for Selected Factorial Model Analysis of variance table [Partial sum of squares] Sum of Mean F Source Squares DF Square Value Prob > F Model significant A B C D F G AD AF BC BG Residual Cor Total The Model F-value of 8.6 implies the model is significant. There is only a.9% chance that a "Model F-Value" this large could occur due to noise. Std. Dev R-Squared

281 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Mean.6 Adj R-Squared.9478 C.V Pred R-Squared.8 PRESS 87.4 Adeq Precision 7.77 Coefficient Standard 95% CI 95% CI Factor Estimate DF Error Low High VIF Intercept A-A B-B C-C D-D F-F G-G AD AF BC BG Final Equation in Terms of Coded Factors: c = * A -.94 * B -.8 * C -7.9 * D -8.3 * F -.6 * G +5.8 * A * D * A * F * B * C -6.3 * B * G Final Equation in Terms of Actual Factors: c = * A * B -.85 * C * D * F -.65 * G * A * D * A * F * B * C * B * G (d) Plot the residuals from this model versus the predicted number of defects. Also, prepare a normal probability plot of the residuals. Comment on the adequacy of these plots. 8-69

282 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Normal plot of residuals Residuals vs. Predicted Normal % probability Residuals Residual Predicted There is a significant problem with inequality of variance. This is likely caused by the response variable being a count. A transformation may be appropriate. (e) In part (d) you should have noticed an indication that the variance of the response is not constant (considering that the response is a count, you should have expected this). The previous table also shows a transformation on c, the square root, that is a widely used variance stabilizing transformation for count data (refer to the discussion of variance stabilizing transformations in Chapter 3). Repeat parts (a) through (d) using the transformed response and comment on your results. Specifically, are the residual plots improved? Design Expert Output Term Effect SumSqr % Contribtn Model Intercept Error A Model B Error C Model D Error E Model F Model G Error H Error J Error AD Error AE Error AF Error AH Error BC Model BG Lenth's ME.7978 Lenth's SME The analysis of the data with the square root transformation identifies only D, F, the BG interaction as being significant. The original analysis identified factor A and several two factor interactions as being significant. 8-7

283 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY DESIGN-EXPERT Plot sq rt Normal plot A: A B: B C: C D: D E: E F: F G: G H: H J: J Normal % probability F D BG Effect Design Expert Output Response: sqrt ANOVA for Selected Factorial Model Analysis of variance table [Partial sum of squares] Sum of Mean F Source Squares DF Square Value Prob > F Model significant B D F G BG Residual 7..7 Cor Total The Model F-value of 6.67 implies the model is significant. There is only a.56% chance that a "Model F-Value" this large could occur due to noise. Std. Dev..3 R-Squared.7694 Mean.3 Adj R-Squared.654 C.V Pred R-Squared.497 PRESS 44.5 Adeq Precision 8.4 Coefficient Standard 95% CI 95% CI Factor Estimate DF Error Low High VIF Intercept B-B D-D F-F G-G BG Final Equation in Terms of Coded Factors: sqrt = * B -.8 * D -.3 * F -.9 * G -.8 * B * G Final Equation in Terms of Actual Factors: sqrt = 8-7

284 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY * B -.85 * D * F -.95 * G -.85 * B * G The residual plots are acceptable; although, there appears to be a slight u shape to the residuals versus predicted plot. Normal plot of residuals Residuals vs. Predicted Norm al % probability Residuals Residual Predicted (f) There is a modification to the square root transformation proposed by Freeman and Tukey ( Transformations Related to the Angular and the Square Root, Annals of Mathematical Statistics, Vol., 95, pp. 67-6) that improves its performance. F&T s modification to the square root transformation is: c c Rework parts (a) through (d) using this transformation and comment on the results. (For an interesting discussion and analysis of this experiment, refer to Analysis of Factorial Experiments with Defects or Defectives as the Response, by S. Bisgaard and H.T. Fuller, Quality Engineering, Vol. 7, 994-5, pp ) Design Expert Output Term Effect SumSqr % Contribtn Model Intercept Error A Model B Error C Model D Error E Model F Model G Error H Error J Error AD Error AE Error AF Error AH

285 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Error BC Model BG Lenth's ME.4 Lenth's SME As with the square root transformation, factors D, F, and the BG interaction remain significant. DESIGN-EXPERT Plot F&T Normal plot A: A B: B C: C D: D E: E F: F G: G H: H J: J Norm al % probability F D BG Effect Design Expert Output Response: F&T ANOVA for Selected Factorial Model Analysis of variance table [Partial sum of squares] Sum of Mean F Source Squares DF Square Value Prob > F Model significant B D F G BG Residual Cor Total The Model F-value of 5.73 implies the model is significant. There is only a.95% chance that a "Model F-Value" this large could occur due to noise. Std. Dev..3 R-Squared.74 Mean.5 Adj R-Squared.67 C.V Pred R-Squared.3373 PRESS 44.7 Adeq Precision 7.86 Coefficient Standard 95% CI 95% CI Factor Estimate DF Error Low High VIF Intercept B-B D-D F-F G-G BG Final Equation in Terms of Coded Factors: F&T = * B 8-73

286 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY -. * D -. * F -. * G -.77 * B * G Final Equation in Terms of Actual Factors: F&T = * B * D -.5 * F -.5 * G -.77 * B * G The following interaction plots appear as they did with the square root transformation; a slight u shape is observed in the residuals versus predicted plot. Normal plot of residuals Residuals vs. Predicted Normal % probability Residuals Residual Predicted 8-3 An experiment is run in a semiconductor factory to investigate the effect of six factors on transistor 6 gain. The design selected is the IV shown below. Standard Run Order Order A B C D E F Gain

287 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY (a) Use a normal plot of the effects to identify the significant factors. Design Expert Output Term Effect SumSqr % Contribtn Model Intercept Model A Model B Model C Error D Error E Error F Model AB Model AC Error AD Error AE Error AF Error BD Error BF Error ABD Error ABF Lenth's ME Lenth's SME 9.57 DESIGN-EXPERT Plot Gain Normal plot A: A B: B C: C D: D E: E F: F Norm al % probability C AC AB B A Effect (b) Conduct appropriate statistical tests for the model identified in part (a). Design Expert Output Response: Gain ANOVA for Selected Factorial Model Analysis of variance table [Partial sum of squares] Sum of Mean F Source Squares DF Square Value Prob > F Model <. significant A <. B <. C <. AB <. AC Residual Cor Total The Model F-value of 39.6 implies the model is significant. There is only a.% chance that a "Model F-Value" this large could occur due to noise. 8-75

288 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Std. Dev R-Squared.997 Mean Adj R-Squared.9876 C.V..39 Pred R-Squared.9788 PRESS Adeq Precision Coefficient Standard 95% CI 95% CI Factor Estimate DF Error Low High VIF Intercept A-A B-B C-C AB AC Final Equation in Terms of Coded Factors: Gain = * A * B -5.3 * C * A * B -4. * A * C Final Equation in Terms of Actual Factors: Gain = * A * B -5.5 * C * A * B -4.5 * A * C (c) Analyze the residuals and comment on your findings. The residual plots are acceptable. The normality and equality of variance assumptions are verified. There does not appear to be any trends or interruptions in the residuals versus run order plot. Normal plot of residuals Residuals vs. Predicted Normal % probability Residuals Residual Predicted 8-76

289 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Residuals vs. Run Residuals vs. A Residuals.5 Residuals Run Number A Residuals vs. B Residuals vs. C Residuals.5 Residuals B C Residuals vs. D Residuals vs. E Residuals.5 Residuals D E 8-77

290 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Residuals vs. F Residuals F (d) Can you find a set of operating conditions that produce gain of 5 5? Yes, see the graphs below. DESIGN-EXPERT Plot Gain 596 Interaction Graph B DESIGN-EXPERT Plot Gain 596 Interaction Graph C X = A: A Y = B: B X = A: A Y = C: C B- -. B+. Actual Factors C: C =. D: D =. E: E =. F: F = Gain 53 C- -. C+. Actual Factors B: B =. D: D =. E: E =. F: F = Gain A A 8-78

291 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY DESIGN-EXPERT Plot Overlay Plot X = A: A Y = B: B. Overlay Plot Actual Factors C: C = -. D: D =. E: E =. F: F =..5 B. Gain: 55 Gain: A 8-3 Heat treating is often used to carbonize metal parts, such as gears. The thickness of the carbonized layer is a critical output variable from this process, and it is usually measured by performing a carbon 6 analysis on the gear pitch (top of the gear tooth). Six factors were studied on a IV design: A = furnace temperature, B = cycle time, C = carbon concentration, D = duration of the carbonizing cycle, E = carbon concentration of the diffuse cycle, and F = duration of the diffuse cycle. The experiment is shown below: Standard Run Order Order A B C D E F Pitch (a) Estimate the factor effects and plot them on a normal probability plot. Select a tentative model. Design Expert Output Term Effect SumSqr % Contribtn Model Intercept Model A Error B Model C Model D Model E Error F Error AB Error AC Error AD Error AE

292 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Error BD Model CD Model DE Error ABD Error ABF Lenth's ME Lenth's SME 3.39 Factors A, C, D, E and the two factor interactions CD and DE appear to be significant. The model can be found in the Design Expert Output below. DESIGN-EXPERT Plot Pitch Normal plot A: A B: B C: C D: D E: E F: F Norm al % probability DE C CD D E A Effect (b) Perform appropriate statistical tests on the model. Design Expert Output Response: Pitch ANOVA for Selected Factorial Model Analysis of variance table [Partial sum of squares] Sum of Mean F Source Squares DF Square Value Prob > F Model <. significant A <. C D <. E <. CD DE Residual Cor Total The Model F-value of implies the model is significant. There is only a.% chance that a "Model F-Value" this large could occur due to noise. Std. Dev. 7. R-Squared.988 Mean Adj R-Squared.968 C.V. 5.3 Pred R-Squared.9393 PRESS 479. Adeq Precision 8.68 Coefficient Standard 95% CI 95% CI Factor Estimate DF Error Low High VIF Intercept A-A C-C D-D

293 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY E-E CD DE Final Equation in Terms of Coded Factors: Pitch = * A -6.5 * C +8.5 * D +7.5 * E +7.5 * C * D -. * D * E Final Equation in Terms of Actual Factors: Pitch = * A -6.5 * C +8.5 * D +7.5 * E +7.5 * C * D -. * D * E (c) Analyze the residuals and comment on model adequacy. The residual plots are acceptable. The normality and equality of variance assumptions are verified. There does not appear to be any trends or interruptions in the residuals versus run order plot. The plots of the residuals versus factors C and E identify reduced variation at the lower level of both variables while the plot of residuals versus factor F identifies reduced variation at the upper level. Because C and E are significant factors in the model, this might not affect the decision on the optimum solution for the process. However, factor F is not included in the model and may be set at the upper level to reduce variation. Normal plot of residuals Residuals vs. Predicted Norm al % probability Residuals Residual Predicted 8-8

294 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Residuals vs. Run Residuals vs. A Residuals -. 5 Residuals Run Number A Residuals vs. B Residuals vs. C Residuals -. 5 Residuals B C Residuals vs. D Residuals vs. E Residuals -. 5 Residuals D E 8-8

295 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Residuals vs. F Residuals F (d) Interpret the results of this experiment. Assume that a layer thickness of between 4 and 6 is desirable. The graphs below identify a region that is acceptable between 4 and 6. DESIGN-EXPERT Plot Pitch 93 Interaction Graph D DESIGN-EXPERT Plot Pitch 93 Interaction Graph E X = C: C Y = D: D X = D: D Y = E: E D- -. D+. Actual Factors A: A =. B: B =. E: E =. F: F = Pitch 3.5 E- -. E+. Actual Factors A: A =. B: B =. C: C =. F: F = Pitch C D 8-83

296 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY DESIGN-EXPERT Plot Overlay Plot X = C: C Y = D: D Coded Factors A: A =.5 B: B =. E: E =. F: F =...5 Overlay Plot Pitch: 6 D. Pitch: C 8-3 Five factors are studied in the irregular fractional factorial design of resolution V shown below. Standard Run Order Order A B C D E Gain (a) Analyze the data from this experiment. What factors influence the response y? Design Expert Output Term Effect SumSqr % Contribtn Model Intercept Model A Model B Model C Model D Error E Model AB Model AC Error AD

297 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Error AE Error BC E-5.668E-5 Error BD Error BE Error CD Error CE Error DE Error ABC Error ABD Error ABE Aliased Error ACD Error ACE Aliased Error ADE Error BCD Error BCE Error BDE Error CDE Lenth's ME Lenth's SME Factors A, B, C, D, and the AB and AC interactions appear to be significant. DESIGN-EXPERT Plot y Normal plot A: A B: B C: C D: D E: E Normal % probability AC D AB A B C Effect Design Expert Output Response: y ANOVA for Selected Factorial Model Analysis of variance table [Partial sum of squares] Sum of Mean F Source Squares DF Square Value Prob > F Model <. significant A <. B <. C <. D <. AB <. AC <. Residual Cor Total The Model F-value of implies the model is significant. There is only a.% chance that a "Model F-Value" this large could occur due to noise. Std. Dev..46 R-Squared.999 Mean 9.97 Adj R-Squared.989 C.V..3 Pred R-Squared

298 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY PRESS 7.6 Adeq Precision Coefficient Standard 95% CI 95% CI Factor Estimate DF Error Low High VIF Intercept A-A B-B C-C D-D AB AC Final Equation in Terms of Coded Factors: y = * A +. * B -.8 * C +.7 * D +.77 * A * B -.97 * A * C Final Equation in Terms of Actual Factors: y = * A * B -.85 * C * D * A * B * A * C DESIGN-EXPERT Plot y 9.3 Interaction Graph B DESIGN-EXPERT Plot y 9.3 Interaction Graph C X = A: A Y = B: B X = A: A Y = C: C B- -. B+. Actual Factors C: C =. D: D =. E: E =. y C- -. C+. Actual Factors B: B =. D: D =. E: E =. y A A 8-86

299 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY DESIGN-EXPERT Plot y 9.3 One Factor Plot X = D: D Actual Factors A: A =. B: B =. C: C =. E: E =. y D (b) Analyze the residuals. Comment on model adequacy. The residual plots are acceptable. The normality and equality of variance assumptions are verified. There does not appear to be any trends or interruptions in the residuals versus run order plot. Normal plot of residuals Residuals vs. Predicted Norm al % probability Residuals Residual Predicted 8-87

300 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Residuals vs. Run Residuals vs. A Residuals Residuals Run Number A Residuals vs. B Residuals vs. C Residuals Residuals B C Residuals vs. D Residuals vs. E Residuals Residuals D E 8-88

301 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Chapter 9 Three-Level and Mixed-Level Factorial and Fractional Factorial Design Solutions 9- The effects of developer concentration (A) and developer time (B) on the density of photographic plate film are being studied. Three strengths and three times are used, and four replicates of a 3 factorial experiment are run. The data from this experiment follow. Analyze the data using the standard methods for factorial experiments. Development Time (minutes) Developer Concentration 4 8 % % % Design Expert Output Response: Data ANOVA for Selected Factorial Model Analysis of variance table [Partial sum of squares] Sum of Mean F Source Squares DF Square Value Prob > F Model <. significant A <. B AB Residual Lack of Fit. Pure Error Cor Total The Model F-value of.66 implies the model is significant. There is only a.% chance that a "Model F-Value" this large could occur due to noise. Values of "Prob > F" less than.5 indicate model terms are significant. In this case A, B are significant model terms. Concentration and time are significant. The interaction is not significant. By letting both A and B be treated as numerical factors, the analysis can be performed as follows: Design Expert Output Response: Data ANOVA for Selected Factorial Model Analysis of variance table [Partial sum of squares] Sum of Mean F Source Squares DF Square Value Prob > F Model <. significant A <. B A B AB Residual Lack of Fit not significant Pure Error Cor Total

302 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY The Model F-value of 7.87 implies the model is significant. There is only a.% chance that a "Model F-Value" this large could occur due to noise. Values of "Prob > F" less than.5 indicate model terms are significant. In this case A, B are significant model terms. 9- Compute the I and J components of the two-factor interaction in Problem 9-. B 7 A AB Totals = 77, 78, 7; SS AB. 39 IAB AB Totals = 78, 74, 74; SS AB. 89 J AB SS AB I AB J AB An experiment was performed to study the effect of three different types of 3-ounce bottles (A) and three different shelf types (B) -- smooth permanent shelves, end-aisle displays with grilled shelves, and beverage coolers -- on the time it takes to stock ten -bottle cases on the shelves. Three workers (factor C) were employed in this experiment, and two replicates of a 3 3 factorial design were run. The observed time data are shown in the following table. Analyze the data and draw conclusions. Replicate I Replicate Worker Bottle Type Permanent EndAisle Cooler Permanent EndAisle Cooler Plastic mm glass mm glass Plastic mm glass mm glass Plastic mm glass mm glass Design Expert Output Response: Time ANOVA for Selected Factorial Model Analysis of variance table [Partial sum of squares] Sum of Mean F Source Squares DF Square Value Prob > F Model <. significant A B <. C <. AB AC BC ABC Residual Lack of Fit. Pure Error Cor Total The Model F-value of 3.6 implies the model is significant. There is only 9-

303 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY a.% chance that a "Model F-Value" this large could occur due to noise. Values of "Prob > F" less than.5 indicate model terms are significant. In this case B, C, BC are significant model terms. Factors B and C, shelf type and worker, and the BC interaction are significant. For the shortest time regardless of worker chose the permanent shelves. This can easily be seen in the interaction plot below. DESIGN-EXPERT Plot Time 6.56 Interaction Graph Shelf Type X = C: Worker Y = B: Shelf Type Design Points 5.73 B Permanent B End Aisle B3 Cooler Actual Factor A: Bottle Type = 8mm glass Tim e Worker 9-4 A medical researcher is studying the effect of lidocaine on the enzyme level in the heart muscle of beagle dogs. Three different commercial brands of lidocaine (A), three dosage levels (B), and three dogs (C) are used in the experiment, and two replicates of a 3 3 factorial design are run. The observed enzyme levels follow. Analyze the data from this experiment. Replicate I Replicate Lidocaine Dosage Dog Dog Brand Strength Design Expert Output Response: Enzyme Level ANOVA for Selected Factorial Model Analysis of variance table [Partial sum of squares] Sum of Mean F Source Squares DF Square Value Prob > F Model <. significant A B <. C AB AC BC ABC

304 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Residual Lack of Fit Pure Error Cor Total The Model F-value of 6.99 implies the model is significant. There is only a.% chance that a "Model F-Value" this large could occur due to noise. Values of "Prob > F" less than.5 indicate model terms are significant. In this case B are significant model terms. The dosage is significant. 9-5 Compute the I and J components of the two-factor interactions for Example 9-. A B I totals = 74,75,6 J totals = -8,3,-8 I(AB) = 6.78 J(AB) = 674. SS AB = 63.9 A C I totals = -,34,-77 J totals = 5,4,- I(AC) = J(AC) = 635. SS AC = B C I totals = -5,79,38 J totals =-53,87,3 I(BC) = 473. J(BC) = SS BC = An experiment is run in a chemical process using a 3 factorial design. The design factors are temperature and pressure, and the response variable is yield. The data that result from this experiment are shown below. Pressure, psig Temperature, C , , , , , , , , , 83.4 (a) Analyze the data from this experiment by conducting an analysis of variance. What conclusions can you draw? 9-4

305 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Design Expert Output Response: Yield ANOVA for Selected Factorial Model Analysis of variance table [Partial sum of squares] Sum of Mean F Source Squares DF Square Value Prob > F Model significant A B AB Pure Error Cor Total The Model F-value of 4.37 implies the model is significant. There is only a.5% chance that a "Model F-Value" this large could occur due to noise. Values of "Prob > F" less than.5 indicate model terms are significant. In this case A, B are significant model terms. Temperature and pressure are significant. Their interaction is not. An alternate analysis is performed below with the A and B treated as numeric factors: Design Expert Output Response: Yield ANOVA for Selected Factorial Model Analysis of variance table [Partial sum of squares] Sum of Mean F Source Squares DF Square Value Prob > F Model significant A B A B AB Residual Lack of Fit not significant Pure Error Cor Total The Model F-value of 7.9 implies the model is significant. There is only a.7% chance that a "Model F-Value" this large could occur due to noise. Values of "Prob > F" less than.5 indicate model terms are significant. In this case A, B, AB are significant model terms. (b) Graphically analyze the residuals. Are there any concerns about underlying assumptions or model adequacy? 9-5

306 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Normal plot of residuals Residuals vs. Predicted Normal % probability Residuals Residual Predicted Residuals vs. Pressure Residuals vs. Temperature Residuals Residuals Pressure Temperature The plot of residuals versus pressure shows a decreasing funnel shape indicating a non-constant variance. (c) Verify that if we let the low, medium and high levels of both factors in this experiment take on the levels -,, and +, then a least squares fit to a second order model for yield is y x 8. 4x 7. 7x 7. 86x 7. 69x x The coefficients can be found in the following table of computer output. Design Expert Output Final Equation in Terms of Coded Factors: Yield = * A +.4 * B * A * B * A * B 9-6

307 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY (d) Confirm that the model in part (c) can be written in terms of the natural variables temperature (T) and pressure (P) as y T8. 59P. 7T. 96P. 384TP The coefficients can be found in the following table of computer output. Design Expert Output Final Equation in Terms of Actual Factors: Yield = * Pressure * Temperature -.96 * Pressure -.77 * Temperature * Pressure * Temperature (e) Construct a contour plot for yield as a function of pressure and temperature. Based on the examination of this plot, where would you recommend running the process.. Yield B: Temperature A: Pres s ure Run the process in the oval region indicated by the yield of (a) Confound a 3 3 design in three blocks using the ABC component of the three-factor interaction. Compare your results with the design in Figure 9-7. L = X + X + X 3 Block Block Block 3 9-7

308 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY The new design is a 8 rotation around the Factor B axis. (b) Confound a 3 3 design in three blocks using the AB C component of the three-factor interaction. Compare your results with the design in Figure 9-7. L = X + X + X 3 Block Block Block 3 The new design is a 8 rotation around the Factor C axis. (c) Confound a 3 3 design three blocks using the ABC component of the three-factor interaction. Compare your results with the design in Figure 9-7. L = X + X + X 3 Block Block Block 3 The new design is a 9 rotation around the Factor C axis along with switching layer and layer in the C axis. (d) After looking at the designs in parts (a), (b), and (c) and Figure 9-7, what conclusions can you draw? All four designs are relatively the same. The only differences are rotations and swapping of layers. 9-8 Confound a 3 4 design in three blocks using the AB CD component of the four-factor interaction. 9-8

309 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY L = X + X + X 3 + X 4 Block Block Block Consider the data from the first replicate of Problem 9-3. Assuming that all 7 observations could not be run on the same day, set up a design for conducting the experiment over three days with AB C confounded with blocks. Analyze the data. Block Block Block 3 = 3.45 = 4.7 = 4. = 4.38 = 4.6 = 4.4 = 5. = 4.4 = 5.7 = 4.3 = 4.7 = 4.8 = 4.96 = 4.8 = 4.5 = 4.86 = 3.94 = 4.53 = 6.5 = 4.99 = 6. = 5.4 = 6.3 = 4.85 = 5.67 = 5.8 = 5.48 Totals-> = Design Expert Output Response: Time ANOVA for Selected Factorial Model Analysis of variance table [Partial sum of squares] Sum of Mean F Source Squares DF Square Value Prob > F Block.3. Model significant A B C AB AC BC Residual Cor Total The Model F-value of 4.7 implies the model is significant. There is only a 4.4% chance that a "Model F-Value" this large could occur due to noise. Values of "Prob > F" less than.5 indicate model terms are significant. In this case B are significant model terms. 9-9

310 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY 9- Outline the analysis of variance table for the 3 4 design in nine blocks. Is this a practical design? Source DF A B C D AB 4 AC 4 AD 4 BC 4 BD 4 CD 4 ABC (AB C,ABC,AB C ) 6 ABD (ABD,AB D,ABD ) 6 ACD (ACD,ACD,AC D ) 6 BCD (BCD,BC D,BCD ) 6 ABCD 6 Blocks (ABC,AB C,AC D,BC D ) 8 Total 8 Any experiment with 8 runs is large. Instead of having three full levels of each factor, if two levels of each factor could be used, then the overall design would have 6 runs plus some center points. This twolevel design could now probably be run in or 4 blocks, with center points in each block. Additional curvature effects could be determined by augmenting the experiment with the axial points of a central composite design and additional enter points. The overall design would be less than 8 runs. 9- Consider the data in Problem 9-3. If ABC is confounded in replicate I and ABC is confounded in replicate II, perform the analysis of variance. L = X + X + X 3 L = X + X + X Block Block Block 3 Block Block Block 3 = 3.45 = 4.8 = 4.8 = 3.36 = 3.5 = 3.68 = 5.5 = 4.53 = 4.38 = 4.44 = 4.39 = 4.4 = 4.85 = 5.67 = 6.3 = 4.7 = 4.65 = 4.75 = 5.48 = 6.5 = 4.99 = 6.38 = 5.88 = 6. = 4.3 = 4.7 = 4.5 = 3.88 = 3.65 = 4.4 = 4.6 = 5.7 = 4.86 = 4.49 = 4.59 = 4.9 = 4.96 = 4.7 = 4. = 4.85 = 5.58 = 5.3 = 3.94 = 4.4 = 5. = 4.37 = 4.9 = 4.6 = 6. = 5.4 = 5.8 = 4.8 = 4.48 = 4.8 The sums of squares for A, B, C, AB, AC, and BC are calculated as usual. The only sums of squares presenting difficulties with calculations are the four components of the ABC interaction (ABC, ABC, AB C, and AB C ). ABC is computed using replicate I and ABC is computed using replicate II. AB C and AB C are computed using data from both replicates. We will show how to calculate AB C and AB C from both replicates. Form a two-way table of A x B at each level of C. Find the I(AB) and J(AB) totals for each third of the A x B table. A C B I J

311 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY The I and J components for each third of the above table are used to form a new table of diagonal totals. C I(AB) J(AB) I Totals: I Totals: 85.6,85.6, ,85.3,83. J Totals: J Totals: 85.73,83.6, ,85.6,85.3 Now, AB C = I[C x I(AB)] = (85. 6 ) (85. 6 ) (83. 3 ) ( ) and, AB C = J[C x I(AB)]= (85. ) (83. ) (84. ) ( ) If it were necessary, we could find ABC as ABC = I[C x J(AB)] and ABC as J[C x J(AB)]. However, these components must be computed using the data from the appropriate replicate. The analysis of variance table: Source SS DF MS F Replicates.6696 Blocks within Replicates.38 4 A B C AB < AC < BC ABC (rep I).45.6 < ABC (rep II)..5.5 AB C AB C Error Total Consider the data from replicate I in Problem 9-3. Suppose that only a one-third fraction of this design with I=ABC is run. Construct the design, determine the alias structure, and analyze the design. 9-

312 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY The design is,,,,,,,,. The alias structure is: A = BC = AB C B = AC = AB C C = AB = ABC AB = AC = BC C A B Source SS DF A.5 B.3 C.8 AB.3 Total From examining Figure 9-9, what type of design would remain if after completing the first 9 runs, one of the three factors could be dropped? A full 3 factorial design results. 9-4 Construct a 3 4 IV The 7 runs for this design are as follows: design with I=ABCD. Write out the alias structure for this design. A = AB C D = BCD B = AB CD = ACD C = ABC D = ABD D = ABCD = ABC AB = ABC D = CD AB = AC D = BC D AC = AB CD = BD AC = AB D = BC D BC = AB C D = AD BC = AB D = AC D BD = AB C = ACD CD = ABC = ABCD AD = AB C = BCD 9-

313 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY 9-5 Verify that the design in Problem 9-4 is a resolution IV design. The design in Problem 9-4 is a Resolution IV design because no main effect is aliased with a component of a two-factor interaction, but some two-factor interaction components are aliased with each other. 9-6 Construct a 3 5- design with I=ABC and I=CDE. Write out the alias structure for this design. What is the resolution of this design? The complete defining relation for this design is : I = ABC = CDE = ABC DE = ABD E This is a resolution III design. The defining contrasts are L = X + X + X 3 and L = X 3 + X 4 + X 5. To find the alias of any effect, multiply the effect by I and I. For example, the alias of A is: A = AB C = ACDE = AB CDE = AB DE = BC = AC D E = BC DE = BD E 9-7 Construct a design, and verify that is a resolution III design. Use the generators I = AC D, I = AB C E, I = BC F, I = AB CG, I = ABCH, and I = ABJ To find the alias of any effect, multiply the effect by I and I. For example, the alias of C is: C = C(BC F ) = BF, At least one main effect is aliased with a component of a two-factor interaction. 9-8 Construct a 4 x 3 design confounded in two blocks of 6 observations each. Outline the analysis of variance for this design. Design is a 4 x 3, with ABC at two levels, and Z at 4 levels. Represent Z with two pseudo-factors D and E as follows: 9-3

314 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Factor Pseudo- Factors Z D E Z = () Z = d Z 3 = e Z 4 = de The 4 x 3 is now a 5 in the factors A, B, C, D and E. Confound ABCDE with blocks. We have given both the letter notation and the digital notation for the treatment combinations. Block Block () = a = ab = b = ac = c = bc = abc = abcd = bcd = abce = bce = cd = acd = ce = ace = de = 3 ade = 3 abde = 3 bde = 3 bcde = 3 abcde = 3 be = abd = ad = abe = ae = d = acde = 3 e = bd = cde = 3 Source DF A B C Z (D+E+DE) 3 AB AC AZ (AD+AE+ADE) 3 BC BZ (BD+BE+BDE) 3 CZ (CD+CE+CDE) 3 ABC ABZ (ABD+ABE+ABDE) 3 ACZ (ACD+ACE+ACDE) 3 BCZ (BCD+BCE+BCDE) 3 ABCZ (ABCD+ABCE) Blocks (or ABCDE) Total Outline the analysis of variance table for a 3 factorial design. Discuss how this design may be confounded in blocks. 9-4

315 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Suppose we have n replicates of a 3 factorial design. A and B are at levels, and C and D are at 3 levels. Source DF Components for Confounding A A B B C C D D AB AB AC AC AD AD BC BD BD CD,CD CD 4 ABC ABC ABD ABD ACD,ACD ACD 4 BCD,BCD BCD 4 ABCD,ABCD ABCD 4 Error 36(n-) Total 36n- Confounding in this series of designs is discussed extensively by Margolin (967). The possibilities for a single replicate of the 3 design are: blocks of 8 observations 6 blocks of 6 observations 3 blocks of observations 9 blocks of 4 observations 4 blocks of 9 observations For example, one component of the four-factor interaction, say ABCD, could be selected to confound the design in 3 blocks for observations each, while to confound the design in blocks of 8 observations 3 each we would select the AB interaction. Cochran and Cox (957) and Anderson and McLean (974) discuss confounding in these designs. 9- Starting with a 6-run 4 design, show how two three-level factors can be incorporated in this experiment. How many two-level factors can be included if we want some information on two-factor interactions? Use column A and B for one three-level factor and columns C and D for the other. Use the AC and BD columns for the two, two-level factors. The design will be of resolution V. 9- Starting with a 6-run 4 design, show how one three-level factor and three two-level factors can be accommodated and still allow the estimation of two-factor interactions. Use columns A and B for the three-level factor, and columns C and D and ABCD for the three two-level factors. This design will be of resolution V. 9- In Problem 9-6, you met Harry and Judy Peterson-Nedry, two friends of the author who have a winery and vineyard in Newberg, Oregon. That problem described the application of two-level fractional factorial designs to their 985 Pinor Noir product. In 987, they wanted to conduct another Pinot Noir experiment. The variables for this experiment were Variable Clone of Pinot Noir Berry Size Levels Wadenswil, Pommard Small, Large 9-5

316 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Fermentation temperature 8F, 85F, 9/8F, 9F Whole Berry None, % Maceration Time days, days Yeast Type Assmanhau, Champagne Oak Type Troncais, Allier Harry and Judy decided to use a 6-run two-level fractional factorial design, treating the four levels of fermentation temperature as two two-level variables. As in Problem 8-6, they used the rankings from a taste-test panel as the response variable. The design and the resulting average ranks are shown below: Berry Ferm. Whole Macer. Yeast Oak Average Run Clone Size Temp. Berry Time Type Type Rank (a) Describe the aliasing in this design. The design is a resolution IV design such that the main effects are aliased with three factor interactions. Design Expert Output Term Aliases Intercept ABCG ABDH ABEF ACDF ACEH ADEG AFGH BCDE BCFH BDFG BEGH CDGH CEFG DEFH A BCG BDH BEF CDF CEH DEG FGH ABCDE B ACG ADH AEF CDE CFH DFG EGH C ABG ADF AEH BDE BFH DGH EFG D ABH ACF AEG BCE BFG CGH EFH E ABF ACH ADG BCD BGH CFG DFH F ABE ACD AGH BCH BDG CEG DEH G ABC ADE AFH BDF BEH CDH CEF H ABD ACE AFG BCF BEG CDG DEF AB CG DH EF ACDE ACFH ADFG AEGH BCDF BCEH BDEG BFGH AC BG DF EH ABDE ABFH ADGH AEFG BCDH BCEF CDEG CFGH AD BH CF EG ABCE ABFG ACGH AEFH BCDG BDEF CDEH DFGH AE BF CH DG ABCD ABGH ACFG ADFH BCEG BDEH CDEF EFGH AF BE CD GH ABCH ABDG ACEG ADEH BCFG BDFH CEFH DEFG AG BC DE FH ABDF ABEH ACDH ACEF BDGH BEFG CDFG CEGH AH BD CE FG ABCF ABEG ACDG ADEF BCGH BEFH CDFH DEGH (b) Analyze the data and draw conclusions. All of the main effects except Yeast and Oak are significant. The Macer Time is the most significant. None of the interactions were significant. 9-6

317 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY DESIGN-EXPERT Plot Rank Normal plot A: Clone B: Berry Size C: Ferm T emp D: Ferm T emp E: Wh o le B erry F: Macer Time G: Yeast H: Oak Normal % probability A E D C B F Effect Design Expert Output Response: Rank ANOVA for Selected Factorial Model Analysis of variance table [Partial sum of squares] Sum of Mean F Source Squares DF Square Value Prob > F Model <. significant A B C D E F <. Residual Cor Total The Model F-value of implies the model is significant. There is only a.% chance that a "Model F-Value" this large could occur due to noise. Std. Dev.. R-Squared.9669 Mean 8.5 Adj R-Squared.9449 C.V. 3.5 Pred R-Squared.8954 PRESS Adeq Precision 9.7 Coefficient Standard 95% CI 95% CI Factor Estimate DF Error Low High VIF Intercept A-Clone B-Berry Size C-Ferm Temp D-Ferm Temp E-Whole Berry F-Macer Time Final Equation in Terms of Coded Factors: Rank = * A +.75 * B +.75 * C +.88 * D -.87 * E +4. * F 9-7

318 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY (c) What comparisons can you make between this experiment and the 985 Pinot Noir experiment from Problem 8-6? The experiment from Problem 8-6 indicates that yeast, barrel, whole cluster and the clone x yeast interactions were significant. This experiment indicates that maceration time, whole berry, clone and fermentation temperature are significant. 9-3 An article by W.D. Baten in the 956 volume of Industrial Quality Control described an experiment to study the effect of three factors on the lengths of steel bars. Each bar was subjected to one of two heat treatment processes, and was cut on one of four machines at one of three times during the day (8 am, am, or 3 pm). The coded length data are shown below (a) Analyze the data from this experiment assuming that the four observations in each cell are replicates. The Machine effect (C) is significant, the Heat Treat Process (B) is also significant, while the Time of Day (A) is not significant. None of the interactions are significant. Time of Day 8am am 3 pm Heat Treat Process Machine Design Expert Output Response: Length ANOVA for Selected Factorial Model Analysis of variance table [Partial sum of squares] Sum of Mean F Source Squares DF Square Value Prob > F Model <. significant A B C <. AB AC BC ABC Pure Error Cor Total The Model F-value of 4.3 implies the model is significant. There is only a.% chance that a "Model F-Value" this large could occur due to noise. Std. Dev..49 R-Squared.5688 Mean 3.96 Adj R-Squared

319 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY C.V Pred R-Squared.334 PRESS Adeq Precision 7. Coefficient Standard 95% CI 95% CI Term Estimate DF Error Low High VIF Intercept A[] A[] B-Process C[] C[] C[3] A[]B A[]B A[]C[] A[]C[] A[]C[] A[]C[] A[]C[3] A[]C[3] BC[] BC[] BC[3] A[]BC[] A[]BC[] A[]BC[] A[]BC[] A[]BC[3] A[]BC[3] Final Equation in Terms of Coded Factors: Length = * A[] -.65 * A[] -.67 * B -.54 * C[] +.9 * C[] -3.8 * C[3] +. * A[]B +.6 * A[]B +.3 * A[]C[] -.7 * A[]C[] -.39 * A[]C[] -. * A[]C[] +.4 * A[]C[3] +.77 * A[]C[3] -.5 * BC[] -.46 * BC[] +.46 * BC[3] -.94 * A[]BC[] -.44 * A[]BC[] +. * A[]BC[] -. * A[]BC[] -.43 * A[]BC[3] +.6 * A[]BC[3] (b) Analyze the residuals from this experiment. Is there any indication that there is an outlier in one cell? If you find an outlier, remove it and repeat the analysis from part (a). What are your conclusions? Standard Order 84, Time of Day at 3:pm, Heat Treat #, Machine #, and length of, appears to be an outlier. 9-9

320 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Normal plot of residuals Residuals vs. Predicted Normal % probability Residuals Residual Predicted The following analysis was performed with the outlier described above removed. As with the original analysis, Machine is significant and Heat Treat Process is also significant, but now Time of Day, factor A, is also significant with an F-value of 3.5 (the P-value is just above.5). Design Expert Output Response: Length ANOVA for Selected Factorial Model Analysis of variance table [Partial sum of squares] Sum of Mean F Source Squares DF Square Value Prob > F Model <. significant A B C <. AB AC BC ABC Pure Error Cor Total. 94 The Model F-value of 4.89 implies the model is significant. There is only a.% chance that a "Model F-Value" this large could occur due to noise. Std. Dev..36 R-Squared.63 Mean 4. Adj R-Squared.4878 C.V. 59. Pred R-Squared.3 PRESS 75.7 Adeq Precision Coefficient Standard 95% CI 95% CI Term Estimate DF Error Low High VIF Intercept z A[] A[] B-Process C[] C[] C[3] A[]B A[]B A[]C[] A[]C[] E-3 A[]C[] A[]C[]

321 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY A[]C[3] A[]C[3] BC[] BC[] BC[3] A[]BC[] E A[]BC[] A[]BC[] A[]BC[] A[]BC[3] A[]BC[3] Final Equation in Terms of Coded Factors: Length = * A[] -.73 * A[] -.58 * B -.63 * C[] +.8 * C[] -3.7 * C[3] -.76 * A[]B +.5 * A[]B +.4 * A[]C[] -.8 * A[]C[] -.65 * A[]C[] -.36 * A[]C[] +.33 * A[]C[3] +.86 * A[]C[3] -.34 * BC[] -. * BC[] +.37 * BC[3] E-3 * A[]BC[] -.35 * A[]BC[] -.5 * A[]BC[] -.36 * A[]BC[] -.34 * A[]BC[3] +.69 * A[]BC[3] The following residual plots are acceptable. Both the normality and constant variance assumptions are satisfied Normal plot of residuals Residuals vs. Predicted Norm al % probability Residuals Residual Predicted 9-

322 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY (c) Suppose that the observations in the cells are the lengths (coded) of bars processed together in heat treating and then cut sequentially (that is, in order) on the three machines. Analyze the data to determine the effects of the three factors on mean length. The analysis with all effects and interactions included: Design Expert Output Response: Length ANOVA for Selected Factorial Model Analysis of variance table [Partial sum of squares] Sum of Mean F Source Squares DF Square Value Prob > F Model A B C AB AC BC ABC Pure Error. Cor Total The by removing the three factor interaction from the model and applying it to the error, the analysis identifies factor C as being significant and B as being mildly significant. Design Expert Output Response: Length ANOVA for Selected Factorial Model Analysis of variance table [Partial sum of squares] Sum of Mean F Source Squares DF Square Value Prob > F Model not significant A B C AB AC BC Residual Cor Total When removing the remaining insignificant factors from the model, C, Machine, is the most significant factor while B, Heat Treat Process, is moderately significant. Factor A, Time of Day, is not significant. Design Expert Output Response: Avg ANOVA for Selected Factorial Model Analysis of variance table [Partial sum of squares] Sum of Mean F Source Squares DF Square Value Prob > F Model <. significant B C <. Residual Cor Total The Model F-value of 3.43 implies the model is significant. There is only a.% chance that a "Model F-Value" this large could occur due to noise. Std. Dev..4 R-Squared.7387 Mean 3.96 Adj R-Squared.6837 C.V Pred R-Squared.583 PRESS 6.53 Adeq Precision

323 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Term Coefficient Estimate DF Standard Error 95% CI Low 95% CI High VIF Intercept B-Process C[] C[] C[3] Final Equation in Terms of Coded Factors: Avg = * B -.54 * C[] +.9 * C[] -3.8 * C[3] The following residual plots are acceptable. Both the normality and uniformity of variance assumptions are verified. Normal plot of residuals Residuals vs. Predicted Normal % probability Residuals Residual Predicted (d) Calculate the log variance of the observations in each cell. Analyze the average length and the log variance of the length for each of the bars cut at each machine/heat treatment process combination. What conclusions can you draw? Factor B, Heat Treat Process, has an affect on the log variance of the observations while Factor A, Time of Day, and Factor C, Machine, are not significant at the 5 percent level. However, A is significant at the percent level, so Tome of Day has some effect on the variance. Design Expert Output Response: Log(Var) ANOVA for Selected Factorial Model Analysis of variance table [Partial sum of squares] Sum of Mean F Source Squares DF Square Value Prob > F Model not significant A B C AB BC Residual.. 9-3

324 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Cor Total 4. 3 The Model F-value of.5 implies there is a 6.48% chance that a "Model F-Value" this large could occur due to noise. Std. Dev..3 R-Squared.6967 Mean.65 Adj R-Squared.486 C.V. 49. Pred R-Squared -.33 PRESS 4.86 Adeq Precision Coefficient Standard 95% CI 95% CI Term Estimate DF Error Low High VIF Intercept A[] A[] B-Process E-3.9. C[] C[] C[3] A[]B E-3 A[]B BC[] BC[] BC[3] Final Equation in Terms of Coded Factors: Log(Var) = * A[] -.6 * A[] +.4 * B +. * C[] +.66 * C[] -.9 * C[3] -. * A[]B +.4 * A[]B -.8 * BC[] -.5 * BC[] +.4 * BC[3] The following residual plots are acceptable. Both the normality and uniformity of variance assumptions are verified. Normal plot of residuals Residuals vs. Predicted Norm al % probability Residuals Residual Predicted 9-4

325 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY (e) Suppose the time at which a bar is cut really cannot be controlled during routine production. Analyze the average length and the log variance of the length for each of the bars cut at each machine/heat treatment process combination. What conclusions can you draw? The analysis of the average length is as follows: Design Expert Output Response: Avg ANOVA for Selected Factorial Model Analysis of variance table [Partial sum of squares] Sum of Mean F Source Squares DF Square Value Prob > F Model A B AB Pure Error. Cor Total Because the Means Square of the AB interaction is much less than the main effects, it is removed from the model and placed in the error. The average length is strongly affected by Factor B, Machine, and moderately affected by Factor A, Heat Treat Process. The interaction effect was small and removed from the model. Design Expert Output Response: Avg ANOVA for Selected Factorial Model Analysis of variance table [Partial sum of squares] Sum of Mean F Source Squares DF Square Value Prob > F Model significant A B Residual Cor Total The Model F-value of 5. implies the model is significant. There is only a.% chance that a "Model F-Value" this large could occur due to noise. Std. Dev..6 R-Squared.979 Mean 3.96 Adj R-Squared.93 C.V. 5.3 Pred R-Squared.799 PRESS 7.75 Adeq Precision 3.89 Coefficient Standard 95% CI 95% CI Term Estimate DF Error Low High VIF Intercept A-Process B[] B[] B[3] Final Equation in Terms of Coded Factors: Avg = * A -.54 * B[] +.9 * B[] -3.8 * B[3] The following residual plots are acceptable. Both the normality and uniformity of variance assumptions are verified. 9-5

326 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Normal plot of residuals Residuals vs. Predicted Normal % probability Residuals Residual Predicted The Log(Var) is analyzed below: Design Expert Output Response: Log(Var) ANOVA for Selected Factorial Model Analysis of variance table [Partial sum of squares] Sum of Mean F Source Squares DF Square Value Prob > F Model A.9.9 B AB Pure Error. Cor Total.3 7 Because the AB interaction has the smallest Mean Square, it was removed from the model and placed in the error. From the following analysis of variance, neither Heat Treat Process, Machine, nor the interaction affect the log variance of the length. Design Expert Output Response: Log(Var) ANOVA for Selected Factorial Model Analysis of variance table [Partial sum of squares] Sum of Mean F Source Squares DF Square Value Prob > F Model not significant A B Residual Cor Total.3 7 The "Model F-value" of.7 implies the model is not significant relative to the noise. There is a 34.4 % chance that a "Model F-value" this large could occur due to noise. Std. Dev..8 R-Squared.6949 Mean.79 Adj R-Squared.88 C.V..9 Pred R-Squared PRESS.69 Adeq Precision 3.99 Coefficient Standard 95% CI 95% CI Term Estimate DF Error Low High VIF Intercept A-Process

327 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY B[] B[] B[3] Final Equation in Terms of Coded Factors: Log(Var) = * A +.5 * B[] +.3 * B[] -. * B[3] The following residual plots are acceptable. Both the normality and uniformity of variance assumptions are verified. Normal plot of residuals Residuals vs. Predicted Normal % probability Residuals Residual Predicted 9-7

328 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Chapter Fitting Regression Models Solutions - The tensile strength of a paper product is related to the amount of hardwood in the pulp. Ten samples are produced in the pilot plant, and the data obtained are shown in the following table. Strength Percent Hardwood Strength Percent Hardwood (a) Fit a linear regression model relating strength to percent hardwood. Minitab Output Regression Analysis: Strength versus Hardwood The regression equation is Strength = Hardwood Predictor Coef SE Coef T P Constant Hardwood S =.3 R-Sq = 97.% R-Sq(adj) = 96.6% PRESS = R-Sq(pred) = 94.9% Regression Plot Strength = Hardwood S =.3 R-Sq = 97. % R-Sq(adj) = 96.6 % 9 Strength Hardwood 3 (b) Test the model in part (a) for significance of regression. Minitab Output -

329 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Analysis of Variance Source DF SS MS F P Regression Residual Error Lack of Fit Pure Error Total rows with no replicates No evidence of lack of fit (P >.) (c) Find a 95 percent confidence interval on the parameter. The 95 percent confidence interval is: ˆ t, n p se ˆ ˆ t seˆ, n p A plant distills liquid air to produce oxygen, nitrogen, and argon. The percentage of impurity in the oxygen is thought to be linearly related to the amount of impurities in the air as measured by the pollution count in part per million (ppm). A sample of plant operating data is shown below. Purity(%) Pollution count (ppm) (a) Fit a linear regression model to the data. Minitab Output Regression Analysis: Purity versus Pollution The regression equation is Purity = Pollution Predictor Coef SE Coef T P Constant Pollutio S =.477 R-Sq = 87.4% R-Sq(adj) = 86.4% PRESS = R-Sq(pred) = 8.77% -

330 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Regression Plot Purity = Pollution S = R-Sq = 87.4 % R-Sq(adj) = 86.4 % Purity Pollution. (b) Test for significance of regression. Minitab Output Analysis of Variance Source DF SS MS F P Regression Residual Error Total No replicates. Cannot do pure error test. No evidence of lack of fit (P >.) (c) Find a 95 percent confidence interval on. The 95 percent confidence interval is: ˆ t, n p se ˆ ˆ t seˆ, n p Plot the residuals from Problem - and comment on model adequacy. -3

331 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Normal Probability Plot of the Residuals (response is Strength) Normal Score Residual 3 Residuals Versus the Fitted Values (response is Strength) 3 Residual Fitted Value 9-4

332 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Residuals Versus the Order of the Data (response is Strength) 3 Residual Observation Order There is nothing unusual about the residual plots. The underlying assumptions have been met. -4 Plot the residuals from Problem - and comment on model adequacy. Normal Probability Plot of the Residuals (response is Purity) Normal Score Residual..5-5

333 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Residuals Versus the Fitted Values (response is Purity).5 Residual Fitted Value Residuals Versus the Order of the Data (response is Purity).5 Residual Observation Order 4 There is nothing unusual about the residual plots. The underlying assumptions have been met. -5 Using the results of Problem -, test the regression model for lack of fit. Minitab Output Analysis of Variance Source DF SS MS F P Regression Residual Error

334 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Lack of Fit Pure Error Total rows with no replicates No evidence of lack of fit (P >.) -6 A study was performed on wear of a bearing y and its relationship to x = oil viscosity and x = load. The following data were obtained. y x x (a) Fit a multiple linear regression model to the data. Minitab Output Regression Analysis: Wear versus Viscosity, Load The regression equation is Wear = Viscosity -.54 Load Predictor Coef SE Coef T P VIF Constant Viscosit Load S = 5.5 R-Sq = 86.% R-Sq(adj) = 77.% PRESS = R-Sq(pred) =.3% (b) Test for significance of regression. Minitab Output Analysis of Variance Source DF SS MS F P Regression Residual Error Total 5 4. No replicates. Cannot do pure error test. Source DF Seq SS Viscosit 4.4 Load 9. * Not enough data for lack of fit test (c) Compute t statistics for each model parameter. What conclusions can you draw? Minitab Output Regression Analysis: Wear versus Viscosity, Load The regression equation is Wear = Viscosity -.54 Load Predictor Coef SE Coef T P VIF Constant

335 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Viscosit Load S = 5.5 R-Sq = 86.% R-Sq(adj) = 77.% PRESS = R-Sq(pred) =.3% The t-tests are shown in part (a). Notice that overall regression is significant (part(b)), but neither variable has a large t-statistic. This could be an indicator that the regressors are nearly linearly dependent. -7 The brake horsepower developed by an automobile engine on a dynomometer is thought to be a function of the engine speed in revolutions per minute (rpm), the road octane number of the fuel, and the engine compression. An experiment is run in the laboratory and the data that follow are collected. Brake Horsepower rpm Road Octane Number Compression (a) Fit a multiple linear regression model to the data. Minitab Output Regression Analysis: Horsepower versus rpm, Octane, Compression The regression equation is Horsepower = rpm Octane +.87 Compression Predictor Coef SE Coef T P VIF Constant rpm Octane Compress S = 8.8 R-Sq = 8.7% R-Sq(adj) = 73.4% PRESS = R-Sq(pred) =.33% (b) Test for significance of regression. What conclusions can you draw? Minitab Output Analysis of Variance Source DF SS MS F P Regression Residual Error Total 3. r No replicates. Cannot do pure error test. Source DF Seq SS rpm Octane 3.56 Compress Lack of fit test Possible interactions with variable Octane (P-Value =.8) Possible lack of fit at outer X-values (P-Value =.) Overall lack of fit test is significant at P =. -8

336 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY (c) Based on t tests, do you need all three regressor variables in the model? Yes, all of the regressor variables are important. -8 Analyze the residuals from the regression model in Problem -7. Comment on model adequacy. Normal Probability Plot of the Residuals (response is Horsepow) Normal Score Residual Residuals Versus the Fitted Values (response is Horsepow) Residual Fitted Value

337 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Residuals Versus the Order of the Data (response is Horsepow) Residual Observation Order 8 The normal probability plot is satisfactory, as is the plot of residuals versus run order (assuming that observation order is run order). The plot of residuals versus predicted response exhibits a slight bow shape. This could be an indication of lack of fit. It might be useful to consider adding some ineraction terms to the model. -9 The yield of a chemical process is related to the concentration of the reactant and the operating temperature. An experiment has been conducted with the following results. Yield Concentration Temperature (a) Suppose we wish to fit a main effects model to this data. Set up the X X matrix using the data exactly as it appears in the table (b) Is the matrix you obtained in part (a) diagonal? Discuss your response. -

338 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY The X X is not diagonal, even though an orthogonal design has been used. The reason is that we have worked with the natural factor levels, not the orthogonally coded variables. (c) Suppose we write our model in terms of the usual coded variables.5 x Conc,.5 x 65 Temp 5 Set up the X X matrix for the model in terms of these coded variables. Is this matrix diagonal? Discuss your response The X X matrix is diagonal because we have used the orthogonally coded variables. (d) Define a new set of coded variables. x Conc,. x 5 Temp 3 Set up the X X matrix for the model in terms of this set of coded variables. Is this matrix diagonal? Discuss your response The X X is not diagonal, even though an orthogonal design has been used. The reason is that we have not used orthogonally coded variables. (e) Summarize what you have learned from this problem about coding the variables. If the design is orthogonal, use the orthogonal coding. This not only makes the analysis somewhat easier, but it also results in model coefficients that are easier to interpret because they are both dimensionless and uncorrelated. -

339 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY - Consider the 4 factorial experiment in Example 6-. Suppose that the last observation in missing. Reanalyze the data and draw conclusions. How do these conclusions compare with those from the original example? The regression analysis with the one data point missing indicates that the same effects are important. Minitab Output Regression Analysis: Rate versus A, B, C, D, AB, AC, AD, BC, BD, CD The regression equation is Rate = A +.5 B C + 7. D -.5 AB AC + 8. AD +.87 BC -.5 BD -.87 CD Predictor Coef SE Coef T P VIF Constant A B C D AB AC AD BC BD CD S = R-Sq = 97.6% R-Sq(adj) = 9.6% PRESS = 75. R-Sq(pred) = 65.9% Analysis of Variance Source DF SS MS F P Regression Residual Error Total No replicates. Cannot do pure error test. Source DF Seq SS A 44.4 B 4. C 6.86 D AB.6 AC 5.63 AD 94.5 BC 6.7 BD.7 CD. -

340 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Normal Probability Plot of the Residuals (response is Rate) Normal Score Residual 5 Residuals Versus the Fitted Values (response is Rate) 5 Residual Fitted Value 8 9 Residuals Versus the Order of the Data (response is Rate) 5 Residual Observation Order 4-3

341 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY The residual plots are acceptable; therefore, the underlying assumptions are valid. - Consider the 4 factorial experiment in Example 6-. Suppose that the last two observations are missing. Reanalyze the data and draw conclusions. How do these conclusions compare with those from the original example? The regression analysis with the one data point missing indicates that the same effects are important. Minitab Output Regression Analysis: Rate versus A, B, C, D, AB, AC, AD, BC, BD, CD The regression equation is Rate = A +.87 B C D -.66 AB AC AD +.5 BC +. BD +.75 CD Predictor Coef SE Coef T P VIF Constant A B C D AB AC AD BC BD CD S = 4.73 R-Sq = 98.7% R-Sq(adj) = 94.% PRESS = R-Sq(pred) = 7.% Analysis of Variance Source DF SS MS F P Regression Residual Error Total No replicates. Cannot do pure error test. Source DF Seq SS A B.5 C D 76. AB.7 AC 7.53 AD 738. BC 4.9 BD 6. CD 4.5-4

342 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Normal Probability Plot of the Residuals (response is Rate) Normal Score Residual 3 Residuals Versus the Fitted Values (response is Rate) 3 Residual Fitted Value 8 9 Residuals Versus the Order of the Data (response is Rate) 3 Residual Observation Order 4-5

343 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY The residual plots are acceptable; therefore, the underlying assumptions are valid. - Given the following data, fit the second-order polynomial regression model y x x x x xx y x x After you have fit the model, test for significance of regression. Minitab Output Regression Analysis: y versus x, x, x^, x^, xx The regression equation is y = x +.7 x x^ +. x^ xx Predictor Coef SE Coef T P VIF Constant x x x^ x^ xx S = 6.4 R-Sq = 99.4% R-Sq(adj) = 98.9% PRESS = 37.7 R-Sq(pred) = 96.4% r Analysis of Variance Source DF SS MS F P Regression Residual Error Lack of Fit Pure Error Total rows with no replicates Source DF Seq SS x 55. x 95.3 x^.9 x^ 5.8 xx

344 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Normal Probability Plot of the Residuals (response is y) Normal Score Residual 5 Residuals Versus the Fitted Values (response is y) 5 Residual -5 7 Fitted Value 7 Residuals Versus the Order of the Data (response is y) 5 Residual Observation Order 8-7

345 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY -3 (a) Consider the quadratic regression model from Problem -. Compute t statistics for each model parameter and comment on the conclusions that follow from the quantities. Minitab Output Predictor Coef SE Coef T P VIF Constant x x x^ x^ xx x is the only model parameter that is statistically significant with a t-value of 3.5. A logical model might also include x to preserve model hierarchy. (b) Use the extra sum of squares method to evaluate the value of the quadratic terms, the model. x, x and x x to The extra sum of squares due to is SS R SS,, SS, SS SS,, R R R, SS sum of squares of regression for the model in Problem - = R SS R F SS R =345.3, SS R, MSE Since F.5,3, , then the addition of the quadratic terms to the model is significant. The P-values indicate that it s probably the term x that is responsible for this. R -4 Relationship between analysis of variance and regression. Any analysis of variance model can be expressed in terms of the general linear model y = X +, where the X matrix consists of zeros and ones. Show that the single-factor model y, i=,,3, j=,,3,4 can be written in general linear model form. Then ij i ij (a) Write the normal equations the model in Chapter 3. ( XX ) ˆ Xy and compare them with the normal equations found for The normal equations are ( XX ) ˆ Xy -8

346 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY ˆ y ˆ y ˆ y 4 ˆ 3 y which are in agreement with the results of Chapter 3. (b) Find the rank of XX. Can ( XX ) be obtained? XXis a 4 x 4 matrix of rank 3, because the last three columns add to the first column. Thus (X X) - does not exist. (c) Suppose the first normal equation is deleted and the restriction n ˆ i i is added. Can the resulting system of equations be solved? If so, find the solution. Find the regression sum of squares ˆXy, and compare it to the treatment sum of squares in the single-factor model. 3 Imposing n ˆi yields the normal equations i ˆ y ˆ y ˆ y 4ˆ 3 y The solution to this set of equations is y.. ˆ y ˆ y This solution was found be solving the last three equations for ˆ i, yielding substituting in the first equation to find ˆ y.. The regression sum of squares is a i SS R ˆ X y =.. i... y y y.... i. y.. i y i..... y an a i y n y an ˆ i y i ˆ, and then with a degrees of freedom. This is the same result found in Chapter 3. For more discussion of the relationship between analysis of variance and regression, see Montgomery and Peck (99). a i y i. n. -5 Suppose that we are fitting a straight line and we desire to make the variance of as small as possible. Restricting ourselves to an even number of experimental points, where should we place these points so as to minimize V ˆ? (Note: Use the design called for in this exercise with great caution because, even though it minimized V ˆ, it has some undesirable properties; for example, see Myers and -9

347 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY - Montgomery (995). Only if you are very sure the true functional relationship is linear should you consider using this design. Since S xx V ˆ, we may minimize ˆ V by making S xx as large as possible. S xx is maximized by spreading out the x j s as much as possible. The experimenter usually has a region of interest for x. If n is even, n/ of the observations should be run at each end of the region of interest. If n is odd, then run one of the observations in the center of the region and the remaining (n-)/ at either end. -6 Weighted least squares. Suppose that we are fitting the straight line x y, but the variance of the y s now depends on the level of x; that is, n i w x y V i i,...,,, where the w i are known constants, often called weights. Show that if we choose estimates of the regression coefficients to minimize the weighted sum of squared errors given by n i i i i x y w, the resulting least squares normal equations are n i n i i i i n i i i y w x w w ˆ ˆ n i n i i i i i n i i i i y x w x w x w ˆ ˆ The least squares normal equations are found: ˆ ˆ ˆ ˆ n i i i n i i i n i i i w x x y L w x y L w x y L which simplify to i n i i n i n i i i i n i i n i n i i i y x w w x w x y w w x w ˆ ˆ ˆ ˆ -7 Consider the 4 IV design discussed in Example -5.

348 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY - (a) Suppose you elect to augment the design with the single run selected in that example. Find the variances and covariances of the regression coefficients in the model (ignoring blocks): x x x x x x x x y X X' ) ( X'X (b) Are there any other runs in the alternate fraction that Any other run from the alternate fraction will dealias AB from CD. (c) Suppose you augment the design with four runs suggested in Example -5. Find the variance and the covariances of the regression coefficients (ignoring blocks) for the model in part (a). Choose 4 runs that are one of the quarter fractions not used in the principal half fraction. X X'

349 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY X' X X' X (d) Considering parts (a) and (c), which augmentation strategy would you prefer and why? If you only have the resources to run one more run, then choose the one-run augmentation. But if resources are not scarce, then augment the design in multiples of two runs, to keep the design orthogonal. Using four runs results in smaller variances of the regression coefficients and a simpler covariance structure Consider the III. Suppose after running the experiment, the largest observed effects are A + BD, B + AD, and D + AB. You wish to augment the original design with a group of four runs to dealias these effects. (a) Which four runs would you make? Take the first four runs of the original experiment and change the sign on A. Design Expert Output Factor Factor Factor 3 Factor 4 Factor 5 Factor 6 Factor 7 Std Run Block A:x B:x C:x3 D:x4 E:x5 F:x6 G:x7 Block Block Block Block Block Block Block Block Block Block Block Block Main effects and interactions of interest are: x x x4 xx xx4 xx

350 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY (b) Find the variances and covariances of the regression coefficients in the model y x x 4 x4 xx 4 x x4 4 x x4 X' X X' X (c) Is it possible to dealias these effects with fewer than four additional runs? It is possible to dealias these effects in only two runs. By utilizing Design Expert s design augmentation function, the runs 9 and (Block ) were generated as follows: Design Expert Output Factor Factor Factor 3 Factor 4 Factor 5 Factor 6 Factor 7 Std Run Block A:x B:x C:x3 D:x4 E:x5 F:x6 G:x7 Block Block Block Block Block Block Block Block Block Block

351 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Chapter Response Surface Methods and Other Approaches to Process Optimization Solutions - A chemical plant produces oxygen by liquefying air and separating it into its component gases by fractional distillation. The purity of the oxygen is a function of the main condenser temperature and the pressure ratio between the upper and lower columns. Current operating conditions are temperature ) - C and pressure ratio ).. Using the following data find the path of steepest ascent. ( ( Temperature (x ) Pressure Ratio (x ) Purity Design Expert Output Response: Purity ANOVA for Selected Factorial Model Analysis of variance table [Partial sum of squares] Sum of Mean F Source Squares DF Square Value Prob > F Model significant A B Curvature not significant Residual Lack of Fit not significant Pure Error Cor Total The Model F-value of 6.7 implies the model is significant. There is only a.5% chance that a "Model F-Value" this large could occur due to noise. Std. Dev..4 R-Squared.99 Mean 84. Adj R-Squared.8935 C.V..9 Pred R-Squared.73 PRESS. Adeq Precision.7 Coefficient Standard 95% CI 95% CI Factor Estimate DF Error Low High VIF Intercept A-Temperature B-Pressure Ratio Center Point Final Equation in Terms of Coded Factors: Purity = * A +.5 * B Final Equation in Terms of Actual Factors: Purity =

352 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY +.7 * Temperature +.5 * Pressure Ratio From the computer output use the model ŷ x. 5x as the equation for steepest ascent. Suppose we use a one degree change in temperature as the basic step size. Thus, the path of steepest ascent passes through the point (x =, x =) and has a slope.5/.85. In the coded variables, one degree of temperature is equivalent to a step of x /5=.. Thus, x (.5/.85).=.59. The path of steepest ascent is: Coded Variables Natural Variables x x Origin Origin Origin Origin An industrial engineer has developed a computer simulation model of a two-item inventory system. The decision variables are the order quantity and the reorder point for each item. The response to be minimized is the total inventory cost. The simulation model is used to produce the data shown in the following table. Identify the experimental design. Find the path of steepest descent. Item Item Order Reorder Order Reorder Total Quantity (x) Point (x) Quantity (x3) Point (x4) Cost The design is a 4- fractional factorial with generator I=ABCD, and three center points. Design Expert Output Response: Total Cost ANOVA for Selected Factorial Model Analysis of variance table [Partial sum of squares] Sum of Mean F Source Squares DF Square Value Prob > F Model significant A C D AC AD CD Curvature significant Residual Lack of Fit not significant Pure Error Cor Total The Model F-value of 63.6 implies the model is significant. There is only -

353 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY a.3% chance that a "Model F-Value" this large could occur due to noise. Std. Dev. 3. R-Squared.99 Mean Adj R-Squared.9765 C.V..48 Pred R-Squared.9593 PRESS 9.5 Adeq Precision Coefficient Standard 95% CI 95% CI Factor Estimate DF Error Low High VIF Intercept A-Item QTY C-Item QTY D-Item Reorder AC AD CD Center Point Final Equation in Terms of Coded Factors: Total Cost = * A +3.5 * C +7.5 * D -7. * A * C * A * D +9.5 * C * D Final Equation in Terms of Actual Factors: Total Cost = * Item QTY +. * Item QTY * Item Reorder -.4 * Item QTY * Item QTY * Item QTY * Item Reorder +.85 * Item QTY * Item Reorder +.9 * Item QTY * Item Reorder The equation used to compute the path of steepest ascent is ŷ x 3. 5x3 7. 5x4. Notice that even though the model contains interaction, it is relatively common practice to ignore the interactions in computing the path of steepest ascent. This means that the path constructed is only an approximation to the path that would have been obtained if the interactions were considered, but it s usually close enough to give satisfactory results. It is helpful to give a general method for finding the path of steepest ascent. Suppose we have a first-order model in k variables, say ŷ ˆ k ˆ i x i i The path of steepest ascent passes through the origin, x=, and through the point on a hypersphere of radius, R where ŷ is a maximum. Thus, the x s must satisfy the constraint k i x i R To find the set of x s that maximize ŷ subject to this constraint, we maximize -3

354 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY k L ˆ ˆ i xi i i k x i R where is a LaGrange multiplier. From L / xi L /, we find ˆ i xi It is customary to specify a basic step size in one of the variables, say x j, and then calculate as = ˆ j / x j. Then this value of can be used to generate the remaining coordinates of a point on the path of steepest ascent. We demonstrate using the data from this problem. Suppose that we use - units in as the basic step size. Note that a decrease in is called for, because we are looking for a path of steepest decent. Now - units in is equal to -/ = -.5 units change in x. Thus, = ˆ / x = 9.5/(-.5) = -8.5 Consequently, x x ˆ ˆ are the remaining coordinates of points along the path of steepest decent, in terms of the coded variables. The path of steepest decent is shown below: Coded Variables Natural Variables x x x 3 x Origin Origin Origin Verify that the following design is a simplex. Fit the first-order model and find the path of steepest ascent. Position x x x 3 y

355 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY 4 x x 3 3 x The graphical representation of the design identifies a tetrahedron; therefore, the design is a simplex. Design Expert Output Response: y ANOVA for Selected Factorial Model Analysis of variance table [Partial sum of squares] Sum of Mean F Source Squares DF Square Value Prob > F Model A B.6.6 C.4.4 Pure Error. Cor Total Std. Dev. R-Squared. Mean 9.55 Adj R-Squared C.V. Pred R-Squared N/A PRESS N/A Adeq Precision. Case(s) with leverage of.: Pred R-Squared and PRESS statistic not defined Coefficient Standard 95% CI 95% CI Factor Estimate DF Error Low High VIF Intercept 9.55 A-x.35. B-x.55. C-x3.6. Final Equation in Terms of Coded Factors: y = * A +.55 * B +.6 * C Final Equation in Terms of Actual Factors: y = * x * x +.6 * x3-5

356 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY The first order model is ŷ x. 55x. 6x3. To find the path of steepest ascent, let the basic step size be x 3. Then using the results obtained in the previous problem, we obtain ˆ 3 x3 or. =.6 which yields. 6. Then the coordinates of points on the path of steepest ascent are defined by Therefore, in the coded variables we have: x x ˆ ˆ Coded Variables x x x 3 Origin.6.4. Origin Origin For the first-order model ŷ 6. 5x. 8x. x3 find the path of steepest ascent. The variables are coded as x. Let the basic step size be 3 ˆ 3 x3 or. = x ˆ. x ˆ. Therefore, in the coded variables we have i Then. Coded Variables x x x 3 Origin Origin Origin The region of experimentation for three factors are time ( 4 T 8 min), temperature ( T 3 C), and pressure ( P 5 psig). A first-order model in coded variables has been fit to yield data from a 3 design. The model is -6

357 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY ŷ 3 5x x. 5x 3. 5 Is the point T = 85, T = 35, P=6 on the path of steepest ascent? The coded variables are found with the following: 6 T 5 x T P 35 x x T 5 5 x. 5 ˆ x or.5 = ˆ. 5 x. 5 ˆ x3. 75 Coded Variables Natural Variables x x x 3 T T P Origin Origin Origin The point T =85, T =35, and P=6 is not on the path of steepest ascent. 3-6 The region of experimentation for two factors are temperature ( T 3 F) and catalyst feed rate ( C 3 lb/h). A first order model in the usual coded variables has been fit to a molecular weight response, yielding the following model. (a) Find the path of steepest ascent. ŷ 5x 4x T x x C T x ˆ 5 x or 5 ˆ 4 x. 3 5 Coded Variables Natural Variables x x T C Origin

358 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Origin Origin (a) It is desired to move to a region where molecular weights are above 5. Based on the information you have from the experiment, in this region, about how may steps along the path of steepest ascent might be required to move to the region of interest? ŷ x ˆ ˆ x. # Steps The path of steepest ascent is usually computed assuming that the model is truly first-order.; that is, there is no interaction. However, even if there is interaction, steepest ascent ignoring the interaction still usually produces good results. To illustrate, suppose that we have fit the model ŷ 5x x 8x 3x using coded variables (- x +) (a) Draw the path of steepest ascent that you would obtain if the interaction were ignored. Path of Steepest Ascent for Main Effects Model - - X X (b) Draw the path of steepest ascent that you would obtain with the interaction included in the model. Compare this with the path found in part (a). -8

359 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Path of Steepest Ascent for Full Model - - X X -8 The data shown in the following table were collected in an experiment to optimize crystal growth as a function of three variables x, x, and x 3. Large values of y (yield in grams) are desirable. Fit a second order model and analyze the fitted surface. Under what set of conditions is maximum growth achieved? x x x 3 y Design Expert Output Response: Yield ANOVA for Response Surface Quadratic Model Analysis of variance table [Partial sum of squares] Sum of Mean F Source Squares DF Square Value Prob > F Model not significant A B C

360 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY A B C AB AC BC Residual Lack of Fit not significant Pure Error Cor Total The "Model F-value" of.9 implies the model is not significant relative to the noise. There is a.94 % chance that a "Model F-value" this large could occur due to noise. Std. Dev R-Squared.663 Mean 83.5 Adj R-Squared.3598 C.V Pred R-Squared PRESS Adeq Precision 3.88 Coefficient Standard 95% CI 95% CI Factor Estimate DF Error Low High VIF Intercept A-x B-x C-x A B C AB AC BC Final Equation in Terms of Coded Factors: Yield = * A +.36 * B -.49 * C * A -.43 * B -9.6 * C +.87 * A * B -.63 * A * C * B * C Final Equation in Terms of Actual Factors: Yield = * x * x * x * x * x * x * x * x -.65 * x * x * x * x3 There are so many nonsignificant terms in this model that we should consider eliminating some of them. A reasonable reduced model is shown below. Design Expert Output -

361 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Response: Yield ANOVA for Response Surface Reduced Quadratic Model Analysis of variance table [Partial sum of squares] Sum of Mean F Source Squares DF Square Value Prob > F Model significant B C B C Residual Lack of Fit not significant Pure Error Cor Total The Model F-value of 4.95 implies the model is significant. There is only a.95% chance that a "Model F-Value" this large could occur due to noise. Std. Dev..6 R-Squared.569 Mean 83.5 Adj R-Squared.454 C.V. 5.7 Pred R-Squared.46 PRESS Adeq Precision Coefficient Standard 95% CI 95% CI Factor Estimate DF Error Low High VIF Intercept B-x C-x B C Final Equation in Terms of Coded Factors: Yield = * B -.49 * C -.6 * B -9.3 * C Final Equation in Terms of Actual Factors: Yield = * x * x * x * x3 The contour plot identifies a maximum near the center of the design space. -

362 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY DESIGN-EXPERT Plot Yield X = B: x Y = C: x3. 8 Yield 85 8 Design Points Actual Factor A: x =. C: x Prediction % Low % High6.9 SE mean SE pred 3.38 X.6 Y B: x -9 The following data were collected by a chemical engineer. The response y is filtration time, x is temperature, and x is pressure. Fit a second-order model. x x y Design Expert Output Response: y ANOVA for Response Surface Quadratic Model Analysis of variance table [Partial sum of squares] Sum of Mean F Source Squares DF Square Value Prob > F Model not significant A B A AB Residual Lack of Fit significant Pure Error Cor Total 47. The "Model F-value" of.57 implies the model is not significant relative to the noise. There is a.94 % chance that a "Model F-value" this large could occur due to noise. Std. Dev. 5.7 R-Squared.56 -

363 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Mean 45. Adj R-Squared.3433 C.V..7 Pred R-Squared PRESS Adeq Precision Coefficient Standard 95% CI 95% CI Factor Estimate DF Error Low High VIF Intercept A-Temperature B-Pressure A AB Final Equation in Terms of Coded Factors: Time = * A -.79 * B * A +6. * A * B Final Equation in Terms of Actual Factors: Time = * Temperature * Pressure * Temperature +6. * Temperature * Pressure The lack of fit test in the above analysis is significant. Also, the residual plot below identifies an outlier which happens to be standard order number 8. Normal plot of residuals Normal % probability Residual We chose to remove this run and re-analyze the data. Design Expert Output Response: y ANOVA for Response Surface Quadratic Model Analysis of variance table [Partial sum of squares] Sum of Mean F Source Squares DF Square Value Prob > F Model significant A

364 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY B A AB Residual Lack of Fit not significant Pure Error Cor Total 43. The Model F-value of 3.3 implies the model is significant. There is only a.% chance that a "Model F-Value" this large could occur due to noise. Std. Dev..84 R-Squared.945 Mean 44.5 Adj R-Squared.938 C.V. 4.3 Pred R-Squared.89 PRESS 8.66 Adeq Precision 8.43 Coefficient Standard 95% CI 95% CI Factor Estimate DF Error Low High VIF Intercept A-Temperature B-Pressure A AB Final Equation in Terms of Coded Factors: Time = * A -4.8 * B * A +6. * A * B Final Equation in Terms of Actual Factors: Time = * Temperature * Pressure * Temperature +6. * Temperature * Pressure The lack of fit test is satisfactory as well as the following normal plot of residuals: Normal plot of residuals Norm al % probability Residual -4

365 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY (a) What operating conditions would you recommend if the objective is to minimize the filtration time?. Prediction % Low % High SE mean.97 SE pred.458 X -.68 Y Time B: Press ure A: Temperature (b) What operating conditions would you recommend if the objective is to operate the process at a mean filtration time very close to 46?. 34 Time B: Press ure A: Temperature There are two regions that enable a filtration time of 46. Either will suffice; however, higher temperatures and pressures typically have higher operating costs. We chose the operating conditions at the lower pressure and temperature as shown. - The hexagon design that follows is used in an experiment that has the objective of fitting a secondorder model. x x y 68-5

366 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY (a) Fit the second-order model Design Expert Output Response: y ANOVA for Response Surface Quadratic Model Analysis of variance table [Partial sum of squares] Sum of Mean F Source Squares DF Square Value Prob > F Model not significant A B A B AB Residual Lack of Fit not significant Pure Error Cor Total The "Model F-value" of.89 implies the model is not significant relative to the noise. There is a 5. % chance that a "Model F-value" this large could occur due to noise. Std. Dev. 5.9 R-Squared.6545 Mean Adj R-Squared.39 C.V. 8. Pred R-Squared -.5 PRESS Adeq Precision 3.75 Coefficient Standard 95% CI 95% CI Factor Estimate DF Error Low High VIF Intercept A-x B-x A B AB Final Equation in Terms of Coded Factors: y = * A +.73 * B +4. * A * B +.5 * A * B (a) Perform the canonical analysis. What type of surface has been found? The full quadratic model is used in the following analysis because the reduced model is singular. -6

367 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Solution Variable Critical Value X X Predicted Value at Solution Eigenvalues and Eigenvectors Variable X X Since both eigenvalues are positive, the response is a minimum at the stationary point. (c) What operating conditions on x and x lead to the stationary point? The stationary point is (x,x ) = (-.6766, -.583) (d) Where would you run this process if the objective is to obtain a response that is as close to 65 as possible?.87 y B: x A: x Any value of x and x that give a point on the contour with value of 65 would be satisfactory. - An experimenter has run a Box-Behnken design and has obtained the results below, where the response variable is the viscosity of a polymer. Level Temp. Agitation Rate Pressure x x x 3 High Middle Low

368 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY (a) Fit the second-order model. Run x x x 3 y Design Expert Output Response: Viscosity ANOVA for Response Surface Quadratic Model Analysis of variance table [Partial sum of squares] Sum of Mean F Source Squares DF Square Value Prob > F Model significant A B C A B C AB AC BC Residual Lack of Fit not significant Pure Error Cor Total The Model F-value of 9.54 implies the model is significant. There is only a.5% chance that a "Model F-Value" this large could occur due to noise. Std. Dev. 3.3 R-Squared.945 Mean Adj R-Squared.846 C.V. 5.6 Pred R-Squared.3347 PRESS 63.5 Adeq Precision.45 Coefficient Standard 95% CI 95% CI Factor Estimate DF Error Low High VIF Intercept A-Temperatue B-Agitation Rate C-Pressure A B C AB AC BC Final Equation in Terms of Coded Factors: -8

369 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Viscosity = * A +7.6 * B -6. * C -75. * A +9.5 * B * C -9.5 * A * B +9.5 * A * C * B * C Final Equation in Terms of Actual Factors: Viscosity = * Temperatue * Agitation Rate -.6 * Pressure -. * Temperatue +3. * Agitation Rate -.43 * Pressure -.3 * Temperatue * Agitation Rate * Temperatue * Pressure +.4 * Agitation Rate * Pressure (b) Perform the canonical analysis. What type of surface has been found? The system is a saddle point. Solution Variable Critical Value X X X Predicted Value at Solution Eigevalues and Eigevectors Variable X X X (c) What operating conditions on x, x, and x 3 lead to the stationary point? The stationary point is (x, x, x 3 ) = (.8496, ,.7586). This is outside the design region. It would be necessary to either examine contour plots or use numerical optimization methods to find desired operating conditions. (d) What operating conditions would you recommend if it is important to obtain a viscosity that is as close to 6 as possible? -9

370 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY DESIGN-EXPERT Plot V i sco si ty X = A: Temperatue Y = C: Pressure Design Points.5 Actual Factor B: Agitation Rate = 7.5 C: Pressure Viscosity A: Tem peratue Any point on either of the contours showing a viscosity of 6 is satisfactory. - Consider the three-variable central composite design shown below. Analyze the data and draw conclusions, assuming that we wish to maximize conversion (y ) with activity (y ) between 55 and 6. Time Temperature Catalyst Conversion (%) Activity Run (min) (C) (%) y y Quadratic models are developed for the Conversion and Activity response variables as follows: -

371 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Design Expert Output Response: Conversion ANOVA for Response Surface Quadratic Model Analysis of variance table [Partial sum of squares] Sum of Mean F Source Squares DF Square Value Prob > F Model significant A B C A B C AB AC <. BC Residual.47.5 Lack of Fit not significant Pure Error Cor Total The Model F-value of.76 implies the model is significant. There is only a.% chance that a "Model F-Value" this large could occur due to noise. Std. Dev. 4.7 R-Squared.999 Mean 78.3 Adj R-Squared.8479 C.V. 6. Pred R-Squared.7566 PRESS 676. Adeq Precision 4.39 Coefficient Standard 95% CI 95% CI Factor Estimate DF Error Low High VIF Intercept A-Time B-Temperature C-Catalyst A B C AB AC BC Final Equation in Terms of Coded Factors: Conversion = * A +4.4 * B +6. * C -.83 * A +.94 * B -5.9 * C +.3 * A * B +.38 * A * C * B * C Final Equation in Terms of Actual Factors: Conversion = * Time * Temperature * Catalyst * Time * Temperature * Catalyst +.5 * Time * Temperature * Time * Catalyst -

372 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY * Temperature * Catalyst Design Expert Output Response: Activity ANOVA for Response Surface Quadratic Model Analysis of variance table [Partial sum of squares] Sum of Mean F Source Squares DF Square Value Prob > F Model significant A <. B C A B C AB AC.. 3.6E BC Residual Lack of Fit significant Pure Error Cor Total The Model F-value of 9.6 implies the model is significant. There is only a.9% chance that a "Model F-Value" this large could occur due to noise. Std. Dev..76 R-Squared.898 Mean 6.5 Adj R-Squared.7945 C.V..9 Pred R-Squared.536 PRESS 4.43 Adeq Precision.9 Coefficient Standard 95% CI 95% CI Factor Estimate DF Error Low High VIF Intercept A-Time B-Temperature C-Catalyst A B C AB AC BC Final Equation in Terms of Coded Factors: Conversion = * A +.5 * B +.3 * C +.83 * A +.75 * B +.57 * C -.39 * A * B -.38 * A * C +.3 * B * C Final Equation in Terms of Actual Factors: Conversion = * Time * Temperature * Catalyst * Time * Temperature -

373 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY * Catalyst * Time * Temperature * Time * Catalyst +.35 * Temperature * Catalyst Because many of the terms are insignificant, the reduced quadratic model is fit as follows: Design Expert Output Response: Activity ANOVA for Response Surface Quadratic Model Analysis of variance table [Partial sum of squares] Sum of Mean F Source Squares DF Square Value Prob > F Model <. significant A <. C <. A Residual Lack of Fit not significant Pure Error Cor Total The Model F-value of implies the model is significant. There is only a.% chance that a "Model F-Value" this large could occur due to noise. Std. Dev..46 R-Squared.884 Mean 6.5 Adj R-Squared.859 C.V..4 Pred R-Squared.63 PRESS 6.4 Adeq Precision.447 Coefficient Standard 95% CI 95% CI Factor Estimate DF Error Low High VIF Intercept A-Time C-Catalyst A Final Equation in Terms of Coded Factors: Activity = * A +.3 * C +.8 * A Final Equation in Terms of Actual Factors: Activity = * Time * Catalyst +.83 * Time -3

374 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY DESIGN-EXPERT Plot Conversion X = A: Time Y = C: Catalyst Design Points Actual Factor B: Temperature = Conversion DESIGN-EXPERT Plot Activity X = A: Time Y = C: Catalyst Design Points Actual Factor B: Temperature = Activity C: Catalyst C: Catalyst A: Tim e A: Tim e DESIGN-EXPERT Plot Overlay Plot X = A: Time Y = C: Catalyst. Overlay Plot Design Points Actual Factor B: Temperature = -..5 Conv ersion: 8 C: Catalyst. Activ ity : A: Tim e The contour plots visually describe the models while the overlay plots identifies the acceptable region for the process. -4

375 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY -3 A manufacturer of cutting tools has developed two empirical equations for tool life in hours (y ) and for tool cost in dollars (y ). Both models are linear functions of steel hardness (x ) and manufacturing time (x ). The two equations are ŷ ŷ 5x 3 3x x 4x and both equations are valid over the range -.5x.5. Unit tool cost must be below $7.5 and life must exceed hours for the product to be competitive. Is there a feasible set of operating conditions for this process? Where would you recommend that the process be run? The contour plots below graphically describe the two models. The overlay plot identifies the feasible operating region for the process..5 Life.5 Cost B: Tim e B: Tim e A: H ardn es s A: H ardn es s.5 Overlay Plot.75 Cost: 7.5 B: Tim e. Life: x 3 3x A: H ardn es s x 4x

376 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY -4 A central composite design is run in a chemical vapor deposition process, resulting in the experimental data shown below. Four experimental units were processed simultaneously on each run of the design, and the responses are the mean and variance of thickness, computed across the four units. x x y s (a) Fit a model to the mean response. Analyze the residuals. Design Expert Output Response: Mean Thick ANOVA for Response Surface Quadratic Model Analysis of variance table [Partial sum of squares] Sum of Mean F Source Squares DF Square Value Prob > F Model significant A B A B AB Residual Lack of Fit not significant Pure Error Cor Total 59.9 The Model F-value of 6. implies the model is significant. There is only a.% chance that a "Model F-Value" this large could occur due to noise. Std. Dev. 4.3 R-Squared.937 Mean 47.3 Adj R-Squared.873 C.V. 5.5 Pred R-Squared.63 PRESS Adeq Precision.6 Coefficient Standard 95% CI 95% CI Factor Estimate DF Error Low High VIF Intercept A-x B-x A B AB Final Equation in Terms of Coded Factors: Mean Thick = * A * B -7

377 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY -.9 * A * B +6.5 * A * B Final Equation in Terms of Actual Factors: Mean Thick = * x * x -.9 * x * x +6.5 * x * x Normal plot of residuals Residuals vs. Predicted Normal % probability Residuals Residual Predicted A modest deviation from normality can be observed in the Normal Plot of Residuals; however, not enough to be concerned. (b) Fit a model to the variance response. Analyze the residuals. Design Expert Output Response: Var Thick ANOVA for Response Surface FI Model Analysis of variance table [Partial sum of squares] Sum of Mean F Source Squares DF Square Value Prob > F Model <. significant A <. B AB Residual Lack of Fit not significant Pure Error Cor Total 7.69 The Model F-value of implies the model is significant. There is only a.% chance that a "Model F-Value" this large could occur due to noise. Std. Dev..78 R-Squared.938 Mean 9.7 Adj R-Squared.948 C.V Pred R-Squared.89 PRESS 7.64 Adeq Precision

378 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Factor Coefficient Estimate DF Standard Error 95% CI Low 95% CI High VIF Intercept A-x B-x AB Final Equation in Terms of Coded Factors: Var Thick = * A -.38 * B -.5 * A * B Final Equation in Terms of Actual Factors: Var Thick = * x * x * x * x Normal plot of residuals Residuals vs. Predicted Normal % probability Residuals Residual Predicted The residual plots are not acceptable. A transformation should be considered. If not successful at correcting the residual plots, further investigation into the two apparently unusual points should be made. (c) Fit a model to the ln(s ). Is this model superior to the one you found in part (b)? Design Expert Output Response: Var Thick Transform: Natural log Constant: ANOVA for Response Surface FI Model Analysis of variance table [Partial sum of squares] Sum of Mean F Source Squares DF Square Value Prob > F Model <. significant A <. B AB Residual E-3 Lack of Fit E not significant Pure Error E-3 Cor Total.7-9

379 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY The Model F-value of implies the model is significant. There is only a.% chance that a "Model F-Value" this large could occur due to noise. Std. Dev..78 R-Squared.937 Mean.8 Adj R-Squared.974 C.V Pred R-Squared.8797 PRESS.87 Adeq Precision Coefficient Standard 95% CI 95% CI Factor Estimate DF Error Low High VIF Intercept A-x B-x AB Final Equation in Terms of Coded Factors: Ln(Var Thick) = * A -.3 * B -.4 * A * B Final Equation in Terms of Actual Factors: Ln(Var Thick) = * x * x * x * x Normal plot of residuals Residuals vs. Predicted Normal % probability Residuals Residual Predicted The residual plots are much improved following the natural log transformation; however, the two runs still appear to be somewhat unusual and should be investigated further. They will be retained in the analysis. (d) Suppose you want the mean thickness to be in the interval 45±5. Find a set of operating conditions that achieve the objective and simultaneously minimize the variance. -3

380 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY. Mean Thick 575. Ln(Var Thick) B: x B: x A: x A: x. Overlay Plot.5 B: x. Ln(Var Thick):. 4 Mean Thick: Mean Thick: A: x The contour plots describe the two models while the overlay plot identifies the acceptable region for the process. (e) Discuss the variance minimization aspects of part (d). Have you minimized total process variance? The within run variance has been minimized; however, the run-to-run variation has not been minimized in the analysis. This may not be the most robust operating conditions for the process. -5 Verify that an orthogonal first-order design is also first-order rotatable. To show that a first order orthogonal design is also first order rotatable, consider V( ŷ ) k ˆ ˆ ˆ V( x ) V( ) x V( ˆ i i i i ) i k i -3

381 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY -3 since all covariances between i and j are zero, due to design orthogonality. Furthermore, we have: n ) ˆ V(... ) ˆ V( ) ˆ V( ˆ V k, so k i x i n n ŷ ) V( k i x i n n ŷ ) V( which is a function of distance from the design center (i.e. x=), and not direction. Thus the design must be rotatable. Note that n is, in general, the number of points in the exterior portion of the design. If there are n c centerpoints, then ) n ( n ) ˆ V( c. -6 Show that augmenting a k design with n c center points does not affect the estimates of the i (i=,,..., k), but that the estimate of the intercept is the average of all k + n c observations. In general, the X matrix for the k design with n c center points and the y vector would be: k X The upper half of the matrix is the usual notation of the k design The lower half of the matrix represents the center points (n c rows) c k n n n y y y y k +n c observations. k k c k n X X' g k g g g y X' Grand total of all k +n c observations usual contrasts from k

382 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Therefore, ˆ g k n k c observations, while ˆ g i i, which does k nc not depend on the number of center points, since in computing the contrasts g i, all observations at the center are multiplied by zero., which is the average of all -7 The rotatable central composite design. It can be shown that a second-order design is rotatable if n a b x u iu x ju if a or b (or both) are odd and if / 4 composite design these conditions lead to the factorial portion. n f n u 4 iu x n 3 x u iux ju. Show that for the central for rotatability, where n f is the number of points in The balance between + and - in the factorial columns and the orthogonality among certain column in the X matrix for the central composite design will result in all odd moments being zero. To solve for use the following relations: n n 4 4 x iu n f, u u x iu x ju n f then n f 4 n u 4 4 x 4 iu 4 n n n f f 3 3( n f n u f ) x iu x ju -8 Verify that the central composite design shown below blocks orthogonally. Block Block Block 3 x x x 3 x x x 3 x x x Note that each block is an orthogonal first order design, since the cross products of elements in different columns add to zero for each block. To verify the second condition, choose a column, say column x. Now k u xu , and n= -33

383 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY For blocks and, So m m n u x m 4, n m =6 x x m u n m.3 = and condition is satisfied by blocks and. For block 3, we have m x m , n m = 8, so m n u x x m u nm n 8.4 =.4 And condition is satisfied by block 3. Similar results hold for the other columns. -9 Blocking in the central composite design. Consider a central composite design for k = 4 variables in two blocks. Can a rotatable design always be found that blocks orthogonally? To run a central composite design in two blocks, assign the n f factorial points and the n center points to block and the k axial points plus n center points to block. Both blocks will be orthogonal first order designs, so the first condition for orthogonal blocking is satisfied. The second condition implies that m m x x im im block n f n k n block c c -34

384 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY However, xim n f in block and x im m m in block, so Which gives: n n n f f c k nc n f k n n f n Since 4 if the design is to be rotatable, then the design must satisfy n f c c n f n f k n n f n c c It is not possible to find rotatable central composite designs which block orthogonally for all k. For example, if k=3, the above condition cannot be satisfied. For k=, there must be an equal number of center points in each block, i.e. n c = n c. For k=4, we must have n c = 4 and n c =. - How could a hexagon design be run in two orthogonal blocks? The hexagonal design can be blocked as shown below. There are n c = n c = n c center points with n c even. 6 n Put the points,3,and 5 in block and,4,and 6 in block. Note that each block is a simplex. - Yield during the first four cycles of a chemical process is shown in the following table. The variables are percent concentration (x ) at levels 3, 3, and 3 and temperature (x ) at 4, 4, and 44F. Analyze by EVOP methods. Conditions Cycle () () (3) (4) (5) -35

385 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Cycle: n= Phase Calculation of Averages Calculation of Standard Deviation Operating Conditions () () (3) (4) (5) (i) Previous Cycle Sum Previous Sum S= (ii) Previous Cycle Average Previous Average = (iii) New Observation New S=Range x f k,n (iv) Differences Range= (v) New Sums New Sum S= (vi) New Averages New average S = New Sum S/(n-)= Calculation of Effects A y3 y4 y y5 B y3 y4 y y5 AB y3 y4 y y5 CIM y3 y4 y y5 4y Calculation of Error Limits For New Average: S n For New Effects: S n.78 For CIM: S n Cycle: n= Phase Calculation of Averages Calculation of Standard Deviation Operating Conditions () () (3) (4) (5) (i) Previous Cycle Sum Previous Sum S= (ii) Previous Cycle Average Previous Average = (iii) New Observation New S=Range x f k,n=.38 (iv) Differences Range=4.6 (v) New Sums New Sum S=.38 (vi) New Averages New average S = New Sum S/(n-)=.38 Calculation of Effects A y3 y4 y y5 B y3 y4 y y5 AB y3 y4 y y5 CIM y3 y4 y y5 4y Calculation of Error Limits.95 For New Average: S n.95 For New Effects: S n For CIM: S n Cycle: n=3 Phase Calculation of Averages Calculation of Standard Deviation Operating Conditions () () (3) (4) (5) (i) Previous Cycle Sum Previous Sum S=.38-36

386 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY (ii) Previous Cycle Average Previous Average =.38 (iii) New Observation New S=Range x f k,n=.8 (iv) Differences Range=6.5 (v) New Sums New Sum S=3.66 (vi) New Averages New average S = New Sum S/(n-)=.38 Calculation of Effects A y3 y4 y y5 B y3 y4 y y5 AB y3 y4 y y5 CIM y3 y4 y y5 4y Calculation of Error Limits. For New Average: S n. For New Effects: S n For CIM: S n Cycle: n=4 Phase Calculation of Averages Calculation of Standard Deviation Operating Conditions () () (3) (4) (5) (i) Previous Cycle Sum Previous Sum S=3.66 (ii) Previous Cycle Average Previous Average =.83 (iii) New Observation New S=Range x f k,n=.45 (iv) Differences Range=6.63 (v) New Sums New Sum S=6. (vi) New Averages New average S = New Sum S/(n-)=.4 Calculation of Effects A y3 y4 y y5 B y3 y4 y y5 AB y3 y4 y y5 CIM y3 y4 y y5 4y Calculation of Error Limits.4 For New Average: S n.4 For New Effects: S n.78.8 For CIM: S n From studying cycles 3 and 4, it is apparent that A (and possibly B) has a significant effect. A new phase should be started following cycle 3 or 4. - Suppose that we approximate a response surface with a model of order d, such as y=x +, when the true surface is described by a model of order d >d ; that is E(y)= X + X. (a) Show that the regression coefficients are biased, that is, that E( )= +A, where A=(X X ) - X X. A is usually called the alias matrix. -37

387 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY -38 Aβ β β X X X X X β X X X β X X β X X X y X X X X y X X β ' ' ' ' ' ' ' ' ' ' E E E ˆ where ' ' X X X X A (a) If d = and d =, and a full k is used to fit the model, use the result in part (a) to determine the alias structure. In this situation, we have assumed the true surface to be first order, when it is really second order. If a full factorial is used for k=, then X = X = and A = Then, E = E ˆ ˆ ˆ The pure quadratic terms bias the intercept. (b) If d =, d = and k=3, find the alias structure assuming that a 3- design is used to fit the model. X = 3 X = and A = Then, E = E ˆ ˆ ˆ ˆ (d) If d =, d =, k=3, and the simplex design in Problem -3 is used to fit the model, determine the alias structure and compare the results with part (c).

388 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY X = 3 X = and A = Then, E = E ˆ ˆ ˆ ˆ Notice that the alias structure is different from that found in the previous part for the 3- design. In general, the A matrix will depend on which simplex design is used. -3 In an article ( Let s All Beware the Latin Square, Quality Engineering, Vol., 989, pp ) J.S. Hunter illustrates some of the problems associated with 3 k-p fractional factorial designs. Factor A is the amount of ethanol added to a standard fuel and factor B represents the air/fuel ratio. The response variable is carbon monoxide (CO) emission in g/m. The design is shown below. Design Observations A B x x y y Notice that we have used the notation system of,, and to represent the low, medium, and high levels for the factors. We have also used a geometric notation of -,, and. Each run in the design is replicated twice. (a) Verify that the second-order model 7. x 4. 5x 4. x 9. x ŷ x x is a reasonable model for this experiment. Sketch the CO concentration contours in the x, x space. In the computer output that follows, the coded factors model is in the -,, + scale. Design Expert Output Response: CO Emis ANOVA for Response Surface Quadratic Model Analysis of variance table [Partial sum of squares] Sum of Mean F Source Squares DF Square Value Prob > F Model <. significant A <. -39

389 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY B <. A B AB <. Residual Lack of Fit not significant Pure Error Cor Total The Model F-value of 5.95 implies the model is significant. There is only a.% chance that a "Model F-Value" this large could occur due to noise. Std. Dev..5 R-Squared.955 Mean 7.83 Adj R-Squared.9363 C.V Pred R-Squared.9 PRESS 69.7 Adeq Precision.95 Coefficient Standard 95% CI 95% CI Factor Estimate DF Error Low High VIF Intercept A-Ethanol B-Air/Fuel Ratio A B AB Final Equation in Terms of Coded Factors: CO Emis = * A -7. * B -4.5 * A -4. * B -9. * A * B CO Emis B: Air/Fuel Ratio A: Ethanol (b) Now suppose that instead of only two factors, we had used four factors in a 3 4- fractional factorial design and obtained exactly the same data in part (a). The design would be as follows: Design Observations A B C D x x x 3 x 4 y y -4

390 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Confirm that this design is an L 9 orthogonal array. This is the same as the design in Table -. (c) Calculate the marginal averages of the CO response at each level of the four factors A, B, C, and D. Construct plots of these marginal averages and interpret the results. Do factors C and D appear to have strong effects? Do these factors really have any effect on CO emission? Why is their apparent effect strong? One Factor Plot One Factor Plot CO Emis 76 CO Emis A: A B: B -4

391 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY One Factor Plot One Factor Plot CO Emis 76 CO Emis C: C D: D Both Factors C and D appear to have an effect on CO emission. This is probably because both C and D are aliased with components of interaction involving A and B, and there is a strong AB interaction. (a) The design in part (b) allows the model y to be fitted. Suppose that the true model is y 4 4 i x 4 i i 4 i iixi i xi ii xi i i ij x x Show that if j represents the least squares estimates of the coefficients in the fitted model, then E E E E E E E E E ˆ ˆ 3 4 / ˆ / ˆ / ˆ 4 4 3/ ˆ 3 4 / ˆ / ˆ / 4 ˆ / 3 ij i j -4

392 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY -43 Let X = and X = Then, A X X X X ' ' = A = ˆ ˆ ˆ ˆ ˆ ˆ ˆ ˆ ˆ E -4 Suppose that you need to design an experiment to fit a quadratic model over the region i x, i=, subject to the constraint x x. If the constraint is violated, the process will not work properly. You can afford to make no more than n= runs. Set up the following designs: (a) An inscribed CCD with center points at x x x x

393 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY (a)* An inscribed CCD with center points at x x 5 so that a larger design could be fit within the constrained region. x x (a) An inscribed 3 factorial with center points at x x. 5 x x (a) A D-optimal design. x x

394 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY (a) A modified D-optimal design that is identical to the one in part (c), but with all replicate runs at the design center. x x (a) Evaluate the ( X X) criteria for each design. X X (a) (a)* (b) (c) (d) (a) Evaluate the D-efficiency for each design relative to the D-optimal design in part (c). (a) (a)* (b) (c) (d) D-efficiency 4.5%.64% 49.4%.% 87.3% (a) Which design would you prefer? Why? The offset CCD, (a)*, is the preferred design based on the D-efficiency. Not only is it better than the D- optimal design, (c), but it maintains the desirable design features of the CCD. -45

395 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY -5 Consider a 3 design for fitting a first-order model. (a) Evaluate the D-criterion ( X X) for this design. ( X X) =.44E-4 (b) Evaluate the A-criterion tr ( XX) for this design. tr ( XX) =.5 (c) Find the maximum scaled prediction variance for this design. Is this design G-optimal? v x NVar ŷx Nx XX x 4. Yes, this is a G-optimal design. -6 Repeat Problem -5 using a first order model with the two-factor interaction. ( X X) = 4.768E-7 tr ( XX) =.875 v x NVar ŷx Nx XX x 7. Yes, this is a G-optimal design. -7 A chemical engineer wishes to fit a calibration curve for a new procedure used to measure the concentration of a particular ingredient in a product manufactured in his facility. Twelve samples can be prepared, having known concentration. The engineer s interest is in building a model for the measured concentrations. He suspects that a linear calibration curve will be adequate to model the measured concentration as a function of the known concentrations; that is, where x is the actual concentration. Four experimental designs are under consideration. Design consists of 6 runs at known concentration and 6 runs at known concentration. Design consists of 4 runs at concentrations, 5.5, and. Design 3 consists of 3 runs at concentrations, 4, 7, and. Finally, design 4 consists of 3 runs at concentrations and and 6 runs at concentration 5.5. (a) Plot the scaled variance of prediction for all four designs on the same graph over the concentration range. Which design would be preferable, in your opinion? -46

396 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY 3.5 Scaled Variance of Prediction Design 4 Design 3 Design Design Because it has the lowest scaled variance of prediction at all points in the design space with the exception of 5.5, Design is preferred. (b) For each design calculate the determinant of to the D criterion? ( X X). Which design would be preferred according Design ( X X) Design would be preferred. (c) Calculate the D-efficiency of each design relative to the best design that you found in part b. Design D-efficiency.% 8.65% % 4 7.7% (a) For each design, calculate the average variance of prediction over the set of points given by x =,.5,,.5,...,. Which design would you prefer according to the V-criterion? Average Variance of Prediction Design Actual Coded

397 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Design is still preferred based on the V-criterion. (e) Calculate the V-efficiency of each design relative to the best design you found in part (d). (f) What is the G-efficiency of each design? Design V-efficiency.% 88.% 3 8.4% % Design G-efficiency.% 8.% 3 7.4% % -8 Rework Problem -7 assuming that the model the engineer wishes to fit is a quadratic. Obviously, only designs, 3, and 4 can now be considered. 4.5 Scaled Variance of Prediction Design 4 Design 3 Design Based on the plot, the preferred design would depend on the region of interest. Design 4 would be preferred if the center of the region was of interest; otherwise, Design would be preferred. Design ( X X) 4.74E E E-7 Design is preferred based on ( X X). -48

398 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Design 4 is preferred. Design D-efficiency.% % % Average Variance of Prediction Design Actual Coded Design V-efficiency 9.89% % 4.% Design G-efficiency.% 3 79.% 4 75.% -9 An experimenter wishes to run a three-component mixture experiment. The constraints are the components proportions are as follows:. x. x.4 x (a) Set up an experiment to fit a quadratic mixture model. Use n=4 runs, with 4 replicates. Use the D- criteria. (a) Draw the experimental design region. Std x x x

399 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY A: x B: x..7 C: x3 (c) Set up an experiment to fit a quadratic mixture model with n= runs, assuming that three of these runs are replicated. Use the D-criterion. Std x x x

400 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY A: x B: x..7 C: x3 (d) Comment on the two designs you have found. The design points are the same for both designs except that the edge center on the x-x3 edge is not included in the second design. None of the replicates for either design are in the center of the experimental region. The experimental runs are fairly uniformly spaced in the design region. -3 Myers and Montgomery (995) describe a gasoline blending experiment involving three mixture components. There are no constraints on the mixture proportions, and the following run design is used. Design Point x x x 3 y(mpg) 4.5, , , ½ ½ 5. 5 ½ ½ ½ ½ /3 /3 /3 4.8, 4. 8 /3 /6 / /6 /3 /6 3.9 /6 /6 /3 3.7 (a) What type of design did the experimenters use? A simplex centroid design was used. (b) Fit a quadratic mixture model to the data. Is this model adequate? Design Expert Output Response: y ANOVA for Mixture Quadratic Model Analysis of variance table [Partial sum of squares] Sum of Mean F Source Squares DF Square Value Prob > F -5

401 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Model Linear Mixture significant AB AC BC Residual Lack of Fit not significant Pure Error Cor Total The Model F-value of 3.9 implies the model is significant. There is only a 4.35% chance that a "Model F-Value" this large could occur due to noise. Std. Dev..47 R-Squared.79 Mean 4.6 Adj R-Squared.574 C.V..93 Pred R-Squared.44 PRESS 5.7 Adeq Precision Coefficient Standard 95% CI 95% CI Component Estimate DF Error Low High A-x B-x C-x AB AC BC Final Equation in Terms of Pseudo Components: y = * A +4.3 * B +3.8 * C +.5 * A * B +. * A * C -.9 * B * C Final Equation in Terms of Real Components: y = * x * x * x * x * x * x * x * x * x3 The quadratic terms appear to be insignificant. The analysis below is for the linear mixture model: Design Expert Output Response: y ANOVA for Mixture Quadratic Model Analysis of variance table [Partial sum of squares] Sum of Mean F Source Squares DF Square Value Prob > F Model significant Linear Mixture Residual.3.8 Lack of Fit not significant Pure Error Cor Total The Model F-value of.64 implies the model is significant. There is only a.7% chance that a "Model F-Value" this large could occur due to noise. Std. Dev..43 R-Squared.659 Mean 4.6 Adj R-Squared.597 C.V..78 Pred R-Squared.396 PRESS 3.6 Adeq Precision

402 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Coefficient Standard 95% CI 95% CI Component Estimate DF Error Low High A-x B-x C-x Adjusted Adjusted Approx t for H Component Effect DF Std Error Effect= Prob > t A-x B-x C-x Final Equation in Terms of Pseudo Components: y = * A * B +3.9 * C Final Equation in Terms of Real Components: y = * x * x * x3 (c) Plot the response surface contours. What blend would you recommend to maximize the MPG? A: x B: x. y. C: x3 To maximize the miles per gallon, the recommended blend is x =, x =, and x 3 =. -3 Consider the bottle filling experiment in Example 6-. Suppose that the percent carbonation (A) is a noise variable (in coded units ). z (a) Fit the response model to these data. Is there a robust design problem? From the analysis below, the AB interaction appears to have some importance. Because of this, there is opportunity for improvement in the robustness of the process. Design Expert Output -53

403 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Response: Fill Height ANOVA for Selected Factorial Model Analysis of variance table [Partial sum of squares] Sum of Mean F Source Squares DF Square Value Prob > F Model significant A <. B C AB AC BC ABC Pure Error Cor Total The Model F-value of 6.69 implies the model is significant. There is only a.3% chance that a "Model F-Value" this large could occur due to noise. Std. Dev..79 R-Squared.9359 Mean. Adj R-Squared.8798 C.V Pred R-Squared.7436 PRESS. Adeq Precision 3.46 Coefficient Standard 95% CI 95% CI Factor Estimate DF Error Low High VIF Intercept A-Carbination B-Pressure C-Speed AB AC BC ABC Final Equation in Terms of Coded Factors: Fill Height = * A +.3 * B +.88 * C +.38 * A * B +.3 * A * C +.5 * B * C +.5 * A * B * C Final Equation in Terms of Actual Factors: Fill Height = * Carbination +7.8 * Pressure +.8 * Speed -.75 * Carbination * Pressure -.5 * Carbination * Speed -.4 * Pressure * Speed +4.E-3 * Carbination * Pressure * Speed (b) Find the mean model and either the variance model or the POE. The mean model in coded terms is: yx,z.. 3B. 88C. BC E z 5 Contour plots of the mean model and POE are shown below: -54

404 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY DESIGN-EXPERT Plot Fill Height X = B: Pressure Y = C: Speed. Fill Height.5 3 DESIGN-EXPERT Plot POE(Fill Height) X = B: Pressure Y = C: Speed. POE(Fill Height). Coded Factor A: Carbination =..5 Coded Factor A: Carbination =..5.4 C: Speed..5.5 C: Speed B: Press ure B: Press ure (c) Find a set of conditions that result in mean fill deviation as close to zero as possible with minimum transmitted variability from carbonation. The overlay plot below identifies a region that meets these requirements. The Pressure should be set at its low level and the Speed should be set between approximately. and.5 in coded terms. DESIGN-EXPERT Plot Overlay Plot X = B: Pressure Y = C: Speed Coded Factor A: Carbination =...5 Overlay Plot C: Speed. Fill Height: Fill Height: B: Press ure -3 Consider the experiment in Problem -. Suppose that temperature is a noise variable ( z in coded units). Fit response models for both responses. Is there a robust design problem with respect to both responses? Find a set of conditions that maximize conversion with activity between 55 and 6, and that minimize the variability transmitted from temperature. The following is the analysis of variance for the Conversion response. Because of a significant BC interaction, there is some opportunity for improvement in the robustness of the process with regards to Conversion. Design Expert Output -55

405 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Response: Conversion ANOVA for Response Surface Quadratic Model Analysis of variance table [Partial sum of squares] Sum of Mean F Source Squares DF Square Value Prob > F Model significant A B C A B C AB AC <. BC Residual.47.5 Lack of Fit not significant Pure Error Cor Total The Model F-value of.76 implies the model is significant. There is only a.% chance that a "Model F-Value" this large could occur due to noise. Std. Dev. 4.7 R-Squared.999 Mean 78.3 Adj R-Squared.8479 C.V. 6. Pred R-Squared.7566 PRESS 676. Adeq Precision 4.39 Coefficient Standard 95% CI 95% CI Factor Estimate DF Error Low High VIF Intercept A-Time B-Temperature C-Catalyst A B C AB AC BC Final Equation in Terms of Coded Factors: Conversion = * A +4.4 * B +6. * C -.83 * A +.94 * B -5.9 * C +.3 * A * B +.38 * A * C * B * C Final Equation in Terms of Actual Factors: Conversion = * Time * Temperature * Catalyst * Time * Temperature * Catalyst +.5 * Time * Temperature * Time * Catalyst * Temperature * Catalyst -56

406 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY The following is the analysis of variance for the Activity response. Because there is not a significant interaction term involving temperature, there is no opportunity for improvement in the robustness of the process with regards to Activity. Design Expert Output Response: Activity ANOVA for Response Surface Quadratic Model Analysis of variance table [Partial sum of squares] Sum of Mean F Source Squares DF Square Value Prob > F Model significant A <. B C A B C AB AC.. 3.6E BC Residual Lack of Fit significant Pure Error Cor Total The Model F-value of 9.6 implies the model is significant. There is only a.9% chance that a "Model F-Value" this large could occur due to noise. Std. Dev..76 R-Squared.898 Mean 6.5 Adj R-Squared.7945 C.V..9 Pred R-Squared.536 PRESS 4.43 Adeq Precision.9 Coefficient Standard 95% CI 95% CI Factor Estimate DF Error Low High VIF Intercept A-Time B-Temperature C-Catalyst A B C AB AC BC Final Equation in Terms of Coded Factors: Conversion = * A +.5 * B +.3 * C +.83 * A +.75 * B +.57 * C -.39 * A * B -.38 * A * C +.3 * B * C Final Equation in Terms of Actual Factors: Conversion = * Time * Temperature * Catalyst * Time -57

407 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY * Temperature * Catalyst * Time * Temperature * Time * Catalyst +.35 * Temperature * Catalyst Because many of the terms are insignificant, the reduced quadratic model is fit as follows: Design Expert Output Response: Activity ANOVA for Response Surface Quadratic Model Analysis of variance table [Partial sum of squares] Sum of Mean F Source Squares DF Square Value Prob > F Model <. significant A <. C <. A Residual Lack of Fit not significant Pure Error Cor Total The Model F-value of implies the model is significant. There is only a.% chance that a "Model F-Value" this large could occur due to noise. Std. Dev..46 R-Squared.884 Mean 6.5 Adj R-Squared.859 C.V..4 Pred R-Squared.63 PRESS 6.4 Adeq Precision.447 Coefficient Standard 95% CI 95% CI Factor Estimate DF Error Low High VIF Intercept A-Time C-Catalyst A Final Equation in Terms of Coded Factors: Activity = * A +.3 * C +.8 * A Final Equation in Terms of Actual Factors: Activity = * Time * Catalyst +.83 * Time Contour plots of the mean models for the responses along with POE for Conversion are shown below: -58

408 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY DESIGN-EXPERT Plot Conversion X = A: Time Y = C: Catalyst Design Points Actual Factor B: Temperature = Conversion DESIGN-EXPERT Plot Activity X = A: Time Y = C: Catalyst Design Points Actual Factor B: Temperature = Activity C: Catalyst C: Catalyst A: Tim e A: Tim e DESIGN-EXPERT Plot POE(Conversion) X = A: Time Y = C: Catalyst Design Points Actual Factor B: Temperature = POE(Conversion) C: Catalyst A: Tim e The overlay plot shown below identifies a region near the center of the design space that meets the constraints for the process. -59

409 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY DESIGN-EXPERT Plot Overlay Plot X = A: Time Y = B: Temperature. Overlay Plot Design Points Actual Factor C: Catalyst =. B: Temperature.5. POE(Conv ersion): 8 Conv ersion: 8 6 Activ ity : A: Tim e -33 An experiment has been run in a process that applies a coating material to a wafer. Each run in the experiment produced a wafer, and the coating thickness was measured several times at different locations on the wafer. Then the mean y, and standard deviation y of the thickness measurement was obtained. The data [adapted from Box and Draper (987)] are shown in the table below. Run Speed Pressure Distance Mean (y ) Std Dev (y ) (a) What type of design did the experimenters use? Is this a good choice of design for fitting a quadratic model? The design is a 3 3. A better choice would be a 3 central composite design. The CCD gives more information over the design region with fewer points. -6

410 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY (b) Build models of both responses. The model for the mean is developed as follows: Design Expert Output Response: Mean ANOVA for Response Surface Reduced Cubic Model Analysis of variance table [Partial sum of squares] Sum of Mean F Source Squares DF Square Value Prob > F Model.89E E <. significant A 5.64E E <. B.55E+5.55E <. C 3.E+5 3.E+5.4 <. AB AC BC ABC Residual Cor Total.347E+6 6 The Model F-value of 6.45 implies the model is significant. There is only a.% chance that a "Model F-Value" this large could occur due to noise. Std. Dev R-Squared.957 Mean Adj R-Squared.94 C.V Pred R-Squared.956 PRESS.7E+5 Adeq Precision Coefficient Standard 95% CI 95% CI Factor Estimate DF Error Low High VIF Intercept A-Speed B-Pressure C-Distance AB AC BC ABC Final Equation in Terms of Coded Factors: Mean = * A +9.4 * B * C * A * B * A * C * B * C * A * B * C Final Equation in Terms of Actual Factors: Mean = * Speed +9.4 * Pressure * Distance * Speed * Pressure * Speed * Distance * Pressure * Distance * Speed * Pressure * Distance The model for the Std. Dev. response is as follows. A square root transformation was applied to correct problems with the normality assumption. -6

411 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Design Expert Output Response: Std. Dev. Transform: Square root Constant: ANOVA for Response Surface Linear Model Analysis of variance table [Partial sum of squares] Sum of Mean F Source Squares DF Square Value Prob > F Model significant A B C Residual Cor Total The Model F-value of 4.34 implies the model is significant. There is only a.45% chance that a "Model F-Value" this large could occur due to noise. Std. Dev..99 R-Squared.366 Mean 6. Adj R-Squared.783 C.V Pred R-Squared.359 PRESS 79.5 Adeq Precision 7.78 Coefficient Standard 95% CI 95% CI Factor Estimate DF Error Low High VIF Intercept A-Speed B-Pressure C-Distance Final Equation in Terms of Coded Factors: Sqrt(Std. Dev.) = * A +. * B +.3 * C Final Equation in Terms of Actual Factors: Sqrt(Std. Dev.) = * Speed +.96 * Pressure * Distance Because Factor A is insignificant, it is removed from the model. The reduced linear model analysis is shown below: Design Expert Output Response: Std. Dev. Transform: Square root Constant: ANOVA for Response Surface Reduced Linear Model Analysis of variance table [Partial sum of squares] Sum of Mean F Source Squares DF Square Value Prob > F Model significant B C Residual Cor Total The Model F-value of 5.4 implies the model is significant. There is only a.6% chance that a "Model F-Value" this large could occur due to noise. Std. Dev. 3.5 R-Squared.34 Mean 6. Adj R-Squared.59 C.V Pred R-Squared.476 PRESS 75.4 Adeq Precision Coefficient Standard 95% CI 95% CI -6

412 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Factor Intercept Estimate 6. DF Error.59 Low 4.79 High 7. VIF B-Pressure C-Distance Final Equation in Terms of Coded Factors: Sqrt(Std. Dev.) = * B +.3 * C Final Equation in Terms of Actual Factors: Sqrt(Std. Dev.) = * Pressure * Distance The following contour plots graphically represent the two models: DESIGN-EXPERT Plot Mean X = B: Pressure Y = C: Distance Design Points Actual Factor A: Speed =. C: Distance Mean DESIGN-EXPERT Plot Sq rt(std. De v.) X = B: Pressure Y = C: Distance Design Points Actual Factor A: Speed =. C: Distance..5. Std. Dev B: Press ure B: Press ure (c) Find a set of optimum conditions that result in the mean as large as possible with the standard deviation less than 6. The overlay plot identifies a region that meets the criteria of the mean as large as possible with the standard deviation less than 6. The optimum conditions in coded terms are approximately Speed =., Pressure =.75 and Distance =

413 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY DESIGN-EXPERT Plot Overlay Plot X = B: Pressure Y = C: Distance Design Points Actual Factor A: Speed =...5 Overlay Plot Std. Dev.: 6 Mean: 7 C: Distance B: Press ure -34 A variation of Example 6-. In example 6- we found that one of the process variables (B=pressure) was not important. Dropping this variable produced two replicates of a 3 design. The data are shown below. C D A(+) A(-) y s , 48 7, , 8 6, , 45, , 7 86, Assume that C and D are controllable factors and that A is a noise factor. (a) Fit a model to the mean response. The following is the analysis of variance with all terms in the model: Design Expert Output Response: Mean ANOVA for Selected Factorial Model Analysis of variance table [Partial sum of squares] Sum of Mean F Source Squares DF Square Value Prob > F Model A B AB Pure Error. Cor Total Based on the above analysis, the AB interaction is removed from the model and used as error. Design Expert Output Response: Mean ANOVA for Selected Factorial Model Analysis of variance table [Partial sum of squares] Sum of Mean F Source Squares DF Square Value Prob > F -64

414 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Model not significant A B Residual Cor Total The Model F-value of implies there is a 5.5% chance that a "Model F-Value" this large could occur due to noise. Std. Dev..87 R-Squared.9974 Mean 7.9 Adj R-Squared.993 C.V..5 Pred R-Squared.959 PRESS.5 Adeq Precision 3.67 Coefficient Standard 95% CI 95% CI Factor Estimate DF Error Low High VIF Intercept A-Concentration B-Stir Rate Final Equation in Terms of Coded Factors: Mean = * A +7.9 * B Final Equation in Terms of Actual Factors: Mean = * Concentration * Stir Rate The following is a contour plot of the mean model:. Mean B: Stir Rate A: Concentration (b) Fit a model to the ln(s ) response. The following is the analysis of variance with all terms in the model: Design Expert Output Response: Variance Transform: Natural log Constant: ANOVA for Selected Factorial Model Analysis of variance table [Partial sum of squares] -65

415 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Source Sum of Squares DF Mean Square F Value Prob > F Model A B.3.3 AB Pure Error. Cor Total Based on the above analysis, the AB interaction is removed from the model and applied to the residual error. Design Expert Output Response: Variance Transform: Natural log Constant: ANOVA for Selected Factorial Model Analysis of variance table [Partial sum of squares] Sum of Mean F Source Squares DF Square Value Prob > F Model not significant A B Residual Cor Total The "Model F-value" of.94 implies the model is not significant relative to the noise. There is a 38.5 % chance that a "Model F-value" this large could occur due to noise. Std. Dev..8 R-Squared.8545 Mean 5.5 Adj R-Squared.5634 C.V. 5.6 Pred R-Squared PRESS.8 Adeq Precision Coefficient Standard 95% CI 95% CI Factor Estimate DF Error Low High VIF Intercept A-Concentration B-Stir Rate Final Equation in Terms of Coded Factors: Ln(Variance) = * A +.7 * B Final Equation in Terms of Actual Factors: Ln(Variance) = * Concentration +.73 * Stir Rate The following is a contour plot of the variance model in the untransformed form: -66

416 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Variance B: Stir Rate A: Concentration (c) Find operating conditions that result in the mean filtration rate response exceeding 75 with minimum variance. The overlay plot shown below identifies the region required by the process:. Overlay Plot.5 Mean: 75 B: Stir Rate. Variance: A: Concentration (d) Compare your results with those from Example -6 which used the transmission of error approach. How similar are the two answers. The results are very similar. Both require the Concentration to be held at the high level while the stirring rate is held near the middle. -67

417 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Chapter Experiments with Random Factors Solutions - A textile mill has a large number of looms. Each loom is supposed to provide the same output of cloth per minute. To investigate this assumption, five looms are chosen at random and their output is noted at different times. The following data are obtained: Loom Output (lb/min) (a) Explain why this is a random effects experiment. Are the looms equal in output? Use =.5. The looms used in the experiment are a random sample of all the looms in the manufacturing area. The following is the analysis of variance for the data: Minitab Output ANOVA: Output versus Loom Factor Type Levels Values Loom random Analysis of Variance for Output Source DF SS MS F P Loom Error Total Source Variance Error Expected Mean Square for Each Term component term (using restricted model) Loom.4 () + 5() Error.48 () (b) Estimate the variability between looms. ˆ MS Model n MS E (c) Estimate the experimental error variance. ˆ MS E. 48 (d) Find a 95 percent confidence interval for. MS L n MS Model E F,a,n a. 88 -

418 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY MS U n MS Model E F L L. 44,a,n a U U (e) Analyze the residuals from this experiment. Do you think that the analysis of variance assumptions are satisfied? There is nothing unusual about the residual plots; therefore, the analysis of variance assumptions are satisfied. Normal Probability Plot of the Residuals (response is Output) Normal Score Residual.. Residuals Versus the Fitted Values (response is Output).. Residual Fitted Value

419 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Residuals Versus Loom (response is Output).. Residual Loom A manufacturer suspects that the batches of raw material furnished by her supplier differ significantly in calcium content. There are a large number of batches currently in the warehouse. Five of these are randomly selected for study. A chemist makes five determinations on each batch and obtains the following data: Batch Batch Batch 3 Batch 4 Batch (a) Is there significant variation in calcium content from batch to batch? Use =.5. Yes, as shown in the Minitab Output below, there is a difference. Minitab Output ANOVA: Calcium versus Batch Factor Type Levels Values Batch random Analysis of Variance for Calcium Source DF SS MS F P Batch Error Total Source Variance Error Expected Mean Square for Each Term component term (using restricted model) Batch.397 () + 5() Error.438 () (b) Estimate the components of variance. MS.. Model MS E n 5-3

420 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY MS E. 438 (c) Find a 95 percent confidence interval for MS L n MS MS U n MS Model E Model E. F,a,n a F L L,a,n a U U (d) Analyze the residuals from this experiment. Are the basic analysis of variance assumptions satisfied? There are five residuals that stand out in the normal probability plot. From the Residual vs. Batch plot, we see that one point per batch appears to stand out. A natural log transformation was applied to the data but did not change the results of the residual analysis. Further investigation should probably be performed to determine if these points are outliers. Normal Probability Plot of the Residuals (response is Calcium) Normal Score Residual. -4

421 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Residuals Versus the Fitted Values (response is Calcium). Residual Fitted Value Residuals Versus Batch (response is Calcium). Residual Batch Several ovens in a metal working shop are used to heat metal specimens. All the ovens are supposed to operate at the same temperature, although it is suspected that this may not be true. Three ovens are selected at random and their temperatures on successive heats are noted. The data collected are as follows: Oven Temperature (a) Is there significant variation in temperature between ovens? Use =.5. The analysis of variance shown below identifies significant variation in temperature between the ovens. Minitab Output General Linear Model: Temperature versus Oven Factor Type Levels Values -5

422 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Oven random 3 3 Analysis of Variance for Temperat, using Adjusted SS for Tests Source DF Seq SS Adj SS Adj MS F P Oven Error Total Expected Mean Squares, using Adjusted SS Source Expected Mean Square for Each Term Oven () () Error () Error Terms for Tests, using Adjusted SS Source Error DF Error MS Synthesis of Error MS Oven () Variance Components, using Adjusted SS Source Estimated Value Oven 53.7 Error (b) Estimate the components of variance. n n i ni a ni 5. MS.. Model MS E n 493. MS E (c) Analyze the residuals from this experiment. Draw conclusions about model adequacy. There is a funnel shaped appearance in the plot of residuals versus predicted value indicating a possible non-constant variance. There is also some indication of non-constant variance in the plot of residuals versus oven. The inequality of variance problem is not severe. Normal Probability Plot of the Residuals (response is Temperat) Normal Score Residual -6

423 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Residuals Versus the Fitted Values (response is Temperat) Residual Fitted Value Residuals Versus Oven (response is Temperat) Residual - Oven 3-4 An article in the Journal of the Electrochemical Society (Vol. 39, No., 99, pp ) describes an experiment to investigate the low-pressure vapor deposition of polysilicon. The experiment was carried out in a large-capacity reactor at Sematech in Austin, Texas. The reactor has several wafer positions, and four of these positions are selected at random. The response variable is film thickness uniformity. Three replicates of the experiments were run, and the data are as follows: Wafer Position Uniformity (a) Is there a difference in the wafer positions? Use =.5. Yes, there is a difference. -7

424 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Minitab Output ANOVA: Uniformity versus Wafer Position Factor Type Levels Values Wafer Po fixed Analysis of Variance for Uniformi Source DF SS MS F P Wafer Po Error Total.4373 Source Variance Error Expected Mean Square for Each Term component term (using restricted model) Wafer Po () + 3Q[] Error.65 () (b) Estimate the variability due to wafer positions. (c) Estimate the random error component. MSTreatment MSE n (d) Analyze the residuals from this experiment and comment on model adequacy. Variability in film thickness seems to depend on wafer position. These observations also show up as outliers on the normal probability plot. Wafer position number appears to have greater variation in uniformity than the other positions. Normal Probability Plot of the Residuals (response is Uniformi) Normal Score Residual -8

425 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Residuals Versus the Fitted Values (response is Uniformi) Residual - 3 Fitted Value 4 Residuals Versus Wafer Po (response is Uniformi) Residual - Wafer Po Consider the vapor deposition experiment described in Problem -4. (a) Estimate the total variability in the uniformity response. ˆ ˆ (b) How much of the total variability in the uniformity response is due to the difference between positions in the reactor? ˆ ˆ ˆ (c) To what level could the variability in the uniformity response be reduced, if the position-to-position variability in the reactor could be eliminated? Do you believe this is a significant reduction? -9

426 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY The variability would be reduced from.37 to ˆ. 65 which is a reduction of approximately: % An article in the Journal of Quality Technology (Vol. 3, No., 98, pp. -4) describes and experiment that investigates the effects of four bleaching chemicals on pulp brightness. These four chemicals were selected at random from a large population of potential bleaching agents. The data are as follows: Chemical Pulp Brightness (a) Is there a difference in the chemical types? Use =.5. The computer output shows that the null hypothesis cannot be rejected. Therefore, there is no evidence that there is a difference in chemical types. Minitab Output ANOVA: Brightness versus Chemical Factor Type Levels Values Chemical random Analysis of Variance for Brightne Source DF SS MS F P Chemical Error Total Source Variance Error Expected Mean Square for Each Term component term (using restricted model) Chemical -. () + 5() Error () (b) Estimate the variability due to chemical types. MS Treatment MSE n which agrees with the Minitab output. Because the variance component cannot be negative, this likely means that the variability due to chemical types is zero. (c) Estimate the variability due to random error (d) Analyze the residuals from this experiment and comment on model adequacy. -

427 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Two data points appear to be outliers in the normal probability plot of effects. These outliers belong to chemical types and 3 and should be investigated. There seems to be much less variability in brightness with chemical type 4. Normal Probability Plot of the Residuals (response is Brightne) Normal Score Residual 5 Residuals Versus the Fitted Values (response is Brightne) Residual Fitted Value 8 8 -

428 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Residuals Versus Chemical (response is Brightne) Residual 5-5 Chemical Consider the one-way balanced, random effects method. Develop a procedure for finding a (- ) percent confidence interval for /( ). We know that PL U PL U PL U L U P L U -8 Refer to Problem -. (a) What is the probability of accepting H if is four times the error variance? 4. 6 n 5 4 a N a , from the OC curve. 4 (b) If the difference between looms is large enough to increase the standard deviation of an observation by percent, we wish to detect this with a probability of at least.8. What sample size should be used? a 4 N a P( accept).. P n.. n n 44 -

429 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Trial and Error yields: n P(accept) Choose n 4, therefore N 7-9 An experiment was performed to investigate the capability of a measurement system. Ten parts were randomly selected, and two randomly selected operators measured each part three times. The tests were made in random order, and the data below resulted. (a) Analyze the data from this experiment. Minitab Output ANOVA: Measurement versus Part, Operator Operator Operator Measurements Measurements Part Number Factor Type Levels Values Part random Operator random Analysis of Variance for Measurem Source DF SS MS F P Part Operator Part*Operator Error Total Source Variance Error Expected Mean Square for Each Term component term (using restricted model) Part (4) + 3(3) + 6() Operator (4) + 3(3) + 3() 3 Part*Operator (4) + 3(3) 4 Error.5 (4) (b) Find point estimates of the variance components using the analysis of variance method. MS E 5. -3

430 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY MS AB MS A MS E n MS B MS an MS bn All estimates agree with the Minitab output. AB , assume = ˆ ˆ, assume = AB 3 - Reconsider the data in Problem 5-6. Suppose that both factors, machines and operators, are chosen at random. (a) Analyze the data from this experiment. Machine Operator The following Minitab output contains the analysis of variance and the variance component estimates: Minitab Output ANOVA: Strength versus Operator, Machine Factor Type Levels Values Operator random 3 3 Machine random Analysis of Variance for Strength Source DF SS MS F P Operator Machine Operator*Machine Error Total Source Variance Error Expected Mean Square for Each Term component term (using restricted model) Operator (4) + (3) + 8() Machine (4) + (3) + 6() 3 Operator*Machine (4) + (3) 4 Error (4) (b) Find point estimates of the variance components using the analysis of variance method. MS AB MS E MS n E

431 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY MS MS an B AB MS MS bn A AB ( ), assume ( ) These results agree with the Minitab variance component analysis. - Reconsider the data in Problem 5-3. Suppose that both factors are random. (a) Analyze the data from this experiment. Minitab Output General Linear Model: Response versus Row, Column Factor Type Levels Values Row random 3 3 Column random Column Factor Row Factor Analysis of Variance for Response, using Adjusted SS for Tests Source DF Seq SS Adj SS Adj MS F P Row ** Column ** Row*Column ** Error... Total ** Denominator of F-test is zero. Expected Mean Squares, using Adjusted SS Source Expected Mean Square for Each Term Row (4) + (3) + 4.() Column (4) + (3) + 3.() 3 Row*Column (4) + (3) 4 Error (4) Error Terms for Tests, using Adjusted SS Source Error DF Error MS Synthesis of Error MS Row * 4.86 (3) Column * 4.86 (3) 3 Row*Column * * (4) Variance Components, using Adjusted SS Source Estimated Value Row 7.36 Column.6 Row*Column Error. (b) Estimate the variance components. -5

432 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Because the experiment is unreplicated and the interaction term was included in the model, there is no estimate of MS E, and therefore, no estimate of. MS AB MS n MS B MS AB an MS A MS bn These estimates agree with the Minitab output. AB E ˆ ˆ ˆ Suppose that in Problem 5- the furnace positions were randomly selected, resulting in a mixed model experiment. Reanalyze the data from this experiment under this new assumption. Estimate the appropriate model components. The following analysis assumes a restricted model: Minitab Output ANOVA: Density versus Position, Temperature Factor Type Levels Values Position random Temperat fixed Analysis of Variance for Density Temperature ( C) Position Source DF SS MS F P Position Temperat Position*Temperat Error Total Source Variance Error Expected Mean Square for Each Term component term (using restricted model) Position (4) + 9() Temperat 3 (4) + 3(3) + 6Q[] 3 Position*Temperat (4) + 3(3) 4 Error (4) MS AB MS n MS E ˆ E ˆ assume 3-6

433 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY ˆ MS A bn MS E ˆ These results agree with the Minitab output. -3 Reanalyze the measurement systems experiment in Problem -9, assuming that operators are a fixed factor. Estimate the appropriate model components. The following analysis assumes a restricted model: Minitab Output ANOVA: Measurement versus Part, Operator Factor Type Levels Values Part random Operator fixed Analysis of Variance for Measurem Source DF SS MS F P Part Operator Part*Operator Error Total Source Variance Error Expected Mean Square for Each Term component term (using restricted model) Part (4) + 6() Operator 3 (4) + 3(3) + 3Q[] 3 Part*Operator (4) + 3(3) 4 Error.5 (4) MS E ˆ. 5 MS MS E ˆ ˆ assume n 3 MS A MS E ˆ ˆ bn 3 AB These results agree with the Minitab output. -4 In problem 5-6, suppose that there are only four machines of interest, but the operators were selected at random. (a) What type of model is appropriate? A mixed model is appropriate. (b) Perform the analysis and estimate the model components. The following analysis assumes a restricted model: Minitab Output ANOVA: Strength versus Operator, Machine Factor Type Levels Values -7

434 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Operator random 3 3 Machine fixed Analysis of Variance for Strength Source DF SS MS F P Operator Machine Operator*Machine Error Total Source Variance Error Expected Mean Square for Each Term component term (using restricted model) Operator (4) + 8() Machine 3 (4) + (3) + 6Q[] 3 Operator*Machine.86 4 (4) + (3) 4 Error 3.79 (4) ˆ MS MS AB A MS E ˆ MS n MS bn These results agree with the Minitab output. E E ˆ ˆ By application of the expectation operator, develop the expected mean squares for the two-factor factorial, mixed model. Use the restricted model assumptions. Check your results with the expected mean squares given in Table - to see that they agree. The sums of squares may be written as SS AB n SS A bn a yi.. y... i b, SSB an y. j. y... j a b yij. yi.. y. j. y... a b n, SSE yijk y... i j i j k y Using the model ijk i j ijk y y y y i... j. ij.... ij, we may find that i i. j j.... j. i. Using the assumptions for the restricted form of the mixed model,.,. j. Substituting these expressions into the sums of squares yields.. i.. ij ij., which imply that -8

435 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY SS SS SS SS A B AB E bn an n a i b j a b i j ) a b n ijk ij. i j k j i.. j. ij i..... i.... ij. Using the assumption that E ijk, V ( ijk ) E ijk i' j' k' squares by its degrees of freedom and take the expectation to produce E E MS E MS E MS A B AB MS E i... j., and bn E a an b n a b i j j a b i E i.... a b ij i. i j, we may divide each sum of Note that EMS B and EMS E are the results given in Table 8-3. We need to simplify MS A E. Consider E MS AB E E MS A MS E E crossproducts E MS A A a since ij is, a bn a bn a i MSA n a i i i i a i bn a a a a a b i NID. Consider E E E MS E MS AB AB MS n AB MS AB a b a b n n a a i j a b b i j i. E ij i. b a b a E and -9

436 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Thus EMS A and MS AB E agree with table Consider the three-factor factorial design in Example -6. Propose appropriate test statistics for all main effects and interactions. Repeat for the case where A and B are fixed and C is random. If all three factors are random there are no exact tests on main effects. We could use the following: A : F B : F MS MS MS MS MS C : F MS A AB B AB C AC MS MS MS MS ABC AC ABC MS MS If A and B are fixed and C is random, the expected mean squares are (assuming the restricted for m of the model): F F R R a b c n Factor i j k l E(MS) i b c n j a c n BC ABC bn an k a b n abn c n ij n ik b n bn jk a n an ijk n n ijkl BC i bcn a j acn b cn These are exact tests for all effects. ji a b -7 Consider the experiment in Example -7. Analyze the data for the case where A, B, and C are random. Minitab Output ANOVA: Drop versus Temp, Operator, Gauge Factor Type Levels Values Temp random Operator random Gauge random 3 3 Analysis of Variance for Drop Source DF SS MS F P Temp x -

437 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Operator x Gauge x Temp*Operator Temp*Gauge Operator*Gauge Temp*Operator*Gauge Error Total x Not an exact F-test. Source Variance Error Expected Mean Square for Each Term component term (using restricted model) Temp.44 * (8) + (7) + 8(5) + 6(4) + 4() Operator * (8) + (7) + 6(6) + 6(4) + 8() 3 Gauge -.64 * (8) + (7) + 6(6) + 8(5) + 4(3) 4 Temp*Operator (8) + (7) + 6(4) 5 Temp*Gauge (8) + (7) + 8(5) 6 Operator*Gauge (8) + (7) + 6(6) 7 Temp*Operator*Gauge (8) + (7) 8 Error.43 (8) * Synthesized Test. Error Terms for Synthesized Tests Source Error DF Error MS Synthesis of Error MS Temp (4) + (5) - (7) Operator (4) + (6) - (7) 3 Gauge (5) + (6) - (7) Since all three factors are random there are no exact tests on main effects. Minitab uses an approximate F test for the these factors. -8 Derive the expected mean squares shown in Table -4. F R R R a b c n Factor i j k l E(MS) i b c n n bn j a c n an acn k a b n an abn ij c n n cn ik b n n bn jk a n an ijk n n ijkl cn bcn a i -9 Consider a four-factor factorial experiment where factor A is at a levels, factor B is at b levels, factor C is at c levels, factor D is at d levels, and there are n replicates. Write down the sums of squares, the degrees of freedom, and the expected mean squares for the following cases. Do exact tests exist for all effects? If not, propose test statistics for those effects that cannot be directly tested. Assume the restricted model on all cases. You may use a computer package such as Minitab. The four factor model is: -

438 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY yijklh i j k l ij ik il jk jl kl ijklh ijk ijl jkl ikl ijkl To simplify the expected mean square derivations, let capital Latin letters represent the factor effects or variance components. For example, A bcdn i, or B acdn. a (a) A, B, C, and D are fixed factors. F F F F R a b c d n Factor i j k l h E(MS) i b c d n A j a c d n B k a b d n C l a b c n D ( ) ij c d n AB ( ) ik b d n AC ( ) il b c n AD ( ) jk a d n BC ( ) jl a c n BD ( ) kl a b n CD ( ) ijk d n ABC ( ) ijl c n ABD ( ) jkl a n BCD ( ) ikl b n ACD ( ) ijkl n ABCD ( ijkl) h There are exact tests for all effects. The results can also be generated in Minitab as follows: Minitab Output ANOVA: y versus A, B, C, D Factor Type Levels Values A fixed H L B fixed H L C fixed H L D fixed H L Analysis of Variance for y Source DF SS MS F P A B C D A*B A*C A*D B*C B*D C*D A*B*C A*B*D

439 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY A*C*D B*C*D A*B*C*D Error Total Source Variance Error Expected Mean Square for Each Term component term (using restricted model) A 6 (6) + 6Q[] B 6 (6) + 6Q[] 3 C 6 (6) + 6Q[3] 4 D 6 (6) + 6Q[4] 5 A*B 6 (6) + 8Q[5] 6 A*C 6 (6) + 8Q[6] 7 A*D 6 (6) + 8Q[7] 8 B*C 6 (6) + 8Q[8] 9 B*D 6 (6) + 8Q[9] C*D 6 (6) + 8Q[] A*B*C 6 (6) + 4Q[] A*B*D 6 (6) + 4Q[] 3 A*C*D 6 (6) + 4Q[3] 4 B*C*D 6 (6) + 4Q[4] 5 A*B*C*D 6 (6) + Q[5] 6 Error.38 (6) (b) A, B, C, and D are random factors. R R R R R a b c d n Factor i j k l h E(MS) i b c d n ABCD ACD ABD ABC AD AC AB A j a c d n ABCD BCD ABD ABC BD BC AB B k a b d n ABCD ACD BCD ABC AB BC CD C l a b c n ABCD ACD BCD ABD BD AD CD D ( ) ij c d n ABCD ABC ABD AB ( ) ik b d n ABCD ABC ACD AC ( ) il b c n ABCD ABD ACD AD ( ) jk a d n ABCD ABC BCD BC ( ) jl a c n ABCD ABD BCD BD ( ) kl a b n ABCD ACD BCD CD ( ) ijk d n ABCD ABC ( ) ijl c n ABCD ABD ( ) jkl a n ABCD BCD ( ) ikl b n ABCD ACD ( ) ijkl n ABCD ( ijkl) h No exact tests exist on main effects or two-factor interactions. For main effects use statistics such as: AF : MS MS MS MS MS MS MS MS A ABC ABD ACD AB AC AD ABCD For testing two-factor interactions use statistics such as: AB: F MS MS AB ABC MS MS ABCD ABD The results can also be generated in Minitab as follows: -3

440 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Minitab Output ANOVA: y versus A, B, C, D Factor Type Levels Values A random H L B random H L C random H L D random H L Analysis of Variance for y Source DF SS MS F P A ** B.3.3 ** C x D.3.3 ** A*B x A*C x A*D x B*C x B*D x C*D x A*B*C A*B*D A*C*D B*C*D A*B*C*D Error Total x Not an exact F-test. ** Denominator of F-test is zero. Source Variance Error Expected Mean Square for Each Term component term (using restricted model) A.75 * (6) + (5) + 4(3) + 4() + 4() + 8(7) + 8(6) + 8(5) + 6() B.375 * (6) + (5) + 4(4) + 4() + 4() + 8(9) + 8(8) + 8(5) + 6() 3 C -.5 * (6) + (5) + 4(4) + 4(3) + 4() + 8() + 8(8) + 8(6) + 6(3) 4 D.375 * (6) + (5) + 4(4) + 4(3) + 4() + 8() + 8(9) + 8(7) + 6(4) 5 A*B -3.5 * (6) + (5) + 4() + 4() + 8(5) 6 A*C. * (6) + (5) + 4(3) + 4() + 8(6) 7 A*D -3.5 * (6) + (5) + 4(3) + 4() + 8(7) 8 B*C. * (6) + (5) + 4(4) + 4() + 8(8) 9 B*D -3.5 * (6) + (5) + 4(4) + 4() + 8(9) C*D. * (6) + (5) + 4(4) + 4(3) + 8() A*B*C. 5 (6) + (5) + 4() A*B*D (6) + (5) + 4() 3 A*C*D. 5 (6) + (5) + 4(3) 4 B*C*D. 5 (6) + (5) + 4(4) 5 A*B*C*D (6) + (5) 6 Error.375 (6) * Synthesized Test. Error Terms for Synthesized Tests Source Error DF Error MS Synthesis of Error MS A.56 * (5) + (6) + (7) - () - () - (3) + (5) B.56 * (5) + (8) + (9) - () - () - (4) + (5) 3 C (6) + (8) + () - () - (3) - (4) + (5) 4 D.56 * (7) + (9) + () - () - (3) - (4) + (5) 5 A*B () + () - (5) 6 A*C () + (3) - (5) 7 A*D () + (3) - (5) -4

441 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY 8 B*C () + (4) - (5) 9 B*D () + (4) - (5) C*D (3) + (4) - (5) (c) A is fixed and B, C, and D are random. F R R R R a b c d n Factor i j k l h E(MS) i b c d n ABCD ACD ABD ABC AD AC AB A j a c d n BCD ABD BC B k a b d n BCD BC CD C l a b c n BCD BDCD D ( ) ij c d n ABCD ABC ABD AB ( ) ik b d n ABCD ABC ACD AC ( ) il b c n ABCD ABD ACD AD ( ) jk a d n BCD BC ( ) jl a c n BCD BD ( ) kl a b n BCD CD ( ) ijk d n ABCD ABC ( ) ijl c n ABCD ABD ( ) jkl a n BCD ( ) ikl b n ABCD ACD ( ) ijkl n ABCD ( ijkl) h No exact tests exist on main effects or two-factor interactions involving the fixed factor A. To test the fixed factor A use AF : MS MS MS MS MS MS MS MS A ABC ABD ACD AB AC AD ABCD Random main effects could be tested by, for example: DF : MS MS D ABC MS MS ABCD ABD For testing two-factor interactions involving A use: AB: F MS MS AB ABC MS MS ABCD ABD The results can also be generated in Minitab as follows: Minitab Output ANOVA: y versus A, B, C, D Factor Type Levels Values A fixed H L B random H L C random H L D random H L Analysis of Variance for y Source DF SS MS F P A ** B x -5

442 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY C x D x A*B x A*C x A*D x B*C B*D C*D A*B*C A*B*D A*C*D B*C*D A*B*C*D Error Total x Not an exact F-test. ** Denominator of F-test is zero. Source Variance Error Expected Mean Square for Each Term component term (using restricted model) A * (6) + (5) + 4(3) + 4() + 4() + 8(7) + 8(6) + 8(5) + 6Q[] B * (6) + 4(4) + 8(9) + 8(8) + 6() 3 C -.5 * (6) + 4(4) + 8() + 8(8) + 6(3) 4 D * (6) + 4(4) + 8() + 8(9) + 6(4) 5 A*B -3.5 * (6) + (5) + 4() + 4() + 8(5) 6 A*C. * (6) + (5) + 4(3) + 4() + 8(6) 7 A*D -3.5 * (6) + (5) + 4(3) + 4() + 8(7) 8 B*C. 4 (6) + 4(4) + 8(8) 9 B*D. 4 (6) + 4(4) + 8(9) C*D. 4 (6) + 4(4) + 8() A*B*C. 5 (6) + (5) + 4() A*B*D (6) + (5) + 4() 3 A*C*D. 5 (6) + (5) + 4(3) 4 B*C*D (6) + 4(4) 5 A*B*C*D (6) + (5) 6 Error.375 (6) * Synthesized Test. Error Terms for Synthesized Tests Source Error DF Error MS Synthesis of Error MS A.56 * (5) + (6) + (7) - () - () - (3) + (5) B (8) + (9) - (4) 3 C (8) + () - (4) 4 D (9) + () - (4) 5 A*B () + () - (5) 6 A*C () + (3) - (5) 7 A*D () + (3) - (5) (d) A and B are fixed and C and D are random. F F R R R a b c d n Factor i j k l h E(MS) i b c d n ACD AD AC A j a c d n BCD BC BD B k a b d n CD C l a b c n CD D ( ) ij c d n ( ) ik b d n ACD AC ( ) il b c n ACD AD ABCD ABC ABD AB -6

443 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY ( ) jk a d n BCD BC ( ) jl a c n BCD BD ( ) kl a b n CD ( ) ijk d n ABCD ABC ( ) ijl c n ABCD ABD ( ) jkl a n BCD ( ) ikl b n ACD ( ) ijkl n ABCD ( ijkl) h There are no exact tests on the fixed factors A and B, or their two-factor interaction AB. The appropriate test statistics are: AF : BF : AB: F MS MS MS MS MS MS A AC B BC AB MS MS MS MS ABC ACD AD BCD MS MS BD ABCD ABD The results can also be generated in Minitab as follows: Minitab Output ANOVA: y versus A, B, C, D Factor Type Levels Values A fixed H L B fixed H L C random H L D random H L Analysis of Variance for y Source DF SS MS F P A x B x C D A*B x A*C A*D B*C B*D C*D A*B*C A*B*D A*C*D B*C*D A*B*C*D Error Total x Not an exact F-test. Source Variance Error Expected Mean Square for Each Term component term (using restricted model) A * (6) + 4(3) + 8(7) + 8(6) + 6Q[] B * (6) + 4(4) + 8(9) + 8(8) + 6Q[] 3 C -.5 (6) + 8() + 6(3) -7

444 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY 4 D (6) + 8() + 6(4) 5 A*B * (6) + (5) + 4() + 4() + 8Q[5] 6 A*C. 3 (6) + 4(3) + 8(6) 7 A*D. 3 (6) + 4(3) + 8(7) 8 B*C. 4 (6) + 4(4) + 8(8) 9 B*D. 4 (6) + 4(4) + 8(9) C*D (6) + 8() A*B*C. 5 (6) + (5) + 4() A*B*D (6) + (5) + 4() 3 A*C*D (6) + 4(3) 4 B*C*D (6) + 4(4) 5 A*B*C*D (6) + (5) 6 Error.375 (6) * Synthesized Test. Error Terms for Synthesized Tests Source Error DF Error MS Synthesis of Error MS A (6) + (7) - (3) B (8) + (9) - (4) 5 A*B () + () - (5) (e) A, B and C are fixed and D is random. F F F R R a b c d n Factor i j k l h E(MS) i b c d n AD A j a c d n BD B k a b d n CD C l a b c n D ( ) ij c d n ABD AB ( ) ik b d n ACD AC ( ) il b c n AD ( ) jk a d n BCD BC ( ) jl a c n BD ( ) kl a b n CD ( ) ijk d n ABCD ABC ( ) ijl c n ABD ( ) jkl a n BCD ( ) ikl b n ACD ( ) ijkl n ABCD ( ijkl) h There are exact tests for all effects. The results can also be generated in Minitab as follows: Minitab Output ANOVA: y versus A, B, C, D Factor Type Levels Values A fixed H L B fixed H L C fixed H L D random H L Analysis of Variance for y Source DF SS MS F P -8

445 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY A B C D A*B A*C A*D B*C B*D C*D A*B*C A*B*D A*C*D B*C*D A*B*C*D Error Total Source Variance Error Expected Mean Square for Each Term component term (using restricted model) A 7 (6) + 8(7) + 6Q[] B 9 (6) + 8(9) + 6Q[] 3 C (6) + 8() + 6Q[3] 4 D (6) + 6(4) 5 A*B (6) + 4() + 8Q[5] 6 A*C 3 (6) + 4(3) + 8Q[6] 7 A*D (6) + 8(7) 8 B*C 4 (6) + 4(4) + 8Q[8] 9 B*D (6) + 8(9) C*D (6) + 8() A*B*C 5 (6) + (5) + 4Q[] A*B*D (6) + 4() 3 A*C*D (6) + 4(3) 4 B*C*D (6) + 4(4) 5 A*B*C*D (6) + (5) 6 Error.375 (6) - Reconsider cases (c), (d) and (e) of Problem -9. Obtain the expected mean squares assuming the unrestricted model. You may use a computer package such as Minitab. Compare your results with those for the restricted model. A is fixed and B, C, and D are random. Minitab Output ANOVA: y versus A, B, C, D Factor Type Levels Values A fixed H L B random H L C random H L D random H L Analysis of Variance for y Source DF SS MS F P A ** B.3.3 ** C x D.3.3 ** A*B x A*C x A*D x B*C x B*D x C*D x A*B*C A*B*D

446 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY A*C*D B*C*D A*B*C*D Error Total x Not an exact F-test. ** Denominator of F-test is zero. Source Variance Error Expected Mean Square for Each Term component term (using unrestricted model) A * (6) + (5) + 4(3) + 4() + 4() + 8(7) + 8(6) + 8(5) + Q[] B.375 * (6) + (5) + 4(4) + 4() + 4() + 8(9) + 8(8) + 8(5) + 6() 3 C -.5 * (6) + (5) + 4(4) + 4(3) + 4() + 8() + 8(8) + 8(6) + 6(3) 4 D.375 * (6) + (5) + 4(4) + 4(3) + 4() + 8() + 8(9) + 8(7) + 6(4) 5 A*B -3.5 * (6) + (5) + 4() + 4() + 8(5) 6 A*C. * (6) + (5) + 4(3) + 4() + 8(6) 7 A*D -3.5 * (6) + (5) + 4(3) + 4() + 8(7) 8 B*C. * (6) + (5) + 4(4) + 4() + 8(8) 9 B*D -3.5 * (6) + (5) + 4(4) + 4() + 8(9) C*D. * (6) + (5) + 4(4) + 4(3) + 8() A*B*C. 5 (6) + (5) + 4() A*B*D (6) + (5) + 4() 3 A*C*D. 5 (6) + (5) + 4(3) 4 B*C*D. 5 (6) + (5) + 4(4) 5 A*B*C*D (6) + (5) 6 Error.375 (6) * Synthesized Test. Error Terms for Synthesized Tests Source Error DF Error MS Synthesis of Error MS A.56 * (5) + (6) + (7) - () - () - (3) + (5) B.56 * (5) + (8) + (9) - () - () - (4) + (5) 3 C (6) + (8) + () - () - (3) - (4) + (5) 4 D.56 * (7) + (9) + () - () - (3) - (4) + (5) 5 A*B () + () - (5) 6 A*C () + (3) - (5) 7 A*D () + (3) - (5) 8 B*C () + (4) - (5) 9 B*D () + (4) - (5) C*D (3) + (4) - (5) A and B are fixed and C and D are random. Minitab Output ANOVA: y versus A, B, C, D Factor Type Levels Values A fixed H L B fixed H L C random H L D random H L Analysis of Variance for y Source DF SS MS F P A x B x C x D.3.3 ** A*B x A*C x A*D x -3

447 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY B*C x B*D x C*D x A*B*C A*B*D A*C*D B*C*D A*B*C*D Error Total x Not an exact F-test. ** Denominator of F-test is zero. Source Variance Error Expected Mean Square for Each Term component term (using unrestricted model) A * (6) + (5) + 4(3) + 4() + 4() + 8(7) + 8(6) + Q[,5] B * (6) + (5) + 4(4) + 4() + 4() + 8(9) + 8(8) + Q[,5] 3 C -.5 * (6) + (5) + 4(4) + 4(3) + 4() + 8() + 8(8) + 8(6) + 6(3) 4 D.375 * (6) + (5) + 4(4) + 4(3) + 4() + 8() + 8(9) + 8(7) + 6(4) 5 A*B * (6) + (5) + 4() + 4() + Q[5] 6 A*C. * (6) + (5) + 4(3) + 4() + 8(6) 7 A*D -3.5 * (6) + (5) + 4(3) + 4() + 8(7) 8 B*C. * (6) + (5) + 4(4) + 4() + 8(8) 9 B*D -3.5 * (6) + (5) + 4(4) + 4() + 8(9) C*D. * (6) + (5) + 4(4) + 4(3) + 8() A*B*C. 5 (6) + (5) + 4() A*B*D (6) + (5) + 4() 3 A*C*D. 5 (6) + (5) + 4(3) 4 B*C*D. 5 (6) + (5) + 4(4) 5 A*B*C*D (6) + (5) 6 Error.375 (6) * Synthesized Test. Error Terms for Synthesized Tests Source Error DF Error MS Synthesis of Error MS A (6) + (7) - (3) B (8) + (9) - (4) 3 C (6) + (8) + () - () - (3) - (4) + (5) 4 D.56 * (7) + (9) + () - () - (3) - (4) + (5) 5 A*B () + () - (5) 6 A*C () + (3) - (5) 7 A*D () + (3) - (5) 8 B*C () + (4) - (5) 9 B*D () + (4) - (5) C*D (3) + (4) - (5) (e) A, B and C are fixed and D is random. Minitab Output ANOVA: y versus A, B, C, D Factor Type Levels Values A fixed H L B fixed H L C fixed H L D random H L Analysis of Variance for y Source DF SS MS F P A

448 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY B C D.3.3 ** A*B A*C A*D x B*C B*D x C*D x A*B*C A*B*D A*C*D B*C*D A*B*C*D Error Total x Not an exact F-test. ** Denominator of F-test is zero. Source Variance Error Expected Mean Square for Each Term component term (using unrestricted model) A 7 (6) + (5) + 4(3) + 4() + 8(7) + Q[,5,6,] B 9 (6) + (5) + 4(4) + 4() + 8(9) + Q[,5,8,] 3 C (6) + (5) + 4(4) + 4(3) + 8() + Q[3,6,8,] 4 D.375 * (6) + (5) + 4(4) + 4(3) + 4() + 8() + 8(9) + 8(7) + 6(4) 5 A*B (6) + (5) + 4() + Q[5,] 6 A*C 3 (6) + (5) + 4(3) + Q[6,] 7 A*D -3.5 * (6) + (5) + 4(3) + 4() + 8(7) 8 B*C 4 (6) + (5) + 4(4) + Q[8,] 9 B*D -3.5 * (6) + (5) + 4(4) + 4() + 8(9) C*D. * (6) + (5) + 4(4) + 4(3) + 8() A*B*C 5 (6) + (5) + Q[] A*B*D (6) + (5) + 4() 3 A*C*D. 5 (6) + (5) + 4(3) 4 B*C*D. 5 (6) + (5) + 4(4) 5 A*B*C*D (6) + (5) 6 Error.375 (6) * Synthesized Test. Error Terms for Synthesized Tests Source Error DF Error MS Synthesis of Error MS 4 D.56 * (7) + (9) + () - () - (3) - (4) + (5) 7 A*D () + (3) - (5) 9 B*D () + (4) - (5) C*D (3) + (4) - (5) - In Problem 5-7, assume that the three operators were selected at random. Analyze the data under these conditions and draw conclusions. Estimate the variance components. Minitab Output ANOVA: Score versus Cycle Time, Operator, Temperature Factor Type Levels Values Cycle Ti fixed Operator random 3 3 Temperat fixed 3 35 Analysis of Variance for Score Source DF SS MS F P Cycle Ti Operator Temperat

449 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Cycle Ti*Operator Cycle Ti*Temperat Operator*Temperat Cycle Ti*Operator*Temperat Error Total Source Variance Error Expected Mean Square for Each Term component term (using restricted model) Cycle Ti 4 (8) + 6(4) + 8Q[] Operator (8) + 8() 3 Temperat 6 (8) + 9(6) + 7Q[3] 4 Cycle Ti*Operator (8) + 6(4) 5 Cycle Ti*Temperat 7 (8) + 3(7) + 9Q[5] 6 Operator*Temperat.63 8 (8) + 9(6) 7 Cycle Ti*Operator*Temperat (8) + 3(7) 8 Error (8) The following calculations agree with the Minitab results: MS MS MS ABC BC MS E ˆ MS n MS an AC E MS bn MSC MS abn E E E ˆ ˆ ˆ ˆ Consider the three-factor model yijk i j k ijk ij jk Assuming that all the factors are random, develop the analysis of variance table, including the expected mean squares. Propose appropriate test statistics for all effects. Source DF E(MS) A a- c bc B b- c a ac C c- a ab AB (a-)(b-) c BC (b-)(c-) a Error (AC + ABC) b(a-)(c-) Total abc- There are exact tests for all effects except B. To test B, use the statistic F MS MS B AB MS MS E BC -33

450 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY -3 The three-factor model for a single replicate is ( ) ( ) ( ) ( ) y ijk i j k ij jk ik ijk ijk If all the factors are random, can any effects be tested? If the three-factor interaction and the ( ) ij interaction do not exist, can all the remaining effects be tested. The expected mean squares are found by referring to Table -9, deleting the line for the error term ( ijk ) l and setting n=. The three-factor interaction now cannot be tested; however, exact tests exist for the twofactor interactions and approximate F tests can be conducted for the main effects. For example, to test the main effect of A, use F MS MS A AB MS MS ABC AC If ( ) ijk and ( ) ij can be eliminated, the model becomes yijk i j k ijk ij jk ik ijk For this model, the analysis of variance is Source DF E(MS) A a- b bc B b- a ac C c- a b ab AC (a-)(c-) b BC (b-)(c-) a Error (AB + ABC) c(a-)(b-) Total abc- There are exact tests for all effect except C. To test the main effect of C, use the statistic: F MS MS C BC MS MS E AC -4 In Problem 5-6, assume that both machines and operators were chosen randomly. Determine the power of the test for detecting a machine effect such that, where is the variance component for the machine factor. Are two replicates sufficient? an n If, then an estimate of 379., and an estimate of n 745., from the analysis of variance table. Then -34

451 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY and the other OC curve parameters are 3 and 6. This results in 75. approximately, with 5., or 9. with.. Two replicates does not seem sufficient. -5 In the two-factor mixed model analysis of variance, show that Cov ii'. a Since (constant) we have i ij a i V Cov, a ij a a! a Cov ij, i' a! a! a aa Cov ij, i' j Cov, ij i' j a a ij, i' j a V ij, which implies that i ij i' j j for -6 Show that the method of analysis of variance always produces unbiased point estimates of the variance component in any random or mixed model. Let g be the vector of mean squares from the analysis of variance, chosen so that E(g) does not contain any fixed effects. Let be the vector of variance components such that E( g) A, where A is a matrix of constants. Now in the analysis of variance method of variance component estimation, we equate observed and expected mean squares, i.e. Since - A always exists then, E g = As ˆ s A s = EA g A Eg= A As s Thus is an unbiased estimator of. This and other properties of the analysis of variance method are discussed by Searle (97a). - g -7 Invoking the usual normality assumptions, find an expression for the probability that a negative estimate of a variance component will be obtained by the analysis of variance method. Using this result, write a statement giving the probability that in a one-factor analysis of variance. Comment on the usefulness of this probability statement. -35

452 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Suppose MS MS, where MS i for i=, are two mean squares and c is a constant. The c probability that ˆ (negative) is MS MS E MS E MS EMS P ˆ P MS MS P P P Fu, v MS MS EMS EMS EMS where u is the number of degrees of freedom for MS and v is the number of degrees of freedom for MS. For the one-way model, this equation reduces to P PF P F nk a, Na a, N a n where k. Using arbitrary values for some of the parameters in this equation will give an experimenter some idea of the probability of obtaining a negative estimate of ˆ. -8 Analyze the data in Problem -9, assuming that the operators are fixed, using both the unrestricted and restricted forms of the mixed models. Compare the results obtained from the two models. The restricted model is as follows: Minitab Output ANOVA: Measurement versus Part, Operator Factor Type Levels Values Part random Operator fixed Analysis of Variance for Measurem Source DF SS MS F P Part Operator Part*Operator Error Total Source Variance Error Expected Mean Square for Each Term component term (using restricted model) Part (4) + 6() Operator 3 (4) + 3(3) + 3Q[] 3 Part*Operator (4) + 3(3) 4 Error.5 (4) The second approach is the unrestricted mixed model. Minitab Output ANOVA: Measurement versus Part, Operator Factor Type Levels Values Part random Operator fixed -36

453 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Analysis of Variance for Measurem Source DF SS MS F P Part Operator Part*Operator Error Total Source Variance Error Expected Mean Square for Each Term component term (using unrestricted model) Part (4) + 3(3) + 6() Operator 3 (4) + 3(3) + Q[] 3 Part*Operator (4) + 3(3) 4 Error.5 (4) Source Sum of Squares DF Mean Square A a-= E(MS) F-test F i i n bn a a MS A F.69 MS AB B b-=9.85 n an AB (a-)(b-)=9.685 n Error Total nabc-=59 MS B F 8.8 MS AB MS AB F.4 MS E In the unrestricted model, the F-test for B is different. The F-test for B in the unrestricted model should generally be more conservative, since MS AB will generally be larger than MS E. However, this is not the case with this particular experiment. -9 Consider the two-factor mixed model. Show that the standard error of the fixed factor mean (e.g. A) is / bn MS AB. The standard error is often used in Duncan s Multiple Range test. Duncan s Multiple Range Test requires the variance of the difference in two means, say V y i y.. where rows are fixed and columns are random. Now, assuming all model parameters to be independent, we have the following: b b yi.. ym.. i m ij mj ijk j b b j m.. bn b n j k bn b n j k mjk and V yi.. ym.. n b bn b b b bn bn bn bn -37

454 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Since MS AB estimates n, we would use MS AB bn as the standard error to test the difference. However, the table of ranges for Duncan s Multiple Range test already include the constant. -3 Consider the variance components in the random model from Problem -9. (a) Find an exact 95 percent confidence interval on. femse fems, f E, f E (b) Find approximate 95 percent confidence intervals on the other variance components using the Satterthwaite method. and are negative, and the Satterthwaithe method does not apply. The confidence interval on is r MS MS B MS B B MS an AB MS MS b a b AB AB r ˆ E r O, r, r Use the experiment described in Problem 5-6 and assume that both factor are random. Find an exact 95 percent confidence interval on. Construct approximate 95 percent confidence interval on the other variance components using the Satterthwaite method. MS E femse fems, f E, f E E

455 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Satterthwaite Method: r MS MS MS AB AB AB a b MS E n MS E MS df r E E r, r, r , this variance component does not have a confidence interval using Satterthwaite s Method. r MS MS MS A A A MS bn MS AB MS a a b AB AB r ˆ r, r, r 3 (. 648)( 9. 98) (. 648)( 9. 98) Consider the three-factor experiment in Problem 5-7 and assume that operators were selected at random. Find an approximate 95 percent confidence interval on the operator variance component. MSC MSE abn MSC MSE r MSC MS c df E E r ˆ r, r, r

456 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY -33 Rework Problem -3 using the modified large-sample approach described in Section -7.. Compare the two sets of confidence intervals obtained and discuss. G G F. 5, 9,. H F. 95, 9i,. 95, 9 9 ij V V V L L L MS MS an B AB O ˆ O F, f, f G F, f, f H G i j c MS F, fi, f j i j B H 6 c MS AB G c c MS B MS 6 AB L V L Rework Problem -3 using the modified large-sample method described in Section -7.. Compare this confidence interval with he one obtained previously and discuss. G F H F G V V V ij L L L. 95, 36, 3,.,. 95, MSC MS abn E ˆ F, f, f G F, f, f H G i c j MS B F, fi, f j i j H 8 c MS AB G c c MS B MS L V L AB

457 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Chapter 3 Nested and Split-Plot Designs Solutions In this chapter we have not shown residual plots and other diagnostics to conserve space. A complete analysis would, of course, include these model adequacy checking procedures. 3- A rocket propellant manufacturer is studying the burning rate of propellant from three production processes. Four batches of propellant are randomly selected from the output of each process and three determinations of burning rate are made on each batch. The results follow. Analyze the data and draw conclusions. Process Process Process 3 Batch Minitab Output ANOVA: Burn Rate versus Process, Batch Factor Type Levels Values Process fixed 3 3 Batch(Process) random Analysis of Variance for Burn Rat Source DF SS MS F P Process Batch(Process) Error Total Source Variance Error Expected Mean Square for Each Term component term (using restricted model) Process (3) + 3() + Q[] Batch(Process) (3) + 3() 3 Error 8.9 (3) There is no significant effect on mean burning rate among the different processes; however, different batches from the same process have significantly different burning rates. 3- The surface finish of metal parts made on four machines is being studied. An experiment is conducted in which each machine is run by three different operators and two specimens from each operator are collected and tested. Because of the location of the machines, different operators are used on each machine, and the operators are chosen at random. The data are shown in the following table. Analyze the data and draw conclusions. Machine Machine Machine 3 Machine 4 Operator Minitab Output ANOVA: Finish versus Machine, Operator 3-

458 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Factor Type Levels Values Machine fixed Operator(Machine) random 3 3 Analysis of Variance for Finish Source DF SS MS F P Machine Operator(Machine) Error Total Source Variance Error Expected Mean Square for Each Term component term (using restricted model) Machine (3) + () + 6Q[] Operator(Machine) (3) + () 3 Error 84.5 (3) There is a slight effect on surface finish due to the different processes; however, the different operators running the same machine have significantly different surface finish. 3-3 A manufacturing engineer is studying the dimensional variability of a particular component that is produced on three machines. Each machine has two spindles, and four components are randomly selected from each spindle. These results follow. Analyze the data, assuming that machines and spindles are fixed factors. Machine Machine Machine 3 Spindle Minitab Output ANOVA: Variability versus Machine, Spindle Factor Type Levels Values Machine fixed 3 3 Spindle(Machine) fixed Analysis of Variance for Variabil Source DF SS MS F P Machine Spindle(Machine) Error Total 3 6. There is a significant effects on dimensional variability due to the machine and spindle factors. 3-4 To simplify production scheduling, an industrial engineer is studying the possibility of assigning one time standard to a particular class of jobs, believing that differences between jobs is negligible. To see if this simplification is possible, six jobs are randomly selected. Each job is given to a different group of three operators. Each operator completes the job twice at different times during the week, and the following results were obtained. What are your conclusions about the use of a common time standard for all jobs in this class? What value would you use for the standard? Job Operator Operator Operator

459 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Minitab Output ANOVA: Time versus Job, Operator Factor Type Levels Values Job random Operator(Job) random 3 3 Analysis of Variance for Time Source DF SS MS F P Job Operator(Job) Error Total Source Variance Error Expected Mean Square for Each Term component term (using restricted model) Job 4.76 (3) + () + 6() Operator(Job) (3) + () 3 Error.539 (3) The jobs differ significantly; the use of a common time standard would likely not be a good idea. 3-5 Consider the three-stage nested design shown in Figure 3-5 to investigate alloy hardness. Using the data that follow, analyze the design, assuming that alloy chemistry and heats are fixed factors and ingots are random. Alloy Chemistry Heats 3 3 Ingots Minitab Output ANOVA: Hardness versus Alloy, Heat, Ingot Factor Type Levels Values Alloy fixed Heat(Alloy) fixed 3 3 Ingot(Alloy Heat) random Analysis of Variance for Hardness Source DF SS MS F P Alloy Heat(Alloy) Ingot(Alloy Heat) Error Total Source Variance Error Expected Mean Square for Each Term component term (using restricted model) Alloy 3 (4) + (3) + Q[] Heat(Alloy) 3 (4) + (3) + 4Q[] 3 Ingot(Alloy Heat) (4) + (3) 4 Error (4) 3-3

460 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Alloy hardness differs significantly due to the different heats within each alloy. 3-6 Reanalyze the experiment in Problem 3-5 using the unrestricted form of the mixed model. Comment on any differences you observe between the restricted and unrestricted model results. You may use a computer software package. Minitab Output ANOVA: Hardness versus Alloy, Heat, Ingot Factor Type Levels Values Alloy fixed Heat(Alloy) fixed 3 3 Ingot(Alloy Heat) random Analysis of Variance for Hardness Source DF SS MS F P Alloy Heat(Alloy) Ingot(Alloy Heat) Error Total Source Variance Error Expected Mean Square for Each Term component term (using unrestricted model) Alloy 3 (4) + (3) + Q[,] Heat(Alloy) 3 (4) + (3) + Q[] 3 Ingot(Alloy Heat) (4) + (3) 4 Error (4) 3-7 Derive the expected means squares for a balanced three-stage nested design, assuming that A is fixed and that B and C are random. Obtain formulas for estimating the variance components. F R R R a b c n Factor i j k l E(MS) i b c n n ji cn () c n n cn k ij ( ) n n ( ijk) l bcn a i MS E ˆ MSC B MS n E ˆ MS BA MSCB cn The expected mean squares can be generated in Minitab as follows: Minitab Output ANOVA: y versus A, B, C Factor Type Levels Values A fixed - B(A) random - C(A B) random - Analysis of Variance for y Source DF SS MS F P 3-4

461 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY A B(A) C(A B) Error Total Source Variance Error Expected Mean Square for Each Term component term (using restricted model) A (4) + (3) + 4() + 8Q[] B(A) -. 3 (4) + (3) + 4() 3 C(A B) (4) + (3) 4 Error 5.75 (4) 3-8 Repeat Problem 3-7 assuming the unrestricted form of the mixed model. You may use a computer software package. Comment on any differences you observe between the restricted and unrestricted model analysis and conclusions. Minitab Output ANOVA: y versus A, B, C Factor Type Levels Values A fixed - B(A) random - C(A B) random - Analysis of Variance for y Source DF SS MS F P A B(A) C(A B) Error Total Source Variance Error Expected Mean Square for Each Term component term (using unrestricted model) A (4) + (3) + 4() + Q[] B(A) -. 3 (4) + (3) + 4() 3 C(A B) (4) + (3) 4 Error 5.75 (4) In this case there is no difference in results between the restricted and unrestricted models. 3-9 Derive the expected means squares for a balanced three-stage nested design if all three factors are random. Obtain formulas for estimating the variance components. Assume the restricted form of the mixed model. R R R R a b c n Factor i j k l E(MS) i b c n n cn bcn ji () c n n cn k ij ( ) n n ( ijk) l MS E MS C( B) n MS E MS MS B( A) C( B) cn MS A MS bcn B( A) The expected mean squares can be generated in Minitab as follows: 3-5

462 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Minitab Output ANOVA: y versus A, B, C Factor Type Levels Values A random - B(A) random - C(A B) random - Analysis of Variance for y Source DF SS MS F P A B(A) C(A B) Error Total Source Variance Error Expected Mean Square for Each Term component term (using unrestricted model) A -.5 (4) + (3) + 4() + 8() B(A) -. 3 (4) + (3) + 4() 3 C(A B) (4) + (3) 4 Error 5.75 (4) 3- Verify the expected mean squares given in Table 3-. F F R a b n Factor i j l E(MS) i b n bn a j i n n ab ijkl i j i R R R a b n Factor i j l E(MS) i b n n bn j i n n ijkl F R R a b n Factor i j l E(MS) i b n n j i n n ijkl bn a i 3- Unbalanced designs. Consider an unbalanced two-stage nested design with b j levels of B under the ith level of A and n ij replicates in the ijth cell. 3-6

463 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY (a) Write down the least squares normal equations for this situation. Solve the normal equations. The least squares normal equations are: n.. ˆ a i n ˆ i. i b i a b i i j nij ˆ j i y i ni. ˆ ni. ˆ i n ˆ ij j i yi.., for i,,..., a j j i nij ˆ nij ˆ ˆ i nij j i yij., for i,,..., a and j,,..., b i There are +a+b equations in +a+b unknowns. However, there are a+linear dependencies in these equations, and consequently, a+ side conditions are needed to solve them. Any convenient set of a+ linearly independent equations can be used. The easiest set is, i, for i=,,,a. Using these conditions we get, i,... ji () y ij. as the solution to the normal equations. See Searle (97) for a full discussion. (b) Construct the analysis of variance table for the unbalanced two-stage nested design. The analysis of variance table is Source SS DF A a yi.. y... n n a- B Error Total a a i i. b i i j b n i ij i j k y n y ij. ij ijk a.. y n i.. i i. a b i i j a b n y... i ij yijk n i j k.. (c) Analyze the following data, using the results in part (b). Factor A Factor B y n ij. ij b.-a n..-b n..- Note that a=, b =, b =3, b.=b +b =5, n =3, n =, n =4, n =3 and n 3 =3 Source SS DF MS 3-7

464 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY A.3.3 B Error Total The analysis can also be performed in Minitab as follows. The adjusted sum of squares is utilized by Minitab s general linear model routine. Minitab Output General Linear Model: y versus A, B Factor Type Levels Values A fixed B(A) fixed 5 3 Analysis of Variance for y, using Adjusted SS for Tests Source DF Seq SS Adj SS Adj MS F P A B(A) Error Total Variance components in the unbalanced two-stage nested design. Consider the model yijk i j i kij i,,...,a j,,...,b k,,...,n ij where A and B are random factors. Show that E E MS E MS A B A c MS E N N a c where a bi nij N n i j i. c b a a bi a nij n i j i. i c a c i i. c j a n b i n ij N 3-8

465 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY See Variance Component Estimation in the -way Nested Classification, by S.R. Searle, Annals of Mathematical Statistics, Vol. 3, pp. 6-66, 96. A good discussion of variance component estimation from unbalanced data is in Searle (97a). 3-3 A process engineer is testing the yield of a product manufactured on three machines. Each machine can be operated at two power settings. Furthermore, a machine has three stations on which the product is formed. An experiment is conducted in which each machine is tested at both power settings, and three observations on yield are taken from each station. The runs are made in random order, and the results follow. Analyze this experiment, assuming all three factors are fixed. Machine Machine Machine 3 Station Power Setting Power Setting y The linear model is ijkl i j kj ijk l ij ik( j ) Minitab Output ANOVA: Yield versus Machine, Power, Station Factor Type Levels Values Machine fixed 3 3 Power fixed Station(Machine) fixed 3 3 Analysis of Variance for Yield Source DF SS MS F P Machine Power Station(Machine) Machine*Power Power*Station(Machine) Error Total Source Variance Error Expected Mean Square for Each Term component term (using restricted model) Machine 6 (6) + 8Q[] Power 6 (6) + 7Q[] 3 Station(Machine) 6 (6) + 6Q[3] 4 Machine*Power 6 (6) + 9Q[4] 5 Power*Station(Machine) 6 (6) + 3Q[5] 6 Error.636 (6) 3-4 Suppose that in Problem 3-3 a large number of power settings could have been used and that the two selected for the experiment were chosen randomly. Obtain the expected mean squares for this situation and modify the previous analysis appropriately. R F F R Factor i j k l E(MS) i j j 3-9

466 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY ( ) ij 3 3 k j 9 ( ) 3 ( ) ik ( j ) 3 ( ijk) l 3 k( j ) The analysis of variance and the expected mean squares can be completed in Minitab as follows: Minitab Output ANOVA: Yield versus Machine, Power, Station Factor Type Levels Values Machine fixed 3 3 Power random Station(Machine) fixed 3 3 Analysis of Variance for Yield 3 Source DF SS MS F P Machine Power Station(Machine) Machine*Power Power*Station(Machine) Error Total Source Variance Error Expected Mean Square for Each Term component term (using restricted model) Machine 4 (6) + 9(4) + 8Q[] Power (6) + 7() 3 Station(Machine) 5 (6) + 3(5) + 6Q[3] 4 Machine*Power (6) + 9(4) 5 Power*Station(Machine).65 6 (6) + 3(5) 6 Error.6359 (6) 3-5 Reanalyze the experiment in Problem 3-4 assuming the unrestricted form of the mixed model. You may use a computer software program to do this. Comment on any differences between the restricted and unrestricted model analysis and conclusions. ANOVA: Yield versus Machine, Power, Station Factor Type Levels Values Machine fixed 3 3 Power random Station(Machine) fixed 3 3 Analysis of Variance for Yield Source DF SS MS F P Machine Power Station(Machine) Machine*Power Power*Station(Machine) Error Total Source Variance Error Expected Mean Square for Each Term component term (using unrestricted model) Machine 4 (6) + 3(5) + 9(4) + Q[,3] Power (6) + 3(5) + 9(4) + 7() 3 Station(Machine) 5 (6) + 3(5) + Q[3] 4 Machine*Power (6) + 3(5) + 9(4) 5 Power*Station(Machine).65 6 (6) + 3(5) 6 Error.6359 (6) 3-

467 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY There are differences between several of the expected mean squares. However, the conclusions that could be drawn do not differ in any meaningful way from the restricted model analysis. 3-6 A structural engineer is studying the strength of aluminum alloy purchased from three vendors. Each vendor submits the alloy in standard-sized bars of.,.5, or. inches. The processing of different sizes of bar stock from a common ingot involves different forging techniques, and so this factor may be important. Furthermore, the bar stock if forged from ingots made in different heats. Each vendor submits two tests specimens of each size bar stock from the three heats. The resulting strength data follow. Analyze the data, assuming that vendors and bar size are fixed and heats are random. Vendor Vendor Vendor 3 Heat Bar Size: inch / inch inch y kj ( ik j ijk l ijkl i j ij ) Minitab Output ANOVA: Strength versus Vendor, Bar Size, Heat Factor Type Levels Values Vendor fixed 3 3 Heat(Vendor) random 3 3 Bar Size fixed Analysis of Variance for Strength Source DF SS MS F P Vendor Heat(Vendor) Bar Size Vendor*Bar Size Bar Size*Heat(Vendor) Error Total Source Variance Error Expected Mean Square for Each Term component term (using restricted model) Vendor (6) + 6() + 8Q[] Heat(Vendor).7 6 (6) + 6() 3 Bar Size 5 (6) + (5) + 8Q[3] 4 Vendor*Bar Size 5 (6) + (5) + 6Q[4] 5 Bar Size*Heat(Vendor).6 6 (6) + (5) 6 Error.4 (6) 3-7 Reanalyze the experiment in Problem 3-6 assuming the unrestricted form of the mixed model. You may use a computer software program to do this. Comment on any differences between the restricted and unrestricted model analysis and conclusions. Minitab Output ANOVA: Strength versus Vendor, Bar Size, Heat Factor Type Levels Values Vendor fixed 3 3 Heat(Vendor) random

468 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Bar Size fixed Analysis of Variance for Strength Source DF SS MS F P Vendor Heat(Vendor) Bar Size Vendor*Bar Size Bar Size*Heat(Vendor) Error Total Source Variance Error Expected Mean Square for Each Term component term (using unrestricted model) Vendor (6) + (5) + 6() + Q[,4] Heat(Vendor).63 5 (6) + (5) + 6() 3 Bar Size 5 (6) + (5) + Q[3,4] 4 Vendor*Bar Size 5 (6) + (5) + Q[4] 5 Bar Size*Heat(Vendor).6 6 (6) + (5) 6 Error.4 (6) There are some differences in the expected mean squares. However, the conclusions do not differ from those of the restricted model analysis. 3-8 Suppose that in Problem 3-6 the bar stock may be purchased in many sizes and that the three sizes are actually used in experiment were selected randomly. Obtain the expected mean squares for this situation and modify the previous analysis appropriately. Use the restricted form of the mixed model. Minitab Output ANOVA: Strength versus Vendor, Bar Size, Heat Factor Type Levels Values Vendor fixed 3 3 Heat(Vendor) random 3 3 Bar Size random Analysis of Variance for Strength Source DF SS MS F P Vendor x Heat(Vendor) Bar Size Vendor*Bar Size Bar Size*Heat(Vendor) Error Total x Not an exact F-test. Source Variance Error Expected Mean Square for Each Term component term (using restricted model) Vendor * (6) + (5) + 6(4) + 6() + 8Q[] Heat(Vendor).63 5 (6) + (5) + 6() 3 Bar Size. 5 (6) + (5) + 8(3) 4 Vendor*Bar Size (6) + (5) + 6(4) 5 Bar Size*Heat(Vendor).6 6 (6) + (5) 6 Error.4 (6) * Synthesized Test. Error Terms for Synthesized Tests Source Error DF Error MS Synthesis of Error MS Vendor () + (4) - (5) Notice that a Satterthwaite type test is used for vendor. 3-

469 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY 3-9 Steel in normalized by heating above the critical temperature, soaking, and then air cooling. This process increases the strength of the steel, refines the grain, and homogenizes the structure. An experiment is performed to determine the effect of temperature and heat treatment time on the strength of normalized steel. Two temperatures and three times are selected. The experiment is performed by heating the oven to a randomly selected temperature and inserting three specimens. After minutes one specimen is removed, after minutes the second specimen is removed, and after 3 minutes the final specimen is removed. Then the temperature is changed to the other level and the process is repeated. Four shifts are required to collect the data, which are shown below. Analyze the data and draw conclusions, assume both factors are fixed. Temperature (F) Shift Time(minutes) This is a split-plot design. Shifts correspond to blocks, temperature is the whole plot treatment, and time is the subtreatments (in the subplot or split-plot part of the design). The expected mean squares and analysis of variance are shown below. R F F R 4 3 Factor i j k l E(MS) i (blocks) 3 6 j (temp) ij 3 3 / j k (time) 4 8 / k ik jk 4 / 3 ijk ijkl (not estimable) jk The following Minitab Output has been modified to display the results of the split-plot analysis. Minitab will calculate the sums of squares correctly, but the expected mean squares and the statistical tests are not, in general, correct. Notice that the Error term in the analysis of variance is actually the three factor interaction. Minitab Output ANOVA: Strength versus Shift, Temperature, Time Factor Type Levels Values Shift random Temperat fixed 5 6 Time fixed 3 3 Analysis of Variance for Strength 3-3

470 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Standard Split Plot Source DF SS MS F P F P Shift Temperat Shift*Temperat Time Shift*Time Temperat*Time Error Total Source Variance Error Expected Mean Square for Each Term component term (using restricted model) Shift.9 7 (7) + 6() Temperat 3 (7) + 3(3) + Q[] 3 Shift*Temperat (7) + 3(3) 4 Time 5 (7) + (5) + 8Q[4] 5 Shift*Time (7) + (5) 6 Temperat*Time 7 (7) + 4Q[6] 7 Error (7) 3- An experiment is designed to study pigment dispersion in paint. Four different mixes of a particular pigment are studied. The procedure consists of preparing a particular mix and then applying that mix to a panel by three application methods (brushing, spraying, and rolling). The response measured is the percentage reflectance of the pigment. Three days are required to run the experiment, and the data obtained follow. Analyze the data and draw conclusions, assuming that mixes and application methods are fixed. Mix Day App Method This is a split plot design. Days correspond to blocks, mix is the whole plot treatment, and method is the subtreatment (in the subplot or split plot part of the design). The expected mean squares are: R F F R Factor i j k l E(MS) i (blocks) 4 3 j (temp) ij 3 3 / j 3 k (time) / k ik 4 4 jk 3 / 6 ijk ijkl (not estimable) 3 jk The following Minitab Output has been modified to display the results of the split-plot analysis. Minitab will calculate the sums of squares correctly, but the expected mean squares and the statistical tests are not, 3-4

471 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY in general, correct. Notice that the Error term in the analysis of variance is actually the three factor interaction. Minitab Output ANOVA: Reflectance versus Day, Mix, Method Factor Type Levels Values Day random 3 3 Mix fixed Method fixed 3 3 Analysis of Variance for Reflecta Standard Split Plot Source DF SS MS F P F P Day Mix Day*Mix Method Day*Method Mix*Method Error Total Source Variance Error Expected Mean Square for Each Term component term (using restricted model) Day.46 7 (7) + () Mix 3 (7) + 3(3) + 9Q[] 3 Day*Mix (7) + 3(3) 4 Method 5 (7) + 4(5) + Q[4] 5 Day*Method (7) + 4(5) 6 Mix*Method 7 (7) + 3Q[6] 7 Error.733 (7) 3- Repeat Problem 3-, assuming that the mixes are random and the application methods are fixed. The expected mean squares are: R R F R Factor i j k l E(MS) i (blocks) 4 3 j (temp) 3 3 ij k (time) 3 4 ik 4 jk 3 ijk 4 / k 4 3 ijkl (not estimable) The F-tests are the same as those in Problem 3-. The following Minitab Output has been edited to display the results of the split-plot analysis. Minitab will calculate the sums of squares correctly, but the expected mean squares and the statistical tests are not, in general, correct. Again, the Error term in the analysis of variance is actually the three factor interaction. Minitab Output ANOVA: Reflectance versus Day, Mix, Method 3-5

472 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Factor Type Levels Values Day random 3 3 Mix random Method fixed 3 3 Analysis of Variance for Reflecta Standard Split Plot Source DF SS MS F P F P Day Mix Day*Mix Method x 6.6. Day*Method Mix*Method Error Total x Not an exact F-test. Source Variance Error Expected Mean Square for Each Term component term (using restricted model) Day. 3 (7) + 3(3) + () Mix.34 3 (7) + 3(3) + 9() 3 Day*Mix.76 7 (7) + 3(3) 4 Method * (7) + 3(6) + 4(5) + Q[4] 5 Day*Method (7) + 4(5) 6 Mix*Method (7) + 3(6) 7 Error.73 (7) * Synthesized Test. Error Terms for Synthesized Tests Source Error DF Error MS Synthesis of Error MS 4 Method (5) + (6) - (7) 3- Consider the split-split-plot design described in example 3-3. Suppose that this experiment is conducted as described and that the data shown below are obtained. Analyze and draw conclusions. Technician 3 Blocks Dose Strengths Wall Thickness Using the computer output, the F-ratios were calculated by hand using the expected mean squares found in Table 3-8. The following Minitab Output has been edited to display the results of the split-plot analysis. Minitab will calculate the sums of squares correctly, but the expected mean squares and the statistical tests are not, in general, correct. Notice that the Error term in the analysis of variance is actually the four factor interaction. Minitab Output 3-6

473 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY ANOVA: Time versus Day, Tech, Dose, Thick Factor Type Levels Values Day random Tech fixed 3 3 Dose fixed 3 3 Thick fixed Analysis of Variance for Time Standard Split Plot Source DF SS MS F P F P Day Tech Day*Tech Dose Day*Dose Tech*Dose Day*Tech*Dose Thick Day*Thick Tech*Thick Day*Tech*Thick Dose*Thick Day*Dose*Thick Tech*Dose*Thick Error Total Source Variance Error Expected Mean Square for Each Term component term (using restricted model) Day (5) + 36() Tech 3 (5) + (3) + 48Q[] 3 Day*Tech.84 5 (5) + (3) 4 Dose 5 (5) + (5) + 48Q[4] 5 Day*Dose (5) + (5) 6 Tech*Dose 7 (5) + 4(7) + 6Q[6] 7 Day*Tech*Dose (5) + 4(7) 8 Thick 9 (5) + 9(9) + 36Q[8] 9 Day*Thick (5) + 9(9) Tech*Thick (5) + 3() + Q[] Day*Tech*Thick.5 5 (5) + 3() Dose*Thick 3 (5) + 3(3) + Q[] 3 Day*Dose*Thick (5) + 3(3) 4 Tech*Dose*Thick 5 (5) + 4Q[4] 5 Error (5) 3-3 Rework Problem 3-, assuming that the dosage strengths are chosen at random. Use the restricted form of the mixed model. The following Minitab Output has been edited to display the results of the split-plot analysis. Minitab will calculate the sums of squares correctly, but the expected mean squares and the statistical tests are not, in general, correct. Again, the Error term in the analysis of variance is actually the four factor interaction. Minitab Output ANOVA: Time versus Day, Tech, Dose, Thick Factor Type Levels Values Day random Tech fixed 3 3 Dose random 3 3 Thick fixed Analysis of Variance for Time Standard Split Plot Source DF SS MS F P F P Day Tech Day*Tech Dose

474 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Day*Dose Tech*Dose Day*Tech*Dose Thick x Day*Thick Tech*Thick x.6.84 Day*Tech*Thick Dose*Thick Day*Dose*Thick Tech*Dose*Thick Error Total x Not an exact F-test. Source Variance Error Expected Mean Square for Each Term component term (using restricted model) Day (5) + (5) + 36() Tech * (5) + 4(7) + 6(6) + (3) + 48Q[] 3 Day*Tech (5) + 4(7) + (3) 4 Dose (5) + (5) + 48(4) 5 Day*Dose.59 5 (5) + (5) 6 Tech*Dose (5) + 4(7) + 6(6) 7 Day*Tech*Dose.78 5 (5) + 4(7) 8 Thick * (5) + 3(3) + () + 9(9) + 36Q[8] 9 Day*Thick (5) + 3(3) + 9(9) Tech*Thick * (5) + 4(4) + 3() + Q[] Day*Tech*Thick.5 5 (5) + 3() Dose*Thick (5) + 3(3) + () 3 Day*Dose*Thick (5) + 3(3) 4 Tech*Dose*Thick (5) + 4(4) 5 Error (5) * Synthesized Test. Error Terms for Synthesized Tests Source Error DF Error MS Synthesis of Error MS Tech (3) + (6) - (7) 8 Thick (9) + () - (3) Tech*Thick () + (4) - (5) The expected mean squares can also be shown as follows: R F R F R Factor i j k h l E(MS) i j ( 48 / ) j ij 3 4 k ik 3 4 jk 4 4 ijk h ih 3 3 jh ijh 3 kh / h / jh

475 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY ikh 3 jkh 4 ijkh 3 4 ijkl There are no exact tests on technicians j, dosage strengths k, wall thickness h, or the technician x wall thickness interaction. The approximate F-tests are as follows: jh H : j = p q MS F MS B AB MS MS ABC BC MS MS MS 6 Do not reject H : j = B B ABC ABC MS MS MS MS AB AB BC BC MS H : k = MS F MS p q C CD MS MS ACD AD MS MS MS C C 6 Reject H : k = H : h = p ACD ACD MS MS MS MS CD CD MS F MS D MS 9 CD MS AD AD MS ACD AD MS MS MS D D 3 ACD ACD MS

476 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY q MS MS MS CD CD 6 MS 9 AD AD Reject H : h = jh H : = MS F MS BD BCD MS MS ABCD ABD jh F<, Do not reject H : = 3-4 Suppose that in Problem 3- four technicians had been used. Assuming that all the factors are fixed, how many blocks should be run to obtain an adequate number of degrees of freedom on the test for differences among technicians? The number of degrees of freedom for the test is (a-)(4-)=3(a-), where a is the number of blocks used. Number of Blocks (a) DF for test At least three blocks should be run, but four would give a better test. 3-5 Consider the experiment described in Example 3-3. Demonstrate how the order in which the treatments combinations are run would be determined if this experiment were run as (a) a split-split-plot, (b) a split-plot, (c) a factorial design in a randomized block, and (d) a completely randomized factorial design. (a) Randomization for the split-split plot design is described in Example 3-3. (b) In the split-plot, within a block, the technicians would be the main treatment and within a blocktechnician plot, the combinations of dosage strength and wall thickness would be run in random order. The design would be a two-factor factorial in a split-plot. (c) To run the design in a randomized block, the 36 combinations of technician, dosage strength, and wall thickness would be ran in random order within each block. The design would be a three factor factorial in a randomized block. (d) The blocks would be considered as replicates, and all 44 observations would be 4 replicates of a three factor factorial. 3-

477 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Chapter 4 Other Design and Analysis Topics Solutions 4- Reconsider the experiment in Problem 5-. Use the Box-Cox procedure to determine if a transformation on the response is appropriate (or useful) in the analysis of the data from this experiment. DESIGN-EXPERT Plot Crack Growth Lambda Current = Best =. Low C.I. = -.44 High C.I. =.56 Recommend transform: Log (Lambda = ) Ln(ResidualSS) Box-Cox Plot for Power Transforms Lambda With the value of lambda near zero, and since the confidence interval does not include one, a natural log transformation would be appropriate. 4- In example 6-3 we selected a log transformation for the drill advance rate response. Use the Box- Cox procedure to demonstrate that this is an appropriate data transformation. 4-

478 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY DESIGN-EXPERT Plot Advance Rate Lambda Current = Best = -.3 Low C.I. = -.79 High C.I. =.3 Recommend transform: Log (Lambda = ) Ln(ResidualSS) Box-Cox Plot for Power Transforms Lambda Because the value of lambda is very close to zero, and the confidence interval does not include one, the natural log was the correct transformation chosen for this analysis. 4-3 Reconsider the smelting process experiment in Problem 8-3, where a 6-3 fractional factorial design was used to study the weight of packing material stuck to carbon anodes after baking. Each of the eight runs in the design was replicated three times and both the average weight and the range of the weights at each test combination were treated as response variables. Is there any indication that that a transformation is required for either response? DESIGN-EXPERT Plot Weight Box-Cox Plot for Power Transforms DESIGN-EXPERT Plot Range Box-Cox Plot for Power Transforms Lambda Current = Best =.33 Low C.I. = -.7 High C.I. = Lambda Cu rre n t = Best =.58 Low C.I. = -.74 High C.I. = Recommend transform: None (Lambda = ) Ln(ResidualSS) 9.47 Recommend transform: None (Lambda = ) Ln(ResidualSS) Lambda Lambda There is no indication that a transformation is required for either response. 4-4 In Problem 8-4 a replicated fractional factorial design was used to study substrate camber in semiconductor manufacturing. Both the mean and standard deviation of the camber measurements were used as response variables. Is there any indication that a transformation is required for either response? 4-

479 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY DESIGN-EXPERT Plot Camber Avg Box-Cox Plot for Power Transforms DESIGN-EXPERT Plot Cam ber StDev Box-Cox Plot for Power Transforms Lambda Current = Best = -.3 Low C.I. = -.79 High C.I. = Lambda Cu rre n t = Best =.57 Low C.I. = -.3 High C.I. = Recommend transform: Log (Lambda = ) Ln(ResidualSS).49 Recommend transform: None (Lambda = ) Ln(ResidualSS) Lambda Lambda The Box-Cox plot for the Camber Average suggests a natural log transformation should be applied. This decision is based on the confidence interval for lambda not including one and the point estimate of lambda being very close to zero. With a lambda of approximately.5, a square root transformation could be considered for the Camber Standard Deviation; however, the confidence interval indicates that no transformation is needed. 4-5 Reconsider the photoresist experiment in Problem 8-5. Use the variance of the resist thickness at each test combination as the response variable. Is there any indication that a transformation is required? DESIGN-EXPERT Plot Thick StDev Lambda Current = Best = -.4 Low C.I. = -.77 High C.I. =.76 Recommend transform: Log (Lambda = ) Ln(ResidualSS) Box-Cox Plot for Power Transforms Lambda With the point estimate of lambda near zero, and the confidence interval for lambda not inclusive of one, a natural log transformation would be appropriate. 4-3

480 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY 4-6 In the grill defects experiment described in Problem 8-9 a variation of the square root transformation was employed in the analysis of the data. Use the Box-Cox method to determine if this is the appropriate transformation. DESIGN-EXPERT Plot c Lambda Current = Best = -.6 Low C.I. = -.69 High C.I. =.74 Recommend transform: Log (Lambda = ) k =.56 (used to make response values positive) Ln(ResidualSS) Box-Cox Plot for Power Transforms Lambda Because the confidence interval for the minimum lambda does not include one, the decision to use a transformation is correct. Because the lambda point estimate is close to zero, the natural log transformation would be appropriate. This is a stronger transformation than the square root. 4-7 In the central composite design of Problem -4, two responses were obtained, the mean and variance of an oxide thickness. Use the Box-Cox method to investigate the potential usefulness of transformation for both of these responses. Is the log transformation suggested in part (c) of that problem appropriate? DESIGN-EXPERT Plot Mean Thick Box-Cox Plot for Power Transforms DESIGN-EXPERT Plot Var Thick Box-Cox Plot for Power Transforms Lambda Current = Best = -. Low C.I. = High C.I. = Lambda Cu rre n t = Best = -.47 Low C.I. = -.85 High C.I. = Recommend transform: None (Lambda = ) Ln(ResidualSS) 8.4 Recommend transform: None (Lambda = ) Ln(ResidualSS) Lambda Lambda The Box-Cox plot for the Mean Thickness model suggests that a natural log transformation could be applied; however, the confidence interval for lambda includes one. Therefore, a transformation would 4-4

481 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY have a minimal effect. The natural log transformation applied to the Variance of Thickness model appears to be acceptable; however, again the confidence interval for lambda includes one. 4-8 In the 3 3 factorial design of Problem -33 one of the responses is a standard deviation. Use the Box-Cox method to investigate the usefulness of transformations for this response. Would your answer change if we used the variance of the response? DESIGN-EXPERT Plot Std. Dev. Lambda Current = Best =.9 Low C.I. =. High C.I. =.6 Box-Cox Plot for Power Transforms.3 9. Recommend transform: Square Root (Lambda =.5) k =.58 (used to make response values positive) Ln(ResidualSS) Lambda Because the confidence interval for lambda does not include one, a transformation should be applied. The natural log transformation should not be considered due to zero not being included in the confidence interval. The square root transformation appears to be acceptable. However, notice that the value of zero is very close to the lower confidence limit, and the minimizing value of lambda is between and.5. It is likely that either the natural log or the square root transformation would work reasonably well. 4-9 Problem -34 suggests using the ln(s ) as the response (refer to part b). Does the Box-Cox method indicate that a transformation is appropriate? 4-5

482 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY DESIGN-EXPERT Plot Variance Lambda Current = Best = -.7 Low C.I. = -.53 High C.I. = -.7 Recommend transform: Inverse (Lambda = -) Ln(ResidualSS) Box-Cox Plot for Power Transforms Lambda Because the confidence interval for lambda does not include one, a transformation should be applied. The confidence interval does not include zero; therefore, the natural log transformation is inappropriate. With the point estimate of lambda at.7, the reciprocal transformation is appropriate. 4- A soft drink distributor is studying the effectiveness of delivery methods. Three different types of hand trucks have been developed, and an experiment is performed in the company s methods engineering laboratory. The variable of interest is the delivery time in minutes (y); however, delivery time is also strongly related to the case volume delivered (x). Each hand truck is used four times and the data that follow are obtained. Analyze the data and draw the appropriate conclusions. Use =.5. Hand Truck Type 3 3 y x y x y x From the analysis performed in Minitab, hand truck does not have a statistically significant effect on delivery time. Volume, as expected, does have a significant effect. Minitab Output General Linear Model: Time versus Truck Factor Type Levels Values Truck fixed 3 3 Analysis of Variance for Time, using Adjusted SS for Tests Source DF Seq SS Adj SS Adj MS F P Volume Truck Error Total Term Coef SE Coef T P Constant Volume

483 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY 4- Compute the adjusted treatment means and the standard errors of the adjusted treatment means for the data in Problem 4-. S S S adj y y ˆ x x i. i adj y.. adj 5 y.. adj 86 adj.y. adj.y. adj.y3. y 3.. S adj.yi. MS E n i... x x i. E xx The solutions can also be obtained with Minitab as follows: Minitab Output Least Squares Means for Time Truck Mean SE Mean The sums of squares and products for a single-factor analysis of covariance follow. Complete the analysis and draw appropriate conclusions. Use =.5. Source of Degrees of Sums of Squares and Products Variation Freedom x xy x Treatment Error 6 55 Total 5 75 Sums of Squares & Products Adjusted Source df x xy y y df MS F Treatment Error Total

484 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Adjusted Treat Treatments differ only at %. 4-3 Find the standard errors of the adjusted treatment means in Example 4-4. From Example 4-4 y , adj y. 44., adj y S S S adj.y. adj.y. adj.y Four different formulations of an industrial glue are being tested. The tensile strength of the glue when it is applied to join parts is also related to the application thickness. Five observations on strength (y) in pounds and thickness (x) in. inches are obtained for each formulation. The data are shown in the following table. Analyze these data and draw appropriate conclusions. Glue Formulation y x y x y x y x From the analysis performed in Minitab, glue formulation does not have a statistically significant effect on strength. As expected, glue thickness does affect strength. Minitab Output General Linear Model: Strength versus Glue Factor Type Levels Values Glue fixed Analysis of Variance for Strength, using Adjusted SS for Tests Source DF Seq SS Adj SS Adj MS F P Thick Glue Error Total Term Coef SE Coef T P Constant Thick Unusual Observations for Strength 4-8

485 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Obs Strength Fit SE Fit Residual St Resid R R denotes an observation with a large standardized residual. Expected Mean Squares, using Adjusted SS Source Expected Mean Square for Each Term Thick (3) + Q[] Glue (3) + Q[] 3 Error (3) Error Terms for Tests, using Adjusted SS Source Error DF Error MS Synthesis of Error MS Thick (3) Glue (3) Variance Components, using Adjusted SS Source Estimated Value Error Compute the adjusted treatment means and their standard errors using the data in Problem 4-4. adj yi. yi. ˆ xi. x.. adj y adj y adj y adj S S S S y 4.. adj.y. adj.y. adj.y3. adj.y4. S adj.yi. MS E n x x i. E xx The adjusted treatment means can also be generated in Minitab as follows: Minitab Output Least Squares Means for Strength Glue Mean SE Mean

486 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY An engineer is studying the effect of cutting speed on the rate of metal removal in a machining operation. However, the rate of metal removal is also related to the hardness of the test specimen. Five observations are taken at each cutting speed. The amount of metal removed (y) and the hardness of the specimen (x) are shown in the following table. Analyze the data using and analysis of covariance. Use =.5. Cutting Speed (rpm) 4 4 y x y x y x As shown in the analysis performed in Minitab, there is no difference in the rate of removal between the three cutting speeds. As expected, the hardness does have an impact on rate of removal. Minitab Output General Linear Model: Removal versus Speed Factor Type Levels Values Speed fixed 3 4 Analysis of Variance for Removal, using Adjusted SS for Tests Source DF Seq SS Adj SS Adj MS F P Hardness Speed Error Total Term Coef SE Coef T P Constant Hardness Speed Unusual Observations for Removal Obs Removal Fit SE Fit Residual St Resid R R denotes an observation with a large standardized residual. Expected Mean Squares, using Adjusted SS Source Expected Mean Square for Each Term Hardness (3) + Q[] Speed (3) + Q[] 3 Error (3) Error Terms for Tests, using Adjusted SS Source Error DF Error MS Synthesis of Error MS Hardness. 8.7 (3) Speed. 8.7 (3) Variance Components, using Adjusted SS Source Estimated Value 4-

487 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY Error Means for Covariates Covariate Mean StDev Hardness Least Squares Means for Removal Speed Mean SE Mean Show that in a single factor analysis of covariance with a single covariate a (-) percent confidence interval on the i th adjusted treatment mean is y i. ˆ xi. x.. t,a n MS E n x x Using this formula, calculate a 95 percent confidence interval on the adjusted mean of machine in Example 4-4. The (-) percent interval on the i th adjusted treatment mean would be since y ˆ x x ˆ xi. x.. t,an S adjy i. yi. i. i... is an estimator of the i th adjusted treatment mean. The standard error of the adjusted treatment mean is found as follows: adj.y V y ˆ x x i. E xx.. V y x x V ˆ V i. i. i... i. i... Since the y i. and are independent. From regression analysis, we havev ˆ E xx. Therefore, V adj.y i. n x x x x i. E.. xx n i. E xx.. Replacing by its estimator MS E, yields y Vˆ adj.y S adj.y i. i. MS E MS x n E n i. x E xx.. or x x Substitution of this result into i. i..., a( n) adjy i. i. E xx.. ˆ x x t S will produce the desired confidence interval. A 95% confidence interval on the mean of machine would be found as follows: 4-

488 Solutions from Montgomery, D. C. () Design and Analysis of Experiments, Wiley, NY y ˆ i. xi. x adj.yi t adj.yi S. 4., Therefore, , where denotes the true adjusted mean of treatment one. 4-8 Show that in a single-factor analysis of covariance with a single covariate, the standard error of the difference between any two adjusted treatment means is adj.y The variance of this statistic is S Adjyi. Adjy adj.y MS j. E n y ˆ x x x x i. E xx.. ˆ x y x i. j. i. i... j. j.. y ˆ i. adj. y j. yi. y j. xi. x j. adj ˆ y y x x V y V y x x V ˆ i. j. i. j. i. j. i. j x x x x V. n n i. E j. xx n Replacing by its estimator MS E,, and taking the square root yields the standard error i. E xx j... S Adjyi. Adjy j. MS E n x x i. E xx Discuss how the operating characteristic curves for the analysis of variance can be used in the analysis of covariance. To use the operating characteristic curves, fixed effects case, we would use as the parameter, a n The test has a- degrees of freedom in the numerator and a(n-)- degrees of freedom in the denominator. i 4-

Chapter 4 and 5 solutions

Chapter 4 and 5 solutions Chapter 4 and 5 solutions 4.4. Three different washing solutions are being compared to study their effectiveness in retarding bacteria growth in five gallon milk containers. The analysis is done in a laboratory,

More information

Recall this chart that showed how most of our course would be organized:

Recall this chart that showed how most of our course would be organized: Chapter 4 One-Way ANOVA Recall this chart that showed how most of our course would be organized: Explanatory Variable(s) Response Variable Methods Categorical Categorical Contingency Tables Categorical

More information

How To Check For Differences In The One Way Anova

How To Check For Differences In The One Way Anova MINITAB ASSISTANT WHITE PAPER This paper explains the research conducted by Minitab statisticians to develop the methods and data checks used in the Assistant in Minitab 17 Statistical Software. One-Way

More information

1. What is the critical value for this 95% confidence interval? CV = z.025 = invnorm(0.025) = 1.96

1. What is the critical value for this 95% confidence interval? CV = z.025 = invnorm(0.025) = 1.96 1 Final Review 2 Review 2.1 CI 1-propZint Scenario 1 A TV manufacturer claims in its warranty brochure that in the past not more than 10 percent of its TV sets needed any repair during the first two years

More information

Minitab Tutorials for Design and Analysis of Experiments. Table of Contents

Minitab Tutorials for Design and Analysis of Experiments. Table of Contents Table of Contents Introduction to Minitab...2 Example 1 One-Way ANOVA...3 Determining Sample Size in One-way ANOVA...8 Example 2 Two-factor Factorial Design...9 Example 3: Randomized Complete Block Design...14

More information

Analysis of Variance (ANOVA) Using Minitab

Analysis of Variance (ANOVA) Using Minitab Analysis of Variance (ANOVA) Using Minitab By Keith M. Bower, M.S., Technical Training Specialist, Minitab Inc. Frequently, scientists are concerned with detecting differences in means (averages) between

More information

One-Way Analysis of Variance (ANOVA) Example Problem

One-Way Analysis of Variance (ANOVA) Example Problem One-Way Analysis of Variance (ANOVA) Example Problem Introduction Analysis of Variance (ANOVA) is a hypothesis-testing technique used to test the equality of two or more population (or treatment) means

More information

Two-Sample T-Tests Assuming Equal Variance (Enter Means)

Two-Sample T-Tests Assuming Equal Variance (Enter Means) Chapter 4 Two-Sample T-Tests Assuming Equal Variance (Enter Means) Introduction This procedure provides sample size and power calculations for one- or two-sided two-sample t-tests when the variances of

More information

Predictor Coef StDev T P Constant 970667056 616256122 1.58 0.154 X 0.00293 0.06163 0.05 0.963. S = 0.5597 R-Sq = 0.0% R-Sq(adj) = 0.

Predictor Coef StDev T P Constant 970667056 616256122 1.58 0.154 X 0.00293 0.06163 0.05 0.963. S = 0.5597 R-Sq = 0.0% R-Sq(adj) = 0. Statistical analysis using Microsoft Excel Microsoft Excel spreadsheets have become somewhat of a standard for data storage, at least for smaller data sets. This, along with the program often being packaged

More information

Regression Analysis: A Complete Example

Regression Analysis: A Complete Example Regression Analysis: A Complete Example This section works out an example that includes all the topics we have discussed so far in this chapter. A complete example of regression analysis. PhotoDisc, Inc./Getty

More information

NCSS Statistical Software Principal Components Regression. In ordinary least squares, the regression coefficients are estimated using the formula ( )

NCSS Statistical Software Principal Components Regression. In ordinary least squares, the regression coefficients are estimated using the formula ( ) Chapter 340 Principal Components Regression Introduction is a technique for analyzing multiple regression data that suffer from multicollinearity. When multicollinearity occurs, least squares estimates

More information

An analysis method for a quantitative outcome and two categorical explanatory variables.

An analysis method for a quantitative outcome and two categorical explanatory variables. Chapter 11 Two-Way ANOVA An analysis method for a quantitative outcome and two categorical explanatory variables. If an experiment has a quantitative outcome and two categorical explanatory variables that

More information

Section 13, Part 1 ANOVA. Analysis Of Variance

Section 13, Part 1 ANOVA. Analysis Of Variance Section 13, Part 1 ANOVA Analysis Of Variance Course Overview So far in this course we ve covered: Descriptive statistics Summary statistics Tables and Graphs Probability Probability Rules Probability

More information

Chapter 7. One-way ANOVA

Chapter 7. One-way ANOVA Chapter 7 One-way ANOVA One-way ANOVA examines equality of population means for a quantitative outcome and a single categorical explanatory variable with any number of levels. The t-test of Chapter 6 looks

More information

One-Way Analysis of Variance

One-Way Analysis of Variance One-Way Analysis of Variance Note: Much of the math here is tedious but straightforward. We ll skim over it in class but you should be sure to ask questions if you don t understand it. I. Overview A. We

More information

1.5 Oneway Analysis of Variance

1.5 Oneway Analysis of Variance Statistics: Rosie Cornish. 200. 1.5 Oneway Analysis of Variance 1 Introduction Oneway analysis of variance (ANOVA) is used to compare several means. This method is often used in scientific or medical experiments

More information

Paired 2 Sample t-test

Paired 2 Sample t-test Variations of the t-test: Paired 2 Sample 1 Paired 2 Sample t-test Suppose we are interested in the effect of different sampling strategies on the quality of data we recover from archaeological field surveys.

More information

CHAPTER 13. Experimental Design and Analysis of Variance

CHAPTER 13. Experimental Design and Analysis of Variance CHAPTER 13 Experimental Design and Analysis of Variance CONTENTS STATISTICS IN PRACTICE: BURKE MARKETING SERVICES, INC. 13.1 AN INTRODUCTION TO EXPERIMENTAL DESIGN AND ANALYSIS OF VARIANCE Data Collection

More information

Simple Linear Regression Inference

Simple Linear Regression Inference Simple Linear Regression Inference 1 Inference requirements The Normality assumption of the stochastic term e is needed for inference even if it is not a OLS requirement. Therefore we have: Interpretation

More information

How To Test For Significance On A Data Set

How To Test For Significance On A Data Set Non-Parametric Univariate Tests: 1 Sample Sign Test 1 1 SAMPLE SIGN TEST A non-parametric equivalent of the 1 SAMPLE T-TEST. ASSUMPTIONS: Data is non-normally distributed, even after log transforming.

More information

Elementary Statistics Sample Exam #3

Elementary Statistics Sample Exam #3 Elementary Statistics Sample Exam #3 Instructions. No books or telephones. Only the supplied calculators are allowed. The exam is worth 100 points. 1. A chi square goodness of fit test is considered to

More information

Chapter 7 Section 7.1: Inference for the Mean of a Population

Chapter 7 Section 7.1: Inference for the Mean of a Population Chapter 7 Section 7.1: Inference for the Mean of a Population Now let s look at a similar situation Take an SRS of size n Normal Population : N(, ). Both and are unknown parameters. Unlike what we used

More information

2 Sample t-test (unequal sample sizes and unequal variances)

2 Sample t-test (unequal sample sizes and unequal variances) Variations of the t-test: Sample tail Sample t-test (unequal sample sizes and unequal variances) Like the last example, below we have ceramic sherd thickness measurements (in cm) of two samples representing

More information

Analysis of Variance. MINITAB User s Guide 2 3-1

Analysis of Variance. MINITAB User s Guide 2 3-1 3 Analysis of Variance Analysis of Variance Overview, 3-2 One-Way Analysis of Variance, 3-5 Two-Way Analysis of Variance, 3-11 Analysis of Means, 3-13 Overview of Balanced ANOVA and GLM, 3-18 Balanced

More information

Lesson 1: Comparison of Population Means Part c: Comparison of Two- Means

Lesson 1: Comparison of Population Means Part c: Comparison of Two- Means Lesson : Comparison of Population Means Part c: Comparison of Two- Means Welcome to lesson c. This third lesson of lesson will discuss hypothesis testing for two independent means. Steps in Hypothesis

More information

ABSORBENCY OF PAPER TOWELS

ABSORBENCY OF PAPER TOWELS ABSORBENCY OF PAPER TOWELS 15. Brief Version of the Case Study 15.1 Problem Formulation 15.2 Selection of Factors 15.3 Obtaining Random Samples of Paper Towels 15.4 How will the Absorbency be measured?

More information

A POPULATION MEAN, CONFIDENCE INTERVALS AND HYPOTHESIS TESTING

A POPULATION MEAN, CONFIDENCE INTERVALS AND HYPOTHESIS TESTING CHAPTER 5. A POPULATION MEAN, CONFIDENCE INTERVALS AND HYPOTHESIS TESTING 5.1 Concepts When a number of animals or plots are exposed to a certain treatment, we usually estimate the effect of the treatment

More information

Two-Sample T-Tests Allowing Unequal Variance (Enter Difference)

Two-Sample T-Tests Allowing Unequal Variance (Enter Difference) Chapter 45 Two-Sample T-Tests Allowing Unequal Variance (Enter Difference) Introduction This procedure provides sample size and power calculations for one- or two-sided two-sample t-tests when no assumption

More information

12: Analysis of Variance. Introduction

12: Analysis of Variance. Introduction 1: Analysis of Variance Introduction EDA Hypothesis Test Introduction In Chapter 8 and again in Chapter 11 we compared means from two independent groups. In this chapter we extend the procedure to consider

More information

Chapter 13 Introduction to Linear Regression and Correlation Analysis

Chapter 13 Introduction to Linear Regression and Correlation Analysis Chapter 3 Student Lecture Notes 3- Chapter 3 Introduction to Linear Regression and Correlation Analsis Fall 2006 Fundamentals of Business Statistics Chapter Goals To understand the methods for displaing

More information

Data Analysis Tools. Tools for Summarizing Data

Data Analysis Tools. Tools for Summarizing Data Data Analysis Tools This section of the notes is meant to introduce you to many of the tools that are provided by Excel under the Tools/Data Analysis menu item. If your computer does not have that tool

More information

LAB 4 INSTRUCTIONS CONFIDENCE INTERVALS AND HYPOTHESIS TESTING

LAB 4 INSTRUCTIONS CONFIDENCE INTERVALS AND HYPOTHESIS TESTING LAB 4 INSTRUCTIONS CONFIDENCE INTERVALS AND HYPOTHESIS TESTING In this lab you will explore the concept of a confidence interval and hypothesis testing through a simulation problem in engineering setting.

More information

Chapter 2. Hypothesis testing in one population

Chapter 2. Hypothesis testing in one population Chapter 2. Hypothesis testing in one population Contents Introduction, the null and alternative hypotheses Hypothesis testing process Type I and Type II errors, power Test statistic, level of significance

More information

One-Way ANOVA using SPSS 11.0. SPSS ANOVA procedures found in the Compare Means analyses. Specifically, we demonstrate

One-Way ANOVA using SPSS 11.0. SPSS ANOVA procedures found in the Compare Means analyses. Specifically, we demonstrate 1 One-Way ANOVA using SPSS 11.0 This section covers steps for testing the difference between three or more group means using the SPSS ANOVA procedures found in the Compare Means analyses. Specifically,

More information

Univariate Regression

Univariate Regression Univariate Regression Correlation and Regression The regression line summarizes the linear relationship between 2 variables Correlation coefficient, r, measures strength of relationship: the closer r is

More information

ANOVA ANOVA. Two-Way ANOVA. One-Way ANOVA. When to use ANOVA ANOVA. Analysis of Variance. Chapter 16. A procedure for comparing more than two groups

ANOVA ANOVA. Two-Way ANOVA. One-Way ANOVA. When to use ANOVA ANOVA. Analysis of Variance. Chapter 16. A procedure for comparing more than two groups ANOVA ANOVA Analysis of Variance Chapter 6 A procedure for comparing more than two groups independent variable: smoking status non-smoking one pack a day > two packs a day dependent variable: number of

More information

One-Way Analysis of Variance: A Guide to Testing Differences Between Multiple Groups

One-Way Analysis of Variance: A Guide to Testing Differences Between Multiple Groups One-Way Analysis of Variance: A Guide to Testing Differences Between Multiple Groups In analysis of variance, the main research question is whether the sample means are from different populations. The

More information

Chapter 9. Two-Sample Tests. Effect Sizes and Power Paired t Test Calculation

Chapter 9. Two-Sample Tests. Effect Sizes and Power Paired t Test Calculation Chapter 9 Two-Sample Tests Paired t Test (Correlated Groups t Test) Effect Sizes and Power Paired t Test Calculation Summary Independent t Test Chapter 9 Homework Power and Two-Sample Tests: Paired Versus

More information

Final Exam Practice Problem Answers

Final Exam Practice Problem Answers Final Exam Practice Problem Answers The following data set consists of data gathered from 77 popular breakfast cereals. The variables in the data set are as follows: Brand: The brand name of the cereal

More information

NCSS Statistical Software

NCSS Statistical Software Chapter 06 Introduction This procedure provides several reports for the comparison of two distributions, including confidence intervals for the difference in means, two-sample t-tests, the z-test, the

More information

Two-sample hypothesis testing, II 9.07 3/16/2004

Two-sample hypothesis testing, II 9.07 3/16/2004 Two-sample hypothesis testing, II 9.07 3/16/004 Small sample tests for the difference between two independent means For two-sample tests of the difference in mean, things get a little confusing, here,

More information

Assessing Measurement System Variation

Assessing Measurement System Variation Assessing Measurement System Variation Example 1: Fuel Injector Nozzle Diameters Problem A manufacturer of fuel injector nozzles installs a new digital measuring system. Investigators want to determine

More information

INTERPRETING THE ONE-WAY ANALYSIS OF VARIANCE (ANOVA)

INTERPRETING THE ONE-WAY ANALYSIS OF VARIANCE (ANOVA) INTERPRETING THE ONE-WAY ANALYSIS OF VARIANCE (ANOVA) As with other parametric statistics, we begin the one-way ANOVA with a test of the underlying assumptions. Our first assumption is the assumption of

More information

Chapter 7: Simple linear regression Learning Objectives

Chapter 7: Simple linear regression Learning Objectives Chapter 7: Simple linear regression Learning Objectives Reading: Section 7.1 of OpenIntro Statistics Video: Correlation vs. causation, YouTube (2:19) Video: Intro to Linear Regression, YouTube (5:18) -

More information

STAT 350 Practice Final Exam Solution (Spring 2015)

STAT 350 Practice Final Exam Solution (Spring 2015) PART 1: Multiple Choice Questions: 1) A study was conducted to compare five different training programs for improving endurance. Forty subjects were randomly divided into five groups of eight subjects

More information

Randomized Block Analysis of Variance

Randomized Block Analysis of Variance Chapter 565 Randomized Block Analysis of Variance Introduction This module analyzes a randomized block analysis of variance with up to two treatment factors and their interaction. It provides tables of

More information

Skewed Data and Non-parametric Methods

Skewed Data and Non-parametric Methods 0 2 4 6 8 10 12 14 Skewed Data and Non-parametric Methods Comparing two groups: t-test assumes data are: 1. Normally distributed, and 2. both samples have the same SD (i.e. one sample is simply shifted

More information

Gage Studies for Continuous Data

Gage Studies for Continuous Data 1 Gage Studies for Continuous Data Objectives Determine the adequacy of measurement systems. Calculate statistics to assess the linearity and bias of a measurement system. 1-1 Contents Contents Examples

More information

Multiple Optimization Using the JMP Statistical Software Kodak Research Conference May 9, 2005

Multiple Optimization Using the JMP Statistical Software Kodak Research Conference May 9, 2005 Multiple Optimization Using the JMP Statistical Software Kodak Research Conference May 9, 2005 Philip J. Ramsey, Ph.D., Mia L. Stephens, MS, Marie Gaudard, Ph.D. North Haven Group, http://www.northhavengroup.com/

More information

Descriptive Statistics

Descriptive Statistics Descriptive Statistics Primer Descriptive statistics Central tendency Variation Relative position Relationships Calculating descriptive statistics Descriptive Statistics Purpose to describe or summarize

More information

Getting Started with Minitab 17

Getting Started with Minitab 17 2014 by Minitab Inc. All rights reserved. Minitab, Quality. Analysis. Results. and the Minitab logo are registered trademarks of Minitab, Inc., in the United States and other countries. Additional trademarks

More information

Case Study Call Centre Hypothesis Testing

Case Study Call Centre Hypothesis Testing is often thought of as an advanced Six Sigma tool but it is a very useful technique with many applications and in many cases it can be quite simple to use. Hypothesis tests are used to make comparisons

More information

MULTIPLE REGRESSION EXAMPLE

MULTIPLE REGRESSION EXAMPLE MULTIPLE REGRESSION EXAMPLE For a sample of n = 166 college students, the following variables were measured: Y = height X 1 = mother s height ( momheight ) X 2 = father s height ( dadheight ) X 3 = 1 if

More information

1. The parameters to be estimated in the simple linear regression model Y=α+βx+ε ε~n(0,σ) are: a) α, β, σ b) α, β, ε c) a, b, s d) ε, 0, σ

1. The parameters to be estimated in the simple linear regression model Y=α+βx+ε ε~n(0,σ) are: a) α, β, σ b) α, β, ε c) a, b, s d) ε, 0, σ STA 3024 Practice Problems Exam 2 NOTE: These are just Practice Problems. This is NOT meant to look just like the test, and it is NOT the only thing that you should study. Make sure you know all the material

More information

Lecture Notes Module 1

Lecture Notes Module 1 Lecture Notes Module 1 Study Populations A study population is a clearly defined collection of people, animals, plants, or objects. In psychological research, a study population usually consists of a specific

More information

How To Run Statistical Tests in Excel

How To Run Statistical Tests in Excel How To Run Statistical Tests in Excel Microsoft Excel is your best tool for storing and manipulating data, calculating basic descriptive statistics such as means and standard deviations, and conducting

More information

t Tests in Excel The Excel Statistical Master By Mark Harmon Copyright 2011 Mark Harmon

t Tests in Excel The Excel Statistical Master By Mark Harmon Copyright 2011 Mark Harmon t-tests in Excel By Mark Harmon Copyright 2011 Mark Harmon No part of this publication may be reproduced or distributed without the express permission of the author. [email protected] www.excelmasterseries.com

More information

Unit 31: One-Way ANOVA

Unit 31: One-Way ANOVA Unit 31: One-Way ANOVA Summary of Video A vase filled with coins takes center stage as the video begins. Students will be taking part in an experiment organized by psychology professor John Kelly in which

More information

NCSS Statistical Software

NCSS Statistical Software Chapter 06 Introduction This procedure provides several reports for the comparison of two distributions, including confidence intervals for the difference in means, two-sample t-tests, the z-test, the

More information

Good luck! BUSINESS STATISTICS FINAL EXAM INSTRUCTIONS. Name:

Good luck! BUSINESS STATISTICS FINAL EXAM INSTRUCTIONS. Name: Glo bal Leadership M BA BUSINESS STATISTICS FINAL EXAM Name: INSTRUCTIONS 1. Do not open this exam until instructed to do so. 2. Be sure to fill in your name before starting the exam. 3. You have two hours

More information

THE FIRST SET OF EXAMPLES USE SUMMARY DATA... EXAMPLE 7.2, PAGE 227 DESCRIBES A PROBLEM AND A HYPOTHESIS TEST IS PERFORMED IN EXAMPLE 7.

THE FIRST SET OF EXAMPLES USE SUMMARY DATA... EXAMPLE 7.2, PAGE 227 DESCRIBES A PROBLEM AND A HYPOTHESIS TEST IS PERFORMED IN EXAMPLE 7. THERE ARE TWO WAYS TO DO HYPOTHESIS TESTING WITH STATCRUNCH: WITH SUMMARY DATA (AS IN EXAMPLE 7.17, PAGE 236, IN ROSNER); WITH THE ORIGINAL DATA (AS IN EXAMPLE 8.5, PAGE 301 IN ROSNER THAT USES DATA FROM

More information

Name: Date: Use the following to answer questions 3-4:

Name: Date: Use the following to answer questions 3-4: Name: Date: 1. Determine whether each of the following statements is true or false. A) The margin of error for a 95% confidence interval for the mean increases as the sample size increases. B) The margin

More information

August 2012 EXAMINATIONS Solution Part I

August 2012 EXAMINATIONS Solution Part I August 01 EXAMINATIONS Solution Part I (1) In a random sample of 600 eligible voters, the probability that less than 38% will be in favour of this policy is closest to (B) () In a large random sample,

More information

Introduction to Analysis of Variance (ANOVA) Limitations of the t-test

Introduction to Analysis of Variance (ANOVA) Limitations of the t-test Introduction to Analysis of Variance (ANOVA) The Structural Model, The Summary Table, and the One- Way ANOVA Limitations of the t-test Although the t-test is commonly used, it has limitations Can only

More information

Statistiek II. John Nerbonne. October 1, 2010. Dept of Information Science [email protected]

Statistiek II. John Nerbonne. October 1, 2010. Dept of Information Science j.nerbonne@rug.nl Dept of Information Science [email protected] October 1, 2010 Course outline 1 One-way ANOVA. 2 Factorial ANOVA. 3 Repeated measures ANOVA. 4 Correlation and regression. 5 Multiple regression. 6 Logistic

More information

Factors affecting online sales

Factors affecting online sales Factors affecting online sales Table of contents Summary... 1 Research questions... 1 The dataset... 2 Descriptive statistics: The exploratory stage... 3 Confidence intervals... 4 Hypothesis tests... 4

More information

11. Analysis of Case-control Studies Logistic Regression

11. Analysis of Case-control Studies Logistic Regression Research methods II 113 11. Analysis of Case-control Studies Logistic Regression This chapter builds upon and further develops the concepts and strategies described in Ch.6 of Mother and Child Health:

More information

Unit 27: Comparing Two Means

Unit 27: Comparing Two Means Unit 27: Comparing Two Means Prerequisites Students should have experience with one-sample t-procedures before they begin this unit. That material is covered in Unit 26, Small Sample Inference for One

More information

Hypothesis Testing. Steps for a hypothesis test:

Hypothesis Testing. Steps for a hypothesis test: Hypothesis Testing Steps for a hypothesis test: 1. State the claim H 0 and the alternative, H a 2. Choose a significance level or use the given one. 3. Draw the sampling distribution based on the assumption

More information

Multiple-Comparison Procedures

Multiple-Comparison Procedures Multiple-Comparison Procedures References A good review of many methods for both parametric and nonparametric multiple comparisons, planned and unplanned, and with some discussion of the philosophical

More information

A) 0.1554 B) 0.0557 C) 0.0750 D) 0.0777

A) 0.1554 B) 0.0557 C) 0.0750 D) 0.0777 Math 210 - Exam 4 - Sample Exam 1) What is the p-value for testing H1: µ < 90 if the test statistic is t=-1.592 and n=8? A) 0.1554 B) 0.0557 C) 0.0750 D) 0.0777 2) The owner of a football team claims that

More information

MEAN SEPARATION TESTS (LSD AND Tukey s Procedure) is rejected, we need a method to determine which means are significantly different from the others.

MEAN SEPARATION TESTS (LSD AND Tukey s Procedure) is rejected, we need a method to determine which means are significantly different from the others. MEAN SEPARATION TESTS (LSD AND Tukey s Procedure) If Ho 1 2... n is rejected, we need a method to determine which means are significantly different from the others. We ll look at three separation tests

More information

STAT 145 (Notes) Al Nosedal [email protected] Department of Mathematics and Statistics University of New Mexico. Fall 2013

STAT 145 (Notes) Al Nosedal anosedal@unm.edu Department of Mathematics and Statistics University of New Mexico. Fall 2013 STAT 145 (Notes) Al Nosedal [email protected] Department of Mathematics and Statistics University of New Mexico Fall 2013 CHAPTER 18 INFERENCE ABOUT A POPULATION MEAN. Conditions for Inference about mean

More information

Fat Content in Ground Meat: A statistical analysis

Fat Content in Ground Meat: A statistical analysis Volume 25: Mini Workshops 385 Fat Content in Ground Meat: A statistical analysis Mary Culp Canisius College Biology Department 2001 Main Street Buffalo, NY 14208-1098 [email protected] Mary Culp has been

More information

TRINITY COLLEGE. Faculty of Engineering, Mathematics and Science. School of Computer Science & Statistics

TRINITY COLLEGE. Faculty of Engineering, Mathematics and Science. School of Computer Science & Statistics UNIVERSITY OF DUBLIN TRINITY COLLEGE Faculty of Engineering, Mathematics and Science School of Computer Science & Statistics BA (Mod) Enter Course Title Trinity Term 2013 Junior/Senior Sophister ST7002

More information

EXCEL Analysis TookPak [Statistical Analysis] 1. First of all, check to make sure that the Analysis ToolPak is installed. Here is how you do it:

EXCEL Analysis TookPak [Statistical Analysis] 1. First of all, check to make sure that the Analysis ToolPak is installed. Here is how you do it: EXCEL Analysis TookPak [Statistical Analysis] 1 First of all, check to make sure that the Analysis ToolPak is installed. Here is how you do it: a. From the Tools menu, choose Add-Ins b. Make sure Analysis

More information

Stat 411/511 THE RANDOMIZATION TEST. Charlotte Wickham. stat511.cwick.co.nz. Oct 16 2015

Stat 411/511 THE RANDOMIZATION TEST. Charlotte Wickham. stat511.cwick.co.nz. Oct 16 2015 Stat 411/511 THE RANDOMIZATION TEST Oct 16 2015 Charlotte Wickham stat511.cwick.co.nz Today Review randomization model Conduct randomization test What about CIs? Using a t-distribution as an approximation

More information

Simple linear regression

Simple linear regression Simple linear regression Introduction Simple linear regression is a statistical method for obtaining a formula to predict values of one variable from another where there is a causal relationship between

More information

General Method: Difference of Means. 3. Calculate df: either Welch-Satterthwaite formula or simpler df = min(n 1, n 2 ) 1.

General Method: Difference of Means. 3. Calculate df: either Welch-Satterthwaite formula or simpler df = min(n 1, n 2 ) 1. General Method: Difference of Means 1. Calculate x 1, x 2, SE 1, SE 2. 2. Combined SE = SE1 2 + SE2 2. ASSUMES INDEPENDENT SAMPLES. 3. Calculate df: either Welch-Satterthwaite formula or simpler df = min(n

More information

An analysis appropriate for a quantitative outcome and a single quantitative explanatory. 9.1 The model behind linear regression

An analysis appropriate for a quantitative outcome and a single quantitative explanatory. 9.1 The model behind linear regression Chapter 9 Simple Linear Regression An analysis appropriate for a quantitative outcome and a single quantitative explanatory variable. 9.1 The model behind linear regression When we are examining the relationship

More information

Permutation Tests for Comparing Two Populations

Permutation Tests for Comparing Two Populations Permutation Tests for Comparing Two Populations Ferry Butar Butar, Ph.D. Jae-Wan Park Abstract Permutation tests for comparing two populations could be widely used in practice because of flexibility of

More information

DEPARTMENT OF PSYCHOLOGY UNIVERSITY OF LANCASTER MSC IN PSYCHOLOGICAL RESEARCH METHODS ANALYSING AND INTERPRETING DATA 2 PART 1 WEEK 9

DEPARTMENT OF PSYCHOLOGY UNIVERSITY OF LANCASTER MSC IN PSYCHOLOGICAL RESEARCH METHODS ANALYSING AND INTERPRETING DATA 2 PART 1 WEEK 9 DEPARTMENT OF PSYCHOLOGY UNIVERSITY OF LANCASTER MSC IN PSYCHOLOGICAL RESEARCH METHODS ANALYSING AND INTERPRETING DATA 2 PART 1 WEEK 9 Analysis of covariance and multiple regression So far in this course,

More information

Class 19: Two Way Tables, Conditional Distributions, Chi-Square (Text: Sections 2.5; 9.1)

Class 19: Two Way Tables, Conditional Distributions, Chi-Square (Text: Sections 2.5; 9.1) Spring 204 Class 9: Two Way Tables, Conditional Distributions, Chi-Square (Text: Sections 2.5; 9.) Big Picture: More than Two Samples In Chapter 7: We looked at quantitative variables and compared the

More information

Nonparametric Two-Sample Tests. Nonparametric Tests. Sign Test

Nonparametric Two-Sample Tests. Nonparametric Tests. Sign Test Nonparametric Two-Sample Tests Sign test Mann-Whitney U-test (a.k.a. Wilcoxon two-sample test) Kolmogorov-Smirnov Test Wilcoxon Signed-Rank Test Tukey-Duckworth Test 1 Nonparametric Tests Recall, nonparametric

More information

Projects Involving Statistics (& SPSS)

Projects Involving Statistics (& SPSS) Projects Involving Statistics (& SPSS) Academic Skills Advice Starting a project which involves using statistics can feel confusing as there seems to be many different things you can do (charts, graphs,

More information

Stats for Strategy Fall 2012 First-Discussion Handout: Stats Using Calculators and MINITAB

Stats for Strategy Fall 2012 First-Discussion Handout: Stats Using Calculators and MINITAB Stats for Strategy Fall 2012 First-Discussion Handout: Stats Using Calculators and MINITAB DIRECTIONS: Welcome! Your TA will help you apply your Calculator and MINITAB to review Business Stats, and will

More information

Outline. Topic 4 - Analysis of Variance Approach to Regression. Partitioning Sums of Squares. Total Sum of Squares. Partitioning sums of squares

Outline. Topic 4 - Analysis of Variance Approach to Regression. Partitioning Sums of Squares. Total Sum of Squares. Partitioning sums of squares Topic 4 - Analysis of Variance Approach to Regression Outline Partitioning sums of squares Degrees of freedom Expected mean squares General linear test - Fall 2013 R 2 and the coefficient of correlation

More information

KSTAT MINI-MANUAL. Decision Sciences 434 Kellogg Graduate School of Management

KSTAT MINI-MANUAL. Decision Sciences 434 Kellogg Graduate School of Management KSTAT MINI-MANUAL Decision Sciences 434 Kellogg Graduate School of Management Kstat is a set of macros added to Excel and it will enable you to do the statistics required for this course very easily. To

More information

START Selected Topics in Assurance

START Selected Topics in Assurance START Selected Topics in Assurance Related Technologies Table of Contents Introduction Some Statistical Background Fitting a Normal Using the Anderson Darling GoF Test Fitting a Weibull Using the Anderson

More information

10. Analysis of Longitudinal Studies Repeat-measures analysis

10. Analysis of Longitudinal Studies Repeat-measures analysis Research Methods II 99 10. Analysis of Longitudinal Studies Repeat-measures analysis This chapter builds on the concepts and methods described in Chapters 7 and 8 of Mother and Child Health: Research methods.

More information

Variables Control Charts

Variables Control Charts MINITAB ASSISTANT WHITE PAPER This paper explains the research conducted by Minitab statisticians to develop the methods and data checks used in the Assistant in Minitab 17 Statistical Software. Variables

More information

Review #2. Statistics

Review #2. Statistics Review #2 Statistics Find the mean of the given probability distribution. 1) x P(x) 0 0.19 1 0.37 2 0.16 3 0.26 4 0.02 A) 1.64 B) 1.45 C) 1.55 D) 1.74 2) The number of golf balls ordered by customers of

More information

MULTIPLE LINEAR REGRESSION ANALYSIS USING MICROSOFT EXCEL. by Michael L. Orlov Chemistry Department, Oregon State University (1996)

MULTIPLE LINEAR REGRESSION ANALYSIS USING MICROSOFT EXCEL. by Michael L. Orlov Chemistry Department, Oregon State University (1996) MULTIPLE LINEAR REGRESSION ANALYSIS USING MICROSOFT EXCEL by Michael L. Orlov Chemistry Department, Oregon State University (1996) INTRODUCTION In modern science, regression analysis is a necessary part

More information

" Y. Notation and Equations for Regression Lecture 11/4. Notation:

 Y. Notation and Equations for Regression Lecture 11/4. Notation: Notation: Notation and Equations for Regression Lecture 11/4 m: The number of predictor variables in a regression Xi: One of multiple predictor variables. The subscript i represents any number from 1 through

More information

Statistics Review PSY379

Statistics Review PSY379 Statistics Review PSY379 Basic concepts Measurement scales Populations vs. samples Continuous vs. discrete variable Independent vs. dependent variable Descriptive vs. inferential stats Common analyses

More information

Point Biserial Correlation Tests

Point Biserial Correlation Tests Chapter 807 Point Biserial Correlation Tests Introduction The point biserial correlation coefficient (ρ in this chapter) is the product-moment correlation calculated between a continuous random variable

More information

Mind on Statistics. Chapter 13

Mind on Statistics. Chapter 13 Mind on Statistics Chapter 13 Sections 13.1-13.2 1. Which statement is not true about hypothesis tests? A. Hypothesis tests are only valid when the sample is representative of the population for the question

More information