Chapter 4 and 5 solutions

Transcription

1 Chapter 4 and 5 solutions 4.4. Three different washing solutions are being compared to study their effectiveness in retarding bacteria growth in five gallon milk containers. The analysis is done in a laboratory, and only three trials can be run on any day. Because days could represent a potential source of variability, the experimenter decides to use a randomized block design. Observations are taken for four days, and the data are shown here. Analyze the data from this experiment (use α = 0.05) and draw conclusions. Days Solution Design Expert Output Response: Growth ANOVA for Selected Factorial Model Analysis of variance table [Partial sum of squares] Sum of Mean F Source Squares DF Square Value Prob > F Block Model significant A Residual Cor Total The Model F value of 40.7 implies the model is significant. There is only a 0.03% chance that a "Model F Value" this large could occur due to noise. Std. Dev..94 R Squared

2 Mean Adj R Squared C.V Pred R Squared PRESS Adeq Precision Treatment Means (Adjusted, If Necessary) Estimated Mean Standard Error Mean Standard t for H0 Treatment Difference DF Error Coeff=0 Prob > t 1 vs vs vs There is a difference between the means of the three solutions. The Fisher LSD procedure indicates that solution 3 is significantly different than the other two Consider the hardness testing experiment described in Section 4.1. Suppose that the experiment was conducted as described and the following Rockwell C scale data (coded by subtracting 40 units) obtained: Coupon Tip

3 (a) Analyize the data from this experiment. There is a difference between the means of the four tips. Design Expert Output Response: Hardness ANOVA for Selected Factorial Model Analysis of variance table [Terms added sequentially (first to last)] Sum of Mean F Source Squares DF Square Value Prob > F Bock Model significant A Residual E 003 Cor Total The Model F value of implies the model is significant. There is only a 0.09% chance that a "Model F Value" this large could occur due to noise. Std. Dev R Squared Mean 9.63 Adj R Squared C.V Pred R Squared PRESS 0.5 Adeq Precision Treatment Means (Adjusted, If Necessary) Estimated Mean Standard Error

4 Mean Standard t for H0 Treatment Difference DF Error Coeff=0 Prob > t 1 vs vs vs vs vs vs (b) Use the Fisher LSD method to make comparisons among the four tips to determine specifically which tips differ in mean hardness readings. Based on the LSD bars in the Design Expert plot below, the mean of tip 4 differs from the means of tips 1,, and 3. The LSD method identifies a marginal difference between the means of tips and One Factor Plot 9.95 Hardness A: Tip

5 (c) Analyze the residuals from this experiment. The residual plots below do not identify any violations to the assumptions. Normal Plot of vs. Predicted Normal % Probability Residual Predicted vs. Tip Tip 4.8. A consumer products company relies on direct mail marketing pieces as a major component of its advertising campaigns. The company has three different designs for a new brochure and want to evaluate their effectiveness, as there are substantial differences in costs between the three designs. The company decides to test the three designs by mailing 5,000 samples of each to potential customers in four different regions of the country. Since there are known regional differences in the customer base, regions are considered as blocks. The number of responses to each mailing is shown below.

6 Region Design NE NW SE SW (a) Analyze the data from this experiment. The residuals of the analsysis below identify concerns with the normality and equality of variance assumptions. As a result, a square root transformation was applied as shown in the second ANOVA table. The residuals of both analysis are presented for comparison in part (c) of this problem. The analysis concludes that there is a difference between the mean number of responses for the three designs. Design Expert Output Response: Number of responses ANOVA for Selected Factorial Model Analysis of variance table [Terms added sequentially (first to last)] Sum of Mean F Source Squares DF Square Value Prob > F Block Model significant A Residual Cor Total 1.45E The Model F value of implies the model is significant. There is only a 0.0% chance that a "Model F Value" this large could occur due to noise.

7 Std. Dev R Squared Mean Adj R Squared C.V Pred R Squared PRESS Adeq Precision Treatment Means (Adjusted, If Necessary) Estimated Mean Standard Error Mean Standard t for H0 Treatment Difference DF Error Coeff=0 Prob > t 1 vs vs vs Design Expert Output for Transformed Data Response: Number of responsestransform:square root Constant: 0 ANOVA for Selected Factorial Model Analysis of variance table [Terms added sequentially (first to last)] Sum of Mean F Source Squares DF Square Value Prob > F Block Model significant A Residual Cor Total

8 The Model F value of implies the model is significant. There is only a 0.01% chance that a "Model F Value" this large could occur due to noise. Std. Dev R Squared Mean 18.5 Adj R Squared C.V Pred R Squared PRESS 1.05 Adeq Precision Treatment Means (Adjusted, If Necessary) Estimated Mean Standard Error Mean Standard t for H0 Treatment Difference DF Error Coeff=0 Prob > t 1 vs vs vs < (b) Use the Fisher LSD method to make comparisons among the three designs to determine specifically which designs differ in mean response rate. Based on the LSD bars in the Design Expert plot below, designs 1 and 3 do not differ; however, design is different than designs 1 and 3.

9 One Factor Plot Sqrt(Number of responses) A: Design (c) Analyze the residuals from this experiment. The first set of residual plots presented below represent the untransformed data. Concerns with normality as well as inequality of variance are presented. The second set of residual plots represent transformed data and do not identify significant violations of the assumptions. The residuals vs. design plot indicates a slight inequality of variance; however, not a strong violation and an improvement over the non transformed data. Normal Plot of vs. Predicted Normal % Probability Residual Predicted

10 vs. Design Design The following are the square root transformed data residual plots. Normal Plot of vs. Predicted Normal % Probability Residual Predicted

11 vs. Design Design 4.1. An aluminum master alloy manufacturer produces grain refiners in ingot form. The company produces the product in four furnaces. Each furnace is known to have its own unique operating characteristics, so any experiment run in the foundry that involves more than one furnace will consider furnaces as a nuisance variable. The process engineers suspect that stirring rate impacts the grain size of the product. Each furnace can be run at four different stirring rates. A randomized block design is run for a particular refiner and the resulting grain size data is as follows. Furnace Stirring Rate (a) Is there any evidence that stirring rate impacts grain size? Design Expert Output Response: Grain Size ANOVA for Selected Factorial Model

12 Analysis of variance table [Partial sum of squares] Sum of Mean F Source Squares DF Square Value Prob > F Block Model not significant A Residual Cor Total The "Model F value" of 0.85 implies the model is not significant relative to the noise. There is a % chance that a "Model F value" this large could occur due to noise. Std. Dev..95 R Squared 0.13 Mean 7.69 Adj R Squared C.V Pred R Squared PRESS 46.7 Adeq Precision Treatment Means (Adjusted, If Necessary) Estimated Mean Standard Error Mean Standard t for H0 Treatment Difference DF Error Coeff=0 Prob > t 1 vs vs vs

13 vs vs vs The analysis of variance shown above indicates that there is no difference in mean grain size due to the different stirring rates. (b) Graph the residuals from this experiment on a normal probability plot. Interpret this plot. Normal plot of residuals 99 Normal % probability Residual The plot indicates that normality assumption is valid. (c) Plot the residuals versus furnace and stirring rate. Does this plot convey any useful information?

14 vs. Stirring Rate vs. Furnace Stirring Rate Furnace The variance is consistent at different stirring rates. Not only does this validate the assumption of uniform variance, it also identifies that the different stirring rates do not affect variance. (d) What should the process engineers recommend concerning the choice of stirring rate and furnace for this particular grain refiner if small grain size is desirable? There really is no effect due to the stirring rate The effect of five different ingredients (A, B, C, D, E) on reaction time of a chemical process is being studied. Each batch of new material is only large enough to permit five runs to be made. Furthermore, each run requires approximately 1 1/ hours, so only five runs can be made in one day. The experimenter decides to run the experiment as a Latin square so that day and batch effects can be systematically controlled. She obtains the data that follow. Analyze the data from this experiment (use α = 0.05) and draw conclusions. Day Batch A=8 B=7 D=1 C=7 E=3 C=11 E= A=7 D=3 B=8 3 B=4 A=9 C=10 E=1 D=5

15 4 D=6 C=8 E=6 B=6 A=10 5 E=4 D= B=3 A=8 C=8 The Minitab output below identifies the ingredients as having a significant effect on reaction time. Minitab Output General Linear Model Factor Type Levels Values Batch random Day random Catalyst fixed 5 A B C D E Analysis of Variance for Time, using Adjusted SS for Tests Source DF Seq SS Adj SS Adj MS F P Catalyst Batch Day Error Total The yield of a chemical process is being studied. The two most important variables are thought to be the pressure and the temperature. Three levels of each factor are selected, and a factorial experiment with two replicates is performed. The yield data follow: Pressure Temperature

16 (a) Analyze the data and draw conclusions. Use α = Both pressure (A) and temperature (B) are significant, the interaction is not. Response:Surface Finish Design Expert Output ANOVA for Selected Factorial Model Analysis of variance table [Partial sum of squares] Sum of Mean F Source Squares DF Square Value Prob > F Model significant A B AB Residual Lack of Fit Pure Error Cor Total The Model F value of 8.00 implies the model is significant. There is only a 0.6% chance that a "Model F Value" this large could occur due to noise. Values of "Prob > F" less than indicate model terms are significant. In this case A, B are significant model terms.

17 Values greater than indicate the model terms are not significant. If there are many insignificant model terms (not counting those required to support hierarchy), model reduction may improve your model. (b) Prepare appropriate residual plots and comment on the model s adequacy. The residual plots show no serious deviations from the assumptions. vs. Predicted Normal plot of residuals E Normal % probability E Predicted Residual vs. Temperature vs. Pressure E E Temperature Pressure

18 (c) Under what conditions would you operate this process? DESIGN-EXPERT Plot Yield Interaction Graph Temperature X = A: Pressure Y = B: Temperature Design Points B1 150 B 160 B3 170 Yield Pressure Set pressure at 15 and Temperature at the high level, 170 degrees C, as this gives the highest yield. The standard analysis of variance treats all design factors as if they were qualitative. In this case, both factors are quantitative, so some further analysis can be performed. In Section 5.5, we show how response curves and surfaces can be fit to the data from a factorial experiment with at least one quantative factor. Since both factors in this problem are quantitative and have three levels, we can fit linear and quadratic effects of both temperature and pressure, exactly as in Example 5.5 in the text. The Design Expert output, including the response surface plots, now follows. Response:Surface Finish Design Expert Output ANOVA for Selected Factorial Model Analysis of variance table [Partial sum of squares] Sum of Mean F Source Squares DF Square Value Prob > F Model < significant A B

19 A < B AB Residual Lack of Fit 7.639E E not significant Pure Error Cor Total The Model F value of implies the model is significant. There is only a 0.01% chance that a "Model F Value" this large could occur due to noise. Values of "Prob > F" less than indicate model terms are significant. In this case A, B, A, B are significant model terms. Values greater than indicate the model terms are not significant. If there are many insignificant model terms (not counting those required to support hierarchy), model reduction may improve your model. Std. Dev. 0.1 R-Squared Mean Adj R-Squared C.V Pred R-Squared PRESS 0.4 Adeq Precision Coefficient Standard 95% CI 95% CI Factor Estimate DF Error Low High VIF Intercept A Pressure B Temperature E A B AB E

20 Final Equation in Terms of Coded Factors: Yield = * A * B 0.41 * A +0.4 * B * A * B Final Equation in Terms of Actual Factors: Yield = * Pressure * Temperature E 003 * Pressure E 003 * Temperature E 004 * Pressure * Temperature

21 Yield 90.7 B: Temperature Yield A: Pressure A: Pres s ure B: Temperature An engineer suspects that the surface finish of a metal part is influenced by the feed rate and the depth of cut. She selects three feed rates and four depths of cut. She then conducts a factorial experiment and obtains the following data: Depth of Cut (in) Feed Rate (in/min)

22 (a) Analyze the data and draw conclusions. Use α = The depth (A) and feed rate (B) are significant, as is the interaction (AB). Response: Surface Finish Design Expert Output ANOVA for Selected Factorial Model Analysis of variance table [Partial sum of squares] Sum of Mean F Source Squares DF Square Value Prob > F Model < significant A < B < AB Residual Lack of Fit Pure Error Cor Total The Model F value of implies the model is significant. There is only a 0.01% chance that a "Model F Value" this large could occur due to noise. Values of "Prob > F" less than indicate model terms are significant. In this case A, B, AB are significant model terms. (b) Prepare appropriate residual plots and comment on the model s adequacy.

23 The residual plots shown indicate nothing unusual. vs. Predicted Normal plot of residuals Normal % probability Predicted Residual vs. Feed Rate vs. Depth of Cut Feed Rate Depth of Cut (c) Obtain point estimates of the mean surface finish at each feed rate. Feed Rate Average

24 DESIGN-EXPERT Plot Surface Finish X = B: Feed Rate 114 One Factor Plot Warning! Factor involved in an interaction. Actual Factor A: Depth of Cut = Average Surface Finish Feed Rate (d) Find P values for the tests in part (a). The P values are given in the computer output in part (a) For the data in Problem 5.3, compute a 95 percent interval estimate of the mean difference in response for feed rates of 0.0 and 0.5 in/min. We wish to find a confidence interval on μ 1 μ, where μ 1 is the mean surface finish for 0.0 in/min and μ is the mean surface finish for 0.5 in/min. y y t MS E ( n 1) μ 1 μ y1.. y.. + tα, ab( n 1) n α, ab ) MS n E (8.7) ( ) ± (.064) = 16 ± Therefore, the 95% confidence interval for μ μ is ±

25 5.6. An article in Industrial Quality Control (1956, pp. 5 8) describes an experiment to investigate the effect of the type of glass and the type of phosphor on the brightness of a television tube. The response variable is the current necessary (in microamps) to obtain a specified brightness level. The data are as follows: Glass Phosphor Type Type (a) Is there any indication that either factor influences brightness? Use α = Both factors, phosphor type (A) and Glass type (B) influence brightness. Response: Current in microamps Design Expert Output ANOVA for Selected Factorial Model Analysis of variance table [Partial sum of squares] Sum of Mean F Source Squares DF Square Value Prob > F Model < significant A B <

26 AB Residual Lack of Fit Pure Error Cor Total The Model F value of implies the model is significant. There is only a 0.01% chance that a "Model F Value" this large could occur due to noise. Values of "Prob > F" less than indicate model terms are significant. In this case A, B are significant model terms. (b) Do the two factors interact? Use α = There is no interaction effect. (c) Analyze the residuals from this experiment. The residual plot of residuals versus phosphor content indicates a very slight inequality of variance. It is not serious enough to be of concern, however.

27 vs. Predicted Normal plot of residuals Normal % probability Predicted Residual vs. Glass Type vs. Phosphor Type Glass Type Phosphor Type 5.7. Johnson and Leone (Statistics and Experimental Design in Engineering and the Physical Sciences, Wiley 1977) describe an experiment to investigate the warping of copper plates. The two factors studied were the temperature and the copper content of the plates. The response variable was a measure of the amount of warping. The data were as follows: Copper Content (%) Temperature ( C)

28 50 17,0 16,1 4, 8,7 75 1,9 18,13 17,1 7, ,1 18,1 5,3 30,3 15 1,17 3,1 3, 9,31 (a) Is there any indication that either factor affects the amount of warping? Is there any interaction between the factors? Use α = Both factors, copper content (A) and temperature (B) affect warping, the interaction does not. Response: Warping Design Expert Output ANOVA for Selected Factorial Model Analysis of variance table [Partial sum of squares] Sum of Mean F Source Squares DF Square Value Prob > F Model < significant A < B AB Residual Lack of Fit Pure Error Cor Total The Model F value of 9.5 implies the model is significant. There is only a 0.01% chance that a "Model F Value" this large could occur due to noise. Values of "Prob > F" less than indicate model terms are significant. In this case A, B are significant model terms.

29 (b) Analyze the residuals from this experiment. There is nothing unusual about the residual plots. Normal plot of residuals vs. Predicted Normal % probability E E Residual Predicted vs. Copper Content vs. Temperature E E Copper Content Temperature (c) Plot the average warping at each level of copper content and compare them to an appropriately scaled t distribution. Describe the differences in the effects of the different levels of copper content on warping. If low warping is desirable, what level of copper content would you specify?

30 Design Expert Output Factor Name Level Low Level High Level A Copper Content B Temperature Average Prediction SE Mean 95% CI low 95% CI high SE Pred 95% PI low 95% PI high Warping Factor Name Level Low Level High Level A Copper Content B Temperature Average Prediction SE Mean 95% CI low 95% CI high SE Pred 95% PI low 95% PI high Warping Factor Name Level Low Level High Level A Copper Content B Temperature Average Prediction SE Mean 95% CI low 95% CI high SE Pred 95% PI low 95% PI high Warping Factor Name Level Low Level High Level A Copper Content B Temperature Average Prediction SE Mean 95% CI low 95% CI high SE Pred 95% PI low 95% PI high

31 Warping Use a copper content of 40 for the lowest warping. MS S = E = = 0.9 b 8 Scaled t Distribution Cu=40 Cu=60 Cu=80 Cu= Warping (d) Suppose that temperature cannot be easily controlled in the environment in which the copper plates are to be used. Does this change your answer for part (c)? Use a copper of content of 40. This is the same as for part (c).

32 DESIGN-EXPERT Plot Warping X = A: Copper Content Y = B: Temperature Design Points Interaction Graph Temperature B1 50 B 75 B3 100 B4 15 Warping Copper Content 5.9. A mechanical engineer is studying the thrust force developed by a drill press. He suspects that the drilling speed and the feed rate of the material are the most important factors. He selects four feed rates and uses a high and low drill speed chosen to represent the extreme operating conditions. He obtains the following results. Analyze the data and draw conclusions. Use α = (A) Feed Rate (B) Drill Speed

33 Design Expert Output Response: Force ANOVA for Selected Factorial Model Analysis of variance table [Partial sum of squares] Sum of Mean F Source Squares DF Square Value Prob > F Model significant A < B AB Residual E 003 Lack of Fit Pure Error E 003 Cor Total The Model F value of implies the model is significant. There is only a 0.05% chance that a "Model F Value" this large could occur due to noise. Values of "Prob > F" less than indicate model terms are significant. In this case A, B, AB are significant model terms. The factors speed and feed rate, as well as the interaction is important.

34 DESIGN-EXPERT Plot Force Interaction Graph Drill Speed X = B: Feed Rate Y = A: Drill Speed Design Points A1 15 A Force Feed Rate The standard analysis of variance treats all design factors as if they were qualitative. In this case, both factors are quantitative, so some further analysis can be performed. In Section 5.5, we show how response curves and surfaces can be fit to the data from a factorial experiment with at least one quantative factor. Since both factors in this problem are quantitative, we can fit polynomial effects of both speed and feed rate, exactly as in Example 5.5 in the text. The Design Expert output with only the significant terms retained, including the response surface plots, now follows. Response: Force Design Expert Output ANOVA for Selected Factorial Model Analysis of variance table [Partial sum of squares] Sum of Mean F Source Squares DF Square Value Prob > F Model significant A B B Residual E 003 Lack of Fit significant Pure Error E 003 Cor Total

35 The Model F value of implies the model is significant. There is only a 0.08% chance that a "Model F Value" this large could occur due to noise. Values of "Prob > F" less than indicate model terms are significant. In this case A, B are significant model terms. Values greater than indicate the model terms are not significant. If there are many insignificant model terms (not counting those required to support hierarchy), model reduction may improve your model. Std. Dev R-Squared Mean.77 Adj R-Squared C.V..9 Pred R-Squared PRESS 0.14 Adeq Precision 9.69 Coefficient Standard 95% CI 95% CI Factor Estimate DF Error Low High VIF Intercept A Drill Speed B Feed Rate B Final Equation in Terms of Coded Factors: Force = * A * B * B Final Equation in Terms of Actual Factors:

36 Force = E 003 * Drill Speed * Feed Rate * Feed Rate Force B: Feed Rate Force A: Drill Speed B: Feed Rate A: Drill Speed