Statistics EXAM II through 2003

Save this PDF as:
 WORD  PNG  TXT  JPG

Size: px
Start display at page:

Download "Statistics 641 - EXAM II - 1999 through 2003"

Transcription

1 Statistics EXAM II through 2003 December 1, 1999 I. (40 points ) Place the letter of the best answer in the blank to the left of each question. (1) In testing H 0 : µ 5 vs H 1 : µ > 5, the P-value of the data was If α=.01 and the true value of µ was µ=7, then the decision based on the data A. was a Type I error. B. was a Type II error. C. was correct. D. cannot be determined. E. all of the above (2) The P-value of the computed value of a test statistic is A. the weight of evidence in favor of H 1. B. the largest value of α for which the observed data will reject H o. C. the smallest value of α for which the observed data will reject H o. D. the probability of observing a less extreme value of the test statistic. (3) A 95% confidence interval for µ was calculated to be (15,27). Then 95% represents: A. the probability of a type I error. B. the probability that µ is between 15 and 27. C. the probability that we obtain a sample which will yield an interval containing µ. D. the probability that µ 10. (4) The most crucial of the conditions imposed on the sampled data and the populations in order for the pooled t-test to be valid is A. normality. B. equal variance. C. independence. D. all three conditions are equally important. E. none of the conditions are crucial (5) In a hypotheses test of H o : µ 5 vs H 1 : µ < 5, with σ known, if the sample size remains constant, but the level α is increased from.01 to.05, then the probability of a Type II error at µ=4, A. increases. B. decreases. C. remains the same. D. may increase or decrease depending on the sample size. E. cannot be determined with the given information. 1

2 (6) The reason that experimental units are paired in a study to compare the average responses of two treatments A. is to reduce the degrees of freedom of the t-test. B. is to reduce the variance of the difference in the two sample means. C. is to increase the degrees of freedom of the t-test. D. is to make the difference in the two sample means normally distributed. (7) The Wilcoxon rank sum statistic is preferred to the pooled t-test A. if the population distributions are normally distributed. B. for all continuous distributions. C. if the population distributions are symmetric. D. if the population distributions have equal variance. E. if the population distributions have very heavy tails. (8) A 95/99 tolerance interval for a normal population A. has a higher level of confidence than a 95% confidence interval on the population mean. B. is a 95% estimate of µ and a 99% estimate of σ. C. an estimate of a region of values which will contain between 95% and 99% of the population values. D. is a region of values for which we are 99% confident that the region contains 95% of the population values. E. is based on the central limit theorem. (9) An experimenter wants to test H o : F = F o, where F is a process cdf. Which one of the following statements is TRUE? A. The Chi-squared GOF test is the preferred test statistic. B. The most powerful test statistic depends on the shape of F o. C. The Anderson-Darling test has greater power than any other test. D. The Shapiro-Wilk test has greater power than the Chi-squared test. E. The Chi-squared GOF test can only be used when F o is discrete. (10) If f(y; θ) is a pdf which is symmetric about θ, then, amongst the three test statistics discussed in class, the test statistic having greatest power A. is the Wilcoxon Signed Rank test. B. is the sign test C. is the t-test D. depends on the form of f(y; θ) 2

3 II. (28 points) In the following problems, (A) state the null and alternative hypotheses, (B) give the formula for the test statistic but do not compute its value, (C) set-up the rejection region by selecting the proper value from the appropriate table. (1) Two different types of fabrics (A and B) are to be compared on a Martingale wear tester. The wear tester is known to be quite variable from run to run. Thus, on each run of the tester, the operator evaluates a sample of Fabric A and a sample of Fabric B. The weight losses (in milligrams) from seven runs are as follows: Run : X S Fabric A: Fabric B: Is there significant evidence (α = 0.01) that Fabric A has a smaller average weight loss than Fabric B? A. H o : H a : B. Test statistic: C. Rejection Region: (2) An experiment is run to study the effects of PCB, an industrial contaminant, on the reproductive ability of owls. The shell thickness of eggs produced by owls exposed to PCB are compared to the shell thickness of eggs produced by owls which did not have PCB exposures. From previous studies, the shell thickness of eggs has been normally distributed. Owl X S PCB-Exposed: UnExposed: Is there significant(α =.05) evidence that the PCB exposed owls have thinner egg shells than those of the unexposed owls? A. H o : H a : B. Test statistic: C. Rejection Region: (3) Scientists think that robots will play a crucial role in factories in the next 20 years. Suppose that in an experiment to determine whether the use of robots to weave computer cables is feasible, a robot was used to assemble 10 cables and an experienced worker also assembled 10 cables. The cables were examined and the number of defectives on each cable was recorded. The 10 data points for each method were plotted in a normal probability plot and the plot indicated a very heavy-tailed distribution. The data is given here: Cable Mean St.Dev. Robot Human Does this data support the assertion that the average number of defectives per assembled cable is less for robots than for humans? Use α = A. H o : H a : B. Test statistic: C. Rejection Region: 3

4 III. (32 points) A company is developing a new cooling fan for diesel engines. From previous studies involving the old fan, the exponential distribution provided an adequate model for the time until failure of the fan. The existing fan has an average time to failure of 25,000 hours. The company developing the new cooling fan wants to determine if the new fan has a longer average time to failure than the old fan. (1) Suppose 20 of the new fans are evaluated in an accelerated life testing experiment and their times to failure T 1,..., T 20 are determined. Describe a graphical technique to evaluate whether the failure times from the accelerated life testing follow an exponential distribution. Be sure to define all your terms and explicitly label both axes on your graphs. (2) Suppose we have determined that the times to failure from the accelerated life testing procedures satisfy an exponential distribution and the average time to failure of the 20 fans is T = 32, 381 hours using the accelerated life testing procedures. Construct 95% confidence interval for the average time to failure for the new fans. Hint: If T 1,..., T n are iid exponential random variables with mean λ then 2n T λ has a Chi-square distribution with d.f. = 2n. (3) Is there sufficient evidence (α =.05), using the data from part (2) of this problem, that the new fan has a longer average time to failure than the fan currently in use? Compute the p-value of your test. (4) If the average time to failure of the new fan is 30,000 hours, compute the probability that the test in part (3) of this problem will detect that the average time to failure is greater than 25,000 hours based on testing 20 new fans. 4

5 EXAM II, December 1, 2000 I. (40 points) One of the major sources of sulfur dioxide air pollution is coal powered utility plants. Currently only 10% of such power plants meet federal EPA air pollution standards. Several major changes have been made in the power plants, such as using low-sulfur coal, burning the coal at a higher temperature, and installing new scrubbers on the air stacks of the plants. One year after imposing the changes, the EPA randomly selected 25 power plants for a length examination. The investigators found that 5 of the 25 power plants met the EPA air pollution standards, whereas 20 power plants did not meet the standards. 1. Place a 95% confidence interval on the proportion of power plants currently meeting the EPA air pollution standards. 2. Is there significant evidence at the α =.05 level that the proportion of power plants meeting the EPA air pollution standards has increased since the major changes were made in the power plants. 3. Compute the P-value for the test statistic obtained in part Evaluate the power function of the test constructed in part 2 for the proportions.05,.1,.2,.3,.4,.5, and.6. Use these values to sketch a rough plot of the power function. 5. The researchers are interested in a much larger study. How large must the sample size n be in order that the researchers can be 95% confident that the sample estimator of the proportion of power plants meeting the standard is within.05 of the true value? II. (30 points) A company has designed a new type of braking system for sports utility vehicles. To evaluate the effectivenes of the new system, they place n of the braking systems on a test device and recorded the time to failure of the braking systems: Y 1,..., Y n. The pdf of the random variables f(y; θ), depends on a unknown parameter θ. For each of the following situations, state whether the given statement is TRUE or FALSE. If the statement is FALSE, explain VERY BRIEFLY why the statement is FALSE. 1. In order to evaluate the reliability of the braking systems a 95/95 tolerance interval for the population of times to failure for the brakes is to be constructed. In order to construct the interval, it is necessary that the functional form of f(y; θ) be known. 2. The researchers obtain a point estimator of θ. They state that the sampling distribution of the estimator can be adequately approximated by a normal distribution if the sample size, n, is large enough. 3. Under appropriate weather conditions, the pdf f(y; θ) is symmetric about θ. In this type of situation, the sample mean would have a smaller mean squared error (MSE) than the sample median as an estimator of θ. III. (30 points ) Place the letter of the BEST answer in the BLANK to the LEFT of each question. (1) In testing H 0 : π.4 vs H a : π <.4, the P-value of the data was If α=.05 and the true value of π was π=.6, then the decision based on the data A) was a Type I error. B) was a Type II error. C) was correct. D) cannot be determined. 5

6 (2) The P-value of the computed value of a test statistic is A) the probability of observing a less extreme value of the test statistic. B) the largest value of α for which the observed data will reject H o. C) the weight of evidence in favor of H a. D) the smallest value of α for which the observed data will reject H o. (3) An industrial process produces piston rings having a nominal diameter of 9 cm. A 95% confidence interval for the mean diameter of the piston rings produced during July was calculated and a 95%/95% tolerance interval was calculated for the diameters of the piston rings produced during July. A) The probability is.95 that the mean diameter will fall within the confidence interval. B) The width of the 95% confidence interval is generally narrower than the width of the 95%/95% tolerance interval and hence is a more precise estimator. C) If the engineer wanted to set limits such that 95% of the output was within these limits, then the tolerance interval would be more informative than the confidence interval. D) The tolerance interval for the piston diameters could be used to determine if the mean diameter for July s output was equal to 9 cm or not. (4) Suppose we have two normal populations and we want to compare the means of the populations. Random samples of size 9 are selected from each population. The researcher is certain that the standard deviation of the first population is at least 8 times larger than the standard deviation of the second population. The most appropriate procedure for testing if the two population means differ is A. the pooled variance t-test. B. the Wilcoxon rank sum test. C. to transform the data and use a pooled variance t-test. D. the separate variance t-test using the Satterthwaite correction for the degrees of freedom. E. none of the procedures are very useful in this situation since the sample sizes are too small. (5) A biochemist is attempting to estimate the typical length of time it takes a drug to reach the kidney of a mature rat. There are several possible estimators of this parameter. In attempting to select the best estimator the biochemist should A. select the estimator with the smallest average squared distance from the parameter. B. select the estimator with the smallest variance since it would be the most consistent estimator. C. select the estimator with the smallest bias or smallest variance. D. always select the unbiased estimator since on the average it would equal the parameter. E. ask a statistician for advice. (6) In a study to compare the average responses of two drugs for the treatment of heart worms, the researchers were concerned that the available dogs for the study ranged from 1 year to 12 years of age and weighed from 10 kilograms to 55 kilograms. How can the effect of the difference in experimental units be reduced so as to not mask any significant difference in the treatments? A. A procedure based on the ranks of the responses should be used as a test statistic. B. A transformation of the data prior to using a pooled t-test would be effective in reducing variability. C. The separate variance t-test would adjust the test statistic for the unequal variances. D. The dogs should be grouped into similar pairs of dogs prior to assigning the two treatments. E. The effect of the differences in the dogs is not a problem if the treatments are randomly assigned to the dogs. 6

7 (7) The Wilcoxon rank sum test statistic is called a distribution-free test statistic since A. its sampling distribution under H o does not depend on the shape of the population distributions. B. it can be used even if the population distributions are non-normal. C. its sampling distribution is non-normal for small sample sizes. D. its sampling distribution does not require the variances to be equal. E. it has greater power than the pooled t-test when the population distributions are non-normal. (8) The random assignment of treatments to experimental units is crucial in designed experiments A. since it eliminates the effects of nontreatment factors on the experimental responses. B. since the effect of nontreatment factors are averaged over all experimental units, no matter the treatment. C. since it allows us to estimate the amount that nontreatment factors affect the average response. population values. D. since the randomization prevents the experimenter from knowing which experimental units are assigned to which treatments. reasons are correct. (9) An experimenter wants to test H o : F = F o, where F is a process cdf. Which ONE of the following statements is FALSE? A. The Chi-squared GOF test statistic can be used even when F is an absolutely continuous cdf. B. The Cramer-von-Mises test statistic is generally a better test than the Kolmogorov-Smirnov test statistic. C. The Anderson-Darling test has greater power than the Chi-squared GOF test for all choices of F o. D. The Shapiro-Wilk test has greater power than the Anderson-Darling test if F has a normal cdf. E. The Kolmogorov-Smirnov test is not an appropriate test statistic when F o is discrete. (10) If f(y; θ) is a pdf which is symmetric about θ, then, amongst the three test statistics discussed in class, the test statistic having greatest power A. is the Wilcoxon signed rank test. B. is the sign test C. is the t-test D. depends on the form of f(y; θ) E. the tests have essentially the same power 7

8 November 21, 2001 I. (35 points) A company that manufacturers silicon wafers for computer chips is concerned with both the mean thickness of the chips and the fluctuation in the thickness of the chips. In order to monitor the thickness, a random sample of 20 chips is selected every hour and the thickness is measured on each of the chips. The process is considered to be in control provided the process mean, µ, is 200 mm and the process standard deviation, σ is less than or equal to 2.5 mm. The company s statistician develops a test to evaluate whether the process standard deviation is greater than 2.5 mm. She plots the power curve of the test in order to evaluate its performance. The curve is given here: Power Curve for Standard Deviation Power Process Standard Deviation Use the above graph to answer questions a.-d. a. What is the level of significance of the test whose power curve is depicted above? b. What is the probability of a Type I error if σ = 2.4? c. What is the probability of a Type II error if σ = 2.4? d. What is the probability of a Type II error if σ = 3? e. What is the practical consequence to the company if the test commits a Type I error? f. What is the practical consequence to the company if the test commits a Type II error? 8

9 g. The random sample of 20 chips yields a sample standard deviation of Compute the p-value of the test statistic and determine if the process standard deviation is outside of its specification. II. (25 points) The company also needs to meet the specification that the mean thickness of the wafers must be at least 200 mm. The company wants to develop a test which has level of significance of 5% to determine if the mean thickness is less than 200 mm. a. Suppose the sample size is 20 units and we develop a level 0.05 test based on n=20. What is the chance the test will detect that the wafer mean thickness is less than 200 mm when the process mean is 199 mm and mm. (Assume that σ = 2.5 mm.) b. Sketch the power curve for the test. c. What sample size is needed so that the test in part a. will have a 80% chance to detect that the wafer mean thickness is less than 200 mm when the true mean thickness is at most mm? III. (4 points each) Place the letter of the best answer in the blank to the left of each question. (1) Suppose that the company s process engineer informed you that in most samples of 20 wafers that the distribution of wafer thicknesses is highly skewed to the right. In using the Chi-square test of whether σ is greater than 2.5 mm with α = 0.05, A. the actual level of significance will be greater than B. the actual level of significance will be less than C. the actual level of significance will be very close to D. it is completely unknown what the effect will be. (2) The p-value of the observed value of a test statistic is A. the probability of making the correct decision. B. the weight of evidence that the alternative hypothesis is true. C. the smallest level of significance for which the observed data will reject the null hypothesis. D. the probability of making a Type I error. (3) The sample estimator ˆθ of a population parameter θ is unbiased. Unbiased means that the estimator ˆθ A. is 95% certain of being close to θ. B. has a sampling distribution which is symmetric about θ. C. has a smaller mean squared error than a biased estimator. D. has a sampling distribution with mean value θ. E. all the above 9

10 (4) A 95/95 tolerance interval is to be constructed for a population having pdf f( ). A. The tolerance interval is always wider than a 95% confidence interval for the population parameter. B. It is necessary to specify the family for f( ) in order to construct the tolerance interval. C. The distribution-free tolerance interval will generally be wider than the tolerance interval based on a specified family for f( ). D. The normal based tolerance interval will have approximately the correct probabilities provided the sample size is large enough for the central limit theorem to be valid. E. All of the above. (5) In an α = 0.05 test of H o : µ 12 versus H 1 : µ > 12, the probability of a Type II error is A. greater at µ = 13 than at µ = 14. B. smaller at µ = 13 than at µ = 14. C. the same at µ = 13 as at µ = 14. D. always less than (6) In testing H o : µ µ o versus H 1 : µ < µ o, where µ is the median of a population having a symmetric pdf, f(), A. the power of the t-test is greater than the power of the Wilcoxon signed-rank test. B. the power of the t-test is greater than the power of the sign test. C. the power of the t-test is less than the power of the Wilcoxon signed-rank test. D. the power of the t-test is less than the power of the sign test. E. None of the above (7) The observations X 1,, X n are positively correlated with correlation greater than 0.8. An α = 0.05, t-test = X µ o s/ n of H o : µ µ o versus H 1 : µ > µ o will have A. maximum probability of Type I error equal to B. maximum probability of Type I error less than C. maximum probability of Type I error greater than D. maximum probability of Type I cannot be computed. (8) A study is to be conducted to estimate the mean conductivity (in ohms) of a new alloy. What sample size is needed to ensure that the sample mean will estimate the average conductivity to within 5 ohms with a reliability of 99%. Conductivity has a normal distribution with a standard deviation of approximately 15. A. 538 B. 30 C. 60 D. 49 E. cannot be determined with the given information 10

11 (9) A 95/99 tolerance interval for a normal population is A. a 95% C.I. for µ and a 99% C.I. for σ. B. an estimate of the population mean in which we are 99% confident that the sample mean is within 95% of σ from the population mean. C. an estimate of a region of values which will contain between 95% and 99% of the population values. D. a region of values for which we are 99% confident that the region will contain at least 95% of the population values.. (10) In testing H o : σ 2.3 versus H 1 : σ < 2.3, the p-value of the test statistic was computed to be If the level of significance was α = 0.05 and the true value of σ = 1.8 then the decision based on the data A. was a Type I error. B. was a Type II error. C. was a correct decision. D. depends on the power of the test. 11

12 November 21, 2002 I. (36 points) Suppose X 1,, X n are n observations from a population. (A) Suppose the X i s are iid and the population distribution is normal with µ and σ unknown. Describe how to compute the power curve for testing the hypotheses: H o : µ µ o versus H a : µ > µ o. (B) Suppose the X i s are iid but the population distribution is symmetric with very heavy-tails. If n is relatively small, describe an alternative test statistic to the one used in part (A) for testing the hypotheses: H o : µ µ o versus H a : µ > µ o. (C) Suppose the X i s are iid and the population distribution is normal. Determine the smallest sample size necessary for an α = 0.05 test of the hypotheses: H o : µ µ o versus H a : µ > µ o to have power at least.8 whenever µ > µ o +.75 σ. (D) Suppose X 1,, X n s are iid and the population distribution is very skewed to the right. Describe an interval of values for which you are 95% confident that the interval contains 90% of the population values. (E) Describe a procedure to determine if the X i s are positively correlated? (F) Suppose the population distribution is normal. What is the effect of the positive correlation on the standard test of the hypotheses: H o : µ µ o versus H a : µ > µ o? Justify your answer. II. (24 points) A company produces a product whose time to failure T has an exponential distribution with average time to failure λ = 25 (in thousands of hours). They make changes to the product and want to determine if the changes have an effect on the distribution of the average times to failure. An accelerated life test is performed on a random sample of 15 units produced after the product changes yielding a sample mean T = 40 (thousands of hours). (A) Describe a graphical technique to evaluate whether the failure times from the accelerated life tests follow an exponential distribution. Make sure to label your axes. (B) Construct a 90% confidence interval for the average time to failure. You may assume that T 1,, T 15 are iid exponential with T = 40. Hint: If T 1,, T n are iid exponential with average value λ, then d.f. = 2n. 2n T λ has a Chi-squared distribution with (C) Use the confidence interval you constructed in part (B) to test the hypothesis that the average time to failure after the changes is greater than 25. What is the level of significance of your test? (D) Construct a 95% prediction interval for the time to failure, T, of a single unit produced after the changes were made to the product. You may assume that T 1,, T 15 are iid exponential with T = 40. Hint: T and T are independent and T 1,, T n are iid exponential with average value λ, thus Chi-squared distribution with d.f. = 2n. III. (4 points each) Place the letter of the best answer in the blank to the left of each question. 2n T λ has a (1) Suppose that X 1,, X n are highly positively correlated with a N(µ, σ 2 ) distribution. A 95% confidence interval for µ was constructed using the formula X ± (t α/2,n 1 )(s/ n). The true coverage probability of this confidence interval A. is B. is much less than C. is very close to D. is much greater than

13 E. may be greater or less than (2) The p-value of the observed value of a test statistic is A. the probability of making the correct decision. B. the smallest level of significance for which the observed data will reject the null hypothesis. C. the largest level of significance for which the observed data will reject the null hypothesis. D. the probability of making a Type I error. E. the probability of making a Type II error. (3) There are two sample estimators ˆθ 1 and ˆθ 2 of a population parameter θ. ˆθ 1 is unbiased and ˆθ 2 is biased. A. ˆθ1 is always preferred to ˆθ 2 B. ˆθ1 is preferred to ˆθ 2 because it is a more accurate estimator than ˆθ 2. C. ˆθ2 is preferred to ˆθ 1 if it has a smaller variance than ˆθ 1. D. ˆθ2 is preferred to ˆθ 1 if it has a smaller variance and smaller bias than ˆθ 1. E. ˆθ1 is preferred to ˆθ 2 if it has a smaller variance than ˆθ 2. (4) A 95/95 tolerance interval is constructed for a population having pdf f( ) and mean µ. A. The tolerance interval is wider than a 95% confidence interval for µ. B. It is necessary to specify the family for f( ) in order to construct the tolerance interval. C. If we know that the population distribution is normal but use a Distribution-free tolerance interval, then the tolerance interval will contain more than 95% of the population values. D. A normal based tolerance interval will have approximately the correct probabilities for any pdf provided the sample size is large enough for the central limit theorem to be valid. E. All of the above. (5) In an α = 0.05 test of H o : µ 12 versus H 1 : µ < 12, the probability of a Type II error is A. greater at µ = 13 than at µ = 14. B. smaller at µ = 10 than at µ = 11. C. the same at µ = 10 as at µ = 11. D. always less than E. cannot be determined because my dog ate my noncentral t-tables (6) In testing H o : µ µ o versus H 1 : µ < µ o, where µ is the finite expected value for a population having a symmetric pdf, f( ), A. the power of the t-test is greater than the power of the Wilcoxon signed-rank test. B. the power of the t-test is greater than the power of the sign test. C. the power of the t-test is less than the power of the Wilcoxon signed-rank test. D. the power of the t-test is less than the power of the sign test. E. None of the above (7) The Anderson-Darling statistic is preferred to the Chi-square statistic in testing H o : F = F o based on a random sample X 1,, X n from a continuous cdf F because A. the Anderson-Darling is easier to compute. B. the number of degrees of freedom must be approximated for the Chi-squared test. C. the probability of Type I error is greater for the Chi-squared test. 13

14 D. the Anderson-Darling test has greater power. E. all of the above (8) A study is to be conducted to estimate the mean conductivity (in ohms) of a new alloy. What sample size is needed to ensure that the sample mean will estimate the average conductivity to within 10 ohms with a reliability of 99%. Conductivity has a normal distribution with a standard deviation of approximately 30. A. 35 B. 49 C. 60 D. 538 E. cannot be determined since σ is unknown (9) The Kolmogorov-Smirnov, Cramer von Mises, and Anderson-Darling statistics are referred to as Distribution- Free tests for testing H o : F = F o when the population cdf F o is completely specified because A. the three test statistics have the same level of significance. B. any unknown parameters have very little effect on their null distribution. C. the null distribution of the three tests do not depend on the particular form of F o. D. the power function is identical for the three tests.. (10) In testing H o : µ 5 vs H a : µ > 5, the P-value of the test statistic was computed to be If the level of significance was α =.10, and the true value of µ was µ = 4. The decision based on the data A. was a Type I error. B. was a Type II error. C. was a Type III error. D. was correct. E. was either a Type I error or a correct decision. 14

15 November 18, 2003 I. (25 points) Suppose Y 1,, Y 60 are the distances (in thousands of miles) to failure of 60 transmissions produced by a supplier of truck transmissions. The data is given on the next page. (A) Determine a warranty value such that you are 90% confident that at least 75% of all transmissions produced from the production line will have miles to failure greater than the warranty value. (B) Predict with 90% confidence the number of miles to failure of a transmission randomly selected from the production facility. (C) Estimate with 90% confidence the proportion of transmissions from the production facility having miles to failure greater than 50,000 miles. (D) Suppose the Y i s are iid but the population distribution is multimodal due to the supplier having several production facilities with varying levels of quality. Determine a warranty value such that you are 90% confident that at least 75% of all transmissions produced from the production facilities will have miles to failure greater than the warranty value. The following data is the failure times of the 60 transmissions in thousands of miles: The following summary statistics were computed from the 60 failure times: Ȳ = S = Y (1) = ˆQ(.25) = ˆQ(.5) = ˆQ(.75) = Y(60) =

16 Normal Quantile Plot for Failure Times Failure Times Standard Normal Quantiles II. (75 points) Answers each of the following questions in 20 words or less. (1) Suppose that X 1,, X n is a sample from a N(µ, σ 2 ) distribution but the X i s are highly positively correlated. A 95% confidence interval for µ was constructed using the formula X ± (t α/2,n 1 ) S n. What is the effect of the correlation on the true coverage probability of this confidence interval? (2) There are two sample estimators ˆθ 1 and ˆθ 2 of a population parameter θ. ˆθ1 is unbiased and ˆθ 2 is biased. Which of the two estimators is preferred? Justify your answer. (3) A 95/95 tolerance interval is to be constructed for a population having pdf f( ). Is the following statement true: A normal based tolerance interval will have approximately the correct probabilities provided the sample size is large enough for the central limit theorem to be valid.? Justify your answer. (4) Why is the Anderson-Darling statistic preferred to the Chi-square statistic in testing whether a population cdf is F o based on a random sample X 1,, X n from a continuous cdf F? (5) A study is to be conducted to estimate the mean conductivity (in ohms) of a new alloy. What sample size is needed to ensure that the sample mean will estimate the average conductivity to within 10 ohms with a reliability of 99%. Conductivity has a normal distribution with a standard deviation of approximately

17 (6) Explain why the Kolmogorov-Smirnov statistic, is referred to as Distribution-Free method for evaluating whether a population cdf F is equal to F o when F o is completely specified. (7) The Agresti-Coull C.I. for a proportion is somewhat more complex than the standard asymptotic C.I. for a proportion. Why would you recommend using the Agresti-Coull C.I.? (8) What are the two major sources of error in using the bootstrap procedure to the estimate the percentiles of a pivot? (9) In order to construct a 95% C.I. for the mean µ of a population, a random sample X 1,..., X 31 was selected from a process having pdf f( ). Because the sample size is relatively large, the biologist used X S ± t.025 n as the C.I. for µ. You plotted the data and noticed that the data was highly right skewed. What problems may exist in using this C.I.? (10) Referring to question (9), the problem encountered with the interval is due to possible correlation between X and S. If the biologist can not provide you with any information about the population distribution other than the data, describe a method to determine the degree of correlation between X and S using the observed data from the study. (11) Suppose X 1,..., X 20 are iid with a highly right skewed distribution. The transformation Y = g(x) = X yields a Shapiro-Wilk p-value = A 95% C.I. for µ Y is [2.86, 3.97]. Why is [(2.86) 2, (3.97) 2 ] NOT an appropriate 95% C.I. for µ X? (12) For each of the following sentences, state whether the sentence is true or false. If false, explain why. (A). The sampling distribution of an estimator of a parameter θ will have approximately a normal distribution if the sample size is large enough. (B). The bootstrap procedure for constructing a C.I. for a parameter θ is always preferred to using a distribution-based procedure in constructing the C.I. because the distribution-based procedure depends on various conditions being valid. (C). If the population pdf f(y; θ) is symmetric about θ, then the sample mean is a better estimator of θ than is the sample median. (D). A 95% C.I. for a parameter θ is given by (2.75, 3.25). This means that there is a.95 probability that θ is between 2.75 and

Summary of Formulas and Concepts. Descriptive Statistics (Ch. 1-4)

Summary of Formulas and Concepts. Descriptive Statistics (Ch. 1-4) Summary of Formulas and Concepts Descriptive Statistics (Ch. 1-4) Definitions Population: The complete set of numerical information on a particular quantity in which an investigator is interested. We assume

More information

Hypothesis Testing COMP 245 STATISTICS. Dr N A Heard. 1 Hypothesis Testing 2 1.1 Introduction... 2 1.2 Error Rates and Power of a Test...

Hypothesis Testing COMP 245 STATISTICS. Dr N A Heard. 1 Hypothesis Testing 2 1.1 Introduction... 2 1.2 Error Rates and Power of a Test... Hypothesis Testing COMP 45 STATISTICS Dr N A Heard Contents 1 Hypothesis Testing 1.1 Introduction........................................ 1. Error Rates and Power of a Test.............................

More information

LAB 4 INSTRUCTIONS CONFIDENCE INTERVALS AND HYPOTHESIS TESTING

LAB 4 INSTRUCTIONS CONFIDENCE INTERVALS AND HYPOTHESIS TESTING LAB 4 INSTRUCTIONS CONFIDENCE INTERVALS AND HYPOTHESIS TESTING In this lab you will explore the concept of a confidence interval and hypothesis testing through a simulation problem in engineering setting.

More information

Sampling and Hypothesis Testing

Sampling and Hypothesis Testing Population and sample Sampling and Hypothesis Testing Allin Cottrell Population : an entire set of objects or units of observation of one sort or another. Sample : subset of a population. Parameter versus

More information

Hypothesis testing for µ:

Hypothesis testing for µ: University of California, Los Angeles Department of Statistics Statistics 13 Elements of a hypothesis test: Hypothesis testing Instructor: Nicolas Christou 1. Null hypothesis, H 0 (always =). 2. Alternative

More information

Dongfeng Li. Autumn 2010

Dongfeng Li. Autumn 2010 Autumn 2010 Chapter Contents Some statistics background; ; Comparing means and proportions; variance. Students should master the basic concepts, descriptive statistics measures and graphs, basic hypothesis

More information

Null Hypothesis H 0. The null hypothesis (denoted by H 0

Null Hypothesis H 0. The null hypothesis (denoted by H 0 Hypothesis test In statistics, a hypothesis is a claim or statement about a property of a population. A hypothesis test (or test of significance) is a standard procedure for testing a claim about a property

More information

Hypothesis Testing or How to Decide to Decide Edpsy 580

Hypothesis Testing or How to Decide to Decide Edpsy 580 Hypothesis Testing or How to Decide to Decide Edpsy 580 Carolyn J. Anderson Department of Educational Psychology University of Illinois at Urbana-Champaign Hypothesis Testing or How to Decide to Decide

More information

Chi-Square Test. Contingency Tables. Contingency Tables. Chi-Square Test for Independence. Chi-Square Tests for Goodnessof-Fit

Chi-Square Test. Contingency Tables. Contingency Tables. Chi-Square Test for Independence. Chi-Square Tests for Goodnessof-Fit Chi-Square Tests 15 Chapter Chi-Square Test for Independence Chi-Square Tests for Goodness Uniform Goodness- Poisson Goodness- Goodness Test ECDF Tests (Optional) McGraw-Hill/Irwin Copyright 2009 by The

More information

Chapter Additional: Standard Deviation and Chi- Square

Chapter Additional: Standard Deviation and Chi- Square Chapter Additional: Standard Deviation and Chi- Square Chapter Outline: 6.4 Confidence Intervals for the Standard Deviation 7.5 Hypothesis testing for Standard Deviation Section 6.4 Objectives Interpret

More information

Confidence Intervals for the Difference Between Two Means

Confidence Intervals for the Difference Between Two Means Chapter 47 Confidence Intervals for the Difference Between Two Means Introduction This procedure calculates the sample size necessary to achieve a specified distance from the difference in sample means

More information

How to Conduct a Hypothesis Test

How to Conduct a Hypothesis Test How to Conduct a Hypothesis Test The idea of hypothesis testing is relatively straightforward. In various studies we observe certain events. We must ask, is the event due to chance alone, or is there some

More information

Two-Sample T-Tests Allowing Unequal Variance (Enter Difference)

Two-Sample T-Tests Allowing Unequal Variance (Enter Difference) Chapter 45 Two-Sample T-Tests Allowing Unequal Variance (Enter Difference) Introduction This procedure provides sample size and power calculations for one- or two-sided two-sample t-tests when no assumption

More information

BA 275 Review Problems - Week 6 (10/30/06-11/3/06) CD Lessons: 53, 54, 55, 56 Textbook: pp. 394-398, 404-408, 410-420

BA 275 Review Problems - Week 6 (10/30/06-11/3/06) CD Lessons: 53, 54, 55, 56 Textbook: pp. 394-398, 404-408, 410-420 BA 275 Review Problems - Week 6 (10/30/06-11/3/06) CD Lessons: 53, 54, 55, 56 Textbook: pp. 394-398, 404-408, 410-420 1. Which of the following will increase the value of the power in a statistical test

More information

9-3.4 Likelihood ratio test. Neyman-Pearson lemma

9-3.4 Likelihood ratio test. Neyman-Pearson lemma 9-3.4 Likelihood ratio test Neyman-Pearson lemma 9-1 Hypothesis Testing 9-1.1 Statistical Hypotheses Statistical hypothesis testing and confidence interval estimation of parameters are the fundamental

More information

Introduction to Hypothesis Testing. Point estimation and confidence intervals are useful statistical inference procedures.

Introduction to Hypothesis Testing. Point estimation and confidence intervals are useful statistical inference procedures. Introduction to Hypothesis Testing Point estimation and confidence intervals are useful statistical inference procedures. Another type of inference is used frequently used concerns tests of hypotheses.

More information

Standard Deviation Estimator

Standard Deviation Estimator CSS.com Chapter 905 Standard Deviation Estimator Introduction Even though it is not of primary interest, an estimate of the standard deviation (SD) is needed when calculating the power or sample size of

More information

1. What is the critical value for this 95% confidence interval? CV = z.025 = invnorm(0.025) = 1.96

1. What is the critical value for this 95% confidence interval? CV = z.025 = invnorm(0.025) = 1.96 1 Final Review 2 Review 2.1 CI 1-propZint Scenario 1 A TV manufacturer claims in its warranty brochure that in the past not more than 10 percent of its TV sets needed any repair during the first two years

More information

Two-Sample T-Tests Assuming Equal Variance (Enter Means)

Two-Sample T-Tests Assuming Equal Variance (Enter Means) Chapter 4 Two-Sample T-Tests Assuming Equal Variance (Enter Means) Introduction This procedure provides sample size and power calculations for one- or two-sided two-sample t-tests when the variances of

More information

Non-Parametric Tests (I)

Non-Parametric Tests (I) Lecture 5: Non-Parametric Tests (I) KimHuat LIM lim@stats.ox.ac.uk http://www.stats.ox.ac.uk/~lim/teaching.html Slide 1 5.1 Outline (i) Overview of Distribution-Free Tests (ii) Median Test for Two Independent

More information

MINITAB ASSISTANT WHITE PAPER

MINITAB ASSISTANT WHITE PAPER MINITAB ASSISTANT WHITE PAPER This paper explains the research conducted by Minitab statisticians to develop the methods and data checks used in the Assistant in Minitab 17 Statistical Software. One-Way

More information

Null Hypothesis Significance Testing Signifcance Level, Power, t-tests. 18.05 Spring 2014 Jeremy Orloff and Jonathan Bloom

Null Hypothesis Significance Testing Signifcance Level, Power, t-tests. 18.05 Spring 2014 Jeremy Orloff and Jonathan Bloom Null Hypothesis Significance Testing Signifcance Level, Power, t-tests 18.05 Spring 2014 Jeremy Orloff and Jonathan Bloom Simple and composite hypotheses Simple hypothesis: the sampling distribution is

More information

Statistical Inference and t-tests

Statistical Inference and t-tests 1 Statistical Inference and t-tests Objectives Evaluate the difference between a sample mean and a target value using a one-sample t-test. Evaluate the difference between a sample mean and a target value

More information

7 Hypothesis testing - one sample tests

7 Hypothesis testing - one sample tests 7 Hypothesis testing - one sample tests 7.1 Introduction Definition 7.1 A hypothesis is a statement about a population parameter. Example A hypothesis might be that the mean age of students taking MAS113X

More information

Non-Inferiority Tests for Two Means using Differences

Non-Inferiority Tests for Two Means using Differences Chapter 450 on-inferiority Tests for Two Means using Differences Introduction This procedure computes power and sample size for non-inferiority tests in two-sample designs in which the outcome is a continuous

More information

3. Nonparametric methods

3. Nonparametric methods 3. Nonparametric methods If the probability distributions of the statistical variables are unknown or are not as required (e.g. normality assumption violated), then we may still apply nonparametric tests

More information

3.4 Statistical inference for 2 populations based on two samples

3.4 Statistical inference for 2 populations based on two samples 3.4 Statistical inference for 2 populations based on two samples Tests for a difference between two population means The first sample will be denoted as X 1, X 2,..., X m. The second sample will be denoted

More information

Experimental Design. Power and Sample Size Determination. Proportions. Proportions. Confidence Interval for p. The Binomial Test

Experimental Design. Power and Sample Size Determination. Proportions. Proportions. Confidence Interval for p. The Binomial Test Experimental Design Power and Sample Size Determination Bret Hanlon and Bret Larget Department of Statistics University of Wisconsin Madison November 3 8, 2011 To this point in the semester, we have largely

More information

Hypothesis Testing Level I Quantitative Methods. IFT Notes for the CFA exam

Hypothesis Testing Level I Quantitative Methods. IFT Notes for the CFA exam Hypothesis Testing 2014 Level I Quantitative Methods IFT Notes for the CFA exam Contents 1. Introduction... 3 2. Hypothesis Testing... 3 3. Hypothesis Tests Concerning the Mean... 10 4. Hypothesis Tests

More information

Using pivots to construct confidence intervals. In Example 41 we used the fact that

Using pivots to construct confidence intervals. In Example 41 we used the fact that Using pivots to construct confidence intervals In Example 41 we used the fact that Q( X, µ) = X µ σ/ n N(0, 1) for all µ. We then said Q( X, µ) z α/2 with probability 1 α, and converted this into a statement

More information

Name: Date: Use the following to answer questions 3-4:

Name: Date: Use the following to answer questions 3-4: Name: Date: 1. Determine whether each of the following statements is true or false. A) The margin of error for a 95% confidence interval for the mean increases as the sample size increases. B) The margin

More information

Standard Deviation Calculator

Standard Deviation Calculator CSS.com Chapter 35 Standard Deviation Calculator Introduction The is a tool to calculate the standard deviation from the data, the standard error, the range, percentiles, the COV, confidence limits, or

More information

Module 5 Hypotheses Tests: Comparing Two Groups

Module 5 Hypotheses Tests: Comparing Two Groups Module 5 Hypotheses Tests: Comparing Two Groups Objective: In medical research, we often compare the outcomes between two groups of patients, namely exposed and unexposed groups. At the completion of this

More information

Permutation Tests for Comparing Two Populations

Permutation Tests for Comparing Two Populations Permutation Tests for Comparing Two Populations Ferry Butar Butar, Ph.D. Jae-Wan Park Abstract Permutation tests for comparing two populations could be widely used in practice because of flexibility of

More information

1 SAMPLE SIGN TEST. Non-Parametric Univariate Tests: 1 Sample Sign Test 1. A non-parametric equivalent of the 1 SAMPLE T-TEST.

1 SAMPLE SIGN TEST. Non-Parametric Univariate Tests: 1 Sample Sign Test 1. A non-parametric equivalent of the 1 SAMPLE T-TEST. Non-Parametric Univariate Tests: 1 Sample Sign Test 1 1 SAMPLE SIGN TEST A non-parametric equivalent of the 1 SAMPLE T-TEST. ASSUMPTIONS: Data is non-normally distributed, even after log transforming.

More information

SIMULATION STUDIES IN STATISTICS WHAT IS A SIMULATION STUDY, AND WHY DO ONE? What is a (Monte Carlo) simulation study, and why do one?

SIMULATION STUDIES IN STATISTICS WHAT IS A SIMULATION STUDY, AND WHY DO ONE? What is a (Monte Carlo) simulation study, and why do one? SIMULATION STUDIES IN STATISTICS WHAT IS A SIMULATION STUDY, AND WHY DO ONE? What is a (Monte Carlo) simulation study, and why do one? Simulations for properties of estimators Simulations for properties

More information

Hypothesis testing. Power of a test. Alternative is greater than Null. Probability

Hypothesis testing. Power of a test. Alternative is greater than Null. Probability Probability February 14, 2013 Debdeep Pati Hypothesis testing Power of a test 1. Assuming standard deviation is known. Calculate power based on one-sample z test. A new drug is proposed for people with

More information

Chapter 2. Hypothesis testing in one population

Chapter 2. Hypothesis testing in one population Chapter 2. Hypothesis testing in one population Contents Introduction, the null and alternative hypotheses Hypothesis testing process Type I and Type II errors, power Test statistic, level of significance

More information

HYPOTHESIS TESTING: POWER OF THE TEST

HYPOTHESIS TESTING: POWER OF THE TEST HYPOTHESIS TESTING: POWER OF THE TEST The first 6 steps of the 9-step test of hypothesis are called "the test". These steps are not dependent on the observed data values. When planning a research project,

More information

BA 275 Review Problems - Week 5 (10/23/06-10/27/06) CD Lessons: 48, 49, 50, 51, 52 Textbook: pp. 380-394

BA 275 Review Problems - Week 5 (10/23/06-10/27/06) CD Lessons: 48, 49, 50, 51, 52 Textbook: pp. 380-394 BA 275 Review Problems - Week 5 (10/23/06-10/27/06) CD Lessons: 48, 49, 50, 51, 52 Textbook: pp. 380-394 1. Does vigorous exercise affect concentration? In general, the time needed for people to complete

More information

BIOSTATISTICS QUIZ ANSWERS

BIOSTATISTICS QUIZ ANSWERS BIOSTATISTICS QUIZ ANSWERS 1. When you read scientific literature, do you know whether the statistical tests that were used were appropriate and why they were used? a. Always b. Mostly c. Rarely d. Never

More information

Outline of Topics. Statistical Methods I. Types of Data. Descriptive Statistics

Outline of Topics. Statistical Methods I. Types of Data. Descriptive Statistics Statistical Methods I Tamekia L. Jones, Ph.D. (tjones@cog.ufl.edu) Research Assistant Professor Children s Oncology Group Statistics & Data Center Department of Biostatistics Colleges of Medicine and Public

More information

Study Guide for the Final Exam

Study Guide for the Final Exam Study Guide for the Final Exam When studying, remember that the computational portion of the exam will only involve new material (covered after the second midterm), that material from Exam 1 will make

More information

Confidence Intervals for Cp

Confidence Intervals for Cp Chapter 296 Confidence Intervals for Cp Introduction This routine calculates the sample size needed to obtain a specified width of a Cp confidence interval at a stated confidence level. Cp is a process

More information

Variables Control Charts

Variables Control Charts MINITAB ASSISTANT WHITE PAPER This paper explains the research conducted by Minitab statisticians to develop the methods and data checks used in the Assistant in Minitab 17 Statistical Software. Variables

More information

4. Continuous Random Variables, the Pareto and Normal Distributions

4. Continuous Random Variables, the Pareto and Normal Distributions 4. Continuous Random Variables, the Pareto and Normal Distributions A continuous random variable X can take any value in a given range (e.g. height, weight, age). The distribution of a continuous random

More information

Confidence Intervals for One Standard Deviation Using Standard Deviation

Confidence Intervals for One Standard Deviation Using Standard Deviation Chapter 640 Confidence Intervals for One Standard Deviation Using Standard Deviation Introduction This routine calculates the sample size necessary to achieve a specified interval width or distance from

More information

ACTM State Exam-Statistics

ACTM State Exam-Statistics ACTM State Exam-Statistics For the 25 multiple-choice questions, make your answer choice and record it on the answer sheet provided. Once you have completed that section of the test, proceed to the tie-breaker

More information

Comparing Means in Two Populations

Comparing Means in Two Populations Comparing Means in Two Populations Overview The previous section discussed hypothesis testing when sampling from a single population (either a single mean or two means from the same population). Now we

More information

UCLA STAT 13 Statistical Methods - Final Exam Review Solutions Chapter 7 Sampling Distributions of Estimates

UCLA STAT 13 Statistical Methods - Final Exam Review Solutions Chapter 7 Sampling Distributions of Estimates UCLA STAT 13 Statistical Methods - Final Exam Review Solutions Chapter 7 Sampling Distributions of Estimates 1. (a) (i) µ µ (ii) σ σ n is exactly Normally distributed. (c) (i) is approximately Normally

More information

Exact Confidence Intervals

Exact Confidence Intervals Math 541: Statistical Theory II Instructor: Songfeng Zheng Exact Confidence Intervals Confidence intervals provide an alternative to using an estimator ˆθ when we wish to estimate an unknown parameter

More information

Chapter 9, Part A Hypothesis Tests. Learning objectives

Chapter 9, Part A Hypothesis Tests. Learning objectives Chapter 9, Part A Hypothesis Tests Slide 1 Learning objectives 1. Understand how to develop Null and Alternative Hypotheses 2. Understand Type I and Type II Errors 3. Able to do hypothesis test about population

More information

Inferential Statistics

Inferential Statistics Inferential Statistics Sampling and the normal distribution Z-scores Confidence levels and intervals Hypothesis testing Commonly used statistical methods Inferential Statistics Descriptive statistics are

More information

[Chapter 10. Hypothesis Testing]

[Chapter 10. Hypothesis Testing] [Chapter 10. Hypothesis Testing] 10.1 Introduction 10.2 Elements of a Statistical Test 10.3 Common Large-Sample Tests 10.4 Calculating Type II Error Probabilities and Finding the Sample Size for Z Tests

More information

Chapter 9: Hypothesis Testing Sections

Chapter 9: Hypothesis Testing Sections Chapter 9: Hypothesis Testing Sections 9.1 Problems of Testing Hypotheses Skip: 9.2 Testing Simple Hypotheses Skip: 9.3 Uniformly Most Powerful Tests Skip: 9.4 Two-Sided Alternatives 9.6 Comparing the

More information

2 Sample t-test (unequal sample sizes and unequal variances)

2 Sample t-test (unequal sample sizes and unequal variances) Variations of the t-test: Sample tail Sample t-test (unequal sample sizes and unequal variances) Like the last example, below we have ceramic sherd thickness measurements (in cm) of two samples representing

More information

NCSS Statistical Software

NCSS Statistical Software Chapter 06 Introduction This procedure provides several reports for the comparison of two distributions, including confidence intervals for the difference in means, two-sample t-tests, the z-test, the

More information

fifty Fathoms Statistics Demonstrations for Deeper Understanding Tim Erickson

fifty Fathoms Statistics Demonstrations for Deeper Understanding Tim Erickson fifty Fathoms Statistics Demonstrations for Deeper Understanding Tim Erickson Contents What Are These Demos About? How to Use These Demos If This Is Your First Time Using Fathom Tutorial: An Extended Example

More information

One-Sample t-test. Example 1: Mortgage Process Time. Problem. Data set. Data collection. Tools

One-Sample t-test. Example 1: Mortgage Process Time. Problem. Data set. Data collection. Tools One-Sample t-test Example 1: Mortgage Process Time Problem A faster loan processing time produces higher productivity and greater customer satisfaction. A financial services institution wants to establish

More information

Tutorial 5: Hypothesis Testing

Tutorial 5: Hypothesis Testing Tutorial 5: Hypothesis Testing Rob Nicholls nicholls@mrc-lmb.cam.ac.uk MRC LMB Statistics Course 2014 Contents 1 Introduction................................ 1 2 Testing distributional assumptions....................

More information

CHI-SQUARE: TESTING FOR GOODNESS OF FIT

CHI-SQUARE: TESTING FOR GOODNESS OF FIT CHI-SQUARE: TESTING FOR GOODNESS OF FIT In the previous chapter we discussed procedures for fitting a hypothesized function to a set of experimental data points. Such procedures involve minimizing a quantity

More information

HypoTesting. Name: Class: Date: Multiple Choice Identify the choice that best completes the statement or answers the question.

HypoTesting. Name: Class: Date: Multiple Choice Identify the choice that best completes the statement or answers the question. Name: Class: Date: HypoTesting Multiple Choice Identify the choice that best completes the statement or answers the question. 1. A Type II error is committed if we make: a. a correct decision when the

More information

The Standard Normal distribution

The Standard Normal distribution The Standard Normal distribution 21.2 Introduction Mass-produced items should conform to a specification. Usually, a mean is aimed for but due to random errors in the production process we set a tolerance

More information

CHAPTER 11 SECTION 2: INTRODUCTION TO HYPOTHESIS TESTING

CHAPTER 11 SECTION 2: INTRODUCTION TO HYPOTHESIS TESTING CHAPTER 11 SECTION 2: INTRODUCTION TO HYPOTHESIS TESTING MULTIPLE CHOICE 56. In testing the hypotheses H 0 : µ = 50 vs. H 1 : µ 50, the following information is known: n = 64, = 53.5, and σ = 10. The standardized

More information

Hypothesis Testing - II

Hypothesis Testing - II -3σ -2σ +σ +2σ +3σ Hypothesis Testing - II Lecture 9 0909.400.01 / 0909.400.02 Dr. P. s Clinic Consultant Module in Probability & Statistics in Engineering Today in P&S -3σ -2σ +σ +2σ +3σ Review: Hypothesis

More information

Simple Linear Regression Inference

Simple Linear Regression Inference Simple Linear Regression Inference 1 Inference requirements The Normality assumption of the stochastic term e is needed for inference even if it is not a OLS requirement. Therefore we have: Interpretation

More information

Normality Testing in Excel

Normality Testing in Excel Normality Testing in Excel By Mark Harmon Copyright 2011 Mark Harmon No part of this publication may be reproduced or distributed without the express permission of the author. mark@excelmasterseries.com

More information

Difference of Means and ANOVA Problems

Difference of Means and ANOVA Problems Difference of Means and Problems Dr. Tom Ilvento FREC 408 Accounting Firm Study An accounting firm specializes in auditing the financial records of large firm It is interested in evaluating its fee structure,particularly

More information

Hypothesis Testing: Two Means, Paired Data, Two Proportions

Hypothesis Testing: Two Means, Paired Data, Two Proportions Chapter 10 Hypothesis Testing: Two Means, Paired Data, Two Proportions 10.1 Hypothesis Testing: Two Population Means and Two Population Proportions 1 10.1.1 Student Learning Objectives By the end of this

More information

Chicago Booth BUSINESS STATISTICS 41000 Final Exam Fall 2011

Chicago Booth BUSINESS STATISTICS 41000 Final Exam Fall 2011 Chicago Booth BUSINESS STATISTICS 41000 Final Exam Fall 2011 Name: Section: I pledge my honor that I have not violated the Honor Code Signature: This exam has 34 pages. You have 3 hours to complete this

More information

Lecture Notes Module 1

Lecture Notes Module 1 Lecture Notes Module 1 Study Populations A study population is a clearly defined collection of people, animals, plants, or objects. In psychological research, a study population usually consists of a specific

More information

Notes for STA 437/1005 Methods for Multivariate Data

Notes for STA 437/1005 Methods for Multivariate Data Notes for STA 437/1005 Methods for Multivariate Data Radford M. Neal, 26 November 2010 Random Vectors Notation: Let X be a random vector with p elements, so that X = [X 1,..., X p ], where denotes transpose.

More information

Hypothesis testing - Steps

Hypothesis testing - Steps Hypothesis testing - Steps Steps to do a two-tailed test of the hypothesis that β 1 0: 1. Set up the hypotheses: H 0 : β 1 = 0 H a : β 1 0. 2. Compute the test statistic: t = b 1 0 Std. error of b 1 =

More information

STT315 Chapter 4 Random Variables & Probability Distributions KM. Chapter 4.5, 6, 8 Probability Distributions for Continuous Random Variables

STT315 Chapter 4 Random Variables & Probability Distributions KM. Chapter 4.5, 6, 8 Probability Distributions for Continuous Random Variables Chapter 4.5, 6, 8 Probability Distributions for Continuous Random Variables Discrete vs. continuous random variables Examples of continuous distributions o Uniform o Exponential o Normal Recall: A random

More information

II. DISTRIBUTIONS distribution normal distribution. standard scores

II. DISTRIBUTIONS distribution normal distribution. standard scores Appendix D Basic Measurement And Statistics The following information was developed by Steven Rothke, PhD, Department of Psychology, Rehabilitation Institute of Chicago (RIC) and expanded by Mary F. Schmidt,

More information

2. DATA AND EXERCISES (Geos2911 students please read page 8)

2. DATA AND EXERCISES (Geos2911 students please read page 8) 2. DATA AND EXERCISES (Geos2911 students please read page 8) 2.1 Data set The data set available to you is an Excel spreadsheet file called cyclones.xls. The file consists of 3 sheets. Only the third is

More information

The Goodness-of-Fit Test

The Goodness-of-Fit Test on the Lecture 49 Section 14.3 Hampden-Sydney College Tue, Apr 21, 2009 Outline 1 on the 2 3 on the 4 5 Hypotheses on the (Steps 1 and 2) (1) H 0 : H 1 : H 0 is false. (2) α = 0.05. p 1 = 0.24 p 2 = 0.20

More information

Hypothesis Testing Introduction

Hypothesis Testing Introduction Hypothesis Testing Introduction Hypothesis: A conjecture about the distribution of some random variables. A hypothesis can be simple or composite. A simple hypothesis completely specifies the distribution.

More information

An Introduction to Statistics Course (ECOE 1302) Spring Semester 2011 Chapter 10- TWO-SAMPLE TESTS

An Introduction to Statistics Course (ECOE 1302) Spring Semester 2011 Chapter 10- TWO-SAMPLE TESTS The Islamic University of Gaza Faculty of Commerce Department of Economics and Political Sciences An Introduction to Statistics Course (ECOE 130) Spring Semester 011 Chapter 10- TWO-SAMPLE TESTS Practice

More information

ANOVA MULTIPLE CHOICE QUESTIONS. In the following multiple-choice questions, select the best answer.

ANOVA MULTIPLE CHOICE QUESTIONS. In the following multiple-choice questions, select the best answer. ANOVA MULTIPLE CHOICE QUESTIONS In the following multiple-choice questions, select the best answer. 1. Analysis of variance is a statistical method of comparing the of several populations. a. standard

More information

Week 3&4: Z tables and the Sampling Distribution of X

Week 3&4: Z tables and the Sampling Distribution of X Week 3&4: Z tables and the Sampling Distribution of X 2 / 36 The Standard Normal Distribution, or Z Distribution, is the distribution of a random variable, Z N(0, 1 2 ). The distribution of any other normal

More information

HYPOTHESIS TESTING (ONE SAMPLE) - CHAPTER 7 1. used confidence intervals to answer questions such as...

HYPOTHESIS TESTING (ONE SAMPLE) - CHAPTER 7 1. used confidence intervals to answer questions such as... HYPOTHESIS TESTING (ONE SAMPLE) - CHAPTER 7 1 PREVIOUSLY used confidence intervals to answer questions such as... You know that 0.25% of women have red/green color blindness. You conduct a study of men

More information

Sample Size Determination

Sample Size Determination Sample Size Determination Population A: 10,000 Population B: 5,000 Sample 10% Sample 15% Sample size 1000 Sample size 750 The process of obtaining information from a subset (sample) of a larger group (population)

More information

Versions 1a Page 1 of 17

Versions 1a Page 1 of 17 Note to Students: This practice exam is intended to give you an idea of the type of questions the instructor asks and the approximate length of the exam. It does NOT indicate the exact questions or the

More information

9Tests of Hypotheses. for a Single Sample CHAPTER OUTLINE

9Tests of Hypotheses. for a Single Sample CHAPTER OUTLINE 9Tests of Hypotheses for a Single Sample CHAPTER OUTLINE 9-1 HYPOTHESIS TESTING 9-1.1 Statistical Hypotheses 9-1.2 Tests of Statistical Hypotheses 9-1.3 One-Sided and Two-Sided Hypotheses 9-1.4 General

More information

Non-Inferiority Tests for One Mean

Non-Inferiority Tests for One Mean Chapter 45 Non-Inferiority ests for One Mean Introduction his module computes power and sample size for non-inferiority tests in one-sample designs in which the outcome is distributed as a normal random

More information

Hypothesis Testing Introduction

Hypothesis Testing Introduction Hypothesis Testing Introduction Hypothesis: A conjecture about the distribution of some random variables. For example, a claim about the value of a parameter of the statistical model. A hypothesis can

More information

Lesson 1: Comparison of Population Means Part c: Comparison of Two- Means

Lesson 1: Comparison of Population Means Part c: Comparison of Two- Means Lesson : Comparison of Population Means Part c: Comparison of Two- Means Welcome to lesson c. This third lesson of lesson will discuss hypothesis testing for two independent means. Steps in Hypothesis

More information

Chapter 7 Section 1 Homework Set A

Chapter 7 Section 1 Homework Set A Chapter 7 Section 1 Homework Set A 7.15 Finding the critical value t *. What critical value t * from Table D (use software, go to the web and type t distribution applet) should be used to calculate the

More information

Power and Sample Size Determination

Power and Sample Size Determination Power and Sample Size Determination Bret Hanlon and Bret Larget Department of Statistics University of Wisconsin Madison November 3 8, 2011 Power 1 / 31 Experimental Design To this point in the semester,

More information

NONPARAMETRIC STATISTICS 1. depend on assumptions about the underlying distribution of the data (or on the Central Limit Theorem)

NONPARAMETRIC STATISTICS 1. depend on assumptions about the underlying distribution of the data (or on the Central Limit Theorem) NONPARAMETRIC STATISTICS 1 PREVIOUSLY parametric statistics in estimation and hypothesis testing... construction of confidence intervals computing of p-values classical significance testing depend on assumptions

More information

Chapter 3 RANDOM VARIATE GENERATION

Chapter 3 RANDOM VARIATE GENERATION Chapter 3 RANDOM VARIATE GENERATION In order to do a Monte Carlo simulation either by hand or by computer, techniques must be developed for generating values of random variables having known distributions.

More information

Statistics I for QBIC. Contents and Objectives. Chapters 1 7. Revised: August 2013

Statistics I for QBIC. Contents and Objectives. Chapters 1 7. Revised: August 2013 Statistics I for QBIC Text Book: Biostatistics, 10 th edition, by Daniel & Cross Contents and Objectives Chapters 1 7 Revised: August 2013 Chapter 1: Nature of Statistics (sections 1.1-1.6) Objectives

More information

Exam 3 - Math Solutions - Spring Name : ID :

Exam 3 - Math Solutions - Spring Name : ID : Exam 3 - Math 3200 Solutions - Spring 2013 Name : ID : General instructions: This exam has 14 questions, each worth the same amount. Check that no pages are missing and promptly notify one of the proctors

More information

Estimation and Confidence Intervals

Estimation and Confidence Intervals Estimation and Confidence Intervals Fall 2001 Professor Paul Glasserman B6014: Managerial Statistics 403 Uris Hall Properties of Point Estimates 1 We have already encountered two point estimators: th e

More information

CHAPTER 13. Experimental Design and Analysis of Variance

CHAPTER 13. Experimental Design and Analysis of Variance CHAPTER 13 Experimental Design and Analysis of Variance CONTENTS STATISTICS IN PRACTICE: BURKE MARKETING SERVICES, INC. 13.1 AN INTRODUCTION TO EXPERIMENTAL DESIGN AND ANALYSIS OF VARIANCE Data Collection

More information

1.5 Oneway Analysis of Variance

1.5 Oneway Analysis of Variance Statistics: Rosie Cornish. 200. 1.5 Oneway Analysis of Variance 1 Introduction Oneway analysis of variance (ANOVA) is used to compare several means. This method is often used in scientific or medical experiments

More information

C. The null hypothesis is not rejected when the alternative hypothesis is true. A. population parameters.

C. The null hypothesis is not rejected when the alternative hypothesis is true. A. population parameters. Sample Multiple Choice Questions for the material since Midterm 2. Sample questions from Midterms and 2 are also representative of questions that may appear on the final exam.. A randomly selected sample

More information

Nonparametric Two-Sample Tests. Nonparametric Tests. Sign Test

Nonparametric Two-Sample Tests. Nonparametric Tests. Sign Test Nonparametric Two-Sample Tests Sign test Mann-Whitney U-test (a.k.a. Wilcoxon two-sample test) Kolmogorov-Smirnov Test Wilcoxon Signed-Rank Test Tukey-Duckworth Test 1 Nonparametric Tests Recall, nonparametric

More information

Nonparametric tests these test hypotheses that are not statements about population parameters (e.g.,

Nonparametric tests these test hypotheses that are not statements about population parameters (e.g., CHAPTER 13 Nonparametric and Distribution-Free Statistics Nonparametric tests these test hypotheses that are not statements about population parameters (e.g., 2 tests for goodness of fit and independence).

More information