Practice Test III Unit 6 Estimation Unit 7 Hypothesis Testing SOLUTIONS

Similar documents
Week 4: Standard Error and Confidence Intervals

Regression Analysis: A Complete Example

HYPOTHESIS TESTING (ONE SAMPLE) - CHAPTER 7 1. used confidence intervals to answer questions such as...

General Method: Difference of Means. 3. Calculate df: either Welch-Satterthwaite formula or simpler df = min(n 1, n 2 ) 1.

Class 19: Two Way Tables, Conditional Distributions, Chi-Square (Text: Sections 2.5; 9.1)

LAB 4 INSTRUCTIONS CONFIDENCE INTERVALS AND HYPOTHESIS TESTING

Simple Regression Theory II 2010 Samuel L. Baker

A POPULATION MEAN, CONFIDENCE INTERVALS AND HYPOTHESIS TESTING

HYPOTHESIS TESTING (ONE SAMPLE) - CHAPTER 7 1. used confidence intervals to answer questions such as...

Simple Linear Regression Inference

Lecture Notes Module 1

Two-Sample T-Tests Allowing Unequal Variance (Enter Difference)

Hypothesis testing - Steps

Two-Sample T-Tests Assuming Equal Variance (Enter Means)

Chapter 7 Section 7.1: Inference for the Mean of a Population

SCHOOL OF HEALTH AND HUMAN SCIENCES DON T FORGET TO RECODE YOUR MISSING VALUES

The correlation coefficient

Introduction to Analysis of Variance (ANOVA) Limitations of the t-test

Lesson 1: Comparison of Population Means Part c: Comparison of Two- Means

Association Between Variables

HYPOTHESIS TESTING: POWER OF THE TEST

3.4 Statistical inference for 2 populations based on two samples

Chapter 2 Probability Topics SPSS T tests

Mind on Statistics. Chapter 12

1.5 Oneway Analysis of Variance

An Introduction to Statistics Course (ECOE 1302) Spring Semester 2011 Chapter 10- TWO-SAMPLE TESTS

Topic 8. Chi Square Tests

MONT 107N Understanding Randomness Solutions For Final Examination May 11, 2010

Unit 31 A Hypothesis Test about Correlation and Slope in a Simple Linear Regression

CHAPTER IV FINDINGS AND CONCURRENT DISCUSSIONS

X X X a) perfect linear correlation b) no correlation c) positive correlation (r = 1) (r = 0) (0 < r < 1)

Confidence Intervals for One Standard Deviation Using Standard Deviation

Chapter 7: Simple linear regression Learning Objectives

1. What is the critical value for this 95% confidence interval? CV = z.025 = invnorm(0.025) = 1.96

CHAPTER 13 SIMPLE LINEAR REGRESSION. Opening Example. Simple Regression. Linear Regression

Variables Control Charts

4. Continuous Random Variables, the Pareto and Normal Distributions

Statistical estimation using confidence intervals

Calculating P-Values. Parkland College. Isela Guerra Parkland College. Recommended Citation

Population Mean (Known Variance)

Chapter 5 Analysis of variance SPSS Analysis of variance

Study Guide for the Final Exam

MEASURES OF VARIATION

Math 108 Exam 3 Solutions Spring 00

Review. March 21, S7.1 2_3 Estimating a Population Proportion. Chapter 7 Estimates and Sample Sizes. Test 2 (Chapters 4, 5, & 6) Results

Confidence Intervals for Cp

Descriptive Statistics

One-Way Analysis of Variance (ANOVA) Example Problem

Parametric and non-parametric statistical methods for the life sciences - Session I

Two-sample inference: Continuous data

2 Sample t-test (unequal sample sizes and unequal variances)

12.5: CHI-SQUARE GOODNESS OF FIT TESTS

BA 275 Review Problems - Week 6 (10/30/06-11/3/06) CD Lessons: 53, 54, 55, 56 Textbook: pp , ,

Summary of Formulas and Concepts. Descriptive Statistics (Ch. 1-4)

Testing Research and Statistical Hypotheses

Factors affecting online sales

Statistics I for QBIC. Contents and Objectives. Chapters 1 7. Revised: August 2013

CHAPTER 14 ORDINAL MEASURES OF CORRELATION: SPEARMAN'S RHO AND GAMMA

BA 275 Review Problems - Week 5 (10/23/06-10/27/06) CD Lessons: 48, 49, 50, 51, 52 Textbook: pp

Confidence Intervals for the Difference Between Two Means

II. DISTRIBUTIONS distribution normal distribution. standard scores

Unit 26 Estimation with Confidence Intervals

5.1 Identifying the Target Parameter

LAB : THE CHI-SQUARE TEST. Probability, Random Chance, and Genetics

Point Biserial Correlation Tests

Biostatistics: DESCRIPTIVE STATISTICS: 2, VARIABILITY

StatCrunch and Nonparametric Statistics

Difference of Means and ANOVA Problems

Sample Size and Power in Clinical Trials

Experimental Design. Power and Sample Size Determination. Proportions. Proportions. Confidence Interval for p. The Binomial Test

Final Exam Practice Problem Answers

Example G Cost of construction of nuclear power plants

MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.

AP: LAB 8: THE CHI-SQUARE TEST. Probability, Random Chance, and Genetics

Chapter 23. Two Categorical Variables: The Chi-Square Test

Odds ratio, Odds ratio test for independence, chi-squared statistic.

Means, standard deviations and. and standard errors

p ˆ (sample mean and sample

Using R for Linear Regression

CHAPTER THREE COMMON DESCRIPTIVE STATISTICS COMMON DESCRIPTIVE STATISTICS / 13

CHAPTER 13. Experimental Design and Analysis of Variance

Test Positive True Positive False Positive. Test Negative False Negative True Negative. Figure 5-1: 2 x 2 Contingency Table

Comparing Two Groups. Standard Error of ȳ 1 ȳ 2. Setting. Two Independent Samples

Good luck! BUSINESS STATISTICS FINAL EXAM INSTRUCTIONS. Name:

12: Analysis of Variance. Introduction

99.37, 99.38, 99.38, 99.39, 99.39, 99.39, 99.39, 99.40, 99.41, cm

UNDERSTANDING THE DEPENDENT-SAMPLES t TEST

11. Analysis of Case-control Studies Logistic Regression

INTERPRETING THE ONE-WAY ANALYSIS OF VARIANCE (ANOVA)

Point and Interval Estimates

Bivariate Statistics Session 2: Measuring Associations Chi-Square Test

Online 12 - Sections 9.1 and 9.2-Doug Ensley

Chapter 23. Inferences for Regression

Name: Date: Use the following to answer questions 3-4:

Hypothesis Testing. Hypothesis Testing

MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.

Math 58. Rumbos Fall Solutions to Review Problems for Exam 2

August 2012 EXAMINATIONS Solution Part I

Transcription:

PubHlth 540 Introductory Biostatistics Page 1 of 8 1. Practice Test III Unit 6 Estimation Unit 7 Hypothesis Testing SOLUTIONS a. True or False. A hypothesis test for which the type I error occurs with probability α has probability of type II error equal to (1 - α). Answer: FALSE A type I error can only occur when the null hypothesis is true and a type II error can only occur when the alternative hypothesis is true. b. True or False. If a one sided test indicates that the null hypothesis can be rejected at the 5% level, then a two sided test performed on the same set of data is necessarily significant at the 5% level. Answer: FALSE The significance level for a two sided test is the sum of two areas under the curve whereas it is the sum of one area under the curve when the test is one sided. Thus, for a given test statistic value, the two sided p-value will always be larger than the one sided p- value. Accordingly, if a one sided p-value is.05 or close to it, then the two sided p-value cannot necessarily be less than or equal to.05.

PubHlth 540 Introductory Biostatistics Page of 8 c. True or False. For a given sample variance s and sample mean X, a 90% confidence interval for an unknown mean μ is narrower than a 99% confidence interval. Answer: TRUE All other things constant, the confidence coefficient multiplier for a 90% CI is smaller than the confidence coefficient multiplier for a 99% CI. As a result the width of the CI, being [ upper limit lower limit ] will be narrower. For example, 90% and 99% confidence intervals based on the Normal distribution have multipliers 1.645 and.576, respectively. d. True or False. An investigator is performing a t-test for which the assumptions are satisfied could, in the absence of a student s t-distribution tables, use a Normal(0,1) probability table provided the degrees of freedom is sufficiently large. Answer: TRUE Consider Type I Error ( sided) Test Statistic Critical Value of Test Statistic.10 t df=30 1.697 t df=50 1.676 t df=100 1.66 Normal (0,1) 1.645.01 t df=30.75 t df=50.678 t df=100.66 Normal (0,1).576

PubHlth 540 Introductory Biostatistics Page 3 of 8 e. Choose ONE. The meaning of a p-value is i. the power of the test ii. the probability of getting a result as extreme or more extreme than the one observed if the null hypothesis is false. iii. the probability that the null hypothesis is true iv. the probability of making a type II error v. the probability of getting a result as extreme or more extreme than the one observed if the null hypothesis is true. Answer: v. By definition, the pvalue is the probability of obtaining a result as extreme or more extreme as the one observed under the assumption of the null hypothesis.. In September 1991, the Vermont Yankee Rowe nuclear power plant was closed by the Nuclear Regulatory Commission. New England Electric System now buys electricity from Canada to meet the energy needs of residents of Vermont. It is known that the per customer yearly energy demand follows a normal distribution with mean μ =,383 kilowatt hours and variance σ = 1,050,65 kilowatt hours squared. Quebec Power and Light have hired you as a consultant. Their first request is to compute an interval such that 95% of the per customer yearly consumption of energy are contained in this interval. Find the lower and upper limits of the required interval.

PubHlth 540 Introductory Biostatistics Page 4 of 8 Answer: (374, 439) For Z ~ Normal(0,1) the required 95% coverage interval is -1.96 < Z < +1.96 Use the standardization formula backwards to obtain the corresponding coverage interval for a random variable X ~ Normal ( μ = 383 and σ = 1,050,65 ) Thus, for X ~ Normal ( μ = 383 and σ = 1,050,65 ), the required coverage interval is (-1.96) [σ] + μ X (+1.96) [σ] + μ = (-1.96) [105] + 383 X (+1.96) [105] + 383 = 374, 439 3. A report from a clinical trial concludes that difference between treatments is not statistically significant, p < 0.05. What caveat/s must accompany this conclusion? Why is it potentially wrong to interpret this as meaning that there is no clinically important difference between treatments? Answer: The caveats that must accompany this conclusion pertain to the possibility that the alternative is true and that the statistical procedure performed missed its detection. There are several ways in which this might have occurred, including but not limited to the following: (1) the alternative is true and an event of low probability occurred and resulted in an incorrect retention of the null hypothesis () the alternative is true and the sample size was too small so that the power of the statistical test performed was low (3) the alternative is true but the choice of indicator was a very imprecise one so that the power of the statistical test was low

PubHlth 540 Introductory Biostatistics Page 5 of 8 4. Consider testing the null hypothesis that the mean of a normal probability distribution is μ = 98.6. The population standard deviation is known and is equal to σ = 1.5. You draw a sample of size 5 from this distribution and calculate a sample mean. If it is of interest to perform a two sided test of the null hypothesis with a pre-specified type I error equal to 0.0, what values must the sample mean lie between in order for the null hypothesis to be not rejected? Answer: Between 97.9 and 99.9 (1) For Z ~ Normal(0,1) and desired two sided type I error =.0, obtain the critical region for the Z test. To do this, obtain from a Normal (0,1) calculator that Pr [ -.36 < Z < +.36 ] =.98 () Identify the null hypothesis distribution of X n=5 σ 1.5 X n=5 ~ Normal ( μ NULL = 98.6 and σ = = ) under null. X n 5 (3) Solve for critical region values of X n=5 by using the standardization formula backwards. Here, it takes the form (-.36) [σ ] + μ X (+.36) [σ ] + μ X NULL n=5 X NULL = (-.36) [1.5/ 5] + 98.6 X n=5 (+.36) [1.5/ 5] + 98.6 = 97.9, 99.9

PubHlth 540 Introductory Biostatistics Page 6 of 8 5. A randomized trial of diet continuation versus diet termination was performed to evaluate whether diet termination is associated with a reduction in normal development among children with PKU. The outcome measured was Bitnet IQ, a measure of mental retardation. The following summary statistics were obtained from complete data on children: Group Diet Continuation Diet Termination Sample size, n 1 10 Sample mean IQ, X 103.6 99.8 Carry out the appropriate statistical hypothesis test to evaluate the study data. In developing your answer you may assume that the observations are distributed normal in both groups. You may also assume that the population standard deviations of a Bitnet IQ are σ = 13 in the diet continuation population and σ = 17 in the diet termination population. Answer: - H null : μ C = μ T H alternative : μ T < μ C, one sided (X -X ) - [0] σt σ - Z statistic = ; where SE (XT-X C)= + SE (X -X ) n n T C C T C T C (99.8-103.6) σ σ 17 13 - Z = = 0.5793; where 6.56= + = + 6.56 nt nc 10 1 - pvalue = 0.8 T C ~ Normal(0,1) under null - Do not reject. Conclude these data are insufficient evidence of an effect of diet continuation on Bitnet IQ in children with PKU

PubHlth 540 Introductory Biostatistics Page 7 of 8 6. It has been reported anecdotally that the onsets of menses for women living together (for example, a college dormitory) are more synchronized that those of women living separately. An investigator for the Journal of Irreproducible Results decides to investigate this hypothesis. During the month of December 008, the dates of mensis onset were recorded for two samples of women. The data for the first sample consisted of mensis onset histories for 10 women who had lived together continuously in a single dormitory at Smith College in Northampton, MA during 008. The sample variance was calculated to be s = 30.3. The data for the second sample included mensis onset histories for 5 women of comparable ages attending the University of Massachusetts/Amherst, all of whom lived in separate dwellings. The sample variance for this sample was s = 69.7 Using the appropriate statistical hypothesis test, do these data suggest that mensis onset dates are more synchronized for women living together relative to women living separately? Answer: - H null : σ = σ H alternative : σ > σ, one sided - F statistic = separately together Sseparately Stogether 69.7 - F = =.3 30.3 - pvalue = 0.0976 ~ F 4,9 under null separately together - Do not reject. Conclude these data are insufficient evidence of a synchrony of menses among women who live together, relative to women who do not live together.

PubHlth 540 Introductory Biostatistics Page 8 of 8 7. Consider the setting of a single sample of n=16 data values that are a random sample from a normal distribution. Suppose it is of interest to perform a type I error α = 0.01 statistical hypothesis test of H O : μ > 100 versus H A : μ < 100, α = 0.01. Suppose further that σ is unknown. State the appropriate test statistic ( points) and determine the critical region for values of the sample mean X (8 points). Answer: Appropriate test statistic is student s t with df=15 Critical region is X n=16 (-0.65)[S] + 100 (1) For a one sided test with type I error = 0.01, critical region for t-statistic is t-statistic < t df=15;.01 or t-statistic < -.60 X n=16 - μ NULL X n=16-100 () Substitution of t-statistic = = SE(X ˆ yields critical region n=16) S/ 16 X n=16-100.60 X n=16.60( S/ 16 ) + 100 S/ 16 n=16 ( ) X 0.65 S + 100