Suggested solution for exam in MSA830: Statistical Analysis and Experimental Design October 2009

Similar documents
Study Guide for the Final Exam

THE FIRST SET OF EXAMPLES USE SUMMARY DATA... EXAMPLE 7.2, PAGE 227 DESCRIBES A PROBLEM AND A HYPOTHESIS TEST IS PERFORMED IN EXAMPLE 7.

1.5 Oneway Analysis of Variance

LAB 4 INSTRUCTIONS CONFIDENCE INTERVALS AND HYPOTHESIS TESTING

Two-Sample T-Tests Allowing Unequal Variance (Enter Difference)

Permutation Tests for Comparing Two Populations

Two-Sample T-Tests Assuming Equal Variance (Enter Means)

NCSS Statistical Software

Lesson 1: Comparison of Population Means Part c: Comparison of Two- Means

Comparing Means in Two Populations

Non-Parametric Tests (I)

Section 13, Part 1 ANOVA. Analysis Of Variance

The Wilcoxon Rank-Sum Test

THE KRUSKAL WALLLIS TEST

NONPARAMETRIC STATISTICS 1. depend on assumptions about the underlying distribution of the data (or on the Central Limit Theorem)

Analysis of Variance ANOVA

12: Analysis of Variance. Introduction

Outline. Definitions Descriptive vs. Inferential Statistics The t-test - One-sample t-test

Fairfield Public Schools

Two-sample inference: Continuous data

Part 3. Comparing Groups. Chapter 7 Comparing Paired Groups 189. Chapter 8 Comparing Two Independent Groups 217

Rank-Based Non-Parametric Tests

Business Statistics. Successful completion of Introductory and/or Intermediate Algebra courses is recommended before taking Business Statistics.

Comparing Two Groups. Standard Error of ȳ 1 ȳ 2. Setting. Two Independent Samples

Nonparametric Two-Sample Tests. Nonparametric Tests. Sign Test

Statistics Review PSY379

General Method: Difference of Means. 3. Calculate df: either Welch-Satterthwaite formula or simpler df = min(n 1, n 2 ) 1.

Simple Linear Regression Inference

Randomized Block Analysis of Variance

Permutation & Non-Parametric Tests

Projects Involving Statistics (& SPSS)

Analysis of Variance. MINITAB User s Guide 2 3-1

SCHOOL OF HEALTH AND HUMAN SCIENCES DON T FORGET TO RECODE YOUR MISSING VALUES

StatCrunch and Nonparametric Statistics

Parametric and non-parametric statistical methods for the life sciences - Session I

NCSS Statistical Software

Course Text. Required Computing Software. Course Description. Course Objectives. StraighterLine. Business Statistics

INTERPRETING THE ONE-WAY ANALYSIS OF VARIANCE (ANOVA)

research/scientific includes the following: statistical hypotheses: you have a null and alternative you accept one and reject the other

N-Way Analysis of Variance

One-Way Analysis of Variance (ANOVA) Example Problem

Class 19: Two Way Tables, Conditional Distributions, Chi-Square (Text: Sections 2.5; 9.1)

Introduction to Analysis of Variance (ANOVA) Limitations of the t-test

Final Exam Practice Problem Answers

Confidence Intervals for Cp

Statistical tests for SPSS

Experimental Design. Power and Sample Size Determination. Proportions. Proportions. Confidence Interval for p. The Binomial Test

individualdifferences

13: Additional ANOVA Topics. Post hoc Comparisons

Elementary Statistics Sample Exam #3

Statistics. One-two sided test, Parametric and non-parametric test statistics: one group, two groups, and more than two groups samples

HYPOTHESIS TESTING (ONE SAMPLE) - CHAPTER 7 1. used confidence intervals to answer questions such as...

Testing Group Differences using T-tests, ANOVA, and Nonparametric Measures

ABSORBENCY OF PAPER TOWELS

Stat 411/511 THE RANDOMIZATION TEST. Charlotte Wickham. stat511.cwick.co.nz. Oct

CHAPTER 13. Experimental Design and Analysis of Variance

t-test Statistics Overview of Statistical Tests Assumptions

QUANTITATIVE METHODS BIOLOGY FINAL HONOUR SCHOOL NON-PARAMETRIC TESTS

UNDERSTANDING ANALYSIS OF COVARIANCE (ANCOVA)

Bowerman, O'Connell, Aitken Schermer, & Adcock, Business Statistics in Practice, Canadian edition

Non-Inferiority Tests for Two Means using Differences

Inference for two Population Means

CHAPTER 14 NONPARAMETRIC TESTS

Other forms of ANOVA

A POPULATION MEAN, CONFIDENCE INTERVALS AND HYPOTHESIS TESTING

Chapter 8 Hypothesis Testing Chapter 8 Hypothesis Testing 8-1 Overview 8-2 Basics of Hypothesis Testing

Chapter 5 Analysis of variance SPSS Analysis of variance

Quantitative Methods for Finance

WHERE DOES THE 10% CONDITION COME FROM?

Statistics I for QBIC. Contents and Objectives. Chapters 1 7. Revised: August 2013

An analysis method for a quantitative outcome and two categorical explanatory variables.

Research Methods & Experimental Design

Chapter 7. One-way ANOVA

Unit 26 Estimation with Confidence Intervals

Hypothesis Testing: Two Means, Paired Data, Two Proportions

Unit 31 A Hypothesis Test about Correlation and Slope in a Simple Linear Regression

Non-Inferiority Tests for One Mean

An Introduction to Statistics Course (ECOE 1302) Spring Semester 2011 Chapter 10- TWO-SAMPLE TESTS

CS 147: Computer Systems Performance Analysis

Name: Date: Use the following to answer questions 3-4:

SAMPLING & INFERENTIAL STATISTICS. Sampling is necessary to make inferences about a population.

One-Way Analysis of Variance: A Guide to Testing Differences Between Multiple Groups

II. DISTRIBUTIONS distribution normal distribution. standard scores

statistics Chi-square tests and nonparametric Summary sheet from last time: Hypothesis testing Summary sheet from last time: Confidence intervals

2 Sample t-test (unequal sample sizes and unequal variances)

Data Analysis Tools. Tools for Summarizing Data

STA-201-TE. 5. Measures of relationship: correlation (5%) Correlation coefficient; Pearson r; correlation and causation; proportion of common variance

Confidence Intervals for the Difference Between Two Means

3.4 Statistical inference for 2 populations based on two samples

SIMULATION STUDIES IN STATISTICS WHAT IS A SIMULATION STUDY, AND WHY DO ONE? What is a (Monte Carlo) simulation study, and why do one?

UNDERSTANDING THE TWO-WAY ANOVA

Statistiek II. John Nerbonne. October 1, Dept of Information Science

Regression step-by-step using Microsoft Excel

Statistics courses often teach the two-sample t-test, linear regression, and analysis of variance

Comparison of frequentist and Bayesian inference. Class 20, 18.05, Spring 2014 Jeremy Orloff and Jonathan Bloom

KSTAT MINI-MANUAL. Decision Sciences 434 Kellogg Graduate School of Management

CHAPTER 11 CHI-SQUARE AND F DISTRIBUTIONS

Transcription:

Petter Mostad Matematisk Statistik Chalmers Suggested solution for exam in MSA830: Statistical Analysis and Experimental Design October 2009 1. (a) To use a t-test, one must assume that both groups of numbers are independent random samples from normal distributions. She may or may not assume equal variances. (b) For process A we get sample mean 6 and sample variance 10/3, and for process B we get sample mean 9 and sample variance /2. One can either choose to assume equal population variances, or not to make this assumption. Assuming equal population variances, the pooled variance estimate becomes and the test statistic becomes 10 s 2 p = 3 3+ 2 4 3+4 = 20 7 = 2.87, 9 6 = 2.646. 20 ( 1 + 1 ) 7 4 Comparing this with t 7,0.02 = 2.36, we conclude that we can reject the null hypothesis at a % significance level. The 9% confidence interval for the differences in population means is 9 3±t 7,0.02 20 7 (1 4 + 1 )=3±2.68. Alternatively, not assuming equal population variances, the test statistic becomes and the degrees of freedom d f= ( 10/3 4 9 6 = 2.98 10 /4+ / 3 2 ( 10/3 ) + /2 2 4 ) 2/3+ ( /2 ) 2/4 = 6.047 6 Comparing 2.98 with t 6,0.02 = 2.447 we again get that we can reject the null hypothesis at a % significance level. The 9% confidence intervals in population means becomes 10/3 9 3±t 6,0.02 + /2 4 = 3±2.83. (c) The results in b) are based on the assumption that the given values are random samples, so that they are assumed independent. Consecutive runs of this process may not be independent. Also, the order in which all the runs are made has not been randomized, meaning that nuisance factors that change over time could influence the result. Finally, with so few observations, it is difficult to verify the assumption of normal distributions.

(d) A possibility is to use an external reference distribution : From the data for process A, Alice could compute averages of consecutive runs with various starting points in the data series. The averages for all such possible starting points constitute a sample with which the average of the five yields for process B could be compared. If the last average is extreme compared to the population of averages for process A, one could use this to reject the null hypothesis that data from process A and process B are interchangeable. More precisely, if the average for process B was higher than the median of the sample values for process A, the p-value would be twice the proportion of the sample values larger than the average for process B, and corresponding in the opposite case. 2. (a) Estimate for the main effect of factor B: 439.2+219.7 331.3+192.9 = 29.8. 6 2 (b) The pooled variance estimate is s 2 p = 429+1740+38+873 = 2690.7, 4 and as each of the 4 sample variances in this average has degrees of freedom, the pooled variance estimate has 20 degrees of freedom. The standard error becomes ( 1 s 2 p 12 + 1 ) = 2690.7/6=21.17, 12 and thus the confidence interval becomes 29.8±t 20,0.02 21.17= 29.8±44.2. For this confidence interval to be valid, one must assume that the random errors in the model are normally distributed with the same variance for each combination of settings of the factors. (c) We have AB AC BC = C = B = A meaning that, for each line, the given interaction effect is confounded with the given main effect. (d) The question is a bit unclear: If Anna should do 24 experiments instead of the ones she has done, she should use a full factorial design with three replications, i.e., doing 3 replications with the following design: A B C - - - - - + - + - - + + + - - + - + + + - + + +

If she should do 24 experiments in addition to the ones she has performed, she should follow the same experimental plan again, but with the signs reversed. 3. The important thing is that each friend should do exactly one test with each soil type and with each seed type. The resulting experimental design is called a Latin Square. For example, if the friends are called A,B,C,D, and E, then a possible experimental plan would be Seed 1 Seed 2 Seed 3 Seed 4 Seed Soil 1 A B C D E Soil 2 B C D E A Soil 3 C D E A B Soil 4 D E A B C Soil E A B C D The reason why this is a good experimental plan is that any effect of the different ways different people take care of the plants will be evened out on both the effect of the seed types and the effect of the soil type. The technique is called blocking. 4. (a) The usual test statistic in this case is s 2 B s 2 A = 18.8.9 = 3.186 and it should be compared with an F distribution with 11 and 9 degrees of freedom. From the provided tables, we can see that F 0.0,11,9 < 3.186<F 0.02,11,9. As we should make a 2-sided test, this means that the p-value is between 0.1 and 0.0. (b) The test is based on the values being random samples from normal distributions. To investigate whether for example the values from population A come from a normal distribution, Albert can make a normal probability plot: If the points fall approximately on a line, it indicates that the values may come from a normal distribution. (c) If it seems unreasonable to assume that both data sets come from normal popoulations, Albert could use for example a permutation test: The difference D between the means of the two sets of values could be compared with a sample of differences between the mean of randomly selected 10 values among the 22 values and the mean of the remaining 12 values. If D was quite large or quite small compared to the sample of such differences, one could reject the null hypothesis that the values come from the same distribution. Alternatively, Albert could use a non-parametric test, such as the Mann-Whitney test (also called the Wilcoxon Rank-sum test).. (a) The sum of squares for the colors can be computed as SS color = 4 ( (31 34) 2 + (36 34) 2 + (33 34) 2 + (30 34) 2 + (40 34) 2) = 264 The sum of squares for the fonts can be computed as SS f ont = ( (32 34) 2 + (37 34) 2 + (33 34) 2 + (34 34) 2) = 70 The total sum of squares can most easily be computed by comparing its definition to the definition of the sample variance: We then see that SS total = ( 4 1)s 2 = 19 29.47386=60

We can then fill out the ANOVA table as follows: Source var. Sum of squ. D.f. Mean square F value p-value Color 264 4 66 3.0 0.02 < p < 0.0 Font 70 3 23.33 1.24 p < 0.2 Error 226 12 18.83 Sum 60 19 (b) The assumptions are that the effects of the fonts and the colors are additive, and that the random variation at each combination of font and color is normally distributed, with the same variance at each combination. (Note: As the data in this case consists of counts, the data can never be exactly normally distributed. However, the approximation to a normal distribution may be close enough for the results of the analysis to be useful.) 6. (a) With the probability of success p and thus the probability of failure 1 p, the probability of observing the exact sequence shown is p (1 p) (1 p) p p (1 p) p (1 p) p p= p 6 (1 p) 4. (b) As Alex will stop his experiments when he has reached 6 successful experiments, his last experiment will necessarily be a success. If he has performed 4 unsuccessful experiments before this, and successful ones, it would not matter among these 9 experiments in which order the successes and failures came. Thus the probability for these 9 first experiments is given by the Binomial formula, while the probability for the last experiment is p. In summary, the probability we are asked compute is ( ) 9 p 9! (1 p) 4 p=!4! p (1 p) 4 p=126p 6 (1 p) 4. Because of the unclear language of the question, the answer ( ) 9 p (1 p) 4 was also accepted. (c) Generalizing the thinking above, we get that the probability that Alex will have to endure k failed experiments before he has finished r successful experiments is ( ) r+k 1 p r (1 p) k = (r+k 1)! r 1 (r 1)!k! pr (1 p) k. 7. (a) False. It is the distribution of the mean of a sample that becomes more and more normally distributed (for samples from distributions where the variance exists). (b) True. As the uniform distribution has a variance, the central limit theorem applies to it. (c) True. As long as the distributions from which the samples are drawn have a variance, the central limit theorem will apply to the means that appear in the t-statistic, making the whole t-statistic approximately normally distributed for large samples. Note also that as the number of degrees of freedom increases, the t distribution approaches the standard normal distribution.

8. The formula for the width of his confidence interval is 2t α/2,n 1 s 1/n where n is the number of observations, s is the sample standard deviation, and 1 α is the confidence level. Looking at the table for the t-distribution, we can see that, approximately, t α/2,49 t α/2,199 for relevantα. We also have that s is approximately equal to the standard deviation of the population of possible measurements, whether he is using 0 or 200 measurments. Thus the main difference in the computation of the length of the confidence interval will be that in the case of 0 measurements it will contain the factor 1/0, while in the case of 200 measurements, it will contain the factor 1/200=( 1/0)/2. Thus the length will be approximately half the previous length, i.e., 230/2=11 kronor.