Chapter 7 Part 2. Hypothesis testing Power


 Delphia Mason
 1 years ago
 Views:
Transcription
1 Chapter 7 Part 2 Hypothesis testing Power November 6, 2008 All of the normal curves in this handout are sampling distributions
2 Goal: To understand the process of hypothesis testing and the relationship of sample size and the form of the alternative hypothesis to power. Skills: Will know how and when to conduct an hypothesis test. Will be able to describe the relationship between the hypothesis testing and confidence interval approaches. Will know why power is important and how to maimize it. Contents: Formalization of Hypothesis Testing (Normal distribution and σ Known) 2 2 table Page 1 Twotailed alternative hypothesis Page 5 Confidence interval approach to twotailed alternative Page 8 Comparison of processes for hypothesis testing and CI approach Page 9 Onetailed alternative hypothesis Page 9 Onesided confidence interval approach to onetailed alternative Page 11 Alternative hypothesis a single number rather than a range of numbers Page 12 Figures Figure 1: Twotailed rejection region Page 6 Figure 2: pvalue Page 7 Figure 3: Onetailed rejection region Page 10 Figure 4: Rejection region for single number as the alternative Page 13 Figure 5: Power for a single number as the alternative Page 13
3 Hypothesis Testing  Part 2 Review Riverboat Gambler Hypothesis Testing setup: H 0 is the tested or null hypothesis H A is the alternative hypothesis (sometimes referred to as H 1 ) TRUTH H 0 H 0 is true and H 0 is accepted Correct Decision H A DECISION Accept H A is true and H 0 is accepted Type II or β error Reject H 0 H 0 is true and H 0 is rejected Type I or error H A is true and H 0 is rejected Correct decision  power We also say Fail to reject H 0 " instead of saying Accept H 0 " Accept H 0 " is equivalent to saying Reject H A Reject H 0 " is equivalent to saying Accept H A H 0 is true is equivalent to H A is false H 0 is false is equivalent to H A is true First we are going to review what we learned from the Riverboat Gambler. Hypothesis testing process we used for the Riverboat Gambler: H 0 : p = 0.5 H A : p = 0.2 Notice that we picked as the alternative hypothesis what we believed to be the truth (i.e. if we had believed the coin was fair, we never would have set up the testing of the coin). The null hypothesis is set up as a strawman to be rejected so that we can accept the alternative hypothesis which is our real interest. This is because we have more Page 1
4 control over the probability of incorrectly accepting the alternative hypothesis. Remember that the probability of incorrectly accepting the alternative hypothesis is equivalent to the probability of rejecting the null hypothesis when in fact it is true (see highlighted cell below). The error associated with this is the Type I or error. But we get to select what we will use as the value of. So although we hope the alternative hypothesis will be true, we certainly don t want to declare it true when it isn t. Type I and Type II errors Let X be the random variable indicating the number of heads in 10 tosses. We selected a critical or rejection region  let us use {0,1,9,10}. (Note that here we have selected the rejection region rather than the level, but as we will see this is not how things are usually done.) Then Pr(X= 0 or 1 or 9 or 10 p = 0.5) = Equation 1 level of significance, level or type I error = (see Table 1 Chapter 7 Part 1) Notice that Equation 1 gives the probability of landing in the rejection region when the null hypothesis (p = 0.5) is true. Pr(X = 0 or 1 or 9 or 10 p= 0.2) = power = 1  β = Equation 2 Equation 2 gives the probability of landing in the rejection region when the null hypothesis is false (e.g. the alternative hypothesis is true). Page 2
5 pvalue Definition: The pvalue is the probability associated with the smallest rejection region that includes the value of the test statistic (the number of heads you actually got in 10 tosses) for the sample, under the assumption that the null hypothesis is true (i.e. the coin tossing problem, assuming the probability of getting heads on a given toss is 0.5). Eample: If turns out to be 0 (i.e. we flipped the coin 10 times and the results of each of the flips was tails), then the smallest rejection region containing 0 is {0, 10} and the pvalue is Pr(X = 0 or 10 p = 0.5) = pvalue = Notice that the pvalue and level have the same pattern (i.e. the probability of being in a rejection region given the null hypothesis is true) just different rejection regions. If the original setup is twotailed, then the rejection region associated with the pvalue is also two tailed (which eplains why we included 10 when calculating the pvalue). Eample: If the test statistic turns out to be 4, then the smallest rejection region containing 4 is {0, 1, 2, 3, 4, 6, 7, 8, 9, 10}. That is, the only number not in the rejection region is 5. So pvalue = Pr(X = 0, 1, 2, 3, 4, 6, 7, 8, 9, 10 p = 0.5) = 1  Pr(X = 5 p = 0.5) = = Relationship of the pvalue and level Let us go back to the rejection region {0, 1, 9, 10}. For a categorical distribution when the test statistic is in the rejection region [ i.e. = 0 is in the region {0,1,9,10}], the p value is less than or equal to the level because pvalue = Pr(X = 0 or 10 p = 0.5) = for test statistic level = Pr(X= 0 or 1 or 9 or 10 p = 0.5) = = 0 and Page 3
6 Notice that {, 010} {,,, 01910} Notice that both the pvalue and the When the test statistic is not in the rejection region [i.e. region {0,1,9,10}], then the pvalue is greater than the statistic = 4 is means is a subset of level are based on the null (p = 0.5) hypothesis. = 4 is not in the rejection level. The pvalue for the test pvalue = Pr(X = 0, 1, 2, 3, 4, 6, 7, 8, 9, 10 p = 0.5) = as compared to the level = Pr(X= 0 or 1 or 9 or 10 p = 0.5) = So we have that the level and the pvalue each depend on the null hypothesis and a rejection region, but not necessarily the same rejection region. The power and β error depend on the alternative hypothesis and the rejection region. We will reject the null hypothesis in favor of the alternative hypothesis if the pvalue <. We will fail to reject the null hypothesis, if the pvalue >. We fail to reject the null hypothesis when the pvalue equals Eample of Hypothesis testing given a normal distribution with σ known (I think that the probability of actually knowing σ for a given study is smaller than my probability of winning the lottery {and I don t buy lottery tickets}, but the assumption simplifies our eample): Let us assume that we know that the population of all high school aged kids in Houston has a mean SBP of 125 mm Hg (Rosner labels this population mean μ 0 ) and a known standard deviation of 50 mm Hg. Suppose we obtain a sample of 25 kids from the High School for the Performing Arts (HSPVA) and find that their mean systolic blood pressure is 142 mm Hg. The question is: Does our sample of 25 kids seem to be from the population with mean SBP = 125 mm Hg or does it seem more likely that they are from another population. So our null hypothesis is: H 0 : μ = 125 mm Hg The μ in the null hypothesis above is the mean for the population from which the sample of 25 kids was drawn. The number 125 mm Hg is the mean of the population of all high school aged kids in Houston (i.e. μ 0 = 125). So we are asking are the means of the two populations the same. If we assume the variances are the same, then we are asking are the two populations the same. Page 4
7 There are a number of possible forms for the alternative hypothesis: 1) H A : μ 125 (twotailed alternative) 2) H A : μ > 125 (onetailed upper tail) 3) H A : μ < 125 (onetailed lower tail) 4) H A : μ = 162 (some specific value) The form of the alternative hypothesis is picked prior to collecting the sample information. Please note that in the hypothesis testing setup, μ is the mean of the population from which the sample (of 25) was drawn (we hope this population is the same as the one with mean = 125). So μ = 125 essentially asks if the sample was drawn from the same population as the population with mean = 125 (i.e. are the two populations the same). Or another way to think of the question is: is the mean of our sample of 25 consistent with the null hypothesis or with the alternative hypothesis? Let us assume the following: (1) the twotailed version (i.e. H A : μ 125) of the alternative hypothesis (2) X is the random variable associated with the SBP of all high school kids in Houston (as opposed to the sample from HSPVA). (3) X ~ N( 125, 50 2 ) (4) the level of significance is 0.05 (i.e. = 0.05). Note that to select and then determine the rejection region formed by that level is the usual procedure. In the Riverboat gambler problem we selected the rejection region and then calculated for pedagogical reasons. The sample mean ( = 142 mm Hg) of our sample of 25 kids from HSPVA is our test statistic just like = 2 heads was the test statistic for the coin toss problem. Under the null hypothesis (i.e. assume that the sample of 25 HSPVA students is from the population of all high school students) the distribution of the sample means of all samples of size n = 25 (i.e. the sampling distribution) with σ = 25 (SD of the population of all Houston kids) is the normal distribution with mean = 125 mm Hg and Page 5
8 50 25 X ~ N 125, SD = 10 mm Hg (i.e ). Or because X ~ N( 125, 50 2 ) and n, our sample size, is equal to In the Riverboat gambler problem, we selected the rejection region and then found the level that went with it. We did it in that fashion so it would be clear that the rejection region contained the values that we thought were unlikely to occur if indeed the coin was fair. Another way to decide on the rejection region is to first select the level (usually something like 0.05 or 0.01) and then find the region (which would be in two pieces for a twotailed test or one piece for a onetailed test) whose area is equal to the level. Selecting first is the usual way of doing things. For the current problem we ll choose the level to be Since we are dealing with a twotailed test, we are looking for the values that cutoff the upper and lower area of the curve. We know that on the N(0,1) curve and 1.96 cut off the lower and upper areas respectively. The question is what points on the N(125,10 2 ) [the distribution for X] curve are the equivalents of and 1.96 (i.e. what points are 1.96 standard deviations below and above 125). 125 = = If, then = (10)(1.96) = and if, then = (10)(1.96) = This means that the rejection region consists of all such that < or >144.6 (Notice that the endpoints are in the acceptance region.) and the acceptance region is [105.4, 144.6] where the square brackets indicate that the interval includes the end points. Another way to say this is: Pr( X is in the rejection region μ = 125 and σ = 10) = Pr( X < or X > μ = 125 and σ = 10) = 0.05 Note that 142 (the mean SBP of our sample of 25 kids) falls in the acceptance region (see Figure 1 below) so we would say that we accept the null hypothesis (or that we fail to reject the null hypothesis). Page 6
9 Yet another way of saying this is that we believe our sample of 25 kids could have come from the normal distribution with μ = 125 and σ = 50 (i.e. the distribution of all Houston kids). Fail to reject the null hypothesis is the usual way statisticians report the results, but you ll never see this in a journal article. Notice that what we did was assume the sample was from the population of all high school kids and then look to see if that made sense or to see if the sample really seemed to fit with the population of all high school students. Figure 1.04 Normal Density with Mean = 125 and SD = 10 The vertical lines are at = & = Normal density with Mean = 125 and SD = 10 The vertical lines are at = and = The lines are at = & = Sample Means Each stripped area is half of the rejection region. Each area = The rejection region is the stripped area (sum of the two areas actually) in Figure 1 above. To find the pvalue that goes with this decision, we need to find the area to the right of 142 under the N(125,100) curve and double it because we are dealing with a twotailed alternative. Remember to find the pvalue you need to find the smallest rejection region that includes 142. This would be the rejection region where 142 is the cutoff (see Figure 2 below). To get the probability associated with the region under the N(125,100) curve and to the right of 142 we need to translate 142 to the N(0,1) curve by finding how many standard deviations 142 is from 125. Page 7
10 is 1.7 SD s from 125 since 10 = 1.70 Figure 2 Normal Density with Mean = 125 and SD = 10 The vertical lines are at = & = (for a level) and = 142 (for half the pvalue).04 Normal density with Mean = 125 and SD = 10 The vertical lines are at = and = The dashed line is at 142 The stripped area is half of the pvalue Sample Means Each area beyond the solid line is half of the rejection region (i.e ) According to the tables or Stata, the probability to the right of 1.7 is Since we are dealing with a twotailed test, the pvalue = = You could get this using STATA:. di 2*(1  normal(( )/10)) So we fail to reject the null hypothesis because pvalue >. Notice we are using the normal distribution with the hypothesized mean as opposed to the sample mean. Page 8
11 A Confidence interval approach to the same problem nother way we could look at this problem (H 0 : μ = 125 mm Hg versus H A : μ 125 mm Hg assuming X follows a normal distribution, σ known to be 50 and = 0.05) would be through confidence intervals: The 95% CI for 142 (i.e. you get the confidence interval about the sample value) given that n = 25 and σ for X is 50 (the original distribution  the population of all Houston kids) would be ( ( z σ / n), + ( z σ / n)) 1 ( 2) 1 ( 2) (. 196)( 50) (. 196)( 50) 142, = ( , ) = (122.4, 161.6) Notice that 125 (the null hypothesis) is in this 95% confidence interval for 142, so 142 would not be considered different from 125 at = 0.05 level. This is the same conclusion we reached earlier. Using the confidence interval, we know the set of values (population means) that 142 does not differ from, not just that 142 doesn t differ from 125. However, we can t calculate the pvalue. So we gain something with the confidence interval approach and lose something. Comparison of processes for hypothesis testing and confidence interval approach: For the confidence interval approach the process is to find the confidence interval for the sample mean (142) and then to check whether the population mean (125) is in that confidence interval (i.e. you start with the sample). Using the hypothesis testing approach, you start with the distribution of X under the null hypothesis (i.e. you get an acceptance region about the population mean 125) and look to see if the sample mean (142) lies in this acceptance region. Page 9
12 Same problem but with the alternative hypothesis changed to H A : μ > 125 (i.e. onetailed upper tailed). Let us work through the problem again this time using the #2 form of the alternative hypothesis (i.e H A : μ > 125) but keep the = 0.05 and σ for X at 50. Selecting this particular form of the alternative hypothesis means that we will reject the null hypothesis only when the sample mean is too big. Note that the alternative hypothesis is selected prior to the collection of the sample data. Now, using the hypothesis testing approach, is our sample mean of 142 from a sample of 25 still consistent with the null hypothesis? We will still use the distribution of sample means (i.e. distribution of X) with mean = 125 and SD = 10, but now the critical region is all in the upper tail (i.e. we reject the null hypothesis and accept the alternative only when the sample mean is too big). So again we have the critical and acceptance regions in terms of the relationship to the population mean (125). This means that the entire 0.05 will be in the upper tail because we reject only if 142 is too big. We know that cuts off the upper 5% of the N(0,1) curve. So if we solve the following, we will have the equivalent number for the N(125,10 2 ) curve. Figure 3 Normal Density with Mean = 125 and SD = 10 The vertical line is drawn at = Onetailed test. Normal density with Mean = 125 and SD = 10 The vertical line is at The stripped area is the rejection region. The area = X = Sample bar Means Page 10
13 125 = If, then = The vertical line on the graph below is at which cuts off 5% of the upper tail. Notice that is smaller than that cut off in the upper tail. And also smaller than 142 which cuts off (i.e. half the pvalue from the twotailed test) in the upper tail. Notice that 142 (the sample mean) is now in the rejection region (i.e. 142 > ) whereas it was in the acceptance region for the twotailed test To obtain the pvalue we again solve 10 di 1  normal(1.7) = = 1.70 So the pvalue = (note that we do not multiply by 2 because this is a onetailed test). Since 142 is in the rejection region, we epected the pvalue to be less than which is equal to For the twotailed test the test statistic 142 fell in the acceptance region (so we accepted the null hypothesis or failed to reject the null hypothesis) and the pvalue was equal to But with the onetailed test the test statistic falls in the rejection region (so we reject the null hypothesis) and the pvalue is (note that is half of ). Therefore, with the same sample size and the same null hypothesis we manage to go from acceptance of the null hypothesis (twotailed test) to rejection of the null hypothesis (onetailed test). Note that this is why people want to use a onetailed test. But be aware that if you have picked the wrong tail (so that the rejection region is on the lower end of the distribution), you could end up with acceptance with a onetailed test but rejection with a twotailed. You also need to be aware that it is seldom appropriate to use a onetailed test. You need to have prior information (i.e. an earlier study done in your lab or published in a journal) that indicates that if the sample mean is different from the population mean it is because the sample mean is bigger than the population mean (version 2 of the alternative hypotheses) or smaller than the population mean (version 3 of the alternative hypotheses). Also particularly note that the alternative hypothesis has to be selected prior to your seeing the data. Investigators like to gamble that they know which tail is appropriate because, as we will see later, you can use a smaller sample size. DON T DO THIS!!!!!! Now consider the confidence interval approach but this time it will be Page 11
14 a onesided confidence interval to go with the onesided test. So we consider a 90% twosided consider below because if you consider only one side of a 90% confidence interval and you 10% outside the confidence interval with 5% on each side. The confidence interval is again about the sample mean. The 90% confidence interval for 142 is (142 (1.645)(50),142 + (1.645)(50) ) = (125.55, ) The interval (125.55, 4) is the correct onesided 95% confidence interval. H A :μ > 125 says we reject the null hypothesis in favor of the alternative hypothesis if the sample mean is too big. This can also be translated as the population mean is too small. The confidence interval is about the sample mean 142. So for the population mean to be too small it must be outside the lower bound of the confidence interval. Notice that the tail of this confidence interval is in the same direction as the cutoff for the onetailed hypothesis test. The population mean 125 is not in the confidence interval so we reject the null hypothesis. Notice that for the hypothesis test we reject the null hypothesis when the sample mean is too big. For the confidence interval we reject the null hypothesis when the population mean is too small. The conclusions with the twosided hypothesis test and the twosided confidence interval should be the same provided the level is the same for both. Similarly for the onesided hypothesis test and onesided confidence interval. Revisiting the problem with H A as a single number rather than a range of numbers. Now suppose we use the #4 form of the alternative hypothesis (i.e that H A: μ = 162). Now we have two normal distributions for X each with σ = 10 but one with mean = 125 and one with mean = 162. We should note that this sort of problem is not commonly considered. H 0 : μ = 125 versus H A : μ = 162 Page 12
15 We are going to do a onetailed upper tailed test here (we reject the null hypothesis only when the sample mean is too big). Why are we doing a onetailed test here? We want to know if our sample mean of 142 is consistent with 125 or 162. So we reject the null hypothesis when the sample means looks more like 162 than 125. Well numbers smaller than 125 are not going to cause us to reject the null hypothesis in favor of 162. Only if the numbers are too big would we be willing to think that 162 was more appropriate than 125. With this onetailed test we can actually show the area that goes with the power. Earlier we found that if = 0.05, the cutoff for this upper tailed region is or Pr( X > μ = 125) = 0.05 Figure 4 Rejection Region Normal Distributions with SD = 10 and Means = 125 and Sampling Distribution according to H 0 Sampling Distribution according to H A The line is = The stripped area is the rejection region. Area = Sample Means Power is the probability of being in the rejection region for the null hypothesis when the alternative hypothesis is true (i.e. correctly rejecting the null hypothesis). So the power for this problem is the probability of being to the right of (i.e. being in the rejection region for the null hypothesis) but under the alternative hypothesis curve (i.e. the one with mean = 162). Page 13
16 So Pr( X > μ = 162) = = power [see Figure 5 below]. Recall that on the normal curve with mean 162 and SD = 10 is equivalent to ( )/10 = (i.e is 2.1 standard deviations below the mean of 162) Using Stata, we get power = 1  normal(2.055) normal(2.055) gives the are under the curve with μ = 162 and to the left of the line at (i.e. β = 1  power). It is the area under the rest of the curve (i.e. to the right of ) that is the power (i.e. 1  β). Figure 5 Power Normal Distributions with SD = 10 and Means = 125 and Sampling Distribution according to H 0 Sampling Distribution according to H A The hatched area is the power = The line is = Sample Means Page 14
17 You need to be aware that: 1) When we were looking at the problems above as hypothesis testing problems, we used the population parameters (here population mean = 125 and population SE = 10) to obtain a rejection region and then asked was the sample statistic (here = 142) in the rejection region. 2) However, when considering the problems using the confidence interval approach, we obtained the confidence interval about the sample statistic ( = 142) and then asked if the population parameter (i.e. the population mean 125) was in the confidence interval. Why do people select a onetailed versus a twotailed test? We saw above that it is possible to reject the null hypothesis using a onetailed test but fail to reject the null hypothesis using a twotailed test. So if you were trying to prove that your new drug is better than the standard of care you might be tempted to use a onetailed test. What are the drawbacks to a onetailed test? Well you might have guessed the wrong tail. If before we had obtained a sample (and this is the way you are supposed to play the game), you said if your sample of kids differed from the population with respect to SBP, it would be because the SBP of your sample of kids would be too small. This means that in Figure 3 above the rejection region would be the area to the left of = = would be in the acceptance region. The wrong guess has cost you a significant answer.. So Page 15
Chapter 8 Introduction to Hypothesis Testing
Chapter 8 Student Lecture Notes 81 Chapter 8 Introduction to Hypothesis Testing Fall 26 Fundamentals of Business Statistics 1 Chapter Goals After completing this chapter, you should be able to: Formulate
More informationSingle sample hypothesis testing, II 9.07 3/02/2004
Single sample hypothesis testing, II 9.07 3/02/2004 Outline Very brief review Onetailed vs. twotailed tests Small sample testing Significance & multiple tests II: Data snooping What do our results mean?
More informationChapter 27 & 29. Onetailed vs. Twotailed Tests. Examples. Examples. Chapter 27 & 29. Onetailed vs. Twotailed Tests. Examples.
Interpreting the Pvalue Part VIII of Signicance More for Averages Chapter 29 A Closer Look at of Signicance Example 2 from earlier lecture A senator introduces bill that simplies the tax code. He claims
More information93.4 Likelihood ratio test. NeymanPearson lemma
93.4 Likelihood ratio test NeymanPearson lemma 91 Hypothesis Testing 91.1 Statistical Hypotheses Statistical hypothesis testing and confidence interval estimation of parameters are the fundamental
More informationNotes 4: Hypothesis Testing: Hypothesis Testing, One Sample Z test, and Hypothesis Testing Errors
Notes 4: Hypothesis Testing: Hypothesis Testing, One Sample test, and Hypothesis Testing Errors 1. Coin Toss and Hypothesis Testing Logic Is this result real; what is the probability of such a result?
More informationLecture Topic 6: Chapter 9 Hypothesis Testing
Lecture Topic 6: Chapter 9 Hypothesis Testing 9.1 Developing Null and Alternative Hypotheses Hypothesis testing can be used to determine whether a statement about the value of a population parameter should
More informationHomework #3 is due Friday by 5pm. Homework #4 will be posted to the class website later this week. It will be due Friday, March 7 th, at 5pm.
Homework #3 is due Friday by 5pm. Homework #4 will be posted to the class website later this week. It will be due Friday, March 7 th, at 5pm. Political Science 15 Lecture 12: Hypothesis Testing Sampling
More informationSampling Distribution of the Mean & Hypothesis Testing
Sampling Distribution of the Mean & Hypothesis Testing Let s first review what we know about sampling distributions of the mean (Central Limit Theorem): 1. The mean of the sampling distribution will be
More informationHYPOTHESIS TESTING: POWER OF THE TEST
HYPOTHESIS TESTING: POWER OF THE TEST The first 6 steps of the 9step test of hypothesis are called "the test". These steps are not dependent on the observed data values. When planning a research project,
More informationNull Hypothesis H 0. The null hypothesis (denoted by H 0
Hypothesis test In statistics, a hypothesis is a claim or statement about a property of a population. A hypothesis test (or test of significance) is a standard procedure for testing a claim about a property
More informationMATH 10: Elementary Statistics and Probability Chapter 9: Hypothesis Testing with One Sample
MATH 10: Elementary Statistics and Probability Chapter 9: Hypothesis Testing with One Sample Tony Pourmohamad Department of Mathematics De Anza College Spring 2015 Objectives By the end of this set of
More informationHYPOTHESIS TESTING (ONE SAMPLE)  CHAPTER 7 1. used confidence intervals to answer questions such as...
HYPOTHESIS TESTING (ONE SAMPLE)  CHAPTER 7 1 PREVIOUSLY used confidence intervals to answer questions such as... You know that 0.25% of women have red/green color blindness. You conduct a study of men
More informationIntroduction to Hypothesis Testing
I. Terms, Concepts. Introduction to Hypothesis Testing A. In general, we do not know the true value of population parameters  they must be estimated. However, we do have hypotheses about what the true
More information7 Hypothesis testing  one sample tests
7 Hypothesis testing  one sample tests 7.1 Introduction Definition 7.1 A hypothesis is a statement about a population parameter. Example A hypothesis might be that the mean age of students taking MAS113X
More informationIntroduction to Hypothesis Testing. Hypothesis Testing. Step 1: State the Hypotheses
Introduction to Hypothesis Testing 1 Hypothesis Testing A hypothesis test is a statistical procedure that uses sample data to evaluate a hypothesis about a population Hypothesis is stated in terms of the
More information1 SAMPLE SIGN TEST. NonParametric Univariate Tests: 1 Sample Sign Test 1. A nonparametric equivalent of the 1 SAMPLE TTEST.
NonParametric Univariate Tests: 1 Sample Sign Test 1 1 SAMPLE SIGN TEST A nonparametric equivalent of the 1 SAMPLE TTEST. ASSUMPTIONS: Data is nonnormally distributed, even after log transforming.
More informationCHAPTER 11 SECTION 2: INTRODUCTION TO HYPOTHESIS TESTING
CHAPTER 11 SECTION 2: INTRODUCTION TO HYPOTHESIS TESTING MULTIPLE CHOICE 56. In testing the hypotheses H 0 : µ = 50 vs. H 1 : µ 50, the following information is known: n = 64, = 53.5, and σ = 10. The standardized
More informationHypothesis testing  Steps
Hypothesis testing  Steps Steps to do a twotailed test of the hypothesis that β 1 0: 1. Set up the hypotheses: H 0 : β 1 = 0 H a : β 1 0. 2. Compute the test statistic: t = b 1 0 Std. error of b 1 =
More informationQuantitative Understanding in Biology Module I: Statistics Lecture IV: PValues and Formal Statistical Tests
Quantitative Understanding in Biology Module I: Statistics Lecture IV: PValues and Formal Statistical Tests We have already seen one example of formal statistical testing when we tested for the normality
More informationModule 7: Hypothesis Testing I Statistics (OA3102)
Module 7: Hypothesis Testing I Statistics (OA3102) Professor Ron Fricker Naval Postgraduate School Monterey, California Reading assignment: WM&S chapter 10.110.5 Revision: 212 1 Goals for this Module
More informationExperimental Design. Power and Sample Size Determination. Proportions. Proportions. Confidence Interval for p. The Binomial Test
Experimental Design Power and Sample Size Determination Bret Hanlon and Bret Larget Department of Statistics University of Wisconsin Madison November 3 8, 2011 To this point in the semester, we have largely
More informationHomework 5 Solutions
Math 130 Assignment Chapter 18: 6, 10, 38 Chapter 19: 4, 6, 8, 10, 14, 16, 40 Chapter 20: 2, 4, 9 Chapter 18 Homework 5 Solutions 18.6] M&M s. The candy company claims that 10% of the M&M s it produces
More informationStatistical inference provides methods for drawing conclusions about a population from sample data.
Chapter 15 Tests of Significance: The Basics Statistical inference provides methods for drawing conclusions about a population from sample data. Two of the most common types of statistical inference: 1)
More informationStatistical Inference. Confidence Intervals
Chapter 14 Introduction to Inference Statistical Inference Situation: We are interested in estimating some parameter (population mean, μ) that is unknown. We take a random sample from this population.
More informationTest of Hypotheses. Since the NeymanPearson approach involves two statistical hypotheses, one has to decide which one
Test of Hypotheses Hypothesis, Test Statistic, and Rejection Region Imagine that you play a repeated Bernoulli game: you win $1 if head and lose $1 if tail. After 10 plays, you lost $2 in net (4 heads
More informationChapter Five. Hypothesis Testing: Concepts
Chapter Five The Purpose of Hypothesis Testing... 110 An Initial Look at Hypothesis Testing... 112 Formal Hypothesis Testing... 114 Introduction... 114 Null and Alternate Hypotheses... 114 Procedure for
More informationChapter 8. Hypothesis Testing
Chapter 8 Hypothesis Testing Hypothesis In statistics, a hypothesis is a claim or statement about a property of a population. A hypothesis test (or test of significance) is a standard procedure for testing
More informationThe Paired ttest and Hypothesis Testing. John McGready Johns Hopkins University
This work is licensed under a Creative Commons AttributionNonCommercialShareAlike License. Your use of this material constitutes acceptance of that license and the conditions of use of materials on this
More informationLAB 4 INSTRUCTIONS CONFIDENCE INTERVALS AND HYPOTHESIS TESTING
LAB 4 INSTRUCTIONS CONFIDENCE INTERVALS AND HYPOTHESIS TESTING In this lab you will explore the concept of a confidence interval and hypothesis testing through a simulation problem in engineering setting.
More informationMONT 107N Understanding Randomness Solutions For Final Examination May 11, 2010
MONT 07N Understanding Randomness Solutions For Final Examination May, 00 Short Answer (a) (0) How are the EV and SE for the sum of n draws with replacement from a box computed? Solution: The EV is n times
More informationThe Basics of a Hypothesis Test
Overview The Basics of a Test Dr Tom Ilvento Department of Food and Resource Economics Alternative way to make inferences from a sample to the Population is via a Test A hypothesis test is based upon A
More informationModule 5 Hypotheses Tests: Comparing Two Groups
Module 5 Hypotheses Tests: Comparing Two Groups Objective: In medical research, we often compare the outcomes between two groups of patients, namely exposed and unexposed groups. At the completion of this
More information1 Confidence intervals
Math 143 Inference for Means 1 Statistical inference is inferring information about the distribution of a population from information about a sample. We re generally talking about one of two things: 1.
More informationSampling and Hypothesis Testing
Population and sample Sampling and Hypothesis Testing Allin Cottrell Population : an entire set of objects or units of observation of one sort or another. Sample : subset of a population. Parameter versus
More informationChapter 9, Part A Hypothesis Tests. Learning objectives
Chapter 9, Part A Hypothesis Tests Slide 1 Learning objectives 1. Understand how to develop Null and Alternative Hypotheses 2. Understand Type I and Type II Errors 3. Able to do hypothesis test about population
More informationQuantitative Understanding in Biology 1.4 pvalues and Formal Statistical Tests
Quantitative Understanding in Biology 1.4 pvalues and Formal Statistical Tests Jason Banfelder September 15th, 2015 1 Introduction to pvalues We have already seen one example of formal statistical testing
More informationThe Philosophy of Hypothesis Testing, Questions and Answers 2006 Samuel L. Baker
HYPOTHESIS TESTING PHILOSOPHY 1 The Philosophy of Hypothesis Testing, Questions and Answers 2006 Samuel L. Baker Question: So I'm hypothesis testing. What's the hypothesis I'm testing? Answer: When you're
More informationChapter Six: Two Independent Samples Methods 1/35
Chapter Six: Two Independent Samples Methods 1/35 6.1 Introduction 2/35 Introduction It is not always practical to collect data in the paired samples configurations discussed previously. The majority of
More informationThe alternative hypothesis,, is the statement that the parameter value somehow differs from that claimed by the null hypothesis. : 0.5 :>0.5 :<0.
Section 8.28.5 Null and Alternative Hypotheses... The null hypothesis,, is a statement that the value of a population parameter is equal to some claimed value. :=0.5 The alternative hypothesis,, is the
More informationHYPOTHESIS TESTING (ONE SAMPLE)  CHAPTER 7 1. used confidence intervals to answer questions such as...
HYPOTHESIS TESTING (ONE SAMPLE)  CHAPTER 7 1 PREVIOUSLY used confidence intervals to answer questions such as... You know that 0.25% of women have red/green color blindness. You conduct a study of men
More informationTwoSample TTests Assuming Equal Variance (Enter Means)
Chapter 4 TwoSample TTests Assuming Equal Variance (Enter Means) Introduction This procedure provides sample size and power calculations for one or twosided twosample ttests when the variances of
More informationCHAPTERS 46: Hypothesis Tests Read sections 4.3, 4.5, 5.1.5, Confidence Interval vs. Hypothesis Test (4.3):
CHAPTERS 46: Hypothesis Tests Read sections 4.3, 4.5, 5.1.5, 6.1.3 Confidence Interval vs. Hypothesis Test (4.3): The purpose of a confidence interval is to estimate the value of a parameter. The purpose
More informationHypothesis testing. Power of a test. Alternative is greater than Null. Probability
Probability February 14, 2013 Debdeep Pati Hypothesis testing Power of a test 1. Assuming standard deviation is known. Calculate power based on onesample z test. A new drug is proposed for people with
More informationHypothesis Testing Summary
Hypothesis Testing Summary Hypothesis testing begins with the drawing of a sample and calculating its characteristics (aka, statistics ). A statistical test (a specific form of a hypothesis test) is an
More informationComparing Two Groups. Standard Error of ȳ 1 ȳ 2. Setting. Two Independent Samples
Comparing Two Groups Chapter 7 describes two ways to compare two populations on the basis of independent samples: a confidence interval for the difference in population means and a hypothesis test. The
More information6. Statistical Inference: Significance Tests
6. Statistical Inference: Significance Tests Goal: Use statistical methods to check hypotheses such as Women's participation rates in elections in France is higher than in Germany. (an effect) Ethnic divisions
More informationSimple Regression Theory II 2010 Samuel L. Baker
SIMPLE REGRESSION THEORY II 1 Simple Regression Theory II 2010 Samuel L. Baker Assessing how good the regression equation is likely to be Assignment 1A gets into drawing inferences about how close the
More information1. What is the critical value for this 95% confidence interval? CV = z.025 = invnorm(0.025) = 1.96
1 Final Review 2 Review 2.1 CI 1propZint Scenario 1 A TV manufacturer claims in its warranty brochure that in the past not more than 10 percent of its TV sets needed any repair during the first two years
More informationHypothesis testing S2
Basic medical statistics for clinical and experimental research Hypothesis testing S2 Katarzyna Jóźwiak k.jozwiak@nki.nl 2nd November 2015 1/43 Introduction Point estimation: use a sample statistic to
More informationTwoSample TTests Allowing Unequal Variance (Enter Difference)
Chapter 45 TwoSample TTests Allowing Unequal Variance (Enter Difference) Introduction This procedure provides sample size and power calculations for one or twosided twosample ttests when no assumption
More information9.1 Basic Principles of Hypothesis Testing
9. Basic Principles of Hypothesis Testing Basic Idea Through an Example: On the very first day of class I gave the example of tossing a coin times, and what you might conclude about the fairness of the
More informationWe have discussed methods of point and interval estimation for parameter of interest.
PHP 2510 Hypothesis testing: One sample We have discussed methods of point and interval estimation for parameter of interest. Researchers often have preconceived ideas about what the parameter might be
More informationHypothesis tests I. Onesample ttest, paired ttest
Hypothesis tests I. Onesample ttest, paired ttest 1 Motivating example Two lecturers argue about the mean age of the first year medical students. Lecturer#1 claims that the mean age of the first year
More informationBiostatistics Lab Notes
Biostatistics Lab Notes Page 1 Lab 1: Measurement and Sampling Biostatistics Lab Notes Because we used a chance mechanism to select our sample, each sample will differ. My data set (GerstmanB.sav), looks
More informationAMS 5 TESTS FOR TWO SAMPLES
AMS 5 TESTS FOR TWO SAMPLES Test for difference We will consider the problem of comparing the means of two populations. The main tool to do hypothesis testing in this case is the z test for the difference
More informationLesson 1: Comparison of Population Means Part c: Comparison of Two Means
Lesson : Comparison of Population Means Part c: Comparison of Two Means Welcome to lesson c. This third lesson of lesson will discuss hypothesis testing for two independent means. Steps in Hypothesis
More informationSTAB22 section This time z is negative, so to find the Pvalue,
STAB22 section 6.2 6.36 We are trying to find evidence that the new design is an improvement, so that the alternative hypothesis should reflect this. Let µ be the population mean score for all the students
More informationChapter 9 Hypothesis Testing. Developing Null and Alternative Hypotheses Type I and Type II Errors Population Mean: σ Known
Chapter 9 Hypothesis Testing Developing Null and Alternative Hypotheses Type I and Type II Errors Population Mean: σ Known Developing Null and Alternative Hypotheses Hypothesis testing can be used to determine
More informationPROBLEM SET 1. For the first three answer true or false and explain your answer. A picture is often helpful.
PROBLEM SET 1 For the first three answer true or false and explain your answer. A picture is often helpful. 1. Suppose the significance level of a hypothesis test is α=0.05. If the pvalue of the test
More informationWhat is a hypothesis? Testable Falsifiable. Chapter 22 & 23. Using Data to Make Decisions
Chapter 22 & 23 What Is a Test of Significance? Use and Abuse of Statistical Inference Chapter 22 1 What is a hypothesis? Testable Falsifiable 4 Using Data to Make Decisions Examining Confidence Intervals.
More information10 Hypothesis Testing
10 Hypothesis Testing 1 10 Hypothesis Testing 10.1 Introduction Def 1: A hypothesis is a statement about a population parameter θ. Def 2: The two complementary hypotheses in a hypothesis testing problem
More informationCourse Notes  Statistics
EPI546: Fundamentals of Epidemiology and Biostatistics Course Notes  Statistics MSc (Credit to Roger J. Lewis, MD, PhD) Outline: I. Classical Hypothesis (significance) testing A. Type I (alpha) error
More informationRecall this chart that showed how most of our course would be organized:
Chapter 4 OneWay ANOVA Recall this chart that showed how most of our course would be organized: Explanatory Variable(s) Response Variable Methods Categorical Categorical Contingency Tables Categorical
More informationX 2 has mean E [ S 2 ]= 2. X i. n 1 S 2 2 / 2. 2 n 1 S 2
Week 11 notes Inferences concerning variances (Chapter 8), WEEK 11 page 1 inferences concerning proportions (Chapter 9) We recall that the sample variance S = 1 n 1 X i X has mean E [ S ]= i =1 n and is
More informationIntro. to Hypothesis Tests
Intro. to Hypothesis Tests Two of the most common types of statistical inference: 1. Confidence intervals Goal is to estimate (and communicate uncertainty in our estimate of) a population parameter. 2.
More informationNotes 5a: Onesample t test
Notes 5a: Onesample t test 1. Purpose Onesample ttest is designed to test whether one sample of data differs from a standard value or a population mean. The data must be quantitative (ratio, interval,
More informationHYPOTHESIS TESTING AND TYPE I AND TYPE II ERROR
HYPOTHESIS TESTING AND TYPE I AND TYPE II ERROR Hypothesis is a conjecture (an inferring) about one or more population parameters. Null Hypothesis (H 0 ) is a statement of no difference or no relationship
More informationConfidence Intervals for Cpk
Chapter 297 Confidence Intervals for Cpk Introduction This routine calculates the sample size needed to obtain a specified width of a Cpk confidence interval at a stated confidence level. Cpk is a process
More informationHypothesis Testing Level I Quantitative Methods. IFT Notes for the CFA exam
Hypothesis Testing 2014 Level I Quantitative Methods IFT Notes for the CFA exam Contents 1. Introduction... 3 2. Hypothesis Testing... 3 3. Hypothesis Tests Concerning the Mean... 10 4. Hypothesis Tests
More informationSection 7.1. Introduction to Hypothesis Testing. Schrodinger s cat quantum mechanics thought experiment (1935)
Section 7.1 Introduction to Hypothesis Testing Schrodinger s cat quantum mechanics thought experiment (1935) Statistical Hypotheses A statistical hypothesis is a claim about a population. Null hypothesis
More informationHypothesis Testing or How to Decide to Decide Edpsy 580
Hypothesis Testing or How to Decide to Decide Edpsy 580 Carolyn J. Anderson Department of Educational Psychology University of Illinois at UrbanaChampaign Hypothesis Testing or How to Decide to Decide
More informationUnit 31 A Hypothesis Test about Correlation and Slope in a Simple Linear Regression
Unit 31 A Hypothesis Test about Correlation and Slope in a Simple Linear Regression Objectives: To perform a hypothesis test concerning the slope of a least squares line To recognize that testing for a
More informationIntroduction. Hypothesis Testing. Hypothesis Testing. Significance Testing
Introduction Hypothesis Testing Mark Lunt Arthritis Research UK Centre for Ecellence in Epidemiology University of Manchester 13/10/2015 We saw last week that we can never know the population parameters
More informationSection 12.2, Lesson 3. What Can Go Wrong in Hypothesis Testing: The Two Types of Errors and Their Probabilities
Today: Section 2.2, Lesson 3: What can go wrong with hypothesis testing Section 2.4: Hypothesis tests for difference in two proportions ANNOUNCEMENTS: No discussion today. Check your grades on eee and
More informationProbability, Binomial Distributions and Hypothesis Testing Vartanian, SW 540
Probability, Binomial Distributions and Hypothesis Testing Vartanian, SW 540 1. Assume you are tossing a coin 11 times. The following distribution gives the likelihoods of getting a particular number of
More informationISTA 116 Hypothesis Testing: Binary Data
ISTA 116 Hypothesis Testing: Binary Data November 14, 2013 Types of Errors Example H 1 : Drug is better than a placebo H 0 : Drug no better than a placebo We reject H 0 if the data would be improbable
More informationHow to Conduct a Hypothesis Test
How to Conduct a Hypothesis Test The idea of hypothesis testing is relatively straightforward. In various studies we observe certain events. We must ask, is the event due to chance alone, or is there some
More informationOdds ratio, Odds ratio test for independence, chisquared statistic.
Odds ratio, Odds ratio test for independence, chisquared statistic. Announcements: Assignment 5 is live on webpage. Due Wed Aug 1 at 4:30pm. (9 days, 1 hour, 58.5 minutes ) Final exam is Aug 9. Review
More information6.1 The Elements of a Test of Hypothesis
University of California, Davis Department of Statistics Summer Session II Statistics 13 August 22, 2012 Date of latest update: August 20 Lecture 6: Tests of Hypothesis Suppose you wanted to determine
More informationStep 1 Hypotheses. Hypothesis Test Chapter 9. A Step by Step Guide. Null Hypothesis: Ho always contains equality of some type
Hypothesis Test Chapter 9 A Step by Step Guide If using a printed handout of these slides, the slides should be read left to right, all of top row first, then all of bottom row. If viewing as a slide show,
More informationTHE LOGIC OF HYPOTHESIS TESTING. The general process of hypothesis testing remains constant from one situation to another.
THE LOGIC OF HYPOTHESIS TESTING Hypothesis testing is a statistical procedure that allows researchers to use sample to draw inferences about the population of interest. It is the most commonly used inferential
More informationHypothesis Testing. Dr. Bob Gee Dean Scott Bonney Professor William G. Journigan American Meridian University
Hypothesis Testing Dr. Bob Gee Dean Scott Bonney Professor William G. Journigan American Meridian University 1 AMU / BonTech, LLC, JourniTech Corporation Copyright 2015 Learning Objectives Upon successful
More informationComments on Discussion Sheet 21 and Worksheet 21 ( ) Hypothesis Tests for Population Variances and Ratios of Variances
Comments on Discussion Sheet and Worksheet ( 9. 9.3) Hypothesis Tests for Population Variances and Ratios of Variances Discussion Sheet Hypothesis Tests for Population Variances and Ratios of Variances
More informationpvalues and significance levels (false positive or false alarm rates)
pvalues and significance levels (false positive or false alarm rates) Let's say 123 people in the class toss a coin. Call it "Coin A." There are 65 heads. Then they toss another coin. Call it "Coin B."
More informationLecture 13 More on hypothesis testing
Lecture 13 More on hypothesis testing Thais Paiva STA 111  Summer 2013 Term II July 22, 2013 1 / 27 Thais Paiva STA 111  Summer 2013 Term II Lecture 13, 07/22/2013 Lecture Plan 1 Type I and type II error
More information3.4 Statistical inference for 2 populations based on two samples
3.4 Statistical inference for 2 populations based on two samples Tests for a difference between two population means The first sample will be denoted as X 1, X 2,..., X m. The second sample will be denoted
More informationTesting Hypotheses (and Null Hypotheses)
Testing Hypotheses Overview 5 Steps for testing hypotheses Research and null hypotheses One and twotailed tests Type 1 and Type 2 Errors Z tests and t tests Chapter 13 1 Testing Hypotheses (and ull Hypotheses)
More informationTests concerning one mean : Let us start by discussing EXAMPLE 1 : problem 5 c) from EXAM 2 (example of a hypothesis testing problem).
Week 0 notes Hypothesis testing WEEK 0 page Tests concerning one mean : Let us start by discussing EAMPLE : problem 5 c) from EAM (example of a hypothesis testing problem). 5. The bolt diameter is normally
More informationHypothesis Testing. Experimental Hypotheses. Revision of Important Concepts. New Definition: Sampling Distribution. Hypothesis Testing
Revision of Important Concepts Hypothesis Testing Week 7 Statistics Dr. Sancho Moro http://eustats.pbwiki.com Remember that we used z scores to compute probabilities related to the normal (or gausssian)
More informationChapter 7 Notes  Inference for Single Samples. You know already for a large sample, you can invoke the CLT so:
Chapter 7 Notes  Inference for Single Samples You know already for a large sample, you can invoke the CLT so: X N(µ, ). Also for a large sample, you can replace an unknown σ by s. You know how to do a
More informationT adult = 96 T child = 114.
Homework Solutions Do all tests at the 5% level and quote pvalues when possible. When answering each question uses sentences and include the relevant JMP output and plots (do not include the data in your
More informationHypothesis testing for µ:
University of California, Los Angeles Department of Statistics Statistics 13 Elements of a hypothesis test: Hypothesis testing Instructor: Nicolas Christou 1. Null hypothesis, H 0 (always =). 2. Alternative
More informationHypothesis Testing for Beginners
Hypothesis Testing for Beginners Michele Piffer LSE August, 2011 Michele Piffer (LSE) Hypothesis Testing for Beginners August, 2011 1 / 53 One year ago a friend asked me to put down some easytoread notes
More informationStatistical Foundations:
Statistical Foundations: Confidence Intervals Psychology 790 Lecture #11 9/28/2006 Today sclass Hypothesis Testing. Finding pvalues for hypothesis tests (instead of critical values). Confidence intervals.
More informationHypothesis Testing: pvalue
STAT 101 Dr. Kari Lock Morgan Paul the Octopus Hypothesis Testing: SECTION 4.2 andomization distribution http://www.youtube.com/watch?v=3esgpumj9e Hypotheses In 2008, Paul the Octopus predicted 8 World
More informationI. Basics of Hypothesis Testing
Introduction to Hypothesis Testing This deals with an issue highly similar to what we did in the previous chapter. In that chapter we used sample information to make inferences about the range of possibilities
More informationConsequently, an important statistical technique, called the hypothesis test, is structured as an evaluation of two competing hypotheses:
The Hypothesis Test We have observed that population parameters tend to be unknown; even when we attempt to estimate them, we can only determine them inconclusively and with a certain limited degree of
More informationz and t tests for the mean of a normal distribution Confidence intervals for the mean Binomial tests
z and t tests for the mean of a normal distribution Confidence intervals for the mean Binomial tests Chapters 3.5.1 3.5.2, 3.3.2 Prof. Tesler Math 283 April 28 May 3, 2011 Prof. Tesler z and t tests for
More informationIntroduction to Inference Estimating with Confidence
Introduction to Inference Estimating with Confidence IPS Chapter 6.1 2009 W.H. Freeman and Company Objectives (IPS Chapter 6.1) Estimating with confidence Statistical confidence Confidence intervals Confidence
More informationClass 19: Two Way Tables, Conditional Distributions, ChiSquare (Text: Sections 2.5; 9.1)
Spring 204 Class 9: Two Way Tables, Conditional Distributions, ChiSquare (Text: Sections 2.5; 9.) Big Picture: More than Two Samples In Chapter 7: We looked at quantitative variables and compared the
More information