Chapter 7 Part 2. Hypothesis testing Power


 Delphia Mason
 2 years ago
 Views:
Transcription
1 Chapter 7 Part 2 Hypothesis testing Power November 6, 2008 All of the normal curves in this handout are sampling distributions
2 Goal: To understand the process of hypothesis testing and the relationship of sample size and the form of the alternative hypothesis to power. Skills: Will know how and when to conduct an hypothesis test. Will be able to describe the relationship between the hypothesis testing and confidence interval approaches. Will know why power is important and how to maimize it. Contents: Formalization of Hypothesis Testing (Normal distribution and σ Known) 2 2 table Page 1 Twotailed alternative hypothesis Page 5 Confidence interval approach to twotailed alternative Page 8 Comparison of processes for hypothesis testing and CI approach Page 9 Onetailed alternative hypothesis Page 9 Onesided confidence interval approach to onetailed alternative Page 11 Alternative hypothesis a single number rather than a range of numbers Page 12 Figures Figure 1: Twotailed rejection region Page 6 Figure 2: pvalue Page 7 Figure 3: Onetailed rejection region Page 10 Figure 4: Rejection region for single number as the alternative Page 13 Figure 5: Power for a single number as the alternative Page 13
3 Hypothesis Testing  Part 2 Review Riverboat Gambler Hypothesis Testing setup: H 0 is the tested or null hypothesis H A is the alternative hypothesis (sometimes referred to as H 1 ) TRUTH H 0 H 0 is true and H 0 is accepted Correct Decision H A DECISION Accept H A is true and H 0 is accepted Type II or β error Reject H 0 H 0 is true and H 0 is rejected Type I or error H A is true and H 0 is rejected Correct decision  power We also say Fail to reject H 0 " instead of saying Accept H 0 " Accept H 0 " is equivalent to saying Reject H A Reject H 0 " is equivalent to saying Accept H A H 0 is true is equivalent to H A is false H 0 is false is equivalent to H A is true First we are going to review what we learned from the Riverboat Gambler. Hypothesis testing process we used for the Riverboat Gambler: H 0 : p = 0.5 H A : p = 0.2 Notice that we picked as the alternative hypothesis what we believed to be the truth (i.e. if we had believed the coin was fair, we never would have set up the testing of the coin). The null hypothesis is set up as a strawman to be rejected so that we can accept the alternative hypothesis which is our real interest. This is because we have more Page 1
4 control over the probability of incorrectly accepting the alternative hypothesis. Remember that the probability of incorrectly accepting the alternative hypothesis is equivalent to the probability of rejecting the null hypothesis when in fact it is true (see highlighted cell below). The error associated with this is the Type I or error. But we get to select what we will use as the value of. So although we hope the alternative hypothesis will be true, we certainly don t want to declare it true when it isn t. Type I and Type II errors Let X be the random variable indicating the number of heads in 10 tosses. We selected a critical or rejection region  let us use {0,1,9,10}. (Note that here we have selected the rejection region rather than the level, but as we will see this is not how things are usually done.) Then Pr(X= 0 or 1 or 9 or 10 p = 0.5) = Equation 1 level of significance, level or type I error = (see Table 1 Chapter 7 Part 1) Notice that Equation 1 gives the probability of landing in the rejection region when the null hypothesis (p = 0.5) is true. Pr(X = 0 or 1 or 9 or 10 p= 0.2) = power = 1  β = Equation 2 Equation 2 gives the probability of landing in the rejection region when the null hypothesis is false (e.g. the alternative hypothesis is true). Page 2
5 pvalue Definition: The pvalue is the probability associated with the smallest rejection region that includes the value of the test statistic (the number of heads you actually got in 10 tosses) for the sample, under the assumption that the null hypothesis is true (i.e. the coin tossing problem, assuming the probability of getting heads on a given toss is 0.5). Eample: If turns out to be 0 (i.e. we flipped the coin 10 times and the results of each of the flips was tails), then the smallest rejection region containing 0 is {0, 10} and the pvalue is Pr(X = 0 or 10 p = 0.5) = pvalue = Notice that the pvalue and level have the same pattern (i.e. the probability of being in a rejection region given the null hypothesis is true) just different rejection regions. If the original setup is twotailed, then the rejection region associated with the pvalue is also two tailed (which eplains why we included 10 when calculating the pvalue). Eample: If the test statistic turns out to be 4, then the smallest rejection region containing 4 is {0, 1, 2, 3, 4, 6, 7, 8, 9, 10}. That is, the only number not in the rejection region is 5. So pvalue = Pr(X = 0, 1, 2, 3, 4, 6, 7, 8, 9, 10 p = 0.5) = 1  Pr(X = 5 p = 0.5) = = Relationship of the pvalue and level Let us go back to the rejection region {0, 1, 9, 10}. For a categorical distribution when the test statistic is in the rejection region [ i.e. = 0 is in the region {0,1,9,10}], the p value is less than or equal to the level because pvalue = Pr(X = 0 or 10 p = 0.5) = for test statistic level = Pr(X= 0 or 1 or 9 or 10 p = 0.5) = = 0 and Page 3
6 Notice that {, 010} {,,, 01910} Notice that both the pvalue and the When the test statistic is not in the rejection region [i.e. region {0,1,9,10}], then the pvalue is greater than the statistic = 4 is means is a subset of level are based on the null (p = 0.5) hypothesis. = 4 is not in the rejection level. The pvalue for the test pvalue = Pr(X = 0, 1, 2, 3, 4, 6, 7, 8, 9, 10 p = 0.5) = as compared to the level = Pr(X= 0 or 1 or 9 or 10 p = 0.5) = So we have that the level and the pvalue each depend on the null hypothesis and a rejection region, but not necessarily the same rejection region. The power and β error depend on the alternative hypothesis and the rejection region. We will reject the null hypothesis in favor of the alternative hypothesis if the pvalue <. We will fail to reject the null hypothesis, if the pvalue >. We fail to reject the null hypothesis when the pvalue equals Eample of Hypothesis testing given a normal distribution with σ known (I think that the probability of actually knowing σ for a given study is smaller than my probability of winning the lottery {and I don t buy lottery tickets}, but the assumption simplifies our eample): Let us assume that we know that the population of all high school aged kids in Houston has a mean SBP of 125 mm Hg (Rosner labels this population mean μ 0 ) and a known standard deviation of 50 mm Hg. Suppose we obtain a sample of 25 kids from the High School for the Performing Arts (HSPVA) and find that their mean systolic blood pressure is 142 mm Hg. The question is: Does our sample of 25 kids seem to be from the population with mean SBP = 125 mm Hg or does it seem more likely that they are from another population. So our null hypothesis is: H 0 : μ = 125 mm Hg The μ in the null hypothesis above is the mean for the population from which the sample of 25 kids was drawn. The number 125 mm Hg is the mean of the population of all high school aged kids in Houston (i.e. μ 0 = 125). So we are asking are the means of the two populations the same. If we assume the variances are the same, then we are asking are the two populations the same. Page 4
7 There are a number of possible forms for the alternative hypothesis: 1) H A : μ 125 (twotailed alternative) 2) H A : μ > 125 (onetailed upper tail) 3) H A : μ < 125 (onetailed lower tail) 4) H A : μ = 162 (some specific value) The form of the alternative hypothesis is picked prior to collecting the sample information. Please note that in the hypothesis testing setup, μ is the mean of the population from which the sample (of 25) was drawn (we hope this population is the same as the one with mean = 125). So μ = 125 essentially asks if the sample was drawn from the same population as the population with mean = 125 (i.e. are the two populations the same). Or another way to think of the question is: is the mean of our sample of 25 consistent with the null hypothesis or with the alternative hypothesis? Let us assume the following: (1) the twotailed version (i.e. H A : μ 125) of the alternative hypothesis (2) X is the random variable associated with the SBP of all high school kids in Houston (as opposed to the sample from HSPVA). (3) X ~ N( 125, 50 2 ) (4) the level of significance is 0.05 (i.e. = 0.05). Note that to select and then determine the rejection region formed by that level is the usual procedure. In the Riverboat gambler problem we selected the rejection region and then calculated for pedagogical reasons. The sample mean ( = 142 mm Hg) of our sample of 25 kids from HSPVA is our test statistic just like = 2 heads was the test statistic for the coin toss problem. Under the null hypothesis (i.e. assume that the sample of 25 HSPVA students is from the population of all high school students) the distribution of the sample means of all samples of size n = 25 (i.e. the sampling distribution) with σ = 25 (SD of the population of all Houston kids) is the normal distribution with mean = 125 mm Hg and Page 5
8 50 25 X ~ N 125, SD = 10 mm Hg (i.e ). Or because X ~ N( 125, 50 2 ) and n, our sample size, is equal to In the Riverboat gambler problem, we selected the rejection region and then found the level that went with it. We did it in that fashion so it would be clear that the rejection region contained the values that we thought were unlikely to occur if indeed the coin was fair. Another way to decide on the rejection region is to first select the level (usually something like 0.05 or 0.01) and then find the region (which would be in two pieces for a twotailed test or one piece for a onetailed test) whose area is equal to the level. Selecting first is the usual way of doing things. For the current problem we ll choose the level to be Since we are dealing with a twotailed test, we are looking for the values that cutoff the upper and lower area of the curve. We know that on the N(0,1) curve and 1.96 cut off the lower and upper areas respectively. The question is what points on the N(125,10 2 ) [the distribution for X] curve are the equivalents of and 1.96 (i.e. what points are 1.96 standard deviations below and above 125). 125 = = If, then = (10)(1.96) = and if, then = (10)(1.96) = This means that the rejection region consists of all such that < or >144.6 (Notice that the endpoints are in the acceptance region.) and the acceptance region is [105.4, 144.6] where the square brackets indicate that the interval includes the end points. Another way to say this is: Pr( X is in the rejection region μ = 125 and σ = 10) = Pr( X < or X > μ = 125 and σ = 10) = 0.05 Note that 142 (the mean SBP of our sample of 25 kids) falls in the acceptance region (see Figure 1 below) so we would say that we accept the null hypothesis (or that we fail to reject the null hypothesis). Page 6
9 Yet another way of saying this is that we believe our sample of 25 kids could have come from the normal distribution with μ = 125 and σ = 50 (i.e. the distribution of all Houston kids). Fail to reject the null hypothesis is the usual way statisticians report the results, but you ll never see this in a journal article. Notice that what we did was assume the sample was from the population of all high school kids and then look to see if that made sense or to see if the sample really seemed to fit with the population of all high school students. Figure 1.04 Normal Density with Mean = 125 and SD = 10 The vertical lines are at = & = Normal density with Mean = 125 and SD = 10 The vertical lines are at = and = The lines are at = & = Sample Means Each stripped area is half of the rejection region. Each area = The rejection region is the stripped area (sum of the two areas actually) in Figure 1 above. To find the pvalue that goes with this decision, we need to find the area to the right of 142 under the N(125,100) curve and double it because we are dealing with a twotailed alternative. Remember to find the pvalue you need to find the smallest rejection region that includes 142. This would be the rejection region where 142 is the cutoff (see Figure 2 below). To get the probability associated with the region under the N(125,100) curve and to the right of 142 we need to translate 142 to the N(0,1) curve by finding how many standard deviations 142 is from 125. Page 7
10 is 1.7 SD s from 125 since 10 = 1.70 Figure 2 Normal Density with Mean = 125 and SD = 10 The vertical lines are at = & = (for a level) and = 142 (for half the pvalue).04 Normal density with Mean = 125 and SD = 10 The vertical lines are at = and = The dashed line is at 142 The stripped area is half of the pvalue Sample Means Each area beyond the solid line is half of the rejection region (i.e ) According to the tables or Stata, the probability to the right of 1.7 is Since we are dealing with a twotailed test, the pvalue = = You could get this using STATA:. di 2*(1  normal(( )/10)) So we fail to reject the null hypothesis because pvalue >. Notice we are using the normal distribution with the hypothesized mean as opposed to the sample mean. Page 8
11 A Confidence interval approach to the same problem nother way we could look at this problem (H 0 : μ = 125 mm Hg versus H A : μ 125 mm Hg assuming X follows a normal distribution, σ known to be 50 and = 0.05) would be through confidence intervals: The 95% CI for 142 (i.e. you get the confidence interval about the sample value) given that n = 25 and σ for X is 50 (the original distribution  the population of all Houston kids) would be ( ( z σ / n), + ( z σ / n)) 1 ( 2) 1 ( 2) (. 196)( 50) (. 196)( 50) 142, = ( , ) = (122.4, 161.6) Notice that 125 (the null hypothesis) is in this 95% confidence interval for 142, so 142 would not be considered different from 125 at = 0.05 level. This is the same conclusion we reached earlier. Using the confidence interval, we know the set of values (population means) that 142 does not differ from, not just that 142 doesn t differ from 125. However, we can t calculate the pvalue. So we gain something with the confidence interval approach and lose something. Comparison of processes for hypothesis testing and confidence interval approach: For the confidence interval approach the process is to find the confidence interval for the sample mean (142) and then to check whether the population mean (125) is in that confidence interval (i.e. you start with the sample). Using the hypothesis testing approach, you start with the distribution of X under the null hypothesis (i.e. you get an acceptance region about the population mean 125) and look to see if the sample mean (142) lies in this acceptance region. Page 9
12 Same problem but with the alternative hypothesis changed to H A : μ > 125 (i.e. onetailed upper tailed). Let us work through the problem again this time using the #2 form of the alternative hypothesis (i.e H A : μ > 125) but keep the = 0.05 and σ for X at 50. Selecting this particular form of the alternative hypothesis means that we will reject the null hypothesis only when the sample mean is too big. Note that the alternative hypothesis is selected prior to the collection of the sample data. Now, using the hypothesis testing approach, is our sample mean of 142 from a sample of 25 still consistent with the null hypothesis? We will still use the distribution of sample means (i.e. distribution of X) with mean = 125 and SD = 10, but now the critical region is all in the upper tail (i.e. we reject the null hypothesis and accept the alternative only when the sample mean is too big). So again we have the critical and acceptance regions in terms of the relationship to the population mean (125). This means that the entire 0.05 will be in the upper tail because we reject only if 142 is too big. We know that cuts off the upper 5% of the N(0,1) curve. So if we solve the following, we will have the equivalent number for the N(125,10 2 ) curve. Figure 3 Normal Density with Mean = 125 and SD = 10 The vertical line is drawn at = Onetailed test. Normal density with Mean = 125 and SD = 10 The vertical line is at The stripped area is the rejection region. The area = X = Sample bar Means Page 10
13 125 = If, then = The vertical line on the graph below is at which cuts off 5% of the upper tail. Notice that is smaller than that cut off in the upper tail. And also smaller than 142 which cuts off (i.e. half the pvalue from the twotailed test) in the upper tail. Notice that 142 (the sample mean) is now in the rejection region (i.e. 142 > ) whereas it was in the acceptance region for the twotailed test To obtain the pvalue we again solve 10 di 1  normal(1.7) = = 1.70 So the pvalue = (note that we do not multiply by 2 because this is a onetailed test). Since 142 is in the rejection region, we epected the pvalue to be less than which is equal to For the twotailed test the test statistic 142 fell in the acceptance region (so we accepted the null hypothesis or failed to reject the null hypothesis) and the pvalue was equal to But with the onetailed test the test statistic falls in the rejection region (so we reject the null hypothesis) and the pvalue is (note that is half of ). Therefore, with the same sample size and the same null hypothesis we manage to go from acceptance of the null hypothesis (twotailed test) to rejection of the null hypothesis (onetailed test). Note that this is why people want to use a onetailed test. But be aware that if you have picked the wrong tail (so that the rejection region is on the lower end of the distribution), you could end up with acceptance with a onetailed test but rejection with a twotailed. You also need to be aware that it is seldom appropriate to use a onetailed test. You need to have prior information (i.e. an earlier study done in your lab or published in a journal) that indicates that if the sample mean is different from the population mean it is because the sample mean is bigger than the population mean (version 2 of the alternative hypotheses) or smaller than the population mean (version 3 of the alternative hypotheses). Also particularly note that the alternative hypothesis has to be selected prior to your seeing the data. Investigators like to gamble that they know which tail is appropriate because, as we will see later, you can use a smaller sample size. DON T DO THIS!!!!!! Now consider the confidence interval approach but this time it will be Page 11
14 a onesided confidence interval to go with the onesided test. So we consider a 90% twosided consider below because if you consider only one side of a 90% confidence interval and you 10% outside the confidence interval with 5% on each side. The confidence interval is again about the sample mean. The 90% confidence interval for 142 is (142 (1.645)(50),142 + (1.645)(50) ) = (125.55, ) The interval (125.55, 4) is the correct onesided 95% confidence interval. H A :μ > 125 says we reject the null hypothesis in favor of the alternative hypothesis if the sample mean is too big. This can also be translated as the population mean is too small. The confidence interval is about the sample mean 142. So for the population mean to be too small it must be outside the lower bound of the confidence interval. Notice that the tail of this confidence interval is in the same direction as the cutoff for the onetailed hypothesis test. The population mean 125 is not in the confidence interval so we reject the null hypothesis. Notice that for the hypothesis test we reject the null hypothesis when the sample mean is too big. For the confidence interval we reject the null hypothesis when the population mean is too small. The conclusions with the twosided hypothesis test and the twosided confidence interval should be the same provided the level is the same for both. Similarly for the onesided hypothesis test and onesided confidence interval. Revisiting the problem with H A as a single number rather than a range of numbers. Now suppose we use the #4 form of the alternative hypothesis (i.e that H A: μ = 162). Now we have two normal distributions for X each with σ = 10 but one with mean = 125 and one with mean = 162. We should note that this sort of problem is not commonly considered. H 0 : μ = 125 versus H A : μ = 162 Page 12
15 We are going to do a onetailed upper tailed test here (we reject the null hypothesis only when the sample mean is too big). Why are we doing a onetailed test here? We want to know if our sample mean of 142 is consistent with 125 or 162. So we reject the null hypothesis when the sample means looks more like 162 than 125. Well numbers smaller than 125 are not going to cause us to reject the null hypothesis in favor of 162. Only if the numbers are too big would we be willing to think that 162 was more appropriate than 125. With this onetailed test we can actually show the area that goes with the power. Earlier we found that if = 0.05, the cutoff for this upper tailed region is or Pr( X > μ = 125) = 0.05 Figure 4 Rejection Region Normal Distributions with SD = 10 and Means = 125 and Sampling Distribution according to H 0 Sampling Distribution according to H A The line is = The stripped area is the rejection region. Area = Sample Means Power is the probability of being in the rejection region for the null hypothesis when the alternative hypothesis is true (i.e. correctly rejecting the null hypothesis). So the power for this problem is the probability of being to the right of (i.e. being in the rejection region for the null hypothesis) but under the alternative hypothesis curve (i.e. the one with mean = 162). Page 13
16 So Pr( X > μ = 162) = = power [see Figure 5 below]. Recall that on the normal curve with mean 162 and SD = 10 is equivalent to ( )/10 = (i.e is 2.1 standard deviations below the mean of 162) Using Stata, we get power = 1  normal(2.055) normal(2.055) gives the are under the curve with μ = 162 and to the left of the line at (i.e. β = 1  power). It is the area under the rest of the curve (i.e. to the right of ) that is the power (i.e. 1  β). Figure 5 Power Normal Distributions with SD = 10 and Means = 125 and Sampling Distribution according to H 0 Sampling Distribution according to H A The hatched area is the power = The line is = Sample Means Page 14
17 You need to be aware that: 1) When we were looking at the problems above as hypothesis testing problems, we used the population parameters (here population mean = 125 and population SE = 10) to obtain a rejection region and then asked was the sample statistic (here = 142) in the rejection region. 2) However, when considering the problems using the confidence interval approach, we obtained the confidence interval about the sample statistic ( = 142) and then asked if the population parameter (i.e. the population mean 125) was in the confidence interval. Why do people select a onetailed versus a twotailed test? We saw above that it is possible to reject the null hypothesis using a onetailed test but fail to reject the null hypothesis using a twotailed test. So if you were trying to prove that your new drug is better than the standard of care you might be tempted to use a onetailed test. What are the drawbacks to a onetailed test? Well you might have guessed the wrong tail. If before we had obtained a sample (and this is the way you are supposed to play the game), you said if your sample of kids differed from the population with respect to SBP, it would be because the SBP of your sample of kids would be too small. This means that in Figure 3 above the rejection region would be the area to the left of = = would be in the acceptance region. The wrong guess has cost you a significant answer.. So Page 15
Chapter 8 Introduction to Hypothesis Testing
Chapter 8 Student Lecture Notes 81 Chapter 8 Introduction to Hypothesis Testing Fall 26 Fundamentals of Business Statistics 1 Chapter Goals After completing this chapter, you should be able to: Formulate
More informationSingle sample hypothesis testing, II 9.07 3/02/2004
Single sample hypothesis testing, II 9.07 3/02/2004 Outline Very brief review Onetailed vs. twotailed tests Small sample testing Significance & multiple tests II: Data snooping What do our results mean?
More informationMATH 10: Elementary Statistics and Probability Chapter 9: Hypothesis Testing with One Sample
MATH 10: Elementary Statistics and Probability Chapter 9: Hypothesis Testing with One Sample Tony Pourmohamad Department of Mathematics De Anza College Spring 2015 Objectives By the end of this set of
More informationHYPOTHESIS TESTING (ONE SAMPLE)  CHAPTER 7 1. used confidence intervals to answer questions such as...
HYPOTHESIS TESTING (ONE SAMPLE)  CHAPTER 7 1 PREVIOUSLY used confidence intervals to answer questions such as... You know that 0.25% of women have red/green color blindness. You conduct a study of men
More informationHomework #3 is due Friday by 5pm. Homework #4 will be posted to the class website later this week. It will be due Friday, March 7 th, at 5pm.
Homework #3 is due Friday by 5pm. Homework #4 will be posted to the class website later this week. It will be due Friday, March 7 th, at 5pm. Political Science 15 Lecture 12: Hypothesis Testing Sampling
More information93.4 Likelihood ratio test. NeymanPearson lemma
93.4 Likelihood ratio test NeymanPearson lemma 91 Hypothesis Testing 91.1 Statistical Hypotheses Statistical hypothesis testing and confidence interval estimation of parameters are the fundamental
More informationHYPOTHESIS TESTING: POWER OF THE TEST
HYPOTHESIS TESTING: POWER OF THE TEST The first 6 steps of the 9step test of hypothesis are called "the test". These steps are not dependent on the observed data values. When planning a research project,
More informationNull Hypothesis H 0. The null hypothesis (denoted by H 0
Hypothesis test In statistics, a hypothesis is a claim or statement about a property of a population. A hypothesis test (or test of significance) is a standard procedure for testing a claim about a property
More informationTest of Hypotheses. Since the NeymanPearson approach involves two statistical hypotheses, one has to decide which one
Test of Hypotheses Hypothesis, Test Statistic, and Rejection Region Imagine that you play a repeated Bernoulli game: you win $1 if head and lose $1 if tail. After 10 plays, you lost $2 in net (4 heads
More informationChapter 8. Hypothesis Testing
Chapter 8 Hypothesis Testing Hypothesis In statistics, a hypothesis is a claim or statement about a property of a population. A hypothesis test (or test of significance) is a standard procedure for testing
More informationSampling Distribution of the Mean & Hypothesis Testing
Sampling Distribution of the Mean & Hypothesis Testing Let s first review what we know about sampling distributions of the mean (Central Limit Theorem): 1. The mean of the sampling distribution will be
More informationHypothesis testing  Steps
Hypothesis testing  Steps Steps to do a twotailed test of the hypothesis that β 1 0: 1. Set up the hypotheses: H 0 : β 1 = 0 H a : β 1 0. 2. Compute the test statistic: t = b 1 0 Std. error of b 1 =
More informationSampling and Hypothesis Testing
Population and sample Sampling and Hypothesis Testing Allin Cottrell Population : an entire set of objects or units of observation of one sort or another. Sample : subset of a population. Parameter versus
More informationThe Paired ttest and Hypothesis Testing. John McGready Johns Hopkins University
This work is licensed under a Creative Commons AttributionNonCommercialShareAlike License. Your use of this material constitutes acceptance of that license and the conditions of use of materials on this
More informationIntroduction to Hypothesis Testing. Hypothesis Testing. Step 1: State the Hypotheses
Introduction to Hypothesis Testing 1 Hypothesis Testing A hypothesis test is a statistical procedure that uses sample data to evaluate a hypothesis about a population Hypothesis is stated in terms of the
More informationHYPOTHESIS TESTING (ONE SAMPLE)  CHAPTER 7 1. used confidence intervals to answer questions such as...
HYPOTHESIS TESTING (ONE SAMPLE)  CHAPTER 7 1 PREVIOUSLY used confidence intervals to answer questions such as... You know that 0.25% of women have red/green color blindness. You conduct a study of men
More informationChapter Five. Hypothesis Testing: Concepts
Chapter Five The Purpose of Hypothesis Testing... 110 An Initial Look at Hypothesis Testing... 112 Formal Hypothesis Testing... 114 Introduction... 114 Null and Alternate Hypotheses... 114 Procedure for
More informationIntroduction to Hypothesis Testing
I. Terms, Concepts. Introduction to Hypothesis Testing A. In general, we do not know the true value of population parameters  they must be estimated. However, we do have hypotheses about what the true
More informationModule 7: Hypothesis Testing I Statistics (OA3102)
Module 7: Hypothesis Testing I Statistics (OA3102) Professor Ron Fricker Naval Postgraduate School Monterey, California Reading assignment: WM&S chapter 10.110.5 Revision: 212 1 Goals for this Module
More information1 SAMPLE SIGN TEST. NonParametric Univariate Tests: 1 Sample Sign Test 1. A nonparametric equivalent of the 1 SAMPLE TTEST.
NonParametric Univariate Tests: 1 Sample Sign Test 1 1 SAMPLE SIGN TEST A nonparametric equivalent of the 1 SAMPLE TTEST. ASSUMPTIONS: Data is nonnormally distributed, even after log transforming.
More informationCHAPTER 11 SECTION 2: INTRODUCTION TO HYPOTHESIS TESTING
CHAPTER 11 SECTION 2: INTRODUCTION TO HYPOTHESIS TESTING MULTIPLE CHOICE 56. In testing the hypotheses H 0 : µ = 50 vs. H 1 : µ 50, the following information is known: n = 64, = 53.5, and σ = 10. The standardized
More information7 Hypothesis testing  one sample tests
7 Hypothesis testing  one sample tests 7.1 Introduction Definition 7.1 A hypothesis is a statement about a population parameter. Example A hypothesis might be that the mean age of students taking MAS113X
More informationModule 5 Hypotheses Tests: Comparing Two Groups
Module 5 Hypotheses Tests: Comparing Two Groups Objective: In medical research, we often compare the outcomes between two groups of patients, namely exposed and unexposed groups. At the completion of this
More informationThe Philosophy of Hypothesis Testing, Questions and Answers 2006 Samuel L. Baker
HYPOTHESIS TESTING PHILOSOPHY 1 The Philosophy of Hypothesis Testing, Questions and Answers 2006 Samuel L. Baker Question: So I'm hypothesis testing. What's the hypothesis I'm testing? Answer: When you're
More informationSimple Regression Theory II 2010 Samuel L. Baker
SIMPLE REGRESSION THEORY II 1 Simple Regression Theory II 2010 Samuel L. Baker Assessing how good the regression equation is likely to be Assignment 1A gets into drawing inferences about how close the
More informationCHAPTERS 46: Hypothesis Tests Read sections 4.3, 4.5, 5.1.5, Confidence Interval vs. Hypothesis Test (4.3):
CHAPTERS 46: Hypothesis Tests Read sections 4.3, 4.5, 5.1.5, 6.1.3 Confidence Interval vs. Hypothesis Test (4.3): The purpose of a confidence interval is to estimate the value of a parameter. The purpose
More informationThe Basics of a Hypothesis Test
Overview The Basics of a Test Dr Tom Ilvento Department of Food and Resource Economics Alternative way to make inferences from a sample to the Population is via a Test A hypothesis test is based upon A
More informationStatistical inference provides methods for drawing conclusions about a population from sample data.
Chapter 15 Tests of Significance: The Basics Statistical inference provides methods for drawing conclusions about a population from sample data. Two of the most common types of statistical inference: 1)
More informationChapter 9, Part A Hypothesis Tests. Learning objectives
Chapter 9, Part A Hypothesis Tests Slide 1 Learning objectives 1. Understand how to develop Null and Alternative Hypotheses 2. Understand Type I and Type II Errors 3. Able to do hypothesis test about population
More information1 Confidence intervals
Math 143 Inference for Means 1 Statistical inference is inferring information about the distribution of a population from information about a sample. We re generally talking about one of two things: 1.
More informationLAB 4 INSTRUCTIONS CONFIDENCE INTERVALS AND HYPOTHESIS TESTING
LAB 4 INSTRUCTIONS CONFIDENCE INTERVALS AND HYPOTHESIS TESTING In this lab you will explore the concept of a confidence interval and hypothesis testing through a simulation problem in engineering setting.
More information9.1 Basic Principles of Hypothesis Testing
9. Basic Principles of Hypothesis Testing Basic Idea Through an Example: On the very first day of class I gave the example of tossing a coin times, and what you might conclude about the fairness of the
More informationExperimental Design. Power and Sample Size Determination. Proportions. Proportions. Confidence Interval for p. The Binomial Test
Experimental Design Power and Sample Size Determination Bret Hanlon and Bret Larget Department of Statistics University of Wisconsin Madison November 3 8, 2011 To this point in the semester, we have largely
More informationMONT 107N Understanding Randomness Solutions For Final Examination May 11, 2010
MONT 07N Understanding Randomness Solutions For Final Examination May, 00 Short Answer (a) (0) How are the EV and SE for the sum of n draws with replacement from a box computed? Solution: The EV is n times
More informationLesson 1: Comparison of Population Means Part c: Comparison of Two Means
Lesson : Comparison of Population Means Part c: Comparison of Two Means Welcome to lesson c. This third lesson of lesson will discuss hypothesis testing for two independent means. Steps in Hypothesis
More informationThe alternative hypothesis,, is the statement that the parameter value somehow differs from that claimed by the null hypothesis. : 0.5 :>0.5 :<0.
Section 8.28.5 Null and Alternative Hypotheses... The null hypothesis,, is a statement that the value of a population parameter is equal to some claimed value. :=0.5 The alternative hypothesis,, is the
More informationHypothesis testing S2
Basic medical statistics for clinical and experimental research Hypothesis testing S2 Katarzyna Jóźwiak k.jozwiak@nki.nl 2nd November 2015 1/43 Introduction Point estimation: use a sample statistic to
More informationHomework 5 Solutions
Math 130 Assignment Chapter 18: 6, 10, 38 Chapter 19: 4, 6, 8, 10, 14, 16, 40 Chapter 20: 2, 4, 9 Chapter 18 Homework 5 Solutions 18.6] M&M s. The candy company claims that 10% of the M&M s it produces
More informationPROBLEM SET 1. For the first three answer true or false and explain your answer. A picture is often helpful.
PROBLEM SET 1 For the first three answer true or false and explain your answer. A picture is often helpful. 1. Suppose the significance level of a hypothesis test is α=0.05. If the pvalue of the test
More information6. Statistical Inference: Significance Tests
6. Statistical Inference: Significance Tests Goal: Use statistical methods to check hypotheses such as Women's participation rates in elections in France is higher than in Germany. (an effect) Ethnic divisions
More informationRecall this chart that showed how most of our course would be organized:
Chapter 4 OneWay ANOVA Recall this chart that showed how most of our course would be organized: Explanatory Variable(s) Response Variable Methods Categorical Categorical Contingency Tables Categorical
More informationT adult = 96 T child = 114.
Homework Solutions Do all tests at the 5% level and quote pvalues when possible. When answering each question uses sentences and include the relevant JMP output and plots (do not include the data in your
More informationComparing Two Groups. Standard Error of ȳ 1 ȳ 2. Setting. Two Independent Samples
Comparing Two Groups Chapter 7 describes two ways to compare two populations on the basis of independent samples: a confidence interval for the difference in population means and a hypothesis test. The
More informationHypothesis Testing Summary
Hypothesis Testing Summary Hypothesis testing begins with the drawing of a sample and calculating its characteristics (aka, statistics ). A statistical test (a specific form of a hypothesis test) is an
More informationHYPOTHESIS TESTING AND TYPE I AND TYPE II ERROR
HYPOTHESIS TESTING AND TYPE I AND TYPE II ERROR Hypothesis is a conjecture (an inferring) about one or more population parameters. Null Hypothesis (H 0 ) is a statement of no difference or no relationship
More informationHow to Conduct a Hypothesis Test
How to Conduct a Hypothesis Test The idea of hypothesis testing is relatively straightforward. In various studies we observe certain events. We must ask, is the event due to chance alone, or is there some
More informationSection 7.1. Introduction to Hypothesis Testing. Schrodinger s cat quantum mechanics thought experiment (1935)
Section 7.1 Introduction to Hypothesis Testing Schrodinger s cat quantum mechanics thought experiment (1935) Statistical Hypotheses A statistical hypothesis is a claim about a population. Null hypothesis
More informationLecture 13 More on hypothesis testing
Lecture 13 More on hypothesis testing Thais Paiva STA 111  Summer 2013 Term II July 22, 2013 1 / 27 Thais Paiva STA 111  Summer 2013 Term II Lecture 13, 07/22/2013 Lecture Plan 1 Type I and type II error
More informationProbability, Binomial Distributions and Hypothesis Testing Vartanian, SW 540
Probability, Binomial Distributions and Hypothesis Testing Vartanian, SW 540 1. Assume you are tossing a coin 11 times. The following distribution gives the likelihoods of getting a particular number of
More information15.0 More Hypothesis Testing
15.0 More Hypothesis Testing 1 Answer Questions Type I and Type II Error Power Calculation Bayesian Hypothesis Testing 15.1 Type I and Type II Error In the philosophy of hypothesis testing, the null hypothesis
More informationTwoSample TTests Assuming Equal Variance (Enter Means)
Chapter 4 TwoSample TTests Assuming Equal Variance (Enter Means) Introduction This procedure provides sample size and power calculations for one or twosided twosample ttests when the variances of
More informationMultiple random variables
Multiple random variables Multiple random variables We essentially always consider multiple random variables at once. The key concepts: Joint, conditional and marginal distributions, and independence of
More informationHypoTesting. Name: Class: Date: Multiple Choice Identify the choice that best completes the statement or answers the question.
Name: Class: Date: HypoTesting Multiple Choice Identify the choice that best completes the statement or answers the question. 1. A Type II error is committed if we make: a. a correct decision when the
More informationSection 12.2, Lesson 3. What Can Go Wrong in Hypothesis Testing: The Two Types of Errors and Their Probabilities
Today: Section 2.2, Lesson 3: What can go wrong with hypothesis testing Section 2.4: Hypothesis tests for difference in two proportions ANNOUNCEMENTS: No discussion today. Check your grades on eee and
More informationBA 275 Review Problems  Week 6 (10/30/0611/3/06) CD Lessons: 53, 54, 55, 56 Textbook: pp. 394398, 404408, 410420
BA 275 Review Problems  Week 6 (10/30/0611/3/06) CD Lessons: 53, 54, 55, 56 Textbook: pp. 394398, 404408, 410420 1. Which of the following will increase the value of the power in a statistical test
More informationOdds ratio, Odds ratio test for independence, chisquared statistic.
Odds ratio, Odds ratio test for independence, chisquared statistic. Announcements: Assignment 5 is live on webpage. Due Wed Aug 1 at 4:30pm. (9 days, 1 hour, 58.5 minutes ) Final exam is Aug 9. Review
More information22. HYPOTHESIS TESTING
22. HYPOTHESIS TESTING Often, we need to make decisions based on incomplete information. Do the data support some belief ( hypothesis ) about the value of a population parameter? Is OJ Simpson guilty?
More informationUnit 31 A Hypothesis Test about Correlation and Slope in a Simple Linear Regression
Unit 31 A Hypothesis Test about Correlation and Slope in a Simple Linear Regression Objectives: To perform a hypothesis test concerning the slope of a least squares line To recognize that testing for a
More informationHypothesis testing. Power of a test. Alternative is greater than Null. Probability
Probability February 14, 2013 Debdeep Pati Hypothesis testing Power of a test 1. Assuming standard deviation is known. Calculate power based on onesample z test. A new drug is proposed for people with
More informationHypothesis Testing. Dr. Bob Gee Dean Scott Bonney Professor William G. Journigan American Meridian University
Hypothesis Testing Dr. Bob Gee Dean Scott Bonney Professor William G. Journigan American Meridian University 1 AMU / BonTech, LLC, JourniTech Corporation Copyright 2015 Learning Objectives Upon successful
More information9Tests of Hypotheses. for a Single Sample CHAPTER OUTLINE
9Tests of Hypotheses for a Single Sample CHAPTER OUTLINE 91 HYPOTHESIS TESTING 91.1 Statistical Hypotheses 91.2 Tests of Statistical Hypotheses 91.3 OneSided and TwoSided Hypotheses 91.4 General
More informationAMS 5 TESTS FOR TWO SAMPLES
AMS 5 TESTS FOR TWO SAMPLES Test for difference We will consider the problem of comparing the means of two populations. The main tool to do hypothesis testing in this case is the z test for the difference
More informationTwoSample TTests Allowing Unequal Variance (Enter Difference)
Chapter 45 TwoSample TTests Allowing Unequal Variance (Enter Difference) Introduction This procedure provides sample size and power calculations for one or twosided twosample ttests when no assumption
More informationI. Basics of Hypothesis Testing
Introduction to Hypothesis Testing This deals with an issue highly similar to what we did in the previous chapter. In that chapter we used sample information to make inferences about the range of possibilities
More informationTtest in SPSS Hypothesis tests of proportions Confidence Intervals (End of chapter 6 material)
Ttest in SPSS Hypothesis tests of proportions Confidence Intervals (End of chapter 6 material) Definition of pvalue: The probability of getting evidence as strong as you did assuming that the null hypothesis
More informationIntroduction to. Hypothesis Testing CHAPTER LEARNING OBJECTIVES. 1 Identify the four steps of hypothesis testing.
Introduction to Hypothesis Testing CHAPTER 8 LEARNING OBJECTIVES After reading this chapter, you should be able to: 1 Identify the four steps of hypothesis testing. 2 Define null hypothesis, alternative
More informationHypothesis testing for µ:
University of California, Los Angeles Department of Statistics Statistics 13 Elements of a hypothesis test: Hypothesis testing Instructor: Nicolas Christou 1. Null hypothesis, H 0 (always =). 2. Alternative
More informationHypothesis Testing or How to Decide to Decide Edpsy 580
Hypothesis Testing or How to Decide to Decide Edpsy 580 Carolyn J. Anderson Department of Educational Psychology University of Illinois at UrbanaChampaign Hypothesis Testing or How to Decide to Decide
More informationChapter 8 Hypothesis Testing Chapter 8 Hypothesis Testing 81 Overview 82 Basics of Hypothesis Testing
Chapter 8 Hypothesis Testing 1 Chapter 8 Hypothesis Testing 81 Overview 82 Basics of Hypothesis Testing 83 Testing a Claim About a Proportion 85 Testing a Claim About a Mean: s Not Known 86 Testing
More informationTesting: is my coin fair?
Testing: is my coin fair? Formally: we want to make some inference about P(head) Try it: toss coin several times (say 7 times) Assume that it is fair ( P(head)= ), and see if this assumption is compatible
More informationElements of Hypothesis Testing (Summary from lecture notes)
Statistics20090 MINITAB  Lab 1 Large Sample Tests of Hypothesis About a Population Mean We use hypothesis tests to make an inference about some population parameter of interest, for example the mean
More informationHypothesis Testing for Beginners
Hypothesis Testing for Beginners Michele Piffer LSE August, 2011 Michele Piffer (LSE) Hypothesis Testing for Beginners August, 2011 1 / 53 One year ago a friend asked me to put down some easytoread notes
More informationChapter Additional: Standard Deviation and Chi Square
Chapter Additional: Standard Deviation and Chi Square Chapter Outline: 6.4 Confidence Intervals for the Standard Deviation 7.5 Hypothesis testing for Standard Deviation Section 6.4 Objectives Interpret
More information1. What is the critical value for this 95% confidence interval? CV = z.025 = invnorm(0.025) = 1.96
1 Final Review 2 Review 2.1 CI 1propZint Scenario 1 A TV manufacturer claims in its warranty brochure that in the past not more than 10 percent of its TV sets needed any repair during the first two years
More informationHypothesis Testing Level I Quantitative Methods. IFT Notes for the CFA exam
Hypothesis Testing 2014 Level I Quantitative Methods IFT Notes for the CFA exam Contents 1. Introduction... 3 2. Hypothesis Testing... 3 3. Hypothesis Tests Concerning the Mean... 10 4. Hypothesis Tests
More informationIntroduction. Hypothesis Testing. Hypothesis Testing. Significance Testing
Introduction Hypothesis Testing Mark Lunt Arthritis Research UK Centre for Ecellence in Epidemiology University of Manchester 13/10/2015 We saw last week that we can never know the population parameters
More informationClass 19: Two Way Tables, Conditional Distributions, ChiSquare (Text: Sections 2.5; 9.1)
Spring 204 Class 9: Two Way Tables, Conditional Distributions, ChiSquare (Text: Sections 2.5; 9.) Big Picture: More than Two Samples In Chapter 7: We looked at quantitative variables and compared the
More informationSection: 101 (10am11am) 102 (11am12pm) 103 (1pm2pm) 104 (1pm2pm)
Stat 0 Midterm Exam Instructor: Tessa ChildersDay 1 May 014 Please write your name and student ID below, and circle your section. With your signature, you certify that you have not observed poor or dishonest
More informationChapter 6: t test for dependent samples
Chapter 6: t test for dependent samples ****This chapter corresponds to chapter 11 of your book ( t(ea) for Two (Again) ). What it is: The t test for dependent samples is used to determine whether the
More informationIntroduction to Hypothesis Testing OPRE 6301
Introduction to Hypothesis Testing OPRE 6301 Motivation... The purpose of hypothesis testing is to determine whether there is enough statistical evidence in favor of a certain belief, or hypothesis, about
More informationExpected values, standard errors, Central Limit Theorem. Statistical inference
Expected values, standard errors, Central Limit Theorem FPP 1618 Statistical inference Up to this point we have focused primarily on exploratory statistical analysis We know dive into the realm of statistical
More informationChapter 7 Notes  Inference for Single Samples. You know already for a large sample, you can invoke the CLT so:
Chapter 7 Notes  Inference for Single Samples You know already for a large sample, you can invoke the CLT so: X N(µ, ). Also for a large sample, you can replace an unknown σ by s. You know how to do a
More informationChapter 21. More About Tests and Intervals. Copyright 2012, 2008, 2005 Pearson Education, Inc.
Chapter 21 More About Tests and Intervals Copyright 2012, 2008, 2005 Pearson Education, Inc. Zero In on the Null Null hypotheses have special requirements. To perform a hypothesis test, the null must be
More information6.1 The Elements of a Test of Hypothesis
University of California, Davis Department of Statistics Summer Session II Statistics 13 August 22, 2012 Date of latest update: August 20 Lecture 6: Tests of Hypothesis Suppose you wanted to determine
More informationTHE FIRST SET OF EXAMPLES USE SUMMARY DATA... EXAMPLE 7.2, PAGE 227 DESCRIBES A PROBLEM AND A HYPOTHESIS TEST IS PERFORMED IN EXAMPLE 7.
THERE ARE TWO WAYS TO DO HYPOTHESIS TESTING WITH STATCRUNCH: WITH SUMMARY DATA (AS IN EXAMPLE 7.17, PAGE 236, IN ROSNER); WITH THE ORIGINAL DATA (AS IN EXAMPLE 8.5, PAGE 301 IN ROSNER THAT USES DATA FROM
More informationTest of proportion = 0.5 N Sample prop 95% CI z value p value (0.400, 0.466)
STATISTICS FOR THE SOCIAL AND BEHAVIORAL SCIENCES Recitation #10 Answer Key PROBABILITY, HYPOTHESIS TESTING, CONFIDENCE INTERVALS Hypothesis tests 2 When a recent GSS asked, would you be willing to pay
More informationpvalues and significance levels (false positive or false alarm rates)
pvalues and significance levels (false positive or false alarm rates) Let's say 123 people in the class toss a coin. Call it "Coin A." There are 65 heads. Then they toss another coin. Call it "Coin B."
More information3.4 Statistical inference for 2 populations based on two samples
3.4 Statistical inference for 2 populations based on two samples Tests for a difference between two population means The first sample will be denoted as X 1, X 2,..., X m. The second sample will be denoted
More informationAbout Hypothesis Testing
About Hypothesis Testing TABLE OF CONTENTS About Hypothesis Testing... 1 What is a HYPOTHESIS TEST?... 1 Hypothesis Testing... 1 Hypothesis Testing... 1 Steps in Hypothesis Testing... 2 Steps in Hypothesis
More informationIntroduction to the Practice of Statistics Fifth Edition Moore, McCabe Section 8.1 Homework Answers
Introduction to the Practice of Statistics Fifth Edition Moore, McCabe Section 8.1 Homework Answers 8.1 In each of the following circumstances state whether you would use the large sample confidence interval,
More informationExtending Hypothesis Testing. pvalues & confidence intervals
Extending Hypothesis Testing pvalues & confidence intervals So far: how to state a question in the form of two hypotheses (null and alternative), how to assess the data, how to answer the question by
More informationHypothesis Testing. Hypothesis Testing. Inferential Statistics
Making Hypotheses : Example1: Probability distr. Example2: Zdistribution Errors in One vs. Twosided Tests Inferential Statistics Sample Population Observations Statistics Inference Hypothesis testing
More informationPrinciples of Hypothesis Testing for Public Health
Principles of Hypothesis Testing for Public Health Laura Lee Johnson, Ph.D. Statistician National Center for Complementary and Alternative Medicine johnslau@mail.nih.gov Fall 2011 Answers to Questions
More informationHypothesis Testing: pvalue
STAT 101 Dr. Kari Lock Morgan Paul the Octopus Hypothesis Testing: SECTION 4.2 andomization distribution http://www.youtube.com/watch?v=3esgpumj9e Hypotheses In 2008, Paul the Octopus predicted 8 World
More informationStatistical Foundations:
Statistical Foundations: Hypothesis Testing Psychology 790 Lecture #10 9/26/2006 Today sclass Hypothesis Testing. An Example. Types of errors illustrated. Misconceptions about hypothesis testing. Upcoming
More informationTesting a claim about a population mean
Introductory Statistics Lectures Testing a claim about a population mean One sample hypothesis test of the mean Department of Mathematics Pima Community College Redistribution of this material is prohibited
More informationTRANSCRIPT: In this lecture, we will talk about both theoretical and applied concepts related to hypothesis testing.
This is Dr. Chumney. The focus of this lecture is hypothesis testing both what it is, how hypothesis tests are used, and how to conduct hypothesis tests. 1 In this lecture, we will talk about both theoretical
More informationHypothesis. Testing Examples and Case Studies. Chapter 23. Copyright 2005 Brooks/Cole, a division of Thomson Learning, Inc.
Hypothesis Chapter 23 Testing Examples and Case Studies Copyright 2005 Brooks/Cole, a division of Thomson Learning, Inc. 23.1 How Hypothesis Tests Are Reported in the News 1. Determine the null hypothesis
More informationThe Wilcoxon RankSum Test
1 The Wilcoxon RankSum Test The Wilcoxon ranksum test is a nonparametric alternative to the twosample ttest which is based solely on the order in which the observations from the two samples fall. We
More informationConfidence intervals, t tests, P values
Confidence intervals, t tests, P values Joe Felsenstein Department of Genome Sciences and Department of Biology Confidence intervals, t tests, P values p.1/31 Normality Everybody believes in the normal
More information