Chapter 7 Part 2. Hypothesis testing Power
|
|
- Delphia Mason
- 7 years ago
- Views:
Transcription
1 Chapter 7 Part 2 Hypothesis testing Power November 6, 2008 All of the normal curves in this handout are sampling distributions
2 Goal: To understand the process of hypothesis testing and the relationship of sample size and the form of the alternative hypothesis to power. Skills: Will know how and when to conduct an hypothesis test. Will be able to describe the relationship between the hypothesis testing and confidence interval approaches. Will know why power is important and how to maimize it. Contents: Formalization of Hypothesis Testing (Normal distribution and σ Known) 2 2 table Page 1 Two-tailed alternative hypothesis Page 5 Confidence interval approach to two-tailed alternative Page 8 Comparison of processes for hypothesis testing and CI approach Page 9 One-tailed alternative hypothesis Page 9 One-sided confidence interval approach to one-tailed alternative Page 11 Alternative hypothesis a single number rather than a range of numbers Page 12 Figures Figure 1: Two-tailed rejection region Page 6 Figure 2: p-value Page 7 Figure 3: One-tailed rejection region Page 10 Figure 4: Rejection region for single number as the alternative Page 13 Figure 5: Power for a single number as the alternative Page 13
3 Hypothesis Testing - Part 2 Review Riverboat Gambler Hypothesis Testing setup: H 0 is the tested or null hypothesis H A is the alternative hypothesis (sometimes referred to as H 1 ) TRUTH H 0 H 0 is true and H 0 is accepted Correct Decision H A DECISION Accept H A is true and H 0 is accepted Type II or β error Reject H 0 H 0 is true and H 0 is rejected Type I or error H A is true and H 0 is rejected Correct decision - power We also say Fail to reject H 0 " instead of saying Accept H 0 " Accept H 0 " is equivalent to saying Reject H A Reject H 0 " is equivalent to saying Accept H A H 0 is true is equivalent to H A is false H 0 is false is equivalent to H A is true First we are going to review what we learned from the Riverboat Gambler. Hypothesis testing process we used for the Riverboat Gambler: H 0 : p = 0.5 H A : p = 0.2 Notice that we picked as the alternative hypothesis what we believed to be the truth (i.e. if we had believed the coin was fair, we never would have set up the testing of the coin). The null hypothesis is set up as a strawman to be rejected so that we can accept the alternative hypothesis which is our real interest. This is because we have more Page -1-
4 control over the probability of incorrectly accepting the alternative hypothesis. Remember that the probability of incorrectly accepting the alternative hypothesis is equivalent to the probability of rejecting the null hypothesis when in fact it is true (see highlighted cell below). The error associated with this is the Type I or error. But we get to select what we will use as the value of. So although we hope the alternative hypothesis will be true, we certainly don t want to declare it true when it isn t. Type I and Type II errors Let X be the random variable indicating the number of heads in 10 tosses. We selected a critical or rejection region - let us use {0,1,9,10}. (Note that here we have selected the rejection region rather than the level, but as we will see this is not how things are usually done.) Then Pr(X= 0 or 1 or 9 or 10 p = 0.5) = Equation 1 level of significance, level or type I error = (see Table 1 Chapter 7 Part 1) Notice that Equation 1 gives the probability of landing in the rejection region when the null hypothesis (p = 0.5) is true. Pr(X = 0 or 1 or 9 or 10 p= 0.2) = power = 1 - β = Equation 2 Equation 2 gives the probability of landing in the rejection region when the null hypothesis is false (e.g. the alternative hypothesis is true). Page -2-
5 p-value Definition: The p-value is the probability associated with the smallest rejection region that includes the value of the test statistic (the number of heads you actually got in 10 tosses) for the sample, under the assumption that the null hypothesis is true (i.e. the coin tossing problem, assuming the probability of getting heads on a given toss is 0.5). Eample: If turns out to be 0 (i.e. we flipped the coin 10 times and the results of each of the flips was tails), then the smallest rejection region containing 0 is {0, 10} and the p-value is Pr(X = 0 or 10 p = 0.5) = p-value = Notice that the p-value and level have the same pattern (i.e. the probability of being in a rejection region given the null hypothesis is true) just different rejection regions. If the original setup is two-tailed, then the rejection region associated with the p-value is also two tailed (which eplains why we included 10 when calculating the p-value). Eample: If the test statistic turns out to be 4, then the smallest rejection region containing 4 is {0, 1, 2, 3, 4, 6, 7, 8, 9, 10}. That is, the only number not in the rejection region is 5. So p-value = Pr(X = 0, 1, 2, 3, 4, 6, 7, 8, 9, 10 p = 0.5) = 1 - Pr(X = 5 p = 0.5) = = Relationship of the p-value and level Let us go back to the rejection region {0, 1, 9, 10}. For a categorical distribution when the test statistic is in the rejection region [ i.e. = 0 is in the region {0,1,9,10}], the p- value is less than or equal to the level because p-value = Pr(X = 0 or 10 p = 0.5) = for test statistic level = Pr(X= 0 or 1 or 9 or 10 p = 0.5) = = 0 and Page -3-
6 Notice that {, 010} {,,, 01910} Notice that both the p-value and the When the test statistic is not in the rejection region [i.e. region {0,1,9,10}], then the p-value is greater than the statistic = 4 is means is a subset of level are based on the null (p = 0.5) hypothesis. = 4 is not in the rejection level. The p-value for the test p-value = Pr(X = 0, 1, 2, 3, 4, 6, 7, 8, 9, 10 p = 0.5) = as compared to the level = Pr(X= 0 or 1 or 9 or 10 p = 0.5) = So we have that the level and the p-value each depend on the null hypothesis and a rejection region, but not necessarily the same rejection region. The power and β error depend on the alternative hypothesis and the rejection region. We will reject the null hypothesis in favor of the alternative hypothesis if the p-value <. We will fail to reject the null hypothesis, if the p-value >. We fail to reject the null hypothesis when the p-value equals Eample of Hypothesis testing given a normal distribution with σ known (I think that the probability of actually knowing σ for a given study is smaller than my probability of winning the lottery {and I don t buy lottery tickets}, but the assumption simplifies our eample): Let us assume that we know that the population of all high school aged kids in Houston has a mean SBP of 125 mm Hg (Rosner labels this population mean μ 0 ) and a known standard deviation of 50 mm Hg. Suppose we obtain a sample of 25 kids from the High School for the Performing Arts (HSPVA) and find that their mean systolic blood pressure is 142 mm Hg. The question is: Does our sample of 25 kids seem to be from the population with mean SBP = 125 mm Hg or does it seem more likely that they are from another population. So our null hypothesis is: H 0 : μ = 125 mm Hg The μ in the null hypothesis above is the mean for the population from which the sample of 25 kids was drawn. The number 125 mm Hg is the mean of the population of all high school aged kids in Houston (i.e. μ 0 = 125). So we are asking are the means of the two populations the same. If we assume the variances are the same, then we are asking are the two populations the same. Page -4-
7 There are a number of possible forms for the alternative hypothesis: 1) H A : μ 125 (two-tailed alternative) 2) H A : μ > 125 (one-tailed upper tail) 3) H A : μ < 125 (one-tailed lower tail) 4) H A : μ = 162 (some specific value) The form of the alternative hypothesis is picked prior to collecting the sample information. Please note that in the hypothesis testing setup, μ is the mean of the population from which the sample (of 25) was drawn (we hope this population is the same as the one with mean = 125). So μ = 125 essentially asks if the sample was drawn from the same population as the population with mean = 125 (i.e. are the two populations the same). Or another way to think of the question is: is the mean of our sample of 25 consistent with the null hypothesis or with the alternative hypothesis? Let us assume the following: (1) the two-tailed version (i.e. H A : μ 125) of the alternative hypothesis (2) X is the random variable associated with the SBP of all high school kids in Houston (as opposed to the sample from HSPVA). (3) X ~ N( 125, 50 2 ) (4) the level of significance is 0.05 (i.e. = 0.05). Note that to select and then determine the rejection region formed by that level is the usual procedure. In the Riverboat gambler problem we selected the rejection region and then calculated for pedagogical reasons. The sample mean ( = 142 mm Hg) of our sample of 25 kids from HSPVA is our test statistic just like = 2 heads was the test statistic for the coin toss problem. Under the null hypothesis (i.e. assume that the sample of 25 HSPVA students is from the population of all high school students) the distribution of the sample means of all samples of size n = 25 (i.e. the sampling distribution) with σ = 25 (SD of the population of all Houston kids) is the normal distribution with mean = 125 mm Hg and Page -5-
8 50 25 X ~ N 125, SD = 10 mm Hg (i.e ). Or because X ~ N( 125, 50 2 ) and n, our sample size, is equal to In the Riverboat gambler problem, we selected the rejection region and then found the level that went with it. We did it in that fashion so it would be clear that the rejection region contained the values that we thought were unlikely to occur if indeed the coin was fair. Another way to decide on the rejection region is to first select the level (usually something like 0.05 or 0.01) and then find the region (which would be in two pieces for a two-tailed test or one piece for a one-tailed test) whose area is equal to the level. Selecting first is the usual way of doing things. For the current problem we ll choose the level to be Since we are dealing with a two-tailed test, we are looking for the values that cutoff the upper and lower area of the curve. We know that on the N(0,1) curve and 1.96 cut off the lower and upper areas respectively. The question is what points on the N(125,10 2 ) [the distribution for X] curve are the equivalents of and 1.96 (i.e. what points are 1.96 standard deviations below and above 125). 125 = = If, then = (10)(1.96) = and if, then = (10)(-1.96) = This means that the rejection region consists of all such that < or >144.6 (Notice that the endpoints are in the acceptance region.) and the acceptance region is [105.4, 144.6] where the square brackets indicate that the interval includes the end points. Another way to say this is: Pr( X is in the rejection region μ = 125 and σ = 10) = Pr( X < or X > μ = 125 and σ = 10) = 0.05 Note that 142 (the mean SBP of our sample of 25 kids) falls in the acceptance region (see Figure 1 below) so we would say that we accept the null hypothesis (or that we fail to reject the null hypothesis). Page -6-
9 Yet another way of saying this is that we believe our sample of 25 kids could have come from the normal distribution with μ = 125 and σ = 50 (i.e. the distribution of all Houston kids). Fail to reject the null hypothesis is the usual way statisticians report the results, but you ll never see this in a journal article. Notice that what we did was assume the sample was from the population of all high school kids and then look to see if that made sense or to see if the sample really seemed to fit with the population of all high school students. Figure 1.04 Normal Density with Mean = 125 and SD = 10 The vertical lines are at = & = Normal density with Mean = 125 and SD = 10 The vertical lines are at = and = The lines are at = & = Sample Means Each stripped area is half of the rejection region. Each area = The rejection region is the stripped area (sum of the two areas actually) in Figure 1 above. To find the p-value that goes with this decision, we need to find the area to the right of 142 under the N(125,100) curve and double it because we are dealing with a two-tailed alternative. Remember to find the p-value you need to find the smallest rejection region that includes 142. This would be the rejection region where 142 is the cutoff (see Figure 2 below). To get the probability associated with the region under the N(125,100) curve and to the right of 142 we need to translate 142 to the N(0,1) curve by finding how many standard deviations 142 is from 125. Page -7-
10 is 1.7 SD s from 125 since 10 = 1.70 Figure 2 Normal Density with Mean = 125 and SD = 10 The vertical lines are at = & = (for a level) and = 142 (for half the p-value).04 Normal density with Mean = 125 and SD = 10 The vertical lines are at = and = The dashed line is at 142 The stripped area is half of the p-value Sample Means Each area beyond the solid line is half of the rejection region (i.e ) According to the tables or Stata, the probability to the right of 1.7 is Since we are dealing with a two-tailed test, the p-value = = You could get this using STATA:. di 2*(1 - normal(( )/10)) So we fail to reject the null hypothesis because p-value >. Notice we are using the normal distribution with the hypothesized mean as opposed to the sample mean. Page -8-
11 A Confidence interval approach to the same problem nother way we could look at this problem (H 0 : μ = 125 mm Hg versus H A : μ 125 mm Hg assuming X follows a normal distribution, σ known to be 50 and = 0.05) would be through confidence intervals: The 95% CI for 142 (i.e. you get the confidence interval about the sample value) given that n = 25 and σ for X is 50 (the original distribution - the population of all Houston kids) would be ( ( z σ / n), + ( z σ / n)) 1 ( 2) 1 ( 2) (. 196)( 50) (. 196)( 50) 142, = ( , ) = (122.4, 161.6) Notice that 125 (the null hypothesis) is in this 95% confidence interval for 142, so 142 would not be considered different from 125 at = 0.05 level. This is the same conclusion we reached earlier. Using the confidence interval, we know the set of values (population means) that 142 does not differ from, not just that 142 doesn t differ from 125. However, we can t calculate the p-value. So we gain something with the confidence interval approach and lose something. Comparison of processes for hypothesis testing and confidence interval approach: For the confidence interval approach the process is to find the confidence interval for the sample mean (142) and then to check whether the population mean (125) is in that confidence interval (i.e. you start with the sample). Using the hypothesis testing approach, you start with the distribution of X under the null hypothesis (i.e. you get an acceptance region about the population mean 125) and look to see if the sample mean (142) lies in this acceptance region. Page -9-
12 Same problem but with the alternative hypothesis changed to H A : μ > 125 (i.e. one-tailed upper tailed). Let us work through the problem again this time using the #2 form of the alternative hypothesis (i.e H A : μ > 125) but keep the = 0.05 and σ for X at 50. Selecting this particular form of the alternative hypothesis means that we will reject the null hypothesis only when the sample mean is too big. Note that the alternative hypothesis is selected prior to the collection of the sample data. Now, using the hypothesis testing approach, is our sample mean of 142 from a sample of 25 still consistent with the null hypothesis? We will still use the distribution of sample means (i.e. distribution of X) with mean = 125 and SD = 10, but now the critical region is all in the upper tail (i.e. we reject the null hypothesis and accept the alternative only when the sample mean is too big). So again we have the critical and acceptance regions in terms of the relationship to the population mean (125). This means that the entire 0.05 will be in the upper tail because we reject only if 142 is too big. We know that cuts off the upper 5% of the N(0,1) curve. So if we solve the following, we will have the equivalent number for the N(125,10 2 ) curve. Figure 3 Normal Density with Mean = 125 and SD = 10 The vertical line is drawn at = One-tailed test. Normal density with Mean = 125 and SD = 10 The vertical line is at The stripped area is the rejection region. The area = X = Sample bar Means Page -10-
13 125 = If, then = The vertical line on the graph below is at which cuts off 5% of the upper tail. Notice that is smaller than that cut off in the upper tail. And also smaller than 142 which cuts off (i.e. half the p-value from the two-tailed test) in the upper tail. Notice that 142 (the sample mean) is now in the rejection region (i.e. 142 > ) whereas it was in the acceptance region for the two-tailed test To obtain the p-value we again solve 10 di 1 - normal(1.7) = = 1.70 So the p-value = (note that we do not multiply by 2 because this is a one-tailed test). Since 142 is in the rejection region, we epected the p-value to be less than which is equal to For the two-tailed test the test statistic 142 fell in the acceptance region (so we accepted the null hypothesis or failed to reject the null hypothesis) and the p-value was equal to But with the one-tailed test the test statistic falls in the rejection region (so we reject the null hypothesis) and the p-value is (note that is half of ). Therefore, with the same sample size and the same null hypothesis we manage to go from acceptance of the null hypothesis (two-tailed test) to rejection of the null hypothesis (one-tailed test). Note that this is why people want to use a one-tailed test. But be aware that if you have picked the wrong tail (so that the rejection region is on the lower end of the distribution), you could end up with acceptance with a one-tailed test but rejection with a two-tailed. You also need to be aware that it is seldom appropriate to use a one-tailed test. You need to have prior information (i.e. an earlier study done in your lab or published in a journal) that indicates that if the sample mean is different from the population mean it is because the sample mean is bigger than the population mean (version 2 of the alternative hypotheses) or smaller than the population mean (version 3 of the alternative hypotheses). Also particularly note that the alternative hypothesis has to be selected prior to your seeing the data. Investigators like to gamble that they know which tail is appropriate because, as we will see later, you can use a smaller sample size. DON T DO THIS!!!!!! Now consider the confidence interval approach but this time it will be Page -11-
14 a one-sided confidence interval to go with the one-sided test. So we consider a 90% two-sided consider below because if you consider only one side of a 90% confidence interval and you 10% outside the confidence interval with 5% on each side. The confidence interval is again about the sample mean. The 90% confidence interval for 142 is (142 (1.645)(50),142 + (1.645)(50) ) = (125.55, ) The interval (125.55, 4) is the correct one-sided 95% confidence interval. H A :μ > 125 says we reject the null hypothesis in favor of the alternative hypothesis if the sample mean is too big. This can also be translated as the population mean is too small. The confidence interval is about the sample mean 142. So for the population mean to be too small it must be outside the lower bound of the confidence interval. Notice that the tail of this confidence interval is in the same direction as the cutoff for the one-tailed hypothesis test. The population mean 125 is not in the confidence interval so we reject the null hypothesis. Notice that for the hypothesis test we reject the null hypothesis when the sample mean is too big. For the confidence interval we reject the null hypothesis when the population mean is too small. The conclusions with the two-sided hypothesis test and the two-sided confidence interval should be the same provided the level is the same for both. Similarly for the one-sided hypothesis test and one-sided confidence interval. Revisiting the problem with H A as a single number rather than a range of numbers. Now suppose we use the #4 form of the alternative hypothesis (i.e that H A: μ = 162). Now we have two normal distributions for X each with σ = 10 but one with mean = 125 and one with mean = 162. We should note that this sort of problem is not commonly considered. H 0 : μ = 125 versus H A : μ = 162 Page -12-
15 We are going to do a one-tailed upper tailed test here (we reject the null hypothesis only when the sample mean is too big). Why are we doing a one-tailed test here? We want to know if our sample mean of 142 is consistent with 125 or 162. So we reject the null hypothesis when the sample means looks more like 162 than 125. Well numbers smaller than 125 are not going to cause us to reject the null hypothesis in favor of 162. Only if the numbers are too big would we be willing to think that 162 was more appropriate than 125. With this one-tailed test we can actually show the area that goes with the power. Earlier we found that if = 0.05, the cutoff for this upper tailed region is or Pr( X > μ = 125) = 0.05 Figure 4 Rejection Region Normal Distributions with SD = 10 and Means = 125 and Sampling Distribution according to H 0 Sampling Distribution according to H A The line is = The stripped area is the rejection region. Area = Sample Means Power is the probability of being in the rejection region for the null hypothesis when the alternative hypothesis is true (i.e. correctly rejecting the null hypothesis). So the power for this problem is the probability of being to the right of (i.e. being in the rejection region for the null hypothesis) but under the alternative hypothesis curve (i.e. the one with mean = 162). Page -13-
16 So Pr( X > μ = 162) = = power [see Figure 5 below]. Recall that on the normal curve with mean 162 and SD = 10 is equivalent to ( )/10 = (i.e is 2.1 standard deviations below the mean of 162) Using Stata, we get power = 1 - normal(-2.055) normal(-2.055) gives the are under the curve with μ = 162 and to the left of the line at (i.e. β = 1 - power). It is the area under the rest of the curve (i.e. to the right of ) that is the power (i.e. 1 - β). Figure 5 Power Normal Distributions with SD = 10 and Means = 125 and Sampling Distribution according to H 0 Sampling Distribution according to H A The hatched area is the power = The line is = Sample Means Page -14-
17 You need to be aware that: 1) When we were looking at the problems above as hypothesis testing problems, we used the population parameters (here population mean = 125 and population SE = 10) to obtain a rejection region and then asked was the sample statistic (here = 142) in the rejection region. 2) However, when considering the problems using the confidence interval approach, we obtained the confidence interval about the sample statistic ( = 142) and then asked if the population parameter (i.e. the population mean 125) was in the confidence interval. Why do people select a one-tailed versus a two-tailed test? We saw above that it is possible to reject the null hypothesis using a one-tailed test but fail to reject the null hypothesis using a two-tailed test. So if you were trying to prove that your new drug is better than the standard of care you might be tempted to use a one-tailed test. What are the drawbacks to a one-tailed test? Well you might have guessed the wrong tail. If before we had obtained a sample (and this is the way you are supposed to play the game), you said if your sample of kids differed from the population with respect to SBP, it would be because the SBP of your sample of kids would be too small. This means that in Figure 3 above the rejection region would be the area to the left of = = would be in the acceptance region. The wrong guess has cost you a significant answer.. So Page -15-
HYPOTHESIS TESTING (ONE SAMPLE) - CHAPTER 7 1. used confidence intervals to answer questions such as...
HYPOTHESIS TESTING (ONE SAMPLE) - CHAPTER 7 1 PREVIOUSLY used confidence intervals to answer questions such as... You know that 0.25% of women have red/green color blindness. You conduct a study of men
More informationIntroduction to Hypothesis Testing. Hypothesis Testing. Step 1: State the Hypotheses
Introduction to Hypothesis Testing 1 Hypothesis Testing A hypothesis test is a statistical procedure that uses sample data to evaluate a hypothesis about a population Hypothesis is stated in terms of the
More informationHYPOTHESIS TESTING: POWER OF THE TEST
HYPOTHESIS TESTING: POWER OF THE TEST The first 6 steps of the 9-step test of hypothesis are called "the test". These steps are not dependent on the observed data values. When planning a research project,
More informationHYPOTHESIS TESTING (ONE SAMPLE) - CHAPTER 7 1. used confidence intervals to answer questions such as...
HYPOTHESIS TESTING (ONE SAMPLE) - CHAPTER 7 1 PREVIOUSLY used confidence intervals to answer questions such as... You know that 0.25% of women have red/green color blindness. You conduct a study of men
More informationIntroduction to Hypothesis Testing
I. Terms, Concepts. Introduction to Hypothesis Testing A. In general, we do not know the true value of population parameters - they must be estimated. However, we do have hypotheses about what the true
More informationHypothesis testing - Steps
Hypothesis testing - Steps Steps to do a two-tailed test of the hypothesis that β 1 0: 1. Set up the hypotheses: H 0 : β 1 = 0 H a : β 1 0. 2. Compute the test statistic: t = b 1 0 Std. error of b 1 =
More informationHow To Test For Significance On A Data Set
Non-Parametric Univariate Tests: 1 Sample Sign Test 1 1 SAMPLE SIGN TEST A non-parametric equivalent of the 1 SAMPLE T-TEST. ASSUMPTIONS: Data is non-normally distributed, even after log transforming.
More informationSimple Regression Theory II 2010 Samuel L. Baker
SIMPLE REGRESSION THEORY II 1 Simple Regression Theory II 2010 Samuel L. Baker Assessing how good the regression equation is likely to be Assignment 1A gets into drawing inferences about how close the
More informationExperimental Design. Power and Sample Size Determination. Proportions. Proportions. Confidence Interval for p. The Binomial Test
Experimental Design Power and Sample Size Determination Bret Hanlon and Bret Larget Department of Statistics University of Wisconsin Madison November 3 8, 2011 To this point in the semester, we have largely
More informationLesson 1: Comparison of Population Means Part c: Comparison of Two- Means
Lesson : Comparison of Population Means Part c: Comparison of Two- Means Welcome to lesson c. This third lesson of lesson will discuss hypothesis testing for two independent means. Steps in Hypothesis
More informationMONT 107N Understanding Randomness Solutions For Final Examination May 11, 2010
MONT 07N Understanding Randomness Solutions For Final Examination May, 00 Short Answer (a) (0) How are the EV and SE for the sum of n draws with replacement from a box computed? Solution: The EV is n times
More informationRecall this chart that showed how most of our course would be organized:
Chapter 4 One-Way ANOVA Recall this chart that showed how most of our course would be organized: Explanatory Variable(s) Response Variable Methods Categorical Categorical Contingency Tables Categorical
More informationUnit 31 A Hypothesis Test about Correlation and Slope in a Simple Linear Regression
Unit 31 A Hypothesis Test about Correlation and Slope in a Simple Linear Regression Objectives: To perform a hypothesis test concerning the slope of a least squares line To recognize that testing for a
More informationLAB 4 INSTRUCTIONS CONFIDENCE INTERVALS AND HYPOTHESIS TESTING
LAB 4 INSTRUCTIONS CONFIDENCE INTERVALS AND HYPOTHESIS TESTING In this lab you will explore the concept of a confidence interval and hypothesis testing through a simulation problem in engineering setting.
More informationBA 275 Review Problems - Week 6 (10/30/06-11/3/06) CD Lessons: 53, 54, 55, 56 Textbook: pp. 394-398, 404-408, 410-420
BA 275 Review Problems - Week 6 (10/30/06-11/3/06) CD Lessons: 53, 54, 55, 56 Textbook: pp. 394-398, 404-408, 410-420 1. Which of the following will increase the value of the power in a statistical test
More informationComparing Two Groups. Standard Error of ȳ 1 ȳ 2. Setting. Two Independent Samples
Comparing Two Groups Chapter 7 describes two ways to compare two populations on the basis of independent samples: a confidence interval for the difference in population means and a hypothesis test. The
More informationTwo-Sample T-Tests Assuming Equal Variance (Enter Means)
Chapter 4 Two-Sample T-Tests Assuming Equal Variance (Enter Means) Introduction This procedure provides sample size and power calculations for one- or two-sided two-sample t-tests when the variances of
More informationSection 7.1. Introduction to Hypothesis Testing. Schrodinger s cat quantum mechanics thought experiment (1935)
Section 7.1 Introduction to Hypothesis Testing Schrodinger s cat quantum mechanics thought experiment (1935) Statistical Hypotheses A statistical hypothesis is a claim about a population. Null hypothesis
More informationOdds ratio, Odds ratio test for independence, chi-squared statistic.
Odds ratio, Odds ratio test for independence, chi-squared statistic. Announcements: Assignment 5 is live on webpage. Due Wed Aug 1 at 4:30pm. (9 days, 1 hour, 58.5 minutes ) Final exam is Aug 9. Review
More informationTHE FIRST SET OF EXAMPLES USE SUMMARY DATA... EXAMPLE 7.2, PAGE 227 DESCRIBES A PROBLEM AND A HYPOTHESIS TEST IS PERFORMED IN EXAMPLE 7.
THERE ARE TWO WAYS TO DO HYPOTHESIS TESTING WITH STATCRUNCH: WITH SUMMARY DATA (AS IN EXAMPLE 7.17, PAGE 236, IN ROSNER); WITH THE ORIGINAL DATA (AS IN EXAMPLE 8.5, PAGE 301 IN ROSNER THAT USES DATA FROM
More informationHypothesis Testing for Beginners
Hypothesis Testing for Beginners Michele Piffer LSE August, 2011 Michele Piffer (LSE) Hypothesis Testing for Beginners August, 2011 1 / 53 One year ago a friend asked me to put down some easy-to-read notes
More informationChapter 8 Hypothesis Testing Chapter 8 Hypothesis Testing 8-1 Overview 8-2 Basics of Hypothesis Testing
Chapter 8 Hypothesis Testing 1 Chapter 8 Hypothesis Testing 8-1 Overview 8-2 Basics of Hypothesis Testing 8-3 Testing a Claim About a Proportion 8-5 Testing a Claim About a Mean: s Not Known 8-6 Testing
More information22. HYPOTHESIS TESTING
22. HYPOTHESIS TESTING Often, we need to make decisions based on incomplete information. Do the data support some belief ( hypothesis ) about the value of a population parameter? Is OJ Simpson guilty?
More informationIntroduction. Hypothesis Testing. Hypothesis Testing. Significance Testing
Introduction Hypothesis Testing Mark Lunt Arthritis Research UK Centre for Ecellence in Epidemiology University of Manchester 13/10/2015 We saw last week that we can never know the population parameters
More informationIntroduction to. Hypothesis Testing CHAPTER LEARNING OBJECTIVES. 1 Identify the four steps of hypothesis testing.
Introduction to Hypothesis Testing CHAPTER 8 LEARNING OBJECTIVES After reading this chapter, you should be able to: 1 Identify the four steps of hypothesis testing. 2 Define null hypothesis, alternative
More informationTwo-Sample T-Tests Allowing Unequal Variance (Enter Difference)
Chapter 45 Two-Sample T-Tests Allowing Unequal Variance (Enter Difference) Introduction This procedure provides sample size and power calculations for one- or two-sided two-sample t-tests when no assumption
More informationIntroduction to Hypothesis Testing OPRE 6301
Introduction to Hypothesis Testing OPRE 6301 Motivation... The purpose of hypothesis testing is to determine whether there is enough statistical evidence in favor of a certain belief, or hypothesis, about
More informationPrinciples of Hypothesis Testing for Public Health
Principles of Hypothesis Testing for Public Health Laura Lee Johnson, Ph.D. Statistician National Center for Complementary and Alternative Medicine johnslau@mail.nih.gov Fall 2011 Answers to Questions
More informationClass 19: Two Way Tables, Conditional Distributions, Chi-Square (Text: Sections 2.5; 9.1)
Spring 204 Class 9: Two Way Tables, Conditional Distributions, Chi-Square (Text: Sections 2.5; 9.) Big Picture: More than Two Samples In Chapter 7: We looked at quantitative variables and compared the
More information3.4 Statistical inference for 2 populations based on two samples
3.4 Statistical inference for 2 populations based on two samples Tests for a difference between two population means The first sample will be denoted as X 1, X 2,..., X m. The second sample will be denoted
More informationChapter 7 Notes - Inference for Single Samples. You know already for a large sample, you can invoke the CLT so:
Chapter 7 Notes - Inference for Single Samples You know already for a large sample, you can invoke the CLT so: X N(µ, ). Also for a large sample, you can replace an unknown σ by s. You know how to do a
More informationThe Wilcoxon Rank-Sum Test
1 The Wilcoxon Rank-Sum Test The Wilcoxon rank-sum test is a nonparametric alternative to the twosample t-test which is based solely on the order in which the observations from the two samples fall. We
More information1. What is the critical value for this 95% confidence interval? CV = z.025 = invnorm(0.025) = 1.96
1 Final Review 2 Review 2.1 CI 1-propZint Scenario 1 A TV manufacturer claims in its warranty brochure that in the past not more than 10 percent of its TV sets needed any repair during the first two years
More informationTesting a claim about a population mean
Introductory Statistics Lectures Testing a claim about a population mean One sample hypothesis test of the mean Department of Mathematics Pima Community College Redistribution of this material is prohibited
More informationp-values and significance levels (false positive or false alarm rates)
p-values and significance levels (false positive or false alarm rates) Let's say 123 people in the class toss a coin. Call it "Coin A." There are 65 heads. Then they toss another coin. Call it "Coin B."
More informationNormal distribution. ) 2 /2σ. 2π σ
Normal distribution The normal distribution is the most widely known and used of all distributions. Because the normal distribution approximates many natural phenomena so well, it has developed into a
More informationMind on Statistics. Chapter 12
Mind on Statistics Chapter 12 Sections 12.1 Questions 1 to 6: For each statement, determine if the statement is a typical null hypothesis (H 0 ) or alternative hypothesis (H a ). 1. There is no difference
More informationComparing Means in Two Populations
Comparing Means in Two Populations Overview The previous section discussed hypothesis testing when sampling from a single population (either a single mean or two means from the same population). Now we
More informationStat 20: Intro to Probability and Statistics
Stat 20: Intro to Probability and Statistics Lecture 16: More Box Models Tessa L. Childers-Day UC Berkeley 22 July 2014 By the end of this lecture... You will be able to: Determine what we expect the sum
More informationChapter 2. Hypothesis testing in one population
Chapter 2. Hypothesis testing in one population Contents Introduction, the null and alternative hypotheses Hypothesis testing process Type I and Type II errors, power Test statistic, level of significance
More informationLecture Notes Module 1
Lecture Notes Module 1 Study Populations A study population is a clearly defined collection of people, animals, plants, or objects. In psychological research, a study population usually consists of a specific
More informationProjects Involving Statistics (& SPSS)
Projects Involving Statistics (& SPSS) Academic Skills Advice Starting a project which involves using statistics can feel confusing as there seems to be many different things you can do (charts, graphs,
More information" Y. Notation and Equations for Regression Lecture 11/4. Notation:
Notation: Notation and Equations for Regression Lecture 11/4 m: The number of predictor variables in a regression Xi: One of multiple predictor variables. The subscript i represents any number from 1 through
More informationStat 411/511 THE RANDOMIZATION TEST. Charlotte Wickham. stat511.cwick.co.nz. Oct 16 2015
Stat 411/511 THE RANDOMIZATION TEST Oct 16 2015 Charlotte Wickham stat511.cwick.co.nz Today Review randomization model Conduct randomization test What about CIs? Using a t-distribution as an approximation
More informationCONTINGENCY TABLES ARE NOT ALL THE SAME David C. Howell University of Vermont
CONTINGENCY TABLES ARE NOT ALL THE SAME David C. Howell University of Vermont To most people studying statistics a contingency table is a contingency table. We tend to forget, if we ever knew, that contingency
More informationPearson's Correlation Tests
Chapter 800 Pearson's Correlation Tests Introduction The correlation coefficient, ρ (rho), is a popular statistic for describing the strength of the relationship between two variables. The correlation
More information1 Hypothesis Testing. H 0 : population parameter = hypothesized value:
1 Hypothesis Testing In Statistics, a hypothesis proposes a model for the world. Then we look at the data. If the data are consistent with that model, we have no reason to disbelieve the hypothesis. Data
More informationConfidence Intervals for Cpk
Chapter 297 Confidence Intervals for Cpk Introduction This routine calculates the sample size needed to obtain a specified width of a Cpk confidence interval at a stated confidence level. Cpk is a process
More informationII. DISTRIBUTIONS distribution normal distribution. standard scores
Appendix D Basic Measurement And Statistics The following information was developed by Steven Rothke, PhD, Department of Psychology, Rehabilitation Institute of Chicago (RIC) and expanded by Mary F. Schmidt,
More informationAn Introduction to Statistics Course (ECOE 1302) Spring Semester 2011 Chapter 10- TWO-SAMPLE TESTS
The Islamic University of Gaza Faculty of Commerce Department of Economics and Political Sciences An Introduction to Statistics Course (ECOE 130) Spring Semester 011 Chapter 10- TWO-SAMPLE TESTS Practice
More informationCorrelational Research
Correlational Research Chapter Fifteen Correlational Research Chapter Fifteen Bring folder of readings The Nature of Correlational Research Correlational Research is also known as Associational Research.
More informationLAB : THE CHI-SQUARE TEST. Probability, Random Chance, and Genetics
Period Date LAB : THE CHI-SQUARE TEST Probability, Random Chance, and Genetics Why do we study random chance and probability at the beginning of a unit on genetics? Genetics is the study of inheritance,
More informationChapter 3 RANDOM VARIATE GENERATION
Chapter 3 RANDOM VARIATE GENERATION In order to do a Monte Carlo simulation either by hand or by computer, techniques must be developed for generating values of random variables having known distributions.
More informationLab 11. Simulations. The Concept
Lab 11 Simulations In this lab you ll learn how to create simulations to provide approximate answers to probability questions. We ll make use of a particular kind of structure, called a box model, that
More informationCHAPTER 14 NONPARAMETRIC TESTS
CHAPTER 14 NONPARAMETRIC TESTS Everything that we have done up until now in statistics has relied heavily on one major fact: that our data is normally distributed. We have been able to make inferences
More information2 Sample t-test (unequal sample sizes and unequal variances)
Variations of the t-test: Sample tail Sample t-test (unequal sample sizes and unequal variances) Like the last example, below we have ceramic sherd thickness measurements (in cm) of two samples representing
More informationStatistiek I. Proportions aka Sign Tests. John Nerbonne. CLCG, Rijksuniversiteit Groningen. http://www.let.rug.nl/nerbonne/teach/statistiek-i/
Statistiek I Proportions aka Sign Tests John Nerbonne CLCG, Rijksuniversiteit Groningen http://www.let.rug.nl/nerbonne/teach/statistiek-i/ John Nerbonne 1/34 Proportions aka Sign Test The relative frequency
More informationPoint Biserial Correlation Tests
Chapter 807 Point Biserial Correlation Tests Introduction The point biserial correlation coefficient (ρ in this chapter) is the product-moment correlation calculated between a continuous random variable
More information6.3 Conditional Probability and Independence
222 CHAPTER 6. PROBABILITY 6.3 Conditional Probability and Independence Conditional Probability Two cubical dice each have a triangle painted on one side, a circle painted on two sides and a square painted
More informationA POPULATION MEAN, CONFIDENCE INTERVALS AND HYPOTHESIS TESTING
CHAPTER 5. A POPULATION MEAN, CONFIDENCE INTERVALS AND HYPOTHESIS TESTING 5.1 Concepts When a number of animals or plots are exposed to a certain treatment, we usually estimate the effect of the treatment
More informationTesting Research and Statistical Hypotheses
Testing Research and Statistical Hypotheses Introduction In the last lab we analyzed metric artifact attributes such as thickness or width/thickness ratio. Those were continuous variables, which as you
More informationCALCULATIONS & STATISTICS
CALCULATIONS & STATISTICS CALCULATION OF SCORES Conversion of 1-5 scale to 0-100 scores When you look at your report, you will notice that the scores are reported on a 0-100 scale, even though respondents
More informationHypothesis Testing. Reminder of Inferential Statistics. Hypothesis Testing: Introduction
Hypothesis Testing PSY 360 Introduction to Statistics for the Behavioral Sciences Reminder of Inferential Statistics All inferential statistics have the following in common: Use of some descriptive statistic
More information16. THE NORMAL APPROXIMATION TO THE BINOMIAL DISTRIBUTION
6. THE NORMAL APPROXIMATION TO THE BINOMIAL DISTRIBUTION It is sometimes difficult to directly compute probabilities for a binomial (n, p) random variable, X. We need a different table for each value of
More informationSection 1. Inequalities -5-4 -3-2 -1 0 1 2 3 4 5
Worksheet 2.4 Introduction to Inequalities Section 1 Inequalities The sign < stands for less than. It was introduced so that we could write in shorthand things like 3 is less than 5. This becomes 3 < 5.
More informationDDBA 8438: Introduction to Hypothesis Testing Video Podcast Transcript
DDBA 8438: Introduction to Hypothesis Testing Video Podcast Transcript JENNIFER ANN MORROW: Welcome to "Introduction to Hypothesis Testing." My name is Dr. Jennifer Ann Morrow. In today's demonstration,
More informationHow To Check For Differences In The One Way Anova
MINITAB ASSISTANT WHITE PAPER This paper explains the research conducted by Minitab statisticians to develop the methods and data checks used in the Assistant in Minitab 17 Statistical Software. One-Way
More informationTesting Hypotheses About Proportions
Chapter 11 Testing Hypotheses About Proportions Hypothesis testing method: uses data from a sample to judge whether or not a statement about a population may be true. Steps in Any Hypothesis Test 1. Determine
More informationUnit 26 Estimation with Confidence Intervals
Unit 26 Estimation with Confidence Intervals Objectives: To see how confidence intervals are used to estimate a population proportion, a population mean, a difference in population proportions, or a difference
More informationAP: LAB 8: THE CHI-SQUARE TEST. Probability, Random Chance, and Genetics
Ms. Foglia Date AP: LAB 8: THE CHI-SQUARE TEST Probability, Random Chance, and Genetics Why do we study random chance and probability at the beginning of a unit on genetics? Genetics is the study of inheritance,
More informationConfidence Intervals for Cp
Chapter 296 Confidence Intervals for Cp Introduction This routine calculates the sample size needed to obtain a specified width of a Cp confidence interval at a stated confidence level. Cp is a process
More informationThe Binomial Distribution
The Binomial Distribution James H. Steiger November 10, 00 1 Topics for this Module 1. The Binomial Process. The Binomial Random Variable. The Binomial Distribution (a) Computing the Binomial pdf (b) Computing
More informationUsing Excel for inferential statistics
FACT SHEET Using Excel for inferential statistics Introduction When you collect data, you expect a certain amount of variation, just caused by chance. A wide variety of statistical tests can be applied
More informationWeek 4: Standard Error and Confidence Intervals
Health Sciences M.Sc. Programme Applied Biostatistics Week 4: Standard Error and Confidence Intervals Sampling Most research data come from subjects we think of as samples drawn from a larger population.
More information"Statistical methods are objective methods by which group trends are abstracted from observations on many separate individuals." 1
BASIC STATISTICAL THEORY / 3 CHAPTER ONE BASIC STATISTICAL THEORY "Statistical methods are objective methods by which group trends are abstracted from observations on many separate individuals." 1 Medicine
More informationDescriptive Statistics
Descriptive Statistics Primer Descriptive statistics Central tendency Variation Relative position Relationships Calculating descriptive statistics Descriptive Statistics Purpose to describe or summarize
More informationSCHOOL OF HEALTH AND HUMAN SCIENCES DON T FORGET TO RECODE YOUR MISSING VALUES
SCHOOL OF HEALTH AND HUMAN SCIENCES Using SPSS Topics addressed today: 1. Differences between groups 2. Graphing Use the s4data.sav file for the first part of this session. DON T FORGET TO RECODE YOUR
More informationEstimation of σ 2, the variance of ɛ
Estimation of σ 2, the variance of ɛ The variance of the errors σ 2 indicates how much observations deviate from the fitted surface. If σ 2 is small, parameters β 0, β 1,..., β k will be reliably estimated
More informationSample Size and Power in Clinical Trials
Sample Size and Power in Clinical Trials Version 1.0 May 011 1. Power of a Test. Factors affecting Power 3. Required Sample Size RELATED ISSUES 1. Effect Size. Test Statistics 3. Variation 4. Significance
More informationOutline. Definitions Descriptive vs. Inferential Statistics The t-test - One-sample t-test
The t-test Outline Definitions Descriptive vs. Inferential Statistics The t-test - One-sample t-test - Dependent (related) groups t-test - Independent (unrelated) groups t-test Comparing means Correlation
More informationStat 5102 Notes: Nonparametric Tests and. confidence interval
Stat 510 Notes: Nonparametric Tests and Confidence Intervals Charles J. Geyer April 13, 003 This handout gives a brief introduction to nonparametrics, which is what you do when you don t believe the assumptions
More informationAnalysis of Variance ANOVA
Analysis of Variance ANOVA Overview We ve used the t -test to compare the means from two independent groups. Now we ve come to the final topic of the course: how to compare means from more than two populations.
More informationIndependent samples t-test. Dr. Tom Pierce Radford University
Independent samples t-test Dr. Tom Pierce Radford University The logic behind drawing causal conclusions from experiments The sampling distribution of the difference between means The standard error of
More informationConfidence Intervals for One Standard Deviation Using Standard Deviation
Chapter 640 Confidence Intervals for One Standard Deviation Using Standard Deviation Introduction This routine calculates the sample size necessary to achieve a specified interval width or distance from
More informationHypothesis testing. c 2014, Jeffrey S. Simonoff 1
Hypothesis testing So far, we ve talked about inference from the point of estimation. We ve tried to answer questions like What is a good estimate for a typical value? or How much variability is there
More informationChicago Booth BUSINESS STATISTICS 41000 Final Exam Fall 2011
Chicago Booth BUSINESS STATISTICS 41000 Final Exam Fall 2011 Name: Section: I pledge my honor that I have not violated the Honor Code Signature: This exam has 34 pages. You have 3 hours to complete this
More informationMA 1125 Lecture 14 - Expected Values. Friday, February 28, 2014. Objectives: Introduce expected values.
MA 5 Lecture 4 - Expected Values Friday, February 2, 24. Objectives: Introduce expected values.. Means, Variances, and Standard Deviations of Probability Distributions Two classes ago, we computed the
More information6: Introduction to Hypothesis Testing
6: Introduction to Hypothesis Testing Significance testing is used to help make a judgment about a claim by addressing the question, Can the observed difference be attributed to chance? We break up significance
More informationA Statistical Analysis of Popular Lottery Winning Strategies
CS-BIGS 4(1): 66-72 2010 CS-BIGS http://www.bentley.edu/csbigs/vol4-1/chen.pdf A Statistical Analysis of Popular Lottery Winning Strategies Albert C. Chen Torrey Pines High School, USA Y. Helio Yang San
More informationAnalysis and Interpretation of Clinical Trials. How to conclude?
www.eurordis.org Analysis and Interpretation of Clinical Trials How to conclude? Statistical Issues Dr Ferran Torres Unitat de Suport en Estadística i Metodología - USEM Statistics and Methodology Support
More informationDATA INTERPRETATION AND STATISTICS
PholC60 September 001 DATA INTERPRETATION AND STATISTICS Books A easy and systematic introductory text is Essentials of Medical Statistics by Betty Kirkwood, published by Blackwell at about 14. DESCRIPTIVE
More informationStats for Strategy Fall 2012 First-Discussion Handout: Stats Using Calculators and MINITAB
Stats for Strategy Fall 2012 First-Discussion Handout: Stats Using Calculators and MINITAB DIRECTIONS: Welcome! Your TA will help you apply your Calculator and MINITAB to review Business Stats, and will
More informationPoint and Interval Estimates
Point and Interval Estimates Suppose we want to estimate a parameter, such as p or µ, based on a finite sample of data. There are two main methods: 1. Point estimate: Summarize the sample by a single number
More informationHYPOTHESIS TESTING WITH SPSS:
HYPOTHESIS TESTING WITH SPSS: A NON-STATISTICIAN S GUIDE & TUTORIAL by Dr. Jim Mirabella SPSS 14.0 screenshots reprinted with permission from SPSS Inc. Published June 2006 Copyright Dr. Jim Mirabella CHAPTER
More informationMBA 611 STATISTICS AND QUANTITATIVE METHODS
MBA 611 STATISTICS AND QUANTITATIVE METHODS Part I. Review of Basic Statistics (Chapters 1-11) A. Introduction (Chapter 1) Uncertainty: Decisions are often based on incomplete information from uncertain
More informationChapter 26: Tests of Significance
Chapter 26: Tests of Significance Procedure: 1. State the null and alternative in words and in terms of a box model. 2. Find the test statistic: z = observed EV. SE 3. Calculate the P-value: The area under
More informationSTA 130 (Winter 2016): An Introduction to Statistical Reasoning and Data Science
STA 130 (Winter 2016): An Introduction to Statistical Reasoning and Data Science Mondays 2:10 4:00 (GB 220) and Wednesdays 2:10 4:00 (various) Jeffrey Rosenthal Professor of Statistics, University of Toronto
More informationNONPARAMETRIC STATISTICS 1. depend on assumptions about the underlying distribution of the data (or on the Central Limit Theorem)
NONPARAMETRIC STATISTICS 1 PREVIOUSLY parametric statistics in estimation and hypothesis testing... construction of confidence intervals computing of p-values classical significance testing depend on assumptions
More informationThe overall size of these chance errors is measured by their RMS HALF THE NUMBER OF TOSSES NUMBER OF HEADS MINUS 0 400 800 1200 1600 NUMBER OF TOSSES
INTRODUCTION TO CHANCE VARIABILITY WHAT DOES THE LAW OF AVERAGES SAY? 4 coins were tossed 1600 times each, and the chance error number of heads half the number of tosses was plotted against the number
More informationTwo Correlated Proportions (McNemar Test)
Chapter 50 Two Correlated Proportions (Mcemar Test) Introduction This procedure computes confidence intervals and hypothesis tests for the comparison of the marginal frequencies of two factors (each with
More information