Chapter 7 Part 2. Hypothesis testing Power

Size: px
Start display at page:

Download "Chapter 7 Part 2. Hypothesis testing Power"

Transcription

1 Chapter 7 Part 2 Hypothesis testing Power November 6, 2008 All of the normal curves in this handout are sampling distributions

2 Goal: To understand the process of hypothesis testing and the relationship of sample size and the form of the alternative hypothesis to power. Skills: Will know how and when to conduct an hypothesis test. Will be able to describe the relationship between the hypothesis testing and confidence interval approaches. Will know why power is important and how to maimize it. Contents: Formalization of Hypothesis Testing (Normal distribution and σ Known) 2 2 table Page 1 Two-tailed alternative hypothesis Page 5 Confidence interval approach to two-tailed alternative Page 8 Comparison of processes for hypothesis testing and CI approach Page 9 One-tailed alternative hypothesis Page 9 One-sided confidence interval approach to one-tailed alternative Page 11 Alternative hypothesis a single number rather than a range of numbers Page 12 Figures Figure 1: Two-tailed rejection region Page 6 Figure 2: p-value Page 7 Figure 3: One-tailed rejection region Page 10 Figure 4: Rejection region for single number as the alternative Page 13 Figure 5: Power for a single number as the alternative Page 13

3 Hypothesis Testing - Part 2 Review Riverboat Gambler Hypothesis Testing setup: H 0 is the tested or null hypothesis H A is the alternative hypothesis (sometimes referred to as H 1 ) TRUTH H 0 H 0 is true and H 0 is accepted Correct Decision H A DECISION Accept H A is true and H 0 is accepted Type II or β error Reject H 0 H 0 is true and H 0 is rejected Type I or error H A is true and H 0 is rejected Correct decision - power We also say Fail to reject H 0 " instead of saying Accept H 0 " Accept H 0 " is equivalent to saying Reject H A Reject H 0 " is equivalent to saying Accept H A H 0 is true is equivalent to H A is false H 0 is false is equivalent to H A is true First we are going to review what we learned from the Riverboat Gambler. Hypothesis testing process we used for the Riverboat Gambler: H 0 : p = 0.5 H A : p = 0.2 Notice that we picked as the alternative hypothesis what we believed to be the truth (i.e. if we had believed the coin was fair, we never would have set up the testing of the coin). The null hypothesis is set up as a strawman to be rejected so that we can accept the alternative hypothesis which is our real interest. This is because we have more Page -1-

4 control over the probability of incorrectly accepting the alternative hypothesis. Remember that the probability of incorrectly accepting the alternative hypothesis is equivalent to the probability of rejecting the null hypothesis when in fact it is true (see highlighted cell below). The error associated with this is the Type I or error. But we get to select what we will use as the value of. So although we hope the alternative hypothesis will be true, we certainly don t want to declare it true when it isn t. Type I and Type II errors Let X be the random variable indicating the number of heads in 10 tosses. We selected a critical or rejection region - let us use {0,1,9,10}. (Note that here we have selected the rejection region rather than the level, but as we will see this is not how things are usually done.) Then Pr(X= 0 or 1 or 9 or 10 p = 0.5) = Equation 1 level of significance, level or type I error = (see Table 1 Chapter 7 Part 1) Notice that Equation 1 gives the probability of landing in the rejection region when the null hypothesis (p = 0.5) is true. Pr(X = 0 or 1 or 9 or 10 p= 0.2) = power = 1 - β = Equation 2 Equation 2 gives the probability of landing in the rejection region when the null hypothesis is false (e.g. the alternative hypothesis is true). Page -2-

5 p-value Definition: The p-value is the probability associated with the smallest rejection region that includes the value of the test statistic (the number of heads you actually got in 10 tosses) for the sample, under the assumption that the null hypothesis is true (i.e. the coin tossing problem, assuming the probability of getting heads on a given toss is 0.5). Eample: If turns out to be 0 (i.e. we flipped the coin 10 times and the results of each of the flips was tails), then the smallest rejection region containing 0 is {0, 10} and the p-value is Pr(X = 0 or 10 p = 0.5) = p-value = Notice that the p-value and level have the same pattern (i.e. the probability of being in a rejection region given the null hypothesis is true) just different rejection regions. If the original setup is two-tailed, then the rejection region associated with the p-value is also two tailed (which eplains why we included 10 when calculating the p-value). Eample: If the test statistic turns out to be 4, then the smallest rejection region containing 4 is {0, 1, 2, 3, 4, 6, 7, 8, 9, 10}. That is, the only number not in the rejection region is 5. So p-value = Pr(X = 0, 1, 2, 3, 4, 6, 7, 8, 9, 10 p = 0.5) = 1 - Pr(X = 5 p = 0.5) = = Relationship of the p-value and level Let us go back to the rejection region {0, 1, 9, 10}. For a categorical distribution when the test statistic is in the rejection region [ i.e. = 0 is in the region {0,1,9,10}], the p- value is less than or equal to the level because p-value = Pr(X = 0 or 10 p = 0.5) = for test statistic level = Pr(X= 0 or 1 or 9 or 10 p = 0.5) = = 0 and Page -3-

6 Notice that {, 010} {,,, 01910} Notice that both the p-value and the When the test statistic is not in the rejection region [i.e. region {0,1,9,10}], then the p-value is greater than the statistic = 4 is means is a subset of level are based on the null (p = 0.5) hypothesis. = 4 is not in the rejection level. The p-value for the test p-value = Pr(X = 0, 1, 2, 3, 4, 6, 7, 8, 9, 10 p = 0.5) = as compared to the level = Pr(X= 0 or 1 or 9 or 10 p = 0.5) = So we have that the level and the p-value each depend on the null hypothesis and a rejection region, but not necessarily the same rejection region. The power and β error depend on the alternative hypothesis and the rejection region. We will reject the null hypothesis in favor of the alternative hypothesis if the p-value <. We will fail to reject the null hypothesis, if the p-value >. We fail to reject the null hypothesis when the p-value equals Eample of Hypothesis testing given a normal distribution with σ known (I think that the probability of actually knowing σ for a given study is smaller than my probability of winning the lottery {and I don t buy lottery tickets}, but the assumption simplifies our eample): Let us assume that we know that the population of all high school aged kids in Houston has a mean SBP of 125 mm Hg (Rosner labels this population mean μ 0 ) and a known standard deviation of 50 mm Hg. Suppose we obtain a sample of 25 kids from the High School for the Performing Arts (HSPVA) and find that their mean systolic blood pressure is 142 mm Hg. The question is: Does our sample of 25 kids seem to be from the population with mean SBP = 125 mm Hg or does it seem more likely that they are from another population. So our null hypothesis is: H 0 : μ = 125 mm Hg The μ in the null hypothesis above is the mean for the population from which the sample of 25 kids was drawn. The number 125 mm Hg is the mean of the population of all high school aged kids in Houston (i.e. μ 0 = 125). So we are asking are the means of the two populations the same. If we assume the variances are the same, then we are asking are the two populations the same. Page -4-

7 There are a number of possible forms for the alternative hypothesis: 1) H A : μ 125 (two-tailed alternative) 2) H A : μ > 125 (one-tailed upper tail) 3) H A : μ < 125 (one-tailed lower tail) 4) H A : μ = 162 (some specific value) The form of the alternative hypothesis is picked prior to collecting the sample information. Please note that in the hypothesis testing setup, μ is the mean of the population from which the sample (of 25) was drawn (we hope this population is the same as the one with mean = 125). So μ = 125 essentially asks if the sample was drawn from the same population as the population with mean = 125 (i.e. are the two populations the same). Or another way to think of the question is: is the mean of our sample of 25 consistent with the null hypothesis or with the alternative hypothesis? Let us assume the following: (1) the two-tailed version (i.e. H A : μ 125) of the alternative hypothesis (2) X is the random variable associated with the SBP of all high school kids in Houston (as opposed to the sample from HSPVA). (3) X ~ N( 125, 50 2 ) (4) the level of significance is 0.05 (i.e. = 0.05). Note that to select and then determine the rejection region formed by that level is the usual procedure. In the Riverboat gambler problem we selected the rejection region and then calculated for pedagogical reasons. The sample mean ( = 142 mm Hg) of our sample of 25 kids from HSPVA is our test statistic just like = 2 heads was the test statistic for the coin toss problem. Under the null hypothesis (i.e. assume that the sample of 25 HSPVA students is from the population of all high school students) the distribution of the sample means of all samples of size n = 25 (i.e. the sampling distribution) with σ = 25 (SD of the population of all Houston kids) is the normal distribution with mean = 125 mm Hg and Page -5-

8 50 25 X ~ N 125, SD = 10 mm Hg (i.e ). Or because X ~ N( 125, 50 2 ) and n, our sample size, is equal to In the Riverboat gambler problem, we selected the rejection region and then found the level that went with it. We did it in that fashion so it would be clear that the rejection region contained the values that we thought were unlikely to occur if indeed the coin was fair. Another way to decide on the rejection region is to first select the level (usually something like 0.05 or 0.01) and then find the region (which would be in two pieces for a two-tailed test or one piece for a one-tailed test) whose area is equal to the level. Selecting first is the usual way of doing things. For the current problem we ll choose the level to be Since we are dealing with a two-tailed test, we are looking for the values that cutoff the upper and lower area of the curve. We know that on the N(0,1) curve and 1.96 cut off the lower and upper areas respectively. The question is what points on the N(125,10 2 ) [the distribution for X] curve are the equivalents of and 1.96 (i.e. what points are 1.96 standard deviations below and above 125). 125 = = If, then = (10)(1.96) = and if, then = (10)(-1.96) = This means that the rejection region consists of all such that < or >144.6 (Notice that the endpoints are in the acceptance region.) and the acceptance region is [105.4, 144.6] where the square brackets indicate that the interval includes the end points. Another way to say this is: Pr( X is in the rejection region μ = 125 and σ = 10) = Pr( X < or X > μ = 125 and σ = 10) = 0.05 Note that 142 (the mean SBP of our sample of 25 kids) falls in the acceptance region (see Figure 1 below) so we would say that we accept the null hypothesis (or that we fail to reject the null hypothesis). Page -6-

9 Yet another way of saying this is that we believe our sample of 25 kids could have come from the normal distribution with μ = 125 and σ = 50 (i.e. the distribution of all Houston kids). Fail to reject the null hypothesis is the usual way statisticians report the results, but you ll never see this in a journal article. Notice that what we did was assume the sample was from the population of all high school kids and then look to see if that made sense or to see if the sample really seemed to fit with the population of all high school students. Figure 1.04 Normal Density with Mean = 125 and SD = 10 The vertical lines are at = & = Normal density with Mean = 125 and SD = 10 The vertical lines are at = and = The lines are at = & = Sample Means Each stripped area is half of the rejection region. Each area = The rejection region is the stripped area (sum of the two areas actually) in Figure 1 above. To find the p-value that goes with this decision, we need to find the area to the right of 142 under the N(125,100) curve and double it because we are dealing with a two-tailed alternative. Remember to find the p-value you need to find the smallest rejection region that includes 142. This would be the rejection region where 142 is the cutoff (see Figure 2 below). To get the probability associated with the region under the N(125,100) curve and to the right of 142 we need to translate 142 to the N(0,1) curve by finding how many standard deviations 142 is from 125. Page -7-

10 is 1.7 SD s from 125 since 10 = 1.70 Figure 2 Normal Density with Mean = 125 and SD = 10 The vertical lines are at = & = (for a level) and = 142 (for half the p-value).04 Normal density with Mean = 125 and SD = 10 The vertical lines are at = and = The dashed line is at 142 The stripped area is half of the p-value Sample Means Each area beyond the solid line is half of the rejection region (i.e ) According to the tables or Stata, the probability to the right of 1.7 is Since we are dealing with a two-tailed test, the p-value = = You could get this using STATA:. di 2*(1 - normal(( )/10)) So we fail to reject the null hypothesis because p-value >. Notice we are using the normal distribution with the hypothesized mean as opposed to the sample mean. Page -8-

11 A Confidence interval approach to the same problem nother way we could look at this problem (H 0 : μ = 125 mm Hg versus H A : μ 125 mm Hg assuming X follows a normal distribution, σ known to be 50 and = 0.05) would be through confidence intervals: The 95% CI for 142 (i.e. you get the confidence interval about the sample value) given that n = 25 and σ for X is 50 (the original distribution - the population of all Houston kids) would be ( ( z σ / n), + ( z σ / n)) 1 ( 2) 1 ( 2) (. 196)( 50) (. 196)( 50) 142, = ( , ) = (122.4, 161.6) Notice that 125 (the null hypothesis) is in this 95% confidence interval for 142, so 142 would not be considered different from 125 at = 0.05 level. This is the same conclusion we reached earlier. Using the confidence interval, we know the set of values (population means) that 142 does not differ from, not just that 142 doesn t differ from 125. However, we can t calculate the p-value. So we gain something with the confidence interval approach and lose something. Comparison of processes for hypothesis testing and confidence interval approach: For the confidence interval approach the process is to find the confidence interval for the sample mean (142) and then to check whether the population mean (125) is in that confidence interval (i.e. you start with the sample). Using the hypothesis testing approach, you start with the distribution of X under the null hypothesis (i.e. you get an acceptance region about the population mean 125) and look to see if the sample mean (142) lies in this acceptance region. Page -9-

12 Same problem but with the alternative hypothesis changed to H A : μ > 125 (i.e. one-tailed upper tailed). Let us work through the problem again this time using the #2 form of the alternative hypothesis (i.e H A : μ > 125) but keep the = 0.05 and σ for X at 50. Selecting this particular form of the alternative hypothesis means that we will reject the null hypothesis only when the sample mean is too big. Note that the alternative hypothesis is selected prior to the collection of the sample data. Now, using the hypothesis testing approach, is our sample mean of 142 from a sample of 25 still consistent with the null hypothesis? We will still use the distribution of sample means (i.e. distribution of X) with mean = 125 and SD = 10, but now the critical region is all in the upper tail (i.e. we reject the null hypothesis and accept the alternative only when the sample mean is too big). So again we have the critical and acceptance regions in terms of the relationship to the population mean (125). This means that the entire 0.05 will be in the upper tail because we reject only if 142 is too big. We know that cuts off the upper 5% of the N(0,1) curve. So if we solve the following, we will have the equivalent number for the N(125,10 2 ) curve. Figure 3 Normal Density with Mean = 125 and SD = 10 The vertical line is drawn at = One-tailed test. Normal density with Mean = 125 and SD = 10 The vertical line is at The stripped area is the rejection region. The area = X = Sample bar Means Page -10-

13 125 = If, then = The vertical line on the graph below is at which cuts off 5% of the upper tail. Notice that is smaller than that cut off in the upper tail. And also smaller than 142 which cuts off (i.e. half the p-value from the two-tailed test) in the upper tail. Notice that 142 (the sample mean) is now in the rejection region (i.e. 142 > ) whereas it was in the acceptance region for the two-tailed test To obtain the p-value we again solve 10 di 1 - normal(1.7) = = 1.70 So the p-value = (note that we do not multiply by 2 because this is a one-tailed test). Since 142 is in the rejection region, we epected the p-value to be less than which is equal to For the two-tailed test the test statistic 142 fell in the acceptance region (so we accepted the null hypothesis or failed to reject the null hypothesis) and the p-value was equal to But with the one-tailed test the test statistic falls in the rejection region (so we reject the null hypothesis) and the p-value is (note that is half of ). Therefore, with the same sample size and the same null hypothesis we manage to go from acceptance of the null hypothesis (two-tailed test) to rejection of the null hypothesis (one-tailed test). Note that this is why people want to use a one-tailed test. But be aware that if you have picked the wrong tail (so that the rejection region is on the lower end of the distribution), you could end up with acceptance with a one-tailed test but rejection with a two-tailed. You also need to be aware that it is seldom appropriate to use a one-tailed test. You need to have prior information (i.e. an earlier study done in your lab or published in a journal) that indicates that if the sample mean is different from the population mean it is because the sample mean is bigger than the population mean (version 2 of the alternative hypotheses) or smaller than the population mean (version 3 of the alternative hypotheses). Also particularly note that the alternative hypothesis has to be selected prior to your seeing the data. Investigators like to gamble that they know which tail is appropriate because, as we will see later, you can use a smaller sample size. DON T DO THIS!!!!!! Now consider the confidence interval approach but this time it will be Page -11-

14 a one-sided confidence interval to go with the one-sided test. So we consider a 90% two-sided consider below because if you consider only one side of a 90% confidence interval and you 10% outside the confidence interval with 5% on each side. The confidence interval is again about the sample mean. The 90% confidence interval for 142 is (142 (1.645)(50),142 + (1.645)(50) ) = (125.55, ) The interval (125.55, 4) is the correct one-sided 95% confidence interval. H A :μ > 125 says we reject the null hypothesis in favor of the alternative hypothesis if the sample mean is too big. This can also be translated as the population mean is too small. The confidence interval is about the sample mean 142. So for the population mean to be too small it must be outside the lower bound of the confidence interval. Notice that the tail of this confidence interval is in the same direction as the cutoff for the one-tailed hypothesis test. The population mean 125 is not in the confidence interval so we reject the null hypothesis. Notice that for the hypothesis test we reject the null hypothesis when the sample mean is too big. For the confidence interval we reject the null hypothesis when the population mean is too small. The conclusions with the two-sided hypothesis test and the two-sided confidence interval should be the same provided the level is the same for both. Similarly for the one-sided hypothesis test and one-sided confidence interval. Revisiting the problem with H A as a single number rather than a range of numbers. Now suppose we use the #4 form of the alternative hypothesis (i.e that H A: μ = 162). Now we have two normal distributions for X each with σ = 10 but one with mean = 125 and one with mean = 162. We should note that this sort of problem is not commonly considered. H 0 : μ = 125 versus H A : μ = 162 Page -12-

15 We are going to do a one-tailed upper tailed test here (we reject the null hypothesis only when the sample mean is too big). Why are we doing a one-tailed test here? We want to know if our sample mean of 142 is consistent with 125 or 162. So we reject the null hypothesis when the sample means looks more like 162 than 125. Well numbers smaller than 125 are not going to cause us to reject the null hypothesis in favor of 162. Only if the numbers are too big would we be willing to think that 162 was more appropriate than 125. With this one-tailed test we can actually show the area that goes with the power. Earlier we found that if = 0.05, the cutoff for this upper tailed region is or Pr( X > μ = 125) = 0.05 Figure 4 Rejection Region Normal Distributions with SD = 10 and Means = 125 and Sampling Distribution according to H 0 Sampling Distribution according to H A The line is = The stripped area is the rejection region. Area = Sample Means Power is the probability of being in the rejection region for the null hypothesis when the alternative hypothesis is true (i.e. correctly rejecting the null hypothesis). So the power for this problem is the probability of being to the right of (i.e. being in the rejection region for the null hypothesis) but under the alternative hypothesis curve (i.e. the one with mean = 162). Page -13-

16 So Pr( X > μ = 162) = = power [see Figure 5 below]. Recall that on the normal curve with mean 162 and SD = 10 is equivalent to ( )/10 = (i.e is 2.1 standard deviations below the mean of 162) Using Stata, we get power = 1 - normal(-2.055) normal(-2.055) gives the are under the curve with μ = 162 and to the left of the line at (i.e. β = 1 - power). It is the area under the rest of the curve (i.e. to the right of ) that is the power (i.e. 1 - β). Figure 5 Power Normal Distributions with SD = 10 and Means = 125 and Sampling Distribution according to H 0 Sampling Distribution according to H A The hatched area is the power = The line is = Sample Means Page -14-

17 You need to be aware that: 1) When we were looking at the problems above as hypothesis testing problems, we used the population parameters (here population mean = 125 and population SE = 10) to obtain a rejection region and then asked was the sample statistic (here = 142) in the rejection region. 2) However, when considering the problems using the confidence interval approach, we obtained the confidence interval about the sample statistic ( = 142) and then asked if the population parameter (i.e. the population mean 125) was in the confidence interval. Why do people select a one-tailed versus a two-tailed test? We saw above that it is possible to reject the null hypothesis using a one-tailed test but fail to reject the null hypothesis using a two-tailed test. So if you were trying to prove that your new drug is better than the standard of care you might be tempted to use a one-tailed test. What are the drawbacks to a one-tailed test? Well you might have guessed the wrong tail. If before we had obtained a sample (and this is the way you are supposed to play the game), you said if your sample of kids differed from the population with respect to SBP, it would be because the SBP of your sample of kids would be too small. This means that in Figure 3 above the rejection region would be the area to the left of = = would be in the acceptance region. The wrong guess has cost you a significant answer.. So Page -15-

HYPOTHESIS TESTING (ONE SAMPLE) - CHAPTER 7 1. used confidence intervals to answer questions such as...

HYPOTHESIS TESTING (ONE SAMPLE) - CHAPTER 7 1. used confidence intervals to answer questions such as... HYPOTHESIS TESTING (ONE SAMPLE) - CHAPTER 7 1 PREVIOUSLY used confidence intervals to answer questions such as... You know that 0.25% of women have red/green color blindness. You conduct a study of men

More information

Introduction to Hypothesis Testing. Hypothesis Testing. Step 1: State the Hypotheses

Introduction to Hypothesis Testing. Hypothesis Testing. Step 1: State the Hypotheses Introduction to Hypothesis Testing 1 Hypothesis Testing A hypothesis test is a statistical procedure that uses sample data to evaluate a hypothesis about a population Hypothesis is stated in terms of the

More information

HYPOTHESIS TESTING: POWER OF THE TEST

HYPOTHESIS TESTING: POWER OF THE TEST HYPOTHESIS TESTING: POWER OF THE TEST The first 6 steps of the 9-step test of hypothesis are called "the test". These steps are not dependent on the observed data values. When planning a research project,

More information

HYPOTHESIS TESTING (ONE SAMPLE) - CHAPTER 7 1. used confidence intervals to answer questions such as...

HYPOTHESIS TESTING (ONE SAMPLE) - CHAPTER 7 1. used confidence intervals to answer questions such as... HYPOTHESIS TESTING (ONE SAMPLE) - CHAPTER 7 1 PREVIOUSLY used confidence intervals to answer questions such as... You know that 0.25% of women have red/green color blindness. You conduct a study of men

More information

Introduction to Hypothesis Testing

Introduction to Hypothesis Testing I. Terms, Concepts. Introduction to Hypothesis Testing A. In general, we do not know the true value of population parameters - they must be estimated. However, we do have hypotheses about what the true

More information

Hypothesis testing - Steps

Hypothesis testing - Steps Hypothesis testing - Steps Steps to do a two-tailed test of the hypothesis that β 1 0: 1. Set up the hypotheses: H 0 : β 1 = 0 H a : β 1 0. 2. Compute the test statistic: t = b 1 0 Std. error of b 1 =

More information

How To Test For Significance On A Data Set

How To Test For Significance On A Data Set Non-Parametric Univariate Tests: 1 Sample Sign Test 1 1 SAMPLE SIGN TEST A non-parametric equivalent of the 1 SAMPLE T-TEST. ASSUMPTIONS: Data is non-normally distributed, even after log transforming.

More information

Simple Regression Theory II 2010 Samuel L. Baker

Simple Regression Theory II 2010 Samuel L. Baker SIMPLE REGRESSION THEORY II 1 Simple Regression Theory II 2010 Samuel L. Baker Assessing how good the regression equation is likely to be Assignment 1A gets into drawing inferences about how close the

More information

Experimental Design. Power and Sample Size Determination. Proportions. Proportions. Confidence Interval for p. The Binomial Test

Experimental Design. Power and Sample Size Determination. Proportions. Proportions. Confidence Interval for p. The Binomial Test Experimental Design Power and Sample Size Determination Bret Hanlon and Bret Larget Department of Statistics University of Wisconsin Madison November 3 8, 2011 To this point in the semester, we have largely

More information

Lesson 1: Comparison of Population Means Part c: Comparison of Two- Means

Lesson 1: Comparison of Population Means Part c: Comparison of Two- Means Lesson : Comparison of Population Means Part c: Comparison of Two- Means Welcome to lesson c. This third lesson of lesson will discuss hypothesis testing for two independent means. Steps in Hypothesis

More information

MONT 107N Understanding Randomness Solutions For Final Examination May 11, 2010

MONT 107N Understanding Randomness Solutions For Final Examination May 11, 2010 MONT 07N Understanding Randomness Solutions For Final Examination May, 00 Short Answer (a) (0) How are the EV and SE for the sum of n draws with replacement from a box computed? Solution: The EV is n times

More information

Recall this chart that showed how most of our course would be organized:

Recall this chart that showed how most of our course would be organized: Chapter 4 One-Way ANOVA Recall this chart that showed how most of our course would be organized: Explanatory Variable(s) Response Variable Methods Categorical Categorical Contingency Tables Categorical

More information

Unit 31 A Hypothesis Test about Correlation and Slope in a Simple Linear Regression

Unit 31 A Hypothesis Test about Correlation and Slope in a Simple Linear Regression Unit 31 A Hypothesis Test about Correlation and Slope in a Simple Linear Regression Objectives: To perform a hypothesis test concerning the slope of a least squares line To recognize that testing for a

More information

LAB 4 INSTRUCTIONS CONFIDENCE INTERVALS AND HYPOTHESIS TESTING

LAB 4 INSTRUCTIONS CONFIDENCE INTERVALS AND HYPOTHESIS TESTING LAB 4 INSTRUCTIONS CONFIDENCE INTERVALS AND HYPOTHESIS TESTING In this lab you will explore the concept of a confidence interval and hypothesis testing through a simulation problem in engineering setting.

More information

BA 275 Review Problems - Week 6 (10/30/06-11/3/06) CD Lessons: 53, 54, 55, 56 Textbook: pp. 394-398, 404-408, 410-420

BA 275 Review Problems - Week 6 (10/30/06-11/3/06) CD Lessons: 53, 54, 55, 56 Textbook: pp. 394-398, 404-408, 410-420 BA 275 Review Problems - Week 6 (10/30/06-11/3/06) CD Lessons: 53, 54, 55, 56 Textbook: pp. 394-398, 404-408, 410-420 1. Which of the following will increase the value of the power in a statistical test

More information

Comparing Two Groups. Standard Error of ȳ 1 ȳ 2. Setting. Two Independent Samples

Comparing Two Groups. Standard Error of ȳ 1 ȳ 2. Setting. Two Independent Samples Comparing Two Groups Chapter 7 describes two ways to compare two populations on the basis of independent samples: a confidence interval for the difference in population means and a hypothesis test. The

More information

Two-Sample T-Tests Assuming Equal Variance (Enter Means)

Two-Sample T-Tests Assuming Equal Variance (Enter Means) Chapter 4 Two-Sample T-Tests Assuming Equal Variance (Enter Means) Introduction This procedure provides sample size and power calculations for one- or two-sided two-sample t-tests when the variances of

More information

Section 7.1. Introduction to Hypothesis Testing. Schrodinger s cat quantum mechanics thought experiment (1935)

Section 7.1. Introduction to Hypothesis Testing. Schrodinger s cat quantum mechanics thought experiment (1935) Section 7.1 Introduction to Hypothesis Testing Schrodinger s cat quantum mechanics thought experiment (1935) Statistical Hypotheses A statistical hypothesis is a claim about a population. Null hypothesis

More information

Odds ratio, Odds ratio test for independence, chi-squared statistic.

Odds ratio, Odds ratio test for independence, chi-squared statistic. Odds ratio, Odds ratio test for independence, chi-squared statistic. Announcements: Assignment 5 is live on webpage. Due Wed Aug 1 at 4:30pm. (9 days, 1 hour, 58.5 minutes ) Final exam is Aug 9. Review

More information

THE FIRST SET OF EXAMPLES USE SUMMARY DATA... EXAMPLE 7.2, PAGE 227 DESCRIBES A PROBLEM AND A HYPOTHESIS TEST IS PERFORMED IN EXAMPLE 7.

THE FIRST SET OF EXAMPLES USE SUMMARY DATA... EXAMPLE 7.2, PAGE 227 DESCRIBES A PROBLEM AND A HYPOTHESIS TEST IS PERFORMED IN EXAMPLE 7. THERE ARE TWO WAYS TO DO HYPOTHESIS TESTING WITH STATCRUNCH: WITH SUMMARY DATA (AS IN EXAMPLE 7.17, PAGE 236, IN ROSNER); WITH THE ORIGINAL DATA (AS IN EXAMPLE 8.5, PAGE 301 IN ROSNER THAT USES DATA FROM

More information

Hypothesis Testing for Beginners

Hypothesis Testing for Beginners Hypothesis Testing for Beginners Michele Piffer LSE August, 2011 Michele Piffer (LSE) Hypothesis Testing for Beginners August, 2011 1 / 53 One year ago a friend asked me to put down some easy-to-read notes

More information

Chapter 8 Hypothesis Testing Chapter 8 Hypothesis Testing 8-1 Overview 8-2 Basics of Hypothesis Testing

Chapter 8 Hypothesis Testing Chapter 8 Hypothesis Testing 8-1 Overview 8-2 Basics of Hypothesis Testing Chapter 8 Hypothesis Testing 1 Chapter 8 Hypothesis Testing 8-1 Overview 8-2 Basics of Hypothesis Testing 8-3 Testing a Claim About a Proportion 8-5 Testing a Claim About a Mean: s Not Known 8-6 Testing

More information

22. HYPOTHESIS TESTING

22. HYPOTHESIS TESTING 22. HYPOTHESIS TESTING Often, we need to make decisions based on incomplete information. Do the data support some belief ( hypothesis ) about the value of a population parameter? Is OJ Simpson guilty?

More information

Introduction. Hypothesis Testing. Hypothesis Testing. Significance Testing

Introduction. Hypothesis Testing. Hypothesis Testing. Significance Testing Introduction Hypothesis Testing Mark Lunt Arthritis Research UK Centre for Ecellence in Epidemiology University of Manchester 13/10/2015 We saw last week that we can never know the population parameters

More information

Introduction to. Hypothesis Testing CHAPTER LEARNING OBJECTIVES. 1 Identify the four steps of hypothesis testing.

Introduction to. Hypothesis Testing CHAPTER LEARNING OBJECTIVES. 1 Identify the four steps of hypothesis testing. Introduction to Hypothesis Testing CHAPTER 8 LEARNING OBJECTIVES After reading this chapter, you should be able to: 1 Identify the four steps of hypothesis testing. 2 Define null hypothesis, alternative

More information

Two-Sample T-Tests Allowing Unequal Variance (Enter Difference)

Two-Sample T-Tests Allowing Unequal Variance (Enter Difference) Chapter 45 Two-Sample T-Tests Allowing Unequal Variance (Enter Difference) Introduction This procedure provides sample size and power calculations for one- or two-sided two-sample t-tests when no assumption

More information

Introduction to Hypothesis Testing OPRE 6301

Introduction to Hypothesis Testing OPRE 6301 Introduction to Hypothesis Testing OPRE 6301 Motivation... The purpose of hypothesis testing is to determine whether there is enough statistical evidence in favor of a certain belief, or hypothesis, about

More information

Principles of Hypothesis Testing for Public Health

Principles of Hypothesis Testing for Public Health Principles of Hypothesis Testing for Public Health Laura Lee Johnson, Ph.D. Statistician National Center for Complementary and Alternative Medicine johnslau@mail.nih.gov Fall 2011 Answers to Questions

More information

Class 19: Two Way Tables, Conditional Distributions, Chi-Square (Text: Sections 2.5; 9.1)

Class 19: Two Way Tables, Conditional Distributions, Chi-Square (Text: Sections 2.5; 9.1) Spring 204 Class 9: Two Way Tables, Conditional Distributions, Chi-Square (Text: Sections 2.5; 9.) Big Picture: More than Two Samples In Chapter 7: We looked at quantitative variables and compared the

More information

3.4 Statistical inference for 2 populations based on two samples

3.4 Statistical inference for 2 populations based on two samples 3.4 Statistical inference for 2 populations based on two samples Tests for a difference between two population means The first sample will be denoted as X 1, X 2,..., X m. The second sample will be denoted

More information

Chapter 7 Notes - Inference for Single Samples. You know already for a large sample, you can invoke the CLT so:

Chapter 7 Notes - Inference for Single Samples. You know already for a large sample, you can invoke the CLT so: Chapter 7 Notes - Inference for Single Samples You know already for a large sample, you can invoke the CLT so: X N(µ, ). Also for a large sample, you can replace an unknown σ by s. You know how to do a

More information

The Wilcoxon Rank-Sum Test

The Wilcoxon Rank-Sum Test 1 The Wilcoxon Rank-Sum Test The Wilcoxon rank-sum test is a nonparametric alternative to the twosample t-test which is based solely on the order in which the observations from the two samples fall. We

More information

1. What is the critical value for this 95% confidence interval? CV = z.025 = invnorm(0.025) = 1.96

1. What is the critical value for this 95% confidence interval? CV = z.025 = invnorm(0.025) = 1.96 1 Final Review 2 Review 2.1 CI 1-propZint Scenario 1 A TV manufacturer claims in its warranty brochure that in the past not more than 10 percent of its TV sets needed any repair during the first two years

More information

Testing a claim about a population mean

Testing a claim about a population mean Introductory Statistics Lectures Testing a claim about a population mean One sample hypothesis test of the mean Department of Mathematics Pima Community College Redistribution of this material is prohibited

More information

p-values and significance levels (false positive or false alarm rates)

p-values and significance levels (false positive or false alarm rates) p-values and significance levels (false positive or false alarm rates) Let's say 123 people in the class toss a coin. Call it "Coin A." There are 65 heads. Then they toss another coin. Call it "Coin B."

More information

Normal distribution. ) 2 /2σ. 2π σ

Normal distribution. ) 2 /2σ. 2π σ Normal distribution The normal distribution is the most widely known and used of all distributions. Because the normal distribution approximates many natural phenomena so well, it has developed into a

More information

Mind on Statistics. Chapter 12

Mind on Statistics. Chapter 12 Mind on Statistics Chapter 12 Sections 12.1 Questions 1 to 6: For each statement, determine if the statement is a typical null hypothesis (H 0 ) or alternative hypothesis (H a ). 1. There is no difference

More information

Comparing Means in Two Populations

Comparing Means in Two Populations Comparing Means in Two Populations Overview The previous section discussed hypothesis testing when sampling from a single population (either a single mean or two means from the same population). Now we

More information

Stat 20: Intro to Probability and Statistics

Stat 20: Intro to Probability and Statistics Stat 20: Intro to Probability and Statistics Lecture 16: More Box Models Tessa L. Childers-Day UC Berkeley 22 July 2014 By the end of this lecture... You will be able to: Determine what we expect the sum

More information

Chapter 2. Hypothesis testing in one population

Chapter 2. Hypothesis testing in one population Chapter 2. Hypothesis testing in one population Contents Introduction, the null and alternative hypotheses Hypothesis testing process Type I and Type II errors, power Test statistic, level of significance

More information

Lecture Notes Module 1

Lecture Notes Module 1 Lecture Notes Module 1 Study Populations A study population is a clearly defined collection of people, animals, plants, or objects. In psychological research, a study population usually consists of a specific

More information

Projects Involving Statistics (& SPSS)

Projects Involving Statistics (& SPSS) Projects Involving Statistics (& SPSS) Academic Skills Advice Starting a project which involves using statistics can feel confusing as there seems to be many different things you can do (charts, graphs,

More information

" Y. Notation and Equations for Regression Lecture 11/4. Notation:

 Y. Notation and Equations for Regression Lecture 11/4. Notation: Notation: Notation and Equations for Regression Lecture 11/4 m: The number of predictor variables in a regression Xi: One of multiple predictor variables. The subscript i represents any number from 1 through

More information

Stat 411/511 THE RANDOMIZATION TEST. Charlotte Wickham. stat511.cwick.co.nz. Oct 16 2015

Stat 411/511 THE RANDOMIZATION TEST. Charlotte Wickham. stat511.cwick.co.nz. Oct 16 2015 Stat 411/511 THE RANDOMIZATION TEST Oct 16 2015 Charlotte Wickham stat511.cwick.co.nz Today Review randomization model Conduct randomization test What about CIs? Using a t-distribution as an approximation

More information

CONTINGENCY TABLES ARE NOT ALL THE SAME David C. Howell University of Vermont

CONTINGENCY TABLES ARE NOT ALL THE SAME David C. Howell University of Vermont CONTINGENCY TABLES ARE NOT ALL THE SAME David C. Howell University of Vermont To most people studying statistics a contingency table is a contingency table. We tend to forget, if we ever knew, that contingency

More information

Pearson's Correlation Tests

Pearson's Correlation Tests Chapter 800 Pearson's Correlation Tests Introduction The correlation coefficient, ρ (rho), is a popular statistic for describing the strength of the relationship between two variables. The correlation

More information

1 Hypothesis Testing. H 0 : population parameter = hypothesized value:

1 Hypothesis Testing. H 0 : population parameter = hypothesized value: 1 Hypothesis Testing In Statistics, a hypothesis proposes a model for the world. Then we look at the data. If the data are consistent with that model, we have no reason to disbelieve the hypothesis. Data

More information

Confidence Intervals for Cpk

Confidence Intervals for Cpk Chapter 297 Confidence Intervals for Cpk Introduction This routine calculates the sample size needed to obtain a specified width of a Cpk confidence interval at a stated confidence level. Cpk is a process

More information

II. DISTRIBUTIONS distribution normal distribution. standard scores

II. DISTRIBUTIONS distribution normal distribution. standard scores Appendix D Basic Measurement And Statistics The following information was developed by Steven Rothke, PhD, Department of Psychology, Rehabilitation Institute of Chicago (RIC) and expanded by Mary F. Schmidt,

More information

An Introduction to Statistics Course (ECOE 1302) Spring Semester 2011 Chapter 10- TWO-SAMPLE TESTS

An Introduction to Statistics Course (ECOE 1302) Spring Semester 2011 Chapter 10- TWO-SAMPLE TESTS The Islamic University of Gaza Faculty of Commerce Department of Economics and Political Sciences An Introduction to Statistics Course (ECOE 130) Spring Semester 011 Chapter 10- TWO-SAMPLE TESTS Practice

More information

Correlational Research

Correlational Research Correlational Research Chapter Fifteen Correlational Research Chapter Fifteen Bring folder of readings The Nature of Correlational Research Correlational Research is also known as Associational Research.

More information

LAB : THE CHI-SQUARE TEST. Probability, Random Chance, and Genetics

LAB : THE CHI-SQUARE TEST. Probability, Random Chance, and Genetics Period Date LAB : THE CHI-SQUARE TEST Probability, Random Chance, and Genetics Why do we study random chance and probability at the beginning of a unit on genetics? Genetics is the study of inheritance,

More information

Chapter 3 RANDOM VARIATE GENERATION

Chapter 3 RANDOM VARIATE GENERATION Chapter 3 RANDOM VARIATE GENERATION In order to do a Monte Carlo simulation either by hand or by computer, techniques must be developed for generating values of random variables having known distributions.

More information

Lab 11. Simulations. The Concept

Lab 11. Simulations. The Concept Lab 11 Simulations In this lab you ll learn how to create simulations to provide approximate answers to probability questions. We ll make use of a particular kind of structure, called a box model, that

More information

CHAPTER 14 NONPARAMETRIC TESTS

CHAPTER 14 NONPARAMETRIC TESTS CHAPTER 14 NONPARAMETRIC TESTS Everything that we have done up until now in statistics has relied heavily on one major fact: that our data is normally distributed. We have been able to make inferences

More information

2 Sample t-test (unequal sample sizes and unequal variances)

2 Sample t-test (unequal sample sizes and unequal variances) Variations of the t-test: Sample tail Sample t-test (unequal sample sizes and unequal variances) Like the last example, below we have ceramic sherd thickness measurements (in cm) of two samples representing

More information

Statistiek I. Proportions aka Sign Tests. John Nerbonne. CLCG, Rijksuniversiteit Groningen. http://www.let.rug.nl/nerbonne/teach/statistiek-i/

Statistiek I. Proportions aka Sign Tests. John Nerbonne. CLCG, Rijksuniversiteit Groningen. http://www.let.rug.nl/nerbonne/teach/statistiek-i/ Statistiek I Proportions aka Sign Tests John Nerbonne CLCG, Rijksuniversiteit Groningen http://www.let.rug.nl/nerbonne/teach/statistiek-i/ John Nerbonne 1/34 Proportions aka Sign Test The relative frequency

More information

Point Biserial Correlation Tests

Point Biserial Correlation Tests Chapter 807 Point Biserial Correlation Tests Introduction The point biserial correlation coefficient (ρ in this chapter) is the product-moment correlation calculated between a continuous random variable

More information

6.3 Conditional Probability and Independence

6.3 Conditional Probability and Independence 222 CHAPTER 6. PROBABILITY 6.3 Conditional Probability and Independence Conditional Probability Two cubical dice each have a triangle painted on one side, a circle painted on two sides and a square painted

More information

A POPULATION MEAN, CONFIDENCE INTERVALS AND HYPOTHESIS TESTING

A POPULATION MEAN, CONFIDENCE INTERVALS AND HYPOTHESIS TESTING CHAPTER 5. A POPULATION MEAN, CONFIDENCE INTERVALS AND HYPOTHESIS TESTING 5.1 Concepts When a number of animals or plots are exposed to a certain treatment, we usually estimate the effect of the treatment

More information

Testing Research and Statistical Hypotheses

Testing Research and Statistical Hypotheses Testing Research and Statistical Hypotheses Introduction In the last lab we analyzed metric artifact attributes such as thickness or width/thickness ratio. Those were continuous variables, which as you

More information

CALCULATIONS & STATISTICS

CALCULATIONS & STATISTICS CALCULATIONS & STATISTICS CALCULATION OF SCORES Conversion of 1-5 scale to 0-100 scores When you look at your report, you will notice that the scores are reported on a 0-100 scale, even though respondents

More information

Hypothesis Testing. Reminder of Inferential Statistics. Hypothesis Testing: Introduction

Hypothesis Testing. Reminder of Inferential Statistics. Hypothesis Testing: Introduction Hypothesis Testing PSY 360 Introduction to Statistics for the Behavioral Sciences Reminder of Inferential Statistics All inferential statistics have the following in common: Use of some descriptive statistic

More information

16. THE NORMAL APPROXIMATION TO THE BINOMIAL DISTRIBUTION

16. THE NORMAL APPROXIMATION TO THE BINOMIAL DISTRIBUTION 6. THE NORMAL APPROXIMATION TO THE BINOMIAL DISTRIBUTION It is sometimes difficult to directly compute probabilities for a binomial (n, p) random variable, X. We need a different table for each value of

More information

Section 1. Inequalities -5-4 -3-2 -1 0 1 2 3 4 5

Section 1. Inequalities -5-4 -3-2 -1 0 1 2 3 4 5 Worksheet 2.4 Introduction to Inequalities Section 1 Inequalities The sign < stands for less than. It was introduced so that we could write in shorthand things like 3 is less than 5. This becomes 3 < 5.

More information

DDBA 8438: Introduction to Hypothesis Testing Video Podcast Transcript

DDBA 8438: Introduction to Hypothesis Testing Video Podcast Transcript DDBA 8438: Introduction to Hypothesis Testing Video Podcast Transcript JENNIFER ANN MORROW: Welcome to "Introduction to Hypothesis Testing." My name is Dr. Jennifer Ann Morrow. In today's demonstration,

More information

How To Check For Differences In The One Way Anova

How To Check For Differences In The One Way Anova MINITAB ASSISTANT WHITE PAPER This paper explains the research conducted by Minitab statisticians to develop the methods and data checks used in the Assistant in Minitab 17 Statistical Software. One-Way

More information

Testing Hypotheses About Proportions

Testing Hypotheses About Proportions Chapter 11 Testing Hypotheses About Proportions Hypothesis testing method: uses data from a sample to judge whether or not a statement about a population may be true. Steps in Any Hypothesis Test 1. Determine

More information

Unit 26 Estimation with Confidence Intervals

Unit 26 Estimation with Confidence Intervals Unit 26 Estimation with Confidence Intervals Objectives: To see how confidence intervals are used to estimate a population proportion, a population mean, a difference in population proportions, or a difference

More information

AP: LAB 8: THE CHI-SQUARE TEST. Probability, Random Chance, and Genetics

AP: LAB 8: THE CHI-SQUARE TEST. Probability, Random Chance, and Genetics Ms. Foglia Date AP: LAB 8: THE CHI-SQUARE TEST Probability, Random Chance, and Genetics Why do we study random chance and probability at the beginning of a unit on genetics? Genetics is the study of inheritance,

More information

Confidence Intervals for Cp

Confidence Intervals for Cp Chapter 296 Confidence Intervals for Cp Introduction This routine calculates the sample size needed to obtain a specified width of a Cp confidence interval at a stated confidence level. Cp is a process

More information

The Binomial Distribution

The Binomial Distribution The Binomial Distribution James H. Steiger November 10, 00 1 Topics for this Module 1. The Binomial Process. The Binomial Random Variable. The Binomial Distribution (a) Computing the Binomial pdf (b) Computing

More information

Using Excel for inferential statistics

Using Excel for inferential statistics FACT SHEET Using Excel for inferential statistics Introduction When you collect data, you expect a certain amount of variation, just caused by chance. A wide variety of statistical tests can be applied

More information

Week 4: Standard Error and Confidence Intervals

Week 4: Standard Error and Confidence Intervals Health Sciences M.Sc. Programme Applied Biostatistics Week 4: Standard Error and Confidence Intervals Sampling Most research data come from subjects we think of as samples drawn from a larger population.

More information

"Statistical methods are objective methods by which group trends are abstracted from observations on many separate individuals." 1

Statistical methods are objective methods by which group trends are abstracted from observations on many separate individuals. 1 BASIC STATISTICAL THEORY / 3 CHAPTER ONE BASIC STATISTICAL THEORY "Statistical methods are objective methods by which group trends are abstracted from observations on many separate individuals." 1 Medicine

More information

Descriptive Statistics

Descriptive Statistics Descriptive Statistics Primer Descriptive statistics Central tendency Variation Relative position Relationships Calculating descriptive statistics Descriptive Statistics Purpose to describe or summarize

More information

SCHOOL OF HEALTH AND HUMAN SCIENCES DON T FORGET TO RECODE YOUR MISSING VALUES

SCHOOL OF HEALTH AND HUMAN SCIENCES DON T FORGET TO RECODE YOUR MISSING VALUES SCHOOL OF HEALTH AND HUMAN SCIENCES Using SPSS Topics addressed today: 1. Differences between groups 2. Graphing Use the s4data.sav file for the first part of this session. DON T FORGET TO RECODE YOUR

More information

Estimation of σ 2, the variance of ɛ

Estimation of σ 2, the variance of ɛ Estimation of σ 2, the variance of ɛ The variance of the errors σ 2 indicates how much observations deviate from the fitted surface. If σ 2 is small, parameters β 0, β 1,..., β k will be reliably estimated

More information

Sample Size and Power in Clinical Trials

Sample Size and Power in Clinical Trials Sample Size and Power in Clinical Trials Version 1.0 May 011 1. Power of a Test. Factors affecting Power 3. Required Sample Size RELATED ISSUES 1. Effect Size. Test Statistics 3. Variation 4. Significance

More information

Outline. Definitions Descriptive vs. Inferential Statistics The t-test - One-sample t-test

Outline. Definitions Descriptive vs. Inferential Statistics The t-test - One-sample t-test The t-test Outline Definitions Descriptive vs. Inferential Statistics The t-test - One-sample t-test - Dependent (related) groups t-test - Independent (unrelated) groups t-test Comparing means Correlation

More information

Stat 5102 Notes: Nonparametric Tests and. confidence interval

Stat 5102 Notes: Nonparametric Tests and. confidence interval Stat 510 Notes: Nonparametric Tests and Confidence Intervals Charles J. Geyer April 13, 003 This handout gives a brief introduction to nonparametrics, which is what you do when you don t believe the assumptions

More information

Analysis of Variance ANOVA

Analysis of Variance ANOVA Analysis of Variance ANOVA Overview We ve used the t -test to compare the means from two independent groups. Now we ve come to the final topic of the course: how to compare means from more than two populations.

More information

Independent samples t-test. Dr. Tom Pierce Radford University

Independent samples t-test. Dr. Tom Pierce Radford University Independent samples t-test Dr. Tom Pierce Radford University The logic behind drawing causal conclusions from experiments The sampling distribution of the difference between means The standard error of

More information

Confidence Intervals for One Standard Deviation Using Standard Deviation

Confidence Intervals for One Standard Deviation Using Standard Deviation Chapter 640 Confidence Intervals for One Standard Deviation Using Standard Deviation Introduction This routine calculates the sample size necessary to achieve a specified interval width or distance from

More information

Hypothesis testing. c 2014, Jeffrey S. Simonoff 1

Hypothesis testing. c 2014, Jeffrey S. Simonoff 1 Hypothesis testing So far, we ve talked about inference from the point of estimation. We ve tried to answer questions like What is a good estimate for a typical value? or How much variability is there

More information

Chicago Booth BUSINESS STATISTICS 41000 Final Exam Fall 2011

Chicago Booth BUSINESS STATISTICS 41000 Final Exam Fall 2011 Chicago Booth BUSINESS STATISTICS 41000 Final Exam Fall 2011 Name: Section: I pledge my honor that I have not violated the Honor Code Signature: This exam has 34 pages. You have 3 hours to complete this

More information

MA 1125 Lecture 14 - Expected Values. Friday, February 28, 2014. Objectives: Introduce expected values.

MA 1125 Lecture 14 - Expected Values. Friday, February 28, 2014. Objectives: Introduce expected values. MA 5 Lecture 4 - Expected Values Friday, February 2, 24. Objectives: Introduce expected values.. Means, Variances, and Standard Deviations of Probability Distributions Two classes ago, we computed the

More information

6: Introduction to Hypothesis Testing

6: Introduction to Hypothesis Testing 6: Introduction to Hypothesis Testing Significance testing is used to help make a judgment about a claim by addressing the question, Can the observed difference be attributed to chance? We break up significance

More information

A Statistical Analysis of Popular Lottery Winning Strategies

A Statistical Analysis of Popular Lottery Winning Strategies CS-BIGS 4(1): 66-72 2010 CS-BIGS http://www.bentley.edu/csbigs/vol4-1/chen.pdf A Statistical Analysis of Popular Lottery Winning Strategies Albert C. Chen Torrey Pines High School, USA Y. Helio Yang San

More information

Analysis and Interpretation of Clinical Trials. How to conclude?

Analysis and Interpretation of Clinical Trials. How to conclude? www.eurordis.org Analysis and Interpretation of Clinical Trials How to conclude? Statistical Issues Dr Ferran Torres Unitat de Suport en Estadística i Metodología - USEM Statistics and Methodology Support

More information

DATA INTERPRETATION AND STATISTICS

DATA INTERPRETATION AND STATISTICS PholC60 September 001 DATA INTERPRETATION AND STATISTICS Books A easy and systematic introductory text is Essentials of Medical Statistics by Betty Kirkwood, published by Blackwell at about 14. DESCRIPTIVE

More information

Stats for Strategy Fall 2012 First-Discussion Handout: Stats Using Calculators and MINITAB

Stats for Strategy Fall 2012 First-Discussion Handout: Stats Using Calculators and MINITAB Stats for Strategy Fall 2012 First-Discussion Handout: Stats Using Calculators and MINITAB DIRECTIONS: Welcome! Your TA will help you apply your Calculator and MINITAB to review Business Stats, and will

More information

Point and Interval Estimates

Point and Interval Estimates Point and Interval Estimates Suppose we want to estimate a parameter, such as p or µ, based on a finite sample of data. There are two main methods: 1. Point estimate: Summarize the sample by a single number

More information

HYPOTHESIS TESTING WITH SPSS:

HYPOTHESIS TESTING WITH SPSS: HYPOTHESIS TESTING WITH SPSS: A NON-STATISTICIAN S GUIDE & TUTORIAL by Dr. Jim Mirabella SPSS 14.0 screenshots reprinted with permission from SPSS Inc. Published June 2006 Copyright Dr. Jim Mirabella CHAPTER

More information

MBA 611 STATISTICS AND QUANTITATIVE METHODS

MBA 611 STATISTICS AND QUANTITATIVE METHODS MBA 611 STATISTICS AND QUANTITATIVE METHODS Part I. Review of Basic Statistics (Chapters 1-11) A. Introduction (Chapter 1) Uncertainty: Decisions are often based on incomplete information from uncertain

More information

Chapter 26: Tests of Significance

Chapter 26: Tests of Significance Chapter 26: Tests of Significance Procedure: 1. State the null and alternative in words and in terms of a box model. 2. Find the test statistic: z = observed EV. SE 3. Calculate the P-value: The area under

More information

STA 130 (Winter 2016): An Introduction to Statistical Reasoning and Data Science

STA 130 (Winter 2016): An Introduction to Statistical Reasoning and Data Science STA 130 (Winter 2016): An Introduction to Statistical Reasoning and Data Science Mondays 2:10 4:00 (GB 220) and Wednesdays 2:10 4:00 (various) Jeffrey Rosenthal Professor of Statistics, University of Toronto

More information

NONPARAMETRIC STATISTICS 1. depend on assumptions about the underlying distribution of the data (or on the Central Limit Theorem)

NONPARAMETRIC STATISTICS 1. depend on assumptions about the underlying distribution of the data (or on the Central Limit Theorem) NONPARAMETRIC STATISTICS 1 PREVIOUSLY parametric statistics in estimation and hypothesis testing... construction of confidence intervals computing of p-values classical significance testing depend on assumptions

More information

The overall size of these chance errors is measured by their RMS HALF THE NUMBER OF TOSSES NUMBER OF HEADS MINUS 0 400 800 1200 1600 NUMBER OF TOSSES

The overall size of these chance errors is measured by their RMS HALF THE NUMBER OF TOSSES NUMBER OF HEADS MINUS 0 400 800 1200 1600 NUMBER OF TOSSES INTRODUCTION TO CHANCE VARIABILITY WHAT DOES THE LAW OF AVERAGES SAY? 4 coins were tossed 1600 times each, and the chance error number of heads half the number of tosses was plotted against the number

More information

Two Correlated Proportions (McNemar Test)

Two Correlated Proportions (McNemar Test) Chapter 50 Two Correlated Proportions (Mcemar Test) Introduction This procedure computes confidence intervals and hypothesis tests for the comparison of the marginal frequencies of two factors (each with

More information