STT 430/630/ES 760 Lecture Notes: Chapter 6: Hypothesis Testing 1. February 23, 2009 Chapter 6: Introduction to Hypothesis Testing

Size: px
Start display at page:

Download "STT 430/630/ES 760 Lecture Notes: Chapter 6: Hypothesis Testing 1. February 23, 2009 Chapter 6: Introduction to Hypothesis Testing"

Transcription

1 STT 430/630/ES 760 Lecture Notes: Chapter 6: Hypothesis Testing 1 February 23, 2009 Chapter 6: Introduction to Hypothesis Testing One of the primary uses of statistics is to use data to infer something about a population or a probability model. Populations and probability models are defined by parameters. For instance, let µ 1 and µ 2 be the mean cholesterol levels of people who eat two different diets. We may be interested if there is a difference on average between cholesterol levels due to diet. In order to test this, we can collect data on cholesterol levels from people who eat the two different diets and perform a hypothesis test to determine if the mean cholesterol levels differ depending on diet. This chapter introduces the basics of hypothesis testing: terminology, the logic of hypothesis testing, test statistics, and decision making. 1 Introduction In order to introduce the topic of hypothesis testing, we will begin with an illustrative example: Example: Body Temperatures. It is widely believed that the average body temperature for healthy adults is 98.6 degrees Fahrenheit. Is this true? Where did the 98.6 degree value come from? Do all healthy people have exactly the same body temperature? A study was conducted a few years go to examine this belief. The body temperatures of n = 130 healthy adults were measured (half male and half female). The average temperature from the sample was found to be x = with standard deviation s = Do these statistics contradict the belief that the average body temperature is 98.6? If the true average temperature is indeed 98.6 and we obtain a sample of n = 130 healthy adults, we would not expect the sample mean to come out exactly equal to We observed x = can this deviation from 98.6 be explained by chance or is it unlikely we would observe a value this different from 98.6? Two people debating this issue could come to different conclusions. What is needed is an objective method to determine if the data contradict the hypothesis that the average body temperature is In this example, the parameter of interest is µ, the mean temperature of healthy adults. We want to test a hypothesis about µ. The way hypothesis testing is done is that a hypothetical value is proposed for µ which we denote by µ 0. The null hypothesis, denoted H 0, specifies that µ = µ 0 : H 0 : µ = µ 0. In the body temperature example, µ 0 = 98.6 and the null hypothesis is H 0 : µ = Typically, the null hypothesis represents the status quo. The purpose of many studies is to determine if the data leads us to reject the null hypothesis. The alternative hypothesis, denoted H a, is set up to represent the research goal: H a : µ µ 0. In the body temperature example, we have H a : µ This is an example of a two-sided alternative because we will reject the null hypothesis if there is evidence that the true mean lies to either side (greater or less than) of the hypothesized mean value of 98.6

2 STT 430/630/ES 760 Lecture Notes: Chapter 6: Hypothesis Testing 2 The way the alternative hypothesis is set up depends on the scientific objective at hand. Many examples of alternative hypotheses are one-sided. For example, if we want to determine if an environmental toxin such as PCB reduces the mean eggshell thickness of pelican birds, then we would set up our hypotheses as: H 0 : µ = µ 0 versus H a : µ < µ 0, where µ 0 is the mean thickness for birds not exposed to PCB. On the other hand, if we want to test if a new drug increases the mean survival time for people suffering from a particular type of cancer, then we would set up our hypotheses as: H 0 : µ = µ 0 versus H a : µ > µ 0, where µ 0 is the mean survival time without the medication. Once the data is collected and analyzed, a decision has to be made. Should we reject H 0 and accept H a? Or is there insufficient evidence to reject H 0? When making decisions, there are four possible scenarios and two of them involve errors: 1. Accept H 0 when in fact H 0 is true (good decision). 2. Accept H 0 when in fact H 0 is false (an error). 3. Reject H 0 when in fact H 0 is true (an error). 4. Reject H 0 when H 0 is false (good decision). The two possible errors above have names: Definition. A Type I Error is rejecting the null hypothesis H 0 when H 0 is true. The probability of committing a type I error is denoted by α and is called the significance level of the test. Definition. A Type II Error is accepting H 0 when H 0 is false. The probability of a type II error is denoted by β. Definition. The power of a statistical test is 1 β, which is the probability of rejecting H 0 when H 0 is false. When testing hypotheses, we would like the test to have high power which means the ability to conclude the null hypothesis is false when it really is false with high probability. We would also like the probability of a type I error, α, of our tests to be small. Unfortunately, making the probability of a type I error smaller makes the test less powerful; making the test more powerful leads to a higher type I error. Therefore, a compromise is needed between these competing goals when performing hypothesis testing. Generally, tests are set up so as to minimize the probability of committing a type I error. Typical values for the significance level α, the probability of a type I error, used in practice are 0.05, 0.01, or We do not want to reject a null hypothesis that is true. In the body temperature, committing a type I error means that one would conclude the average body temperature differs from 98.6 when in fact the average body temperature is Most thermometers for humans are marked at Imagine throwing all these thermometers out because a scientific study says they are marked wrong and then realizing later that they were actually marked correctly.

3 STT 430/630/ES 760 Lecture Notes: Chapter 6: Hypothesis Testing 3 A useful analogy for hypothesis testing is a court of law. The defendant is assumed innocent till proven guilty. Thus, the null hypothesis is that the defendant is innocent and the alternative hypothesis is that the defendant is guilty: H 0 : Innocent versus H a : Guilty. The trial starts and evidence is presented. In the statistical setting, the data is the evidence. Does the data allow us to reject H 0 and conclude H a? Convicting an innocent man is committing a type I error: rejecting H 0 when it is true. We certainly do not want to convict innocent people, so we set up hypothesis tests to minimize the probability of committing this error. On the other hand, we do not want to let a guilty defendant go free (i.e. commit a type II error). Note that there are two reasons a defendant will not be convicted in practice: 1. The defendant is innocent (H 0 is true). 2. The defendant is guilty (H 0 is false), but we lack enough evidence to convict (a type II error). In statistics, lack of evidence corresponds to lack of data. If we do not have much data (i.e. the sample size is too small), then we will lack the evidence needed to reject H 0 when it is false. We do not necessarily say a defendant is innocent (accept the null hypothesis) if we fail to convict because the failure could be due to insufficient evidence (reasonable doubt remains). Similarly, in hypothesis testing, if we do not reject the null hypothesis, we generally refrain from saying that we accept the null hypothesis (guard against a type II error); instead we may say that we fail to reject H 0. Students often find the logic of hypothesis testing difficult to understand at first. The court of law analogy should help with the understanding. 2 t-test Now, for the mechanics of performing a hypothesis test, the main idea is to compare the observed sample mean x to the hypothesized value µ 0 and see if the difference x µ 0 is big or small. However, in order to have a framework to decide if the difference is big or not, we need to standardize this difference so that we will have a scale for comparisons. The standardized difference is our test statistic t: t-test Statistic: t = x µ 0 s/ n. (1) Fact: If H 0 is true, then test statistic t in (1) follows a t-distribution on n 1 degrees of freedom, provided the data are from a normal distribution. Thus, the t-distribution is our reference distribution for making a decision. If our observed test statistic t looks like it came from the t-distribution, then that would be consistent with the null hypothesis. However, if t does not look like it came from a t-distribution (because it is too big), then the standard difference t = ( x µ 0 )/(s/ n) is too big to be explained by chance and therefore we would reject the null hypothesis.

4 STT 430/630/ES 760 Lecture Notes: Chapter 6: Hypothesis Testing 4 We need a cut-off value to decide if our test statistic t is too big. Definition. The Rejection Region is the range of values of the test statistic t for which H 0 is rejected. To determine the cut-off value, one chooses a significance level α (e.g. α = 0.05). Recall that α is the probability of making a type I error (i.e. rejecting H 0 when H 0 is true). In the body temperature example, our alternative hypothesis is two-sided which means we will reject H 0 if t is too big in the positive or negative direction. Let us denote our cut-off value by c. Then we need to choose c such that α = P (Rejecting H 0 when H 0 is true) = P (t > c or t < c, when µ = µ 0 ) This implies that c must equal the 1 α/2 percentile of the t-distribution on n 1 degrees of freedom. Summarizing, we have the following testing procedure: Two-Sided test for the Mean of a Normal Distribution Test Statistic: t = x µ 0 s/ n. Decision: H 0 : µ = µ 0 versus H a : µ µ 0. Rejection Region: Reject H 0 if t > t n 1,1 α/2 or t < t n 1,1 α/2. Fail to Reject H 0 otherwise. Notice that we had to split the significance level α in two for the two-sided test. For one-sided tests, we do not need to split the significance level because we will only reject H 0 if the test statistic lies to one side of the hypothesized mean. Using the same reasoning, we get the following results for one-sided tests: Rejection Region: Reject H 0 if t < t n 1,1 α. Otherwise, fail to reject H 0. H 0 : µ = µ 0 versus H a : µ < µ 0 Rejection Region: Reject H 0 if t > t n 1,1 α. H 0 : µ = µ 0 versus H a : µ > µ 0

5 STT 430/630/ES 760 Lecture Notes: Chapter 6: Hypothesis Testing 5 Otherwise, fail to reject H 0. Body Temperature Example continued. We can apply the hypothesis testing procedure to the body temperature example. Let us test the hypothesis using a significance level of α = Recall that the sample size is n = 130. Because it is a two-sided alternative, we shall reject the null hypothesis if our test statistic exceeds t n 1,1 α/2 = t 129,0.975 in absolute value. Using SAS s tinv function, we find that t 129,0.974 = Thus, if the test statistic t exceeds 1.98 in magnitude, we shall reject the null hypothesis that the average body temperature is µ 0 = If t does not exceed 1.98, then we will not reject H 0 and conclude that there is insufficient evidence that the mean body temperature differs from The sample mean and standard deviation for the body temperature data are x = and s = 0.73 respectively. The test statistic is t = x µ 0 s/ n = / 130 = The test statistic t = < 1.98 falls in the rejection region, so we reject H 0 using a significance level α = Here is the logic: if H 0 is true and the average body temperature is indeed 98.6, then our test statistic t should look like it came from a t-distribution on 129 degrees of freedom; but the observed test statistic is t = It is very unlikely that this value of t would be produced by a t-distribution on 129 degrees of freedom. Therefore, the assumption that H 0 is true does not seem plausible. Hence we reject H 0 and accept H a. Conclusion: At the 5% significance level we reject H 0 and conclude that the average body temperature for healthy adults differs from Here are two important notes about the hypothesis test: 1. When stating your conclusion, state the significance level at which you conducted the test so the reader will know the strength of the statistical evidence against the null hypothesis. We rejected H 0 using α = Therefore, with this testing procedure, there is only a 5% probability of incorrectly rejecting the null hypothesis. 2. Your conclusion should be stated to be consistent with your hypotheses. Recall that we rejected H 0 and accepted the alternative hypothesis that µ Therefore, our conclusion is that the mean body temperature differs from We cannot change our mind after seeing the data and decide we want the alternative hypothesis to be H a : µ < 98.6 since the sample mean came out to be lower than This will inflate the type I error probability.

6 3 p-values STT 430/630/ES 760 Lecture Notes: Chapter 6: Hypothesis Testing 6 The method of testing described above is known as the critical value method since we specified a significance level α which in turn determined a cut-off value or critical value (in this case t n 1,1 α/2 ) for deciding whether or not to reject the null hypothesis. To use this method, one needs to decide upon the significance level α and this decision will depend on the type of application. Typically, the value of α = 0.05 is used. For better assurance against committing a type I error, one could use α = Using a smaller α will make it more difficult to reject the null hypothesis, decreasing the power of the test. A problem with the critical value method is the following: Recall that the critical value in the body temperature example was Suppose the test statistic came out to be t = 1.94 (instead of 5.455). Then the conclusion of the test would be to not reject H 0 at α = However, a value of t = 1.94 is right on the border of the rejection region and the evidence is still fairly strong for rejecting H 0, but not quite at the α = 0.05 level. If we test at α = 0.05 all we can state is that we fail to reject H 0 at α = 0.05 without letting on that the actual strength of evidence is quite strong. Also note that the actual test statistic value is t = Could we have rejected H 0 using α = or α = 0.01? What is the smallest significance level at which we could still reject H 0? The answer to this question is the p-value. The p-value is the smallest significance level at which we would reject the null hypothesis. For this reason, the p-value is sometimes called the observed significance level. Let T denote a t-random variable on n 1 degrees of freedom. We can define the p-value as following: Definition: The p-value is the probability that the test statistic takes the value we observed or more extreme away from the null hypothesis if the null hypothesis is true. For a two-sided alternative hypothesis, the p-value is computed as: p-value: = P (T > t or T < t, when H 0 is true) (Two-Sided p-value). The p-value answers the following question: If the null hypothesis is true, how likely is it that our observed test statistic takes the value we observed or more extreme? If this probability is small, then we reject the null hypothesis. If the p-value is not small, then we do not reject the null hypothesis. Interpreting p-values. Here are some rough guidelines for interpreting p-values which can be used in any testing scenario (not just for testing hypotheses about the mean). Let p denote the p-value of a test: If p 0.01, very strong evidence against the null hypothesis. If 0.01 < p 0.05, strong evidence against the null hypothesis. If 0.05 < p 0.10, the evidence against H 0 is moderate. If 0.10 p < 0.20 the evidence against H 0 is fairly weak. If p > 0.20, there is no evidence against H 0. In the body temperature example, the p-value is P (T < or T > 5.455). Because the t-distribution is symmetric about zero, we can write the p-value as p = 2P (T > t ) = 2P (T > 5.455). We can use SAS to compute this probability for us using the probt function as follows:

7 STT 430/630/ES 760 Lecture Notes: Chapter 6: Hypothesis Testing 7 data; p = 2*probt(-5.455, 129); proc print; run; The probt function computes cumulative probabilities for the t-distribution. You have to specify the degrees of freedom (in this case 129 = 130 1). Because we are performing a two-sided test, we need to multiply the probability by 2 to get the correct p-value. Alternatively, SAS s analyst can perform one-sample t-tests for us automatically: Solutions Analysis Analyst. In the window that opens, choose Open by SAS name under the file menu. Double-click on the Work file which contains all the SAS data sets in operation. Open the SAS data set of interest. At the menu at the top, click Statistics Hypothesis Tests One Sample t-test for a Mean. In the window that opens, click on the variable of interest. Be sure to specify the hypothesized mean value µ 0 under NULL. Also specify the correct alternative hypothesis (, <, >). The SAS analyst output for the body temperature example is below: N Mean Std. Dev. Std. Error Hypothesis Test Null hypothesis: Mean of temp = 98.6 Alternative: Mean of temp ^= 98.6 t Statistic Df Prob > t <.0001 SAS automatically computes the t-test statistic and the corresponding two-tailed p-value of p < Note that the p-value is very small and therefore we have very strong evidence against the null hypothesis. Using the p-value when stating the results of a hypothesis test is quite popular because it allows the reader to see the exact strength of the evidence for or against the null hypothesis. Here are some additional notes on hypothesis tests: 1. If we are testing a hypothesis using the critical value method and a significance level of α is being used, then we will reject the null hypothesis if the p-value < α. 2. A statistically significant result does not necessarily mean the result is of scientific significance. If very large sample sizes are used, statistical tests will be very powerful and able to detect minor

8 STT 430/630/ES 760 Lecture Notes: Chapter 6: Hypothesis Testing 8 Histogram of Body Temperature Data Temperature (F) Figure 1: Histogram of the Body Temperature Data. differences from the null hypothesis. A minor deviation from the null hypothesis may not be of scientific interest. For this reason, some prefer to state results using confidence intervals that give a range of plausible values for the parameter of interest. 3. The t-testing procedure requires that the underlying distribution is normal. The t-testing procedure is fairly robust to departures from normality and will produce approximately valid results for nonnormal distributions provided the distribution does not deviate too strongly from normality. For larger sample sizes, the normality assumption can be relaxed more due to the central limit theorem effect. However, if the distribution does deviate strongly from normality, then an alternative testing procedure might be considered, such as a nonparametric test based on ranks or using the bootstrap testing procedure. If the distribution is strongly skewed, then perhaps interest should lie with some other aspect of the distribution other than the mean. Figure 1 shows a histogram of the body temperature data. The body temperature distribution in Figure 1 looks consistent with a normal distribution indicating that the t-test procedure should be reliable. 4. One-Sided Test. If we want to test H 0 : µ = µ 0 versus H a : µ < µ 0, then the p-value = P (T < t). If we want to test H 0 : µ = µ 0 versus H a : µ > µ 0, then the p-value = P (T > t). 4 Power and Sample Size In the body temperature example, we had a sample size of n = 130. How did the investigators decide upon this sample size? If a scientist is to embark on a research project, one of the questions that needs to be answered is: how many subjects? One of the common requests of statisticians in biostatistical settings is to determine appropriate sample sizes for a study. Or, if the investigator has a particular sample size in

9 STT 430/630/ES 760 Lecture Notes: Chapter 6: Hypothesis Testing 9 mind, maybe due to budget or time constraints, it would be useful to know the power of the test. Recall that the power of a test is the probability of rejecting the null hypothesis when it is false. High power is desirable and larger sample sizes lead to higher power. The goal of a study that employs hypothesis testing is to determine if the null hypothesis can be rejected and the alternative hypothesis can be accepted. It would be unfortunate if time and energy is put into a study and the results of the study do not allow the rejection of the null hypothesis when the null hypothesis is false. This is analogous to putting a guilty person on trial but having to acquit due to lack of evidence. Therefore, it is important to power a study adequately in the planning stages. To illustrate a power computation, suppose we are testing H 0 : µ = µ 0 versus the one-tailed alternative H a : µ < µ 0. Suppose the true mean is µ 1 < µ 0 (and therefore the null hypothesis is false). Suppose also for the sake of argument that we know the population standard deviation σ. The critical value of the test will be based on a standard normal distribution instead of the t-distribution if σ is known and the test statistic is z = ( x µ 0 )/(σ/ n). If we are testing at a significance level α, then the power of the test is Power = P (Rejecting H 0 H 0 is false) = P ( X µ 0 σ/ n < z 1 α µ = µ 1 ) X = P ( σ/ n < µ 0 σ/ n z 1 α µ = µ 1 ) = P ( X µ 1 σ/ n < µ 0 µ 1 σ/ n z 1 α µ = µ 1 ) = P (Z < µ 0 µ 1 σ/ n z 1 α) = Φ( µ 0 µ 1 σ/ n z 1 α). If β denotes the probability of a type II error (failing to reject H 0 when H 0 is false), then From the preceding computation, we have which implies Power = 1 β. 1 β = Φ( µ 0 µ 1 σ/ n z 1 α), z 1 β = µ 0 µ 1 σ/ n z 1 α. We can solve this equation for n to determine the required sample size for a given power 1 β : n = σ2 (z 1 β + z 1 α ) 2 (µ 0 µ 1 ) 2. (2) This sample size formula works for one-sided alternatives of the form H a : µ > µ 1 also. Similar power computations can be carried out for two-sided alternatives. Of course, in practice, σ has to be estimated and the power computations require probability computations using the non-central t-distribution. However, the above computations allows us to make some observations about power and sample size:

10 STT 430/630/ES 760 Lecture Notes: Chapter 6: Hypothesis Testing Increasing the sample size n increases the power. Higher power requires larger sample sizes. 2. As the difference µ 0 µ 1 grows bigger, the power increases. That is, if the true mean µ 1 lies far from the hypothesized mean µ 0, then it will be more likely to detect this difference. 3. As the significance level α gets smaller, the power decreases. In order to maintain a given power, larger sample sizes are required for smaller significance levels. This follows because as α gets smaller, the standard normal percentile z 1 α gets bigger. 4. If σ decreases, then the power increases. Similarly, for a given power, the required sample sizes decreases as σ decreases. If the investigator an can collect data in a way that minimizes the variance of the individual observations, then power will be increased. Typically when computing an adequate sample size, one needs the following: Specify the desired power. Typical values range around 0.80 to Specify a guess of the population standard deviation σ since data will not be available at the planning stages to estimate σ (unless a pilot study is conducted). Specify the size of the difference (between µ 0 and the true mean) you would like to be able to detect. For instance, in the body temperature example, we may not be interested in detecting if the true mean differs from 98.6 by a tenth of a degree since that is a small difference. However, if the true average body temperature differs from 98.6 by 0.3 degrees, then we might want to have the power to detect this. The effect size is the standardized difference in the parameters that the test can detect. Some software programs require inputting the effect size. In the old days, statistics books came with numerous tables for estimating power and sample size. Nowadays there are software packages that do power and sample size computations. The following website (introduced in the last chapter) is useful for simple power and sample size computations: In SAS s analyst, choose Statistic Sample Size One Sample t-test, and then fill the dialogue box. For the output below, I plugged in σ = 0.7, a significance level α = 0.05 and an alternative hypothesis value of µ = 98.4 Null hypothesis: Mean of temp = 98.6 Alternative: Mean of temp ^= 98.6 t Statistic Df Prob > t <.0001 One-Sample t-test

11 STT 430/630/ES 760 Lecture Notes: Chapter 6: Hypothesis Testing 11 Null Mean = 98.6 Alternate Mean = 98.4 Standard Deviation =.7 Alpha = Sided Test Power N The output shows that with a sample size of n = 78, one has a power of 0.70 of rejecting the null hypothesis if the true mean differs from 98.6 by 0.2 degrees. The power jumps to 0.90 when the sample size is n = 131. Confidence Intervals and Two-Sided Tests We conclude this chapter by noting the following equivalence between a two-sided test with significance level α and a 100 (1 α) confidence interval: Suppose we are testing H 0 : µ = µ 0 versus H a : µ µ 0 at a significance level α. Then we will reject H 0 if and only if the µ 0 does not lie in the 100 (1 α) confidence interval for µ. 5 Problems 1. An experiment is carried out on n = 15 rats where each rat is given a unit dose an experimental drug. Interest lies in measuring the response time to a neurological stimulus. The mean response time for rats not injected with the drug is 1.2 seconds. The experimenter wants to determine if the mean response time for rats injected with the drug differs from 1.2 seconds. Let µ denote the mean response time for rats injected with a unit dose of the experimental drug. Do the following parts: a) Set up an appropriate null and alternative hypothesis for this problem in terms of µ. b) In the context of this problem, what is a type I error? c) In the context of this problem, what is a type II error? d) Suppose the mean response time for the n = 15 rats was found to be x = 1.05 seconds with standard deviation s = 0.5 seconds. Compute the t-test statistic for this problem. e) The hypothesis test is to be conducted using a significance level α = Draw a picture of the t-density and mark the rejection region for this test. f) Based on the results from parts (d) and (e), what is the conclusion of your test? Write a sentence summarizing your decision. 2. Which of the following numbers corresponds to a p-value showing strong evidence against the null hypothesis in the significance test H 0 : µ = µ 0 versus H a : µ µ 0? (circle the correct answer) a) b) c) d) e) f)

12 STT 430/630/ES 760 Lecture Notes: Chapter 6: Hypothesis Testing Before performing an experiment, a power analysis is conducted to determine the necessary sample size. The hypothesis concerns the mean of a single population. Suppose the required sample size is found to be n = 50 using α = 0.05 for a power of 80% to detect a difference of δ = 1. What happens to the required sample size if: a) The power is raised from 80% to 90%? (Circle one) n goes UP or DOWN. b) The minimum detectable difference goes from δ = 1 to δ = 1.5? (Circle one) n goes UP or DOWN. c) α goes from 0.05 to (Circle one) n goes UP or DOWN. 5. A study is to be done to determine if the cognitive ability of children living near a lead smelter is negatively impacted by increased exposure to lead. Suppose the average IQ for children in the United States is 100. From a pilot study, the standard deviation was estimated to be s = Use a statistical power/sample size software to answer the following questions. a) Should a one-sided or two-sided hypothesis test be used? b) If 80% power is desired to detect a difference of 5 IQ points using a significance level α = 0.05, what is the required sample size? c) Re-do part (b) if 90% power is desired. What happens to the required sample size? d) Re-do part (b) is α changes from 0.05 to What happens to the required sample size? e) Re-do part (b) if one wants 80% power to detect a difference of 10 IQ points. What happens to the required sample size? f) Suppose a two-tailed test is desired instead of a one-tailed test. Re-do part (b). What happens to the required sample size? g) Suppose you can only budget n = 10 IQ tests on children and you want to be able to detect a difference of 5 points in mean IQ using α = Would it be worthwhile to perform the 20 IQ evaluations? Compute the power and explain. h) A study was actually done and the average IQ from n = 124 children living near a lead smelter was found to be x = 91.1 with standard deviation s = Compute the p-value for this test and state a conclusion based on the p-value. 6. The treatment for patients with prostate cancer depends on whether or not the cancer has spread to surrounding lymph nodes. A surgical procedure (laparectomy) into the abdominal cavity can determine the extent of this nodal involvement. It is hypothesized that prostate cancer patients whose cancer has spread to surrounding lymph nodes will have elevated levels of their serum acid phosphatase. The mean level of serum acid phosphatase in prostate cancer patients where the cancer has not spread to surrounding lymph nodes is Twenty prostate cancer patients whose cancer had spread to surrounding lymph nodes were evaluated. The serum acid levels for these n = 20 patients is given in the SAS program below. Use this data to perform an appropriate hypothesis test to determine if the mean serum acid level for prostate cancer patients whose cancer has spread to surrounding lymph nodes is greater than Define µ and write down H 0 and H 1 in terms of µ. Compute the t-test statistic and the p-value for this test. Write a one paragraph report explaining the results of the statistical test. Give an introductory sentence or two, the statistical results, and the interpretation of the results.

13 STT 430/630/ES 760 Lecture Notes: Chapter 6: Hypothesis Testing 13 /********************************************************* Data below gives the serum acid phosphatase levels in 20 prostate cancer patients whose cancer has spread to surrounding lymph nodes. ***********************************************************/ data prostate; input acid; datalines; ; run; proc print; run; proc means; run; data new; set prostate; d = acid ; run; proc means; run; 7. Atrazine is a chemical used in herbicides. Over the last decade, the average atrazine concentration per liter of water in Lake Michigan was 10 ng/l. Efforts have been made to reduce the use of atrazine. A year after these efforts began, a study was conducted to determine if the average atrazine concentration had decreased in Lake Michigan. A random sample of n = 100 liters of Lake Michigan water were tested for atrazine a year after the efforts began. The sample mean was found to be x = 9.1 ng/l with standard deviation s = 2.3 ng/l. Do the following parts:

14 STT 430/630/ES 760 Lecture Notes: Chapter 6: Hypothesis Testing 14 a) Let µ denote the average atrazine (ng/l) concentration in Lake Michigan a year after the effort to reduce the use of atrazine. Set up the appropriate null and alternative hypotheses in terms of µ. b) In the context of this problem, what is a type I error? c) In the context of this problem, what is a type II error? d) Using a level of significance α = 0.05, test the hypothesis stated in part (a). Be sure to compute the test statistic and see if it falls in the critical region. Write a sentence discussing the result of your test. e) Form a 99% confidence interval for µ. References. Mackowiak, P. A., Wasserman, S. S., and Levine, M. M. (1992), A Critical Appraisal of 98.6 Degrees F, the Upper Limit of the Normal Body Temperature, and Other Legacies of Carl Reinhold August Wunderlich, Journal of the American Medical Association, 268,

22. HYPOTHESIS TESTING

22. HYPOTHESIS TESTING 22. HYPOTHESIS TESTING Often, we need to make decisions based on incomplete information. Do the data support some belief ( hypothesis ) about the value of a population parameter? Is OJ Simpson guilty?

More information

How To Compare Birds To Other Birds

How To Compare Birds To Other Birds STT 430/630/ES 760 Lecture Notes: Chapter 7: Two-Sample Inference 1 February 27, 2009 Chapter 7: Two Sample Inference Chapter 6 introduced hypothesis testing in the one-sample setting: one sample is obtained

More information

Two-sample inference: Continuous data

Two-sample inference: Continuous data Two-sample inference: Continuous data Patrick Breheny April 5 Patrick Breheny STA 580: Biostatistics I 1/32 Introduction Our next two lectures will deal with two-sample inference for continuous data As

More information

LAB 4 INSTRUCTIONS CONFIDENCE INTERVALS AND HYPOTHESIS TESTING

LAB 4 INSTRUCTIONS CONFIDENCE INTERVALS AND HYPOTHESIS TESTING LAB 4 INSTRUCTIONS CONFIDENCE INTERVALS AND HYPOTHESIS TESTING In this lab you will explore the concept of a confidence interval and hypothesis testing through a simulation problem in engineering setting.

More information

Lesson 1: Comparison of Population Means Part c: Comparison of Two- Means

Lesson 1: Comparison of Population Means Part c: Comparison of Two- Means Lesson : Comparison of Population Means Part c: Comparison of Two- Means Welcome to lesson c. This third lesson of lesson will discuss hypothesis testing for two independent means. Steps in Hypothesis

More information

Two-Sample T-Tests Assuming Equal Variance (Enter Means)

Two-Sample T-Tests Assuming Equal Variance (Enter Means) Chapter 4 Two-Sample T-Tests Assuming Equal Variance (Enter Means) Introduction This procedure provides sample size and power calculations for one- or two-sided two-sample t-tests when the variances of

More information

Introduction to. Hypothesis Testing CHAPTER LEARNING OBJECTIVES. 1 Identify the four steps of hypothesis testing.

Introduction to. Hypothesis Testing CHAPTER LEARNING OBJECTIVES. 1 Identify the four steps of hypothesis testing. Introduction to Hypothesis Testing CHAPTER 8 LEARNING OBJECTIVES After reading this chapter, you should be able to: 1 Identify the four steps of hypothesis testing. 2 Define null hypothesis, alternative

More information

Introduction to Hypothesis Testing OPRE 6301

Introduction to Hypothesis Testing OPRE 6301 Introduction to Hypothesis Testing OPRE 6301 Motivation... The purpose of hypothesis testing is to determine whether there is enough statistical evidence in favor of a certain belief, or hypothesis, about

More information

Chapter 2. Hypothesis testing in one population

Chapter 2. Hypothesis testing in one population Chapter 2. Hypothesis testing in one population Contents Introduction, the null and alternative hypotheses Hypothesis testing process Type I and Type II errors, power Test statistic, level of significance

More information

Two-Sample T-Tests Allowing Unequal Variance (Enter Difference)

Two-Sample T-Tests Allowing Unequal Variance (Enter Difference) Chapter 45 Two-Sample T-Tests Allowing Unequal Variance (Enter Difference) Introduction This procedure provides sample size and power calculations for one- or two-sided two-sample t-tests when no assumption

More information

HYPOTHESIS TESTING (ONE SAMPLE) - CHAPTER 7 1. used confidence intervals to answer questions such as...

HYPOTHESIS TESTING (ONE SAMPLE) - CHAPTER 7 1. used confidence intervals to answer questions such as... HYPOTHESIS TESTING (ONE SAMPLE) - CHAPTER 7 1 PREVIOUSLY used confidence intervals to answer questions such as... You know that 0.25% of women have red/green color blindness. You conduct a study of men

More information

HYPOTHESIS TESTING (ONE SAMPLE) - CHAPTER 7 1. used confidence intervals to answer questions such as...

HYPOTHESIS TESTING (ONE SAMPLE) - CHAPTER 7 1. used confidence intervals to answer questions such as... HYPOTHESIS TESTING (ONE SAMPLE) - CHAPTER 7 1 PREVIOUSLY used confidence intervals to answer questions such as... You know that 0.25% of women have red/green color blindness. You conduct a study of men

More information

Hypothesis testing - Steps

Hypothesis testing - Steps Hypothesis testing - Steps Steps to do a two-tailed test of the hypothesis that β 1 0: 1. Set up the hypotheses: H 0 : β 1 = 0 H a : β 1 0. 2. Compute the test statistic: t = b 1 0 Std. error of b 1 =

More information

Sample Size and Power in Clinical Trials

Sample Size and Power in Clinical Trials Sample Size and Power in Clinical Trials Version 1.0 May 011 1. Power of a Test. Factors affecting Power 3. Required Sample Size RELATED ISSUES 1. Effect Size. Test Statistics 3. Variation 4. Significance

More information

Unit 31 A Hypothesis Test about Correlation and Slope in a Simple Linear Regression

Unit 31 A Hypothesis Test about Correlation and Slope in a Simple Linear Regression Unit 31 A Hypothesis Test about Correlation and Slope in a Simple Linear Regression Objectives: To perform a hypothesis test concerning the slope of a least squares line To recognize that testing for a

More information

Permutation Tests for Comparing Two Populations

Permutation Tests for Comparing Two Populations Permutation Tests for Comparing Two Populations Ferry Butar Butar, Ph.D. Jae-Wan Park Abstract Permutation tests for comparing two populations could be widely used in practice because of flexibility of

More information

Lecture Notes Module 1

Lecture Notes Module 1 Lecture Notes Module 1 Study Populations A study population is a clearly defined collection of people, animals, plants, or objects. In psychological research, a study population usually consists of a specific

More information

Using Excel for inferential statistics

Using Excel for inferential statistics FACT SHEET Using Excel for inferential statistics Introduction When you collect data, you expect a certain amount of variation, just caused by chance. A wide variety of statistical tests can be applied

More information

C. The null hypothesis is not rejected when the alternative hypothesis is true. A. population parameters.

C. The null hypothesis is not rejected when the alternative hypothesis is true. A. population parameters. Sample Multiple Choice Questions for the material since Midterm 2. Sample questions from Midterms and 2 are also representative of questions that may appear on the final exam.. A randomly selected sample

More information

Chapter 7 Section 7.1: Inference for the Mean of a Population

Chapter 7 Section 7.1: Inference for the Mean of a Population Chapter 7 Section 7.1: Inference for the Mean of a Population Now let s look at a similar situation Take an SRS of size n Normal Population : N(, ). Both and are unknown parameters. Unlike what we used

More information

Tutorial 5: Hypothesis Testing

Tutorial 5: Hypothesis Testing Tutorial 5: Hypothesis Testing Rob Nicholls nicholls@mrc-lmb.cam.ac.uk MRC LMB Statistics Course 2014 Contents 1 Introduction................................ 1 2 Testing distributional assumptions....................

More information

Non-Inferiority Tests for One Mean

Non-Inferiority Tests for One Mean Chapter 45 Non-Inferiority ests for One Mean Introduction his module computes power and sample size for non-inferiority tests in one-sample designs in which the outcome is distributed as a normal random

More information

Comparing Two Groups. Standard Error of ȳ 1 ȳ 2. Setting. Two Independent Samples

Comparing Two Groups. Standard Error of ȳ 1 ȳ 2. Setting. Two Independent Samples Comparing Two Groups Chapter 7 describes two ways to compare two populations on the basis of independent samples: a confidence interval for the difference in population means and a hypothesis test. The

More information

HYPOTHESIS TESTING WITH SPSS:

HYPOTHESIS TESTING WITH SPSS: HYPOTHESIS TESTING WITH SPSS: A NON-STATISTICIAN S GUIDE & TUTORIAL by Dr. Jim Mirabella SPSS 14.0 screenshots reprinted with permission from SPSS Inc. Published June 2006 Copyright Dr. Jim Mirabella CHAPTER

More information

Hypothesis testing. c 2014, Jeffrey S. Simonoff 1

Hypothesis testing. c 2014, Jeffrey S. Simonoff 1 Hypothesis testing So far, we ve talked about inference from the point of estimation. We ve tried to answer questions like What is a good estimate for a typical value? or How much variability is there

More information

Descriptive Statistics

Descriptive Statistics Descriptive Statistics Primer Descriptive statistics Central tendency Variation Relative position Relationships Calculating descriptive statistics Descriptive Statistics Purpose to describe or summarize

More information

II. DISTRIBUTIONS distribution normal distribution. standard scores

II. DISTRIBUTIONS distribution normal distribution. standard scores Appendix D Basic Measurement And Statistics The following information was developed by Steven Rothke, PhD, Department of Psychology, Rehabilitation Institute of Chicago (RIC) and expanded by Mary F. Schmidt,

More information

3.4 Statistical inference for 2 populations based on two samples

3.4 Statistical inference for 2 populations based on two samples 3.4 Statistical inference for 2 populations based on two samples Tests for a difference between two population means The first sample will be denoted as X 1, X 2,..., X m. The second sample will be denoted

More information

Inference for two Population Means

Inference for two Population Means Inference for two Population Means Bret Hanlon and Bret Larget Department of Statistics University of Wisconsin Madison October 27 November 1, 2011 Two Population Means 1 / 65 Case Study Case Study Example

More information

Lesson 9 Hypothesis Testing

Lesson 9 Hypothesis Testing Lesson 9 Hypothesis Testing Outline Logic for Hypothesis Testing Critical Value Alpha (α) -level.05 -level.01 One-Tail versus Two-Tail Tests -critical values for both alpha levels Logic for Hypothesis

More information

THE FIRST SET OF EXAMPLES USE SUMMARY DATA... EXAMPLE 7.2, PAGE 227 DESCRIBES A PROBLEM AND A HYPOTHESIS TEST IS PERFORMED IN EXAMPLE 7.

THE FIRST SET OF EXAMPLES USE SUMMARY DATA... EXAMPLE 7.2, PAGE 227 DESCRIBES A PROBLEM AND A HYPOTHESIS TEST IS PERFORMED IN EXAMPLE 7. THERE ARE TWO WAYS TO DO HYPOTHESIS TESTING WITH STATCRUNCH: WITH SUMMARY DATA (AS IN EXAMPLE 7.17, PAGE 236, IN ROSNER); WITH THE ORIGINAL DATA (AS IN EXAMPLE 8.5, PAGE 301 IN ROSNER THAT USES DATA FROM

More information

QUANTITATIVE METHODS BIOLOGY FINAL HONOUR SCHOOL NON-PARAMETRIC TESTS

QUANTITATIVE METHODS BIOLOGY FINAL HONOUR SCHOOL NON-PARAMETRIC TESTS QUANTITATIVE METHODS BIOLOGY FINAL HONOUR SCHOOL NON-PARAMETRIC TESTS This booklet contains lecture notes for the nonparametric work in the QM course. This booklet may be online at http://users.ox.ac.uk/~grafen/qmnotes/index.html.

More information

Introduction to Hypothesis Testing

Introduction to Hypothesis Testing I. Terms, Concepts. Introduction to Hypothesis Testing A. In general, we do not know the true value of population parameters - they must be estimated. However, we do have hypotheses about what the true

More information

Statistics 2014 Scoring Guidelines

Statistics 2014 Scoring Guidelines AP Statistics 2014 Scoring Guidelines College Board, Advanced Placement Program, AP, AP Central, and the acorn logo are registered trademarks of the College Board. AP Central is the official online home

More information

Stat 411/511 THE RANDOMIZATION TEST. Charlotte Wickham. stat511.cwick.co.nz. Oct 16 2015

Stat 411/511 THE RANDOMIZATION TEST. Charlotte Wickham. stat511.cwick.co.nz. Oct 16 2015 Stat 411/511 THE RANDOMIZATION TEST Oct 16 2015 Charlotte Wickham stat511.cwick.co.nz Today Review randomization model Conduct randomization test What about CIs? Using a t-distribution as an approximation

More information

Principles of Hypothesis Testing for Public Health

Principles of Hypothesis Testing for Public Health Principles of Hypothesis Testing for Public Health Laura Lee Johnson, Ph.D. Statistician National Center for Complementary and Alternative Medicine johnslau@mail.nih.gov Fall 2011 Answers to Questions

More information

t Tests in Excel The Excel Statistical Master By Mark Harmon Copyright 2011 Mark Harmon

t Tests in Excel The Excel Statistical Master By Mark Harmon Copyright 2011 Mark Harmon t-tests in Excel By Mark Harmon Copyright 2011 Mark Harmon No part of this publication may be reproduced or distributed without the express permission of the author. mark@excelmasterseries.com www.excelmasterseries.com

More information

Introduction to Hypothesis Testing. Hypothesis Testing. Step 1: State the Hypotheses

Introduction to Hypothesis Testing. Hypothesis Testing. Step 1: State the Hypotheses Introduction to Hypothesis Testing 1 Hypothesis Testing A hypothesis test is a statistical procedure that uses sample data to evaluate a hypothesis about a population Hypothesis is stated in terms of the

More information

Non-Inferiority Tests for Two Means using Differences

Non-Inferiority Tests for Two Means using Differences Chapter 450 on-inferiority Tests for Two Means using Differences Introduction This procedure computes power and sample size for non-inferiority tests in two-sample designs in which the outcome is a continuous

More information

SCHOOL OF HEALTH AND HUMAN SCIENCES DON T FORGET TO RECODE YOUR MISSING VALUES

SCHOOL OF HEALTH AND HUMAN SCIENCES DON T FORGET TO RECODE YOUR MISSING VALUES SCHOOL OF HEALTH AND HUMAN SCIENCES Using SPSS Topics addressed today: 1. Differences between groups 2. Graphing Use the s4data.sav file for the first part of this session. DON T FORGET TO RECODE YOUR

More information

NCSS Statistical Software

NCSS Statistical Software Chapter 06 Introduction This procedure provides several reports for the comparison of two distributions, including confidence intervals for the difference in means, two-sample t-tests, the z-test, the

More information

Mind on Statistics. Chapter 12

Mind on Statistics. Chapter 12 Mind on Statistics Chapter 12 Sections 12.1 Questions 1 to 6: For each statement, determine if the statement is a typical null hypothesis (H 0 ) or alternative hypothesis (H a ). 1. There is no difference

More information

HYPOTHESIS TESTING: POWER OF THE TEST

HYPOTHESIS TESTING: POWER OF THE TEST HYPOTHESIS TESTING: POWER OF THE TEST The first 6 steps of the 9-step test of hypothesis are called "the test". These steps are not dependent on the observed data values. When planning a research project,

More information

Name: Date: Use the following to answer questions 3-4:

Name: Date: Use the following to answer questions 3-4: Name: Date: 1. Determine whether each of the following statements is true or false. A) The margin of error for a 95% confidence interval for the mean increases as the sample size increases. B) The margin

More information

NCSS Statistical Software. One-Sample T-Test

NCSS Statistical Software. One-Sample T-Test Chapter 205 Introduction This procedure provides several reports for making inference about a population mean based on a single sample. These reports include confidence intervals of the mean or median,

More information

NCSS Statistical Software

NCSS Statistical Software Chapter 06 Introduction This procedure provides several reports for the comparison of two distributions, including confidence intervals for the difference in means, two-sample t-tests, the z-test, the

More information

Testing for differences I exercises with SPSS

Testing for differences I exercises with SPSS Testing for differences I exercises with SPSS Introduction The exercises presented here are all about the t-test and its non-parametric equivalents in their various forms. In SPSS, all these tests can

More information

Introduction. Hypothesis Testing. Hypothesis Testing. Significance Testing

Introduction. Hypothesis Testing. Hypothesis Testing. Significance Testing Introduction Hypothesis Testing Mark Lunt Arthritis Research UK Centre for Ecellence in Epidemiology University of Manchester 13/10/2015 We saw last week that we can never know the population parameters

More information

Chapter 2 Probability Topics SPSS T tests

Chapter 2 Probability Topics SPSS T tests Chapter 2 Probability Topics SPSS T tests Data file used: gss.sav In the lecture about chapter 2, only the One-Sample T test has been explained. In this handout, we also give the SPSS methods to perform

More information

Part 3. Comparing Groups. Chapter 7 Comparing Paired Groups 189. Chapter 8 Comparing Two Independent Groups 217

Part 3. Comparing Groups. Chapter 7 Comparing Paired Groups 189. Chapter 8 Comparing Two Independent Groups 217 Part 3 Comparing Groups Chapter 7 Comparing Paired Groups 189 Chapter 8 Comparing Two Independent Groups 217 Chapter 9 Comparing More Than Two Groups 257 188 Elementary Statistics Using SAS Chapter 7 Comparing

More information

Comparing Means in Two Populations

Comparing Means in Two Populations Comparing Means in Two Populations Overview The previous section discussed hypothesis testing when sampling from a single population (either a single mean or two means from the same population). Now we

More information

Testing Hypotheses About Proportions

Testing Hypotheses About Proportions Chapter 11 Testing Hypotheses About Proportions Hypothesis testing method: uses data from a sample to judge whether or not a statement about a population may be true. Steps in Any Hypothesis Test 1. Determine

More information

Confidence Intervals for One Standard Deviation Using Standard Deviation

Confidence Intervals for One Standard Deviation Using Standard Deviation Chapter 640 Confidence Intervals for One Standard Deviation Using Standard Deviation Introduction This routine calculates the sample size necessary to achieve a specified interval width or distance from

More information

Standard Deviation Estimator

Standard Deviation Estimator CSS.com Chapter 905 Standard Deviation Estimator Introduction Even though it is not of primary interest, an estimate of the standard deviation (SD) is needed when calculating the power or sample size of

More information

UNDERSTANDING THE INDEPENDENT-SAMPLES t TEST

UNDERSTANDING THE INDEPENDENT-SAMPLES t TEST UNDERSTANDING The independent-samples t test evaluates the difference between the means of two independent or unrelated groups. That is, we evaluate whether the means for two independent groups are significantly

More information

Testing a claim about a population mean

Testing a claim about a population mean Introductory Statistics Lectures Testing a claim about a population mean One sample hypothesis test of the mean Department of Mathematics Pima Community College Redistribution of this material is prohibited

More information

A POPULATION MEAN, CONFIDENCE INTERVALS AND HYPOTHESIS TESTING

A POPULATION MEAN, CONFIDENCE INTERVALS AND HYPOTHESIS TESTING CHAPTER 5. A POPULATION MEAN, CONFIDENCE INTERVALS AND HYPOTHESIS TESTING 5.1 Concepts When a number of animals or plots are exposed to a certain treatment, we usually estimate the effect of the treatment

More information

"Statistical methods are objective methods by which group trends are abstracted from observations on many separate individuals." 1

Statistical methods are objective methods by which group trends are abstracted from observations on many separate individuals. 1 BASIC STATISTICAL THEORY / 3 CHAPTER ONE BASIC STATISTICAL THEORY "Statistical methods are objective methods by which group trends are abstracted from observations on many separate individuals." 1 Medicine

More information

An Introduction to Statistics Course (ECOE 1302) Spring Semester 2011 Chapter 10- TWO-SAMPLE TESTS

An Introduction to Statistics Course (ECOE 1302) Spring Semester 2011 Chapter 10- TWO-SAMPLE TESTS The Islamic University of Gaza Faculty of Commerce Department of Economics and Political Sciences An Introduction to Statistics Course (ECOE 130) Spring Semester 011 Chapter 10- TWO-SAMPLE TESTS Practice

More information

Independent samples t-test. Dr. Tom Pierce Radford University

Independent samples t-test. Dr. Tom Pierce Radford University Independent samples t-test Dr. Tom Pierce Radford University The logic behind drawing causal conclusions from experiments The sampling distribution of the difference between means The standard error of

More information

1.5 Oneway Analysis of Variance

1.5 Oneway Analysis of Variance Statistics: Rosie Cornish. 200. 1.5 Oneway Analysis of Variance 1 Introduction Oneway analysis of variance (ANOVA) is used to compare several means. This method is often used in scientific or medical experiments

More information

Good luck! BUSINESS STATISTICS FINAL EXAM INSTRUCTIONS. Name:

Good luck! BUSINESS STATISTICS FINAL EXAM INSTRUCTIONS. Name: Glo bal Leadership M BA BUSINESS STATISTICS FINAL EXAM Name: INSTRUCTIONS 1. Do not open this exam until instructed to do so. 2. Be sure to fill in your name before starting the exam. 3. You have two hours

More information

Pearson's Correlation Tests

Pearson's Correlation Tests Chapter 800 Pearson's Correlation Tests Introduction The correlation coefficient, ρ (rho), is a popular statistic for describing the strength of the relationship between two variables. The correlation

More information

Business Statistics, 9e (Groebner/Shannon/Fry) Chapter 9 Introduction to Hypothesis Testing

Business Statistics, 9e (Groebner/Shannon/Fry) Chapter 9 Introduction to Hypothesis Testing Business Statistics, 9e (Groebner/Shannon/Fry) Chapter 9 Introduction to Hypothesis Testing 1) Hypothesis testing and confidence interval estimation are essentially two totally different statistical procedures

More information

NONPARAMETRIC STATISTICS 1. depend on assumptions about the underlying distribution of the data (or on the Central Limit Theorem)

NONPARAMETRIC STATISTICS 1. depend on assumptions about the underlying distribution of the data (or on the Central Limit Theorem) NONPARAMETRIC STATISTICS 1 PREVIOUSLY parametric statistics in estimation and hypothesis testing... construction of confidence intervals computing of p-values classical significance testing depend on assumptions

More information

Confidence Intervals for Cp

Confidence Intervals for Cp Chapter 296 Confidence Intervals for Cp Introduction This routine calculates the sample size needed to obtain a specified width of a Cp confidence interval at a stated confidence level. Cp is a process

More information

Statistics Review PSY379

Statistics Review PSY379 Statistics Review PSY379 Basic concepts Measurement scales Populations vs. samples Continuous vs. discrete variable Independent vs. dependent variable Descriptive vs. inferential stats Common analyses

More information

Confidence Intervals for the Difference Between Two Means

Confidence Intervals for the Difference Between Two Means Chapter 47 Confidence Intervals for the Difference Between Two Means Introduction This procedure calculates the sample size necessary to achieve a specified distance from the difference in sample means

More information

Association Between Variables

Association Between Variables Contents 11 Association Between Variables 767 11.1 Introduction............................ 767 11.1.1 Measure of Association................. 768 11.1.2 Chapter Summary.................... 769 11.2 Chi

More information

Opgaven Onderzoeksmethoden, Onderdeel Statistiek

Opgaven Onderzoeksmethoden, Onderdeel Statistiek Opgaven Onderzoeksmethoden, Onderdeel Statistiek 1. What is the measurement scale of the following variables? a Shoe size b Religion c Car brand d Score in a tennis game e Number of work hours per week

More information

CONTENTS OF DAY 2. II. Why Random Sampling is Important 9 A myth, an urban legend, and the real reason NOTES FOR SUMMER STATISTICS INSTITUTE COURSE

CONTENTS OF DAY 2. II. Why Random Sampling is Important 9 A myth, an urban legend, and the real reason NOTES FOR SUMMER STATISTICS INSTITUTE COURSE 1 2 CONTENTS OF DAY 2 I. More Precise Definition of Simple Random Sample 3 Connection with independent random variables 3 Problems with small populations 8 II. Why Random Sampling is Important 9 A myth,

More information

Tests for One Proportion

Tests for One Proportion Chapter 100 Tests for One Proportion Introduction The One-Sample Proportion Test is used to assess whether a population proportion (P1) is significantly different from a hypothesized value (P0). This is

More information

SENSITIVITY ANALYSIS AND INFERENCE. Lecture 12

SENSITIVITY ANALYSIS AND INFERENCE. Lecture 12 This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike License. Your use of this material constitutes acceptance of that license and the conditions of use of materials on this

More information

How To Test For Significance On A Data Set

How To Test For Significance On A Data Set Non-Parametric Univariate Tests: 1 Sample Sign Test 1 1 SAMPLE SIGN TEST A non-parametric equivalent of the 1 SAMPLE T-TEST. ASSUMPTIONS: Data is non-normally distributed, even after log transforming.

More information

Recall this chart that showed how most of our course would be organized:

Recall this chart that showed how most of our course would be organized: Chapter 4 One-Way ANOVA Recall this chart that showed how most of our course would be organized: Explanatory Variable(s) Response Variable Methods Categorical Categorical Contingency Tables Categorical

More information

Simple Regression Theory II 2010 Samuel L. Baker

Simple Regression Theory II 2010 Samuel L. Baker SIMPLE REGRESSION THEORY II 1 Simple Regression Theory II 2010 Samuel L. Baker Assessing how good the regression equation is likely to be Assignment 1A gets into drawing inferences about how close the

More information

Independent t- Test (Comparing Two Means)

Independent t- Test (Comparing Two Means) Independent t- Test (Comparing Two Means) The objectives of this lesson are to learn: the definition/purpose of independent t-test when to use the independent t-test the use of SPSS to complete an independent

More information

Basic Concepts in Research and Data Analysis

Basic Concepts in Research and Data Analysis Basic Concepts in Research and Data Analysis Introduction: A Common Language for Researchers...2 Steps to Follow When Conducting Research...3 The Research Question... 3 The Hypothesis... 4 Defining the

More information

The Wilcoxon Rank-Sum Test

The Wilcoxon Rank-Sum Test 1 The Wilcoxon Rank-Sum Test The Wilcoxon rank-sum test is a nonparametric alternative to the twosample t-test which is based solely on the order in which the observations from the two samples fall. We

More information

Psychology 60 Fall 2013 Practice Exam Actual Exam: Next Monday. Good luck!

Psychology 60 Fall 2013 Practice Exam Actual Exam: Next Monday. Good luck! Psychology 60 Fall 2013 Practice Exam Actual Exam: Next Monday. Good luck! Name: 1. The basic idea behind hypothesis testing: A. is important only if you want to compare two populations. B. depends on

More information

CONTINGENCY TABLES ARE NOT ALL THE SAME David C. Howell University of Vermont

CONTINGENCY TABLES ARE NOT ALL THE SAME David C. Howell University of Vermont CONTINGENCY TABLES ARE NOT ALL THE SAME David C. Howell University of Vermont To most people studying statistics a contingency table is a contingency table. We tend to forget, if we ever knew, that contingency

More information

Unit 26: Small Sample Inference for One Mean

Unit 26: Small Sample Inference for One Mean Unit 26: Small Sample Inference for One Mean Prerequisites Students need the background on confidence intervals and significance tests covered in Units 24 and 25. Additional Topic Coverage Additional coverage

More information

Point Biserial Correlation Tests

Point Biserial Correlation Tests Chapter 807 Point Biserial Correlation Tests Introduction The point biserial correlation coefficient (ρ in this chapter) is the product-moment correlation calculated between a continuous random variable

More information

Hypothesis Testing for Beginners

Hypothesis Testing for Beginners Hypothesis Testing for Beginners Michele Piffer LSE August, 2011 Michele Piffer (LSE) Hypothesis Testing for Beginners August, 2011 1 / 53 One year ago a friend asked me to put down some easy-to-read notes

More information

BA 275 Review Problems - Week 6 (10/30/06-11/3/06) CD Lessons: 53, 54, 55, 56 Textbook: pp. 394-398, 404-408, 410-420

BA 275 Review Problems - Week 6 (10/30/06-11/3/06) CD Lessons: 53, 54, 55, 56 Textbook: pp. 394-398, 404-408, 410-420 BA 275 Review Problems - Week 6 (10/30/06-11/3/06) CD Lessons: 53, 54, 55, 56 Textbook: pp. 394-398, 404-408, 410-420 1. Which of the following will increase the value of the power in a statistical test

More information

Paired T-Test. Chapter 208. Introduction. Technical Details. Research Questions

Paired T-Test. Chapter 208. Introduction. Technical Details. Research Questions Chapter 208 Introduction This procedure provides several reports for making inference about the difference between two population means based on a paired sample. These reports include confidence intervals

More information

CHAPTER 14 NONPARAMETRIC TESTS

CHAPTER 14 NONPARAMETRIC TESTS CHAPTER 14 NONPARAMETRIC TESTS Everything that we have done up until now in statistics has relied heavily on one major fact: that our data is normally distributed. We have been able to make inferences

More information

Chapter 9. Two-Sample Tests. Effect Sizes and Power Paired t Test Calculation

Chapter 9. Two-Sample Tests. Effect Sizes and Power Paired t Test Calculation Chapter 9 Two-Sample Tests Paired t Test (Correlated Groups t Test) Effect Sizes and Power Paired t Test Calculation Summary Independent t Test Chapter 9 Homework Power and Two-Sample Tests: Paired Versus

More information

How To Check For Differences In The One Way Anova

How To Check For Differences In The One Way Anova MINITAB ASSISTANT WHITE PAPER This paper explains the research conducted by Minitab statisticians to develop the methods and data checks used in the Assistant in Minitab 17 Statistical Software. One-Way

More information

UNDERSTANDING THE DEPENDENT-SAMPLES t TEST

UNDERSTANDING THE DEPENDENT-SAMPLES t TEST UNDERSTANDING THE DEPENDENT-SAMPLES t TEST A dependent-samples t test (a.k.a. matched or paired-samples, matched-pairs, samples, or subjects, simple repeated-measures or within-groups, or correlated groups)

More information

Testing Group Differences using T-tests, ANOVA, and Nonparametric Measures

Testing Group Differences using T-tests, ANOVA, and Nonparametric Measures Testing Group Differences using T-tests, ANOVA, and Nonparametric Measures Jamie DeCoster Department of Psychology University of Alabama 348 Gordon Palmer Hall Box 870348 Tuscaloosa, AL 35487-0348 Phone:

More information

Experimental Design. Power and Sample Size Determination. Proportions. Proportions. Confidence Interval for p. The Binomial Test

Experimental Design. Power and Sample Size Determination. Proportions. Proportions. Confidence Interval for p. The Binomial Test Experimental Design Power and Sample Size Determination Bret Hanlon and Bret Larget Department of Statistics University of Wisconsin Madison November 3 8, 2011 To this point in the semester, we have largely

More information

SIMPLE LINEAR CORRELATION. r can range from -1 to 1, and is independent of units of measurement. Correlation can be done on two dependent variables.

SIMPLE LINEAR CORRELATION. r can range from -1 to 1, and is independent of units of measurement. Correlation can be done on two dependent variables. SIMPLE LINEAR CORRELATION Simple linear correlation is a measure of the degree to which two variables vary together, or a measure of the intensity of the association between two variables. Correlation

More information

Non Parametric Inference

Non Parametric Inference Maura Department of Economics and Finance Università Tor Vergata Outline 1 2 3 Inverse distribution function Theorem: Let U be a uniform random variable on (0, 1). Let X be a continuous random variable

More information

13: Additional ANOVA Topics. Post hoc Comparisons

13: Additional ANOVA Topics. Post hoc Comparisons 13: Additional ANOVA Topics Post hoc Comparisons ANOVA Assumptions Assessing Group Variances When Distributional Assumptions are Severely Violated Kruskal-Wallis Test Post hoc Comparisons In the prior

More information

Sample Size Planning, Calculation, and Justification

Sample Size Planning, Calculation, and Justification Sample Size Planning, Calculation, and Justification Theresa A Scott, MS Vanderbilt University Department of Biostatistics theresa.scott@vanderbilt.edu http://biostat.mc.vanderbilt.edu/theresascott Theresa

More information

Section 7.1. Introduction to Hypothesis Testing. Schrodinger s cat quantum mechanics thought experiment (1935)

Section 7.1. Introduction to Hypothesis Testing. Schrodinger s cat quantum mechanics thought experiment (1935) Section 7.1 Introduction to Hypothesis Testing Schrodinger s cat quantum mechanics thought experiment (1935) Statistical Hypotheses A statistical hypothesis is a claim about a population. Null hypothesis

More information

The Assumption(s) of Normality

The Assumption(s) of Normality The Assumption(s) of Normality Copyright 2000, 2011, J. Toby Mordkoff This is very complicated, so I ll provide two versions. At a minimum, you should know the short one. It would be great if you knew

More information

Statistical Functions in Excel

Statistical Functions in Excel Statistical Functions in Excel There are many statistical functions in Excel. Moreover, there are other functions that are not specified as statistical functions that are helpful in some statistical analyses.

More information