The Bonferonni and Šidák Corrections for Multiple Comparisons

Save this PDF as:

Size: px
Start display at page:

Transcription

1 The Bonferonni and Šidák Corrections for Multiple Comparisons Hervé Abdi 1 1 Overview The more tests we perform on a set of data, the more likely we are to reject the null hypothesis when it is true (i.e., a Type I error). This is a consequence of the logic of hypothesis testing: We reject the null hypothesis if we witness a rare event. But the larger the number of tests, the easier it is to find rare events and therefore the easier it is to make the mistake of thinking that there is an effect when there is none. This problem is called the inflation of the alpha level. In order to be protected from it, one strategy is to correct the alpha level when performing multiple tests. Making the alpha level more stringent (i.e., smaller) will create less errors, but it may also make it harder to detect real effects. 2 The different meanings of alpha Maybe it is because computers make it easier to run statistical analyses that researchers perform more and more statistical tests on a 1 In: Neil Salkind (Ed.) (2007). Encyclopedia of Measurement and Statistics. Thousand Oaks (CA): Sage. Address correspondence to: Hervé Abdi Program in Cognition and Neurosciences, MS: Gr.4.1, The University of Texas at Dallas, Richardson, TX , USA herve 1

2 same set of data. For example, brain imaging researchers will routinely run millions of tests to analyze an experiment. Running so many tests increases the risk of false alarms. To illustrate, imagine the following pseudo-experiment": I toss 20 coins, and I try to force the coins to fall on the heads. I know that, from the binomial test," the null hypothesis is rejected at the α=.05 level if the number of heads is greater than 14. I repeat this experiment 10 times. Suppose that one trial gives the significant" result of 16 heads versus 4 tails. Did I influence the coins on that occasion? Of course not, because the larger the number of experiments, the greater the probability of detecting a low-probability event (like 16 versus 4). In fact, waiting long enough is a sure way of detecting rare events! 2.1 Probability in the family A family of tests is the technical term for a series of tests performed on a set of data. In this section we show how to compute the probability of rejecting the null hypothesis at least once in a family of tests when the null hypothesis is true. For convenience, suppose that we set the significance level at α=.05. For each test (i.e., one trial in the example of the coins) the probability of making a Type I error is equal to α=.05. The events making a Type I error" and not making a Type I error" are complementary events (they cannot occur simultaneously). Therefore the probability of not making a Type I error on one trial is equal to 1 α=1.05=.95. Recall that when two events are independent, the probability of observing these two events together is the product of their probabilities. Thus, if the tests are independent, the probability of not making a Type I error on the first and the second tests is.95.95=(1.05) 2 = (1 α) 2. 2

3 With 3 tests, we find that the probability of not making a Type I error on all tests is: =(1.05) 3 = (1 α) 3. For a family of C tests, the probability of not making a Type I error for the whole family is: (1 α) C. For our example, the probability of not making a Type I error on the family is (1 α) C = (1.05) 10 =.599. Now, what we are looking for is the probability of making one or more Type I errors on the family of tests. This event is the complement of the event not making a Type I error on the family and therefore it is equal to 1 (1 α) C. For our example, we find 1 (1.05) 10 =.401. So, with an α level of.05 for each of the 10 tests, the probability of wrongly rejecting the null hypothesis is.401. This example makes clear the need to distinguish between two meanings of α when performing multiple tests: The probability of making a Type I error when dealing only with a specific test. This probability is denoted α[pt ] (pronounced alpha per test"). It is also called the testwise alpha. The probability of making at least one Type I error for the whole family of tests. This probability is denoted α[pf ] (pronounced alpha per family of tests ). It is also called the familywise or the experimentwise alpha. 3

4 Table 1: Results of a Monte Carlo simulation. Numbers of Type 1 errors when performing C = 5 tests for 10,000 families when H 0 is true. How to read the table? For example, 192 families over 10,000 have 2 Type 1 errors, this gives 2 192=384 Type 1 errors. Number of families X : Number of Type I Number of with X Type I errors errors per family Type I errors 7, , , , 000 2, A Monte Carlo illustration A Monte Carlo" simulation can illustrate the difference between α[pt ] and α[pf ]. The Monte Carlo technique consists of running a simulated experiment many times using random data. This gives the pattern of results that happens on the basis of chance. Here 6 groups with 100 observations per group were created with data randomly sampled from the same normal population. By construction, H 0 is true (i.e., all population means are equal). Call that procedure an experiment. We performed 5 independent tests from these 6 groups. For each test, we computed an F-test. If its probability was smaller than α=.05, the test was declared significant (i.e., α[pt ] is used). We performed this experiment 10,000 times. Therefore, there were 10, 000 experiments, 10, 000 families, and 5 10,000 = 50,000 tests. The results of this simulation are given in Table 1. Table 1 shows that H 0 is rejected for 2,403 tests over 50,000 tests performed. From these data, an estimation of α[pt ] is computed as: 4

5 number of significant tests α[pt ]= total number of tests = 2,403 = (1) 50,000 This value falls close to the theoretical value of α=.05. For 7, 868 families, no test reaches significance. Equivalently for 2,132 families (10,000 7,868) at least one Type I error is made. From these data, α[pf ] can be estimated as: number of families with at least 1 Type I error α[pf ]= total number of families = 2,132 = (2) 10,000 This value falls close to the theoretical value of α[pf ]=1 (1 α[pt ]) C = 1 (1.05) 5 = How to correct for multiple tests: Šidàk, Bonferonni, Boole, Dunn Recall that the probability of making as least one Type I error for a family of C tests is α[pf ]=1 (1 α[pt ]) C. This equation can be rewritten as α[pt ]=1 (1 α[pf ]) 1/C. This formula derived assuming independence of the tests is sometimes called the Šidàk equation. It shows that in order to reach a given α[pf ] level, we need to adapt the α[pt ] values used for each test. Because the Šidàk equation involves a fractional power, it is difficult to compute by hand and therefore several authors derived 5

6 a simpler approximation which is known as the Bonferonni (the most popular name), or Boole, or even Dunn approximation. Technically, it is the first (linear) term of a Taylor expansion of the Šidàk equation. This approximation gives α[pt ] α[pf ] C Šidàk and Bonferonni are linked to each other by the inequality α[pt ]=1 (1 α[pf ]) 1/C α[pf ] C They are, in general, very close to each other but the Bonferonni approximation is pessimistic (it always does worse than Šidàk equation). Probably because it is easier to compute, the Bonferonni approximation is more well known (and cited more often) than the exact Šidàk equation. The Šidàk-Bonferonni equations can be used to find the value of α[pt ] when α[pf ] is fixed. For example, suppose that you want to perform 4 independent tests, and you want to limit the risk of making at least one Type I error to an overall value of α[pf ]=.05, you will consider a test significant if its associated probability is smaller than α[pt ]=1 (1 α[pf ]) 1/C = 1 (1.05) 1/4 = With the Bonferonni approximation, a test reaches significance if its associated probability is smaller than α[pt ]= α[pf ] C. =.05 4 =.0125, which is very close to the exact value of Correction for non-independent tests The Šidàk equation is derived assuming independence of the tests. When they are not independent, it gives a lower bound (cf. Šidàk, 1967; Games, 1977), and then: α[pf ] 1 (1 α[pt ]) C. 6

7 As previously, we can use a Bonferonni approximation because: α[pf ]< Cα[PT ]. Šidàk and Bonferonni are related by the inequality α[pf ] 1 (1 α[pt ]) C < Cα[PT ]. The Šidàk and Bonferonni inequalities can also be used to find a correction on α[pt ] in order to keep α[pf ] fixed. the Šidàk inequality gives α[pt ] 1 (1 α[pf ]) 1/C. This is a conservative approximation, because the following inequality holds: α[pt ] 1 (1 α[pf ]) 1/C. The Bonferonni approximation gives α[pt ] α[pf ] C 2.5 Splitting up α[pf ] with unequal slices With the Bonferonni approximation we can make an unequal allocation of α[pf ]. This works because with the Bonferonni approximation, α[pf ] is the sum of the individual α[pt ]: α[pf ] Cα[PT ]=α[pt ]+α[pt ]+ +α[pt ]. } {{ } C times If some tests are judged more important a priori than some others, it is possible to allocate unequally α[pf ] (cf. Rosenthal & Rosnow, 1985). For example, suppose we have 3 tests that we want to test with an overall α[pf ]=.05, and we think that the first test is the most important of the set. Then we can decide to test it with α[pt ]=.04, and share the remaining value.01= between the last 2 tests, which will be evaluated each with a value of α[pt ]=.005. The overall Type I error for the family is equal to α[pf ]= =.05 which was indeed the value we set. 7

8 beforehand. It should be emphasized, however, that the (subjective) importance of the tests and the unequal allocation of the individual α[pt ] should be decided a priori for this approach to be statistically valid. An unequal allocation of the α[pt ] can also be achieved using the Šidàk inequality, but it is more computationally involved. 3 Alternatives to Bonferonni The Šidàk-Bonferonni approach becomes very conservative when the number of comparisons becomes large and when the tests are not independent (e.g., as in brain imaging). Recently, some alternative approaches have been proposed (see Shaffer, 1995, for a review) to make the correction less stringent (e.g., Holm 1979, Hochberg, 1988). A more recent approach redefines the problem by replacing the notion of α[pf ] by the false discovery rate (FDR) which is defined as the ratio of the number of Type I errors by the number of significant tests (Benjamini & Hochberg, 1995). References [1] Benjamini & Hochberg, (1995). Controlling the false discovery rate: A practical and powerful approach to multiple testing. Journal of the Royal Statistical Society, Serie B, 57, [2] Games, P.A. (1977). An improved t table for simultaneous control on g contrasts. Journal of the American Statistical Association, 72, [3] Hochberg Y. (1988). A sharper Bonferonni procedure for multiple tests of significance. Biometrika, 75, [4] Holm, S. (1979). A simple sequentially rejective multiple test procedure. Scandinavian Journal of Statistics, 6, [5] Rosenthal, R. & Rosnow, R.L. (1985). Contrast analysis: focused comparisons. Boston: Cambridge University Press. [6] Shaffer, J.P. (1995). Multiple Hypothesis Testing Annual Review of Psychology, 46,

9 [7] Šidàk, Z. (1967). Rectangular confidence region for the means of multivariate normal distributions. Journal of the American Statistical Association, 62,

Multiple Correlation Coefficient

Hervé Abdi 1 1 Overview The multiple correlation coefficient generalizes the standard coefficient of correlation. It is used in multiple regression analysis to assess the quality of the prediction of the

1 Overview. Tukey s Honestly Significant Difference (HSD) Test. Hervé Abdi Lynne J. Williams

In Neil Salkind (Ed.), Encyclopedia of Research Design. Thousand Oaks, CA: Sage. 2010 Tukey s Honestly Significant Difference (HSD) Test Hervé Abdi Lynne J. Williams 1 Overview When an analysis of variance

The Binomial Distribution The Binomial and Sign Tests

The Binomial Distribution The Hervé Abdi 1 1 Overview The binomial distribution models repeated choices between two alternatives. For example, it will give the probability of obtaining 5 Tails when tossing

1 Overview. Coefficient of Variation. Hervé Abdi

In Neil Salkind (Ed.), Encyclopedia of Research Design. Thousand Oaks, CA: Sage. 2010 Coefficient of Variation 1 Overview The coefficient of variation measures the variability of a series of numbers independently

1 Overview. Fisher s Least Significant Difference (LSD) Test. Lynne J. Williams Hervé Abdi

In Neil Salkind (Ed.), Encyclopedia of Research Design. Thousand Oaks, CA: Sage. 2010 Fisher s Least Significant Difference (LSD) Test Lynne J. Williams Hervé Abdi 1 Overview When an analysis of variance

The Kendall Rank Correlation Coefficient

The Kendall Rank Correlation Coefficient Hervé Abdi Overview The Kendall (955) rank correlation coefficient evaluates the degree of similarity between two sets of ranks given to a same set of objects.

1 Overview and background

In Neil Salkind (Ed.), Encyclopedia of Research Design. Thousand Oaks, CA: Sage. 010 The Greenhouse-Geisser Correction Hervé Abdi 1 Overview and background When performing an analysis of variance with

NEW TABLE AND NUMERICAL APPROXIMATIONS FOR KOLMOGOROV-SMIRNOV/LILLIEFORS/VAN SOEST NORMALITY TEST

NEW TABLE AND NUMERICAL APPROXIMATIONS FOR KOLMOGOROV-SMIRNOV/LILLIEFORS/VAN SOEST NORMALITY TEST PAUL MOLIN AND HERVÉ ABDI Abstract. We give new critical values for the Kolmogorov-Smirnov/Lilliefors/Van

Type I Error Of Four Pairwise Mean Comparison Procedures Conducted As Protected And Unprotected Tests

Journal of odern Applied Statistical ethods Volume 4 Issue 2 Article 1 11-1-25 Type I Error Of Four Pairwise ean Comparison Procedures Conducted As Protected And Unprotected Tests J. Jackson Barnette University

Metric Multidimensional Scaling (MDS): Analyzing Distance Matrices

Metric Multidimensional Scaling (MDS): Analyzing Distance Matrices Hervé Abdi 1 1 Overview Metric multidimensional scaling (MDS) transforms a distance matrix into a set of coordinates such that the (Euclidean)

The Method of Least Squares

Hervé Abdi 1 1 Introduction The least square methods (LSM) is probably the most popular technique in statistics. This is due to several factors. First, most common estimators can be casted within this

Factor Rotations in Factor Analyses.

Factor Rotations in Factor Analyses. Hervé Abdi 1 The University of Texas at Dallas Introduction The different methods of factor analysis first extract a set a factors from a data set. These factors are

MINITAB ASSISTANT WHITE PAPER

MINITAB ASSISTANT WHITE PAPER This paper explains the research conducted by Minitab statisticians to develop the methods and data checks used in the Assistant in Minitab 17 Statistical Software. One-Way

Package dunn.test. January 6, 2016

Version 1.3.2 Date 2016-01-06 Package dunn.test January 6, 2016 Title Dunn's Test of Multiple Comparisons Using Rank Sums Author Alexis Dinno Maintainer Alexis Dinno

False Discovery Rates

False Discovery Rates John D. Storey Princeton University, Princeton, USA January 2010 Multiple Hypothesis Testing In hypothesis testing, statistical significance is typically based on calculations involving

1 Introduction. 2 Matrices: Definition. Matrix Algebra. Hervé Abdi Lynne J. Williams

In Neil Salkind (Ed.), Encyclopedia of Research Design. Thousand Oaks, CA: Sage. 00 Matrix Algebra Hervé Abdi Lynne J. Williams Introduction Sylvester developed the modern concept of matrices in the 9th

Introduction to. Hypothesis Testing CHAPTER LEARNING OBJECTIVES. 1 Identify the four steps of hypothesis testing.

Introduction to Hypothesis Testing CHAPTER 8 LEARNING OBJECTIVES After reading this chapter, you should be able to: 1 Identify the four steps of hypothesis testing. 2 Define null hypothesis, alternative

Partial Least Squares (PLS) Regression.

Partial Least Squares (PLS) Regression. Hervé Abdi 1 The University of Texas at Dallas Introduction Pls regression is a recent technique that generalizes and combines features from principal component

Multiple Testing. Joseph P. Romano, Azeem M. Shaikh, and Michael Wolf. Abstract

Multiple Testing Joseph P. Romano, Azeem M. Shaikh, and Michael Wolf Abstract Multiple testing refers to any instance that involves the simultaneous testing of more than one hypothesis. If decisions about

A Review of Experimentwise Type I Error: Implications for Univariate Post Hoc and for. Multivariate Testing. Dan Altman

Experimentwise Error 1 Running Head: REVIEW OF EXPERIMENTWISE ERROR A Review of Experimentwise Type I Error: Implications for Univariate Post Hoc and for Multivariate Testing Dan Altman Texas A&M University

Introduction to Hypothesis Testing

I. Terms, Concepts. Introduction to Hypothesis Testing A. In general, we do not know the true value of population parameters - they must be estimated. However, we do have hypotheses about what the true

+ Section 6.2 and 6.3

Section 6.2 and 6.3 Learning Objectives After this section, you should be able to DEFINE and APPLY basic rules of probability CONSTRUCT Venn diagrams and DETERMINE probabilities DETERMINE probabilities

Chapter 11 Testing Hypotheses About Proportions Hypothesis testing method: uses data from a sample to judge whether or not a statement about a population may be true. Steps in Any Hypothesis Test 1. Determine

A Logic of Prediction and Evaluation

5 - Hypothesis Testing in the Linear Model Page 1 A Logic of Prediction and Evaluation 5:12 PM One goal of science: determine whether current ways of thinking about the world are adequate for predicting

Bootstrapping p-value estimations

Bootstrapping p-value estimations In microarray studies it is common that the the sample size is small and that the distribution of expression values differs from normality. In this situations, permutation

Hypothesis Testing. Barrow, Statistics for Economics, Accounting and Business Studies, 4 th edition Pearson Education Limited 2006

Hypothesis Testing Lecture 4 Hypothesis Testing Hypothesis testing is about making decisions Is a hypothesis true or false? Are women paid less, on average, than men? Principles of Hypothesis Testing The

Hypothesis Testing Summary

Hypothesis Testing Summary Hypothesis testing begins with the drawing of a sample and calculating its characteristics (aka, statistics ). A statistical test (a specific form of a hypothesis test) is an

Combinatorics: The Fine Art of Counting

Combinatorics: The Fine Art of Counting Week 7 Lecture Notes Discrete Probability Continued Note Binomial coefficients are written horizontally. The symbol ~ is used to mean approximately equal. The Bernoulli

Introduction to Hypothesis Testing. Point estimation and confidence intervals are useful statistical inference procedures.

Introduction to Hypothesis Testing Point estimation and confidence intervals are useful statistical inference procedures. Another type of inference is used frequently used concerns tests of hypotheses.

H + T = 1. p(h + T) = p(h) x p(t)

Probability and Statistics Random Chance A tossed penny can land either heads up or tails up. These are mutually exclusive events, i.e. if the coin lands heads up, it cannot also land tails up on the same

Quantitative Methods for Finance

Quantitative Methods for Finance Module 1: The Time Value of Money 1 Learning how to interpret interest rates as required rates of return, discount rates, or opportunity costs. 2 Learning how to explain

Module 5 Hypotheses Tests: Comparing Two Groups

Module 5 Hypotheses Tests: Comparing Two Groups Objective: In medical research, we often compare the outcomes between two groups of patients, namely exposed and unexposed groups. At the completion of this

Nonparametric Statistics

1 14.1 Using the Binomial Table Nonparametric Statistics In this chapter, we will survey several methods of inference from Nonparametric Statistics. These methods will introduce us to several new tables

A general statistical framework for assessing Granger causality

A general statistical framework for assessing Granger causality The MIT Faculty has made this article openly available. Please share how this access benefits you. Your story matters. Citation As Published

Test of proportion = 0.5 N Sample prop 95% CI z- value p- value (0.400, 0.466)

STATISTICS FOR THE SOCIAL AND BEHAVIORAL SCIENCES Recitation #10 Answer Key PROBABILITY, HYPOTHESIS TESTING, CONFIDENCE INTERVALS Hypothesis tests 2 When a recent GSS asked, would you be willing to pay

Testing Multiple Secondary Endpoints in Confirmatory Comparative Studies -- A Regulatory Perspective

Testing Multiple Secondary Endpoints in Confirmatory Comparative Studies -- A Regulatory Perspective Lilly Yue, Ph.D. Chief, Cardiovascular and Ophthalmic Devices Branch Division of Biostatistics CDRH/FDA

The alternative hypothesis,, is the statement that the parameter value somehow differs from that claimed by the null hypothesis. : 0.5 :>0.5 :<0.

Section 8.2-8.5 Null and Alternative Hypotheses... The null hypothesis,, is a statement that the value of a population parameter is equal to some claimed value. :=0.5 The alternative hypothesis,, is the

Study Data and Safety Monitoring Board

Study Data and Safety Monitoring Board 1.0 INTRODUCTION The Data and Safety Monitoring Board (DSMB) is the primary data and safety advisory group for the Sponsor (Sponsor) study entitled Title. The DSMB

Sample Size and Power in Clinical Trials

Sample Size and Power in Clinical Trials Version 1.0 May 011 1. Power of a Test. Factors affecting Power 3. Required Sample Size RELATED ISSUES 1. Effect Size. Test Statistics 3. Variation 4. Significance

Experimental Design. Power and Sample Size Determination. Proportions. Proportions. Confidence Interval for p. The Binomial Test

Experimental Design Power and Sample Size Determination Bret Hanlon and Bret Larget Department of Statistics University of Wisconsin Madison November 3 8, 2011 To this point in the semester, we have largely

PROBABILITY NOTIONS. Summary. 1. Random experiment

PROBABILITY NOTIONS Summary 1. Random experiment... 1 2. Sample space... 2 3. Event... 2 4. Probability calculation... 3 4.1. Fundamental sample space... 3 4.2. Calculation of probability... 3 4.3. Non

2 GENETIC DATA ANALYSIS

2.1 Strategies for learning genetics 2 GENETIC DATA ANALYSIS We will begin this lecture by discussing some strategies for learning genetics. Genetics is different from most other biology courses you have

11.2 POINT ESTIMATES AND CONFIDENCE INTERVALS

11.2 POINT ESTIMATES AND CONFIDENCE INTERVALS Point Estimates Suppose we want to estimate the proportion of Americans who approve of the president. In the previous section we took a random sample of size

Multiple Correspondence Analysis

Multiple Correspondence Analysis Hervé Abdi 1 & Dominique Valentin 1 Overview Multiple correspondence analysis (MCA) is an extension of correspondence analysis (CA) which allows one to analyze the pattern

Power and Sample Size Determination

Power and Sample Size Determination Bret Hanlon and Bret Larget Department of Statistics University of Wisconsin Madison November 3 8, 2011 Power 1 / 31 Experimental Design To this point in the semester,

Hypothesis Testing Level I Quantitative Methods. IFT Notes for the CFA exam

Hypothesis Testing 2014 Level I Quantitative Methods IFT Notes for the CFA exam Contents 1. Introduction... 3 2. Hypothesis Testing... 3 3. Hypothesis Tests Concerning the Mean... 10 4. Hypothesis Tests

A direct approach to false discovery rates

J. R. Statist. Soc. B (2002) 64, Part 3, pp. 479 498 A direct approach to false discovery rates John D. Storey Stanford University, USA [Received June 2001. Revised December 2001] Summary. Multiple-hypothesis

Mgmt 469. Model Specification: Choosing the Right Variables for the Right Hand Side

Mgmt 469 Model Specification: Choosing the Right Variables for the Right Hand Side Even if you have only a handful of predictor variables to choose from, there are infinitely many ways to specify the right

SYSM 6304: Risk and Decision Analysis Lecture 3 Monte Carlo Simulation

SYSM 6304: Risk and Decision Analysis Lecture 3 Monte Carlo Simulation M. Vidyasagar Cecil & Ida Green Chair The University of Texas at Dallas Email: M.Vidyasagar@utdallas.edu September 19, 2015 Outline

Chapter 21. More About Tests and Intervals. Copyright 2012, 2008, 2005 Pearson Education, Inc.

Chapter 21 More About Tests and Intervals Copyright 2012, 2008, 2005 Pearson Education, Inc. Zero In on the Null Null hypotheses have special requirements. To perform a hypothesis test, the null must be

9-3.4 Likelihood ratio test. Neyman-Pearson lemma

9-3.4 Likelihood ratio test Neyman-Pearson lemma 9-1 Hypothesis Testing 9-1.1 Statistical Hypotheses Statistical hypothesis testing and confidence interval estimation of parameters are the fundamental

Module 7: Hypothesis Testing I Statistics (OA3102)

Module 7: Hypothesis Testing I Statistics (OA3102) Professor Ron Fricker Naval Postgraduate School Monterey, California Reading assignment: WM&S chapter 10.1-10.5 Revision: 2-12 1 Goals for this Module

p-values and significance levels (false positive or false alarm rates)

p-values and significance levels (false positive or false alarm rates) Let's say 123 people in the class toss a coin. Call it "Coin A." There are 65 heads. Then they toss another coin. Call it "Coin B."

Hypothesis testing S2

Basic medical statistics for clinical and experimental research Hypothesis testing S2 Katarzyna Jóźwiak k.jozwiak@nki.nl 2nd November 2015 1/43 Introduction Point estimation: use a sample statistic to

The Type I error rate is the fraction of times a Type I error is made. Comparison-wise type I error rate CER. Experiment-wise type I error rate EER

Topic 5. Mean separation: Multiple comparisons [S&T Ch.8 except 8.3] 5. 1. Basic concepts If there are more than treatments the problem is to determine which means are significantly different. This process

CHAPTER 12 MULTIPLE COMPARISONS AMONG

CHAPTER 1 MULTIPLE COMPARISONS AMONG TREATMENT MEANS OBJECTIVES To extend the analysis of variance by examining ways of making comparisons within a set of means. CONTENTS 1.1 ERROR RATES 1. MULTIPLE COMPARISONS

Chapter 8 Hypothesis Testing Chapter 8 Hypothesis Testing 8-1 Overview 8-2 Basics of Hypothesis Testing

Chapter 8 Hypothesis Testing 1 Chapter 8 Hypothesis Testing 8-1 Overview 8-2 Basics of Hypothesis Testing 8-3 Testing a Claim About a Proportion 8-5 Testing a Claim About a Mean: s Not Known 8-6 Testing

, for x = 0, 1, 2, 3,... (4.1) (1 + 1/n) n = 2.71828... b x /x! = e b, x=0

Chapter 4 The Poisson Distribution 4.1 The Fish Distribution? The Poisson distribution is named after Simeon-Denis Poisson (1781 1840). In addition, poisson is French for fish. In this chapter we will

Data Analysis and Uncertainty Part 3: Hypothesis Testing/Sampling

Data Analysis and Uncertainty Part 3: Hypothesis Testing/Sampling Instructor: Sargur N. University at Buffalo The State University of New York srihari@cedar.buffalo.edu Topics 1. Hypothesis Testing 1.

The Philosophy of Hypothesis Testing, Questions and Answers 2006 Samuel L. Baker

HYPOTHESIS TESTING PHILOSOPHY 1 The Philosophy of Hypothesis Testing, Questions and Answers 2006 Samuel L. Baker Question: So I'm hypothesis testing. What's the hypothesis I'm testing? Answer: When you're

Hypothesis testing allows us to use a sample to decide between two statements made about a Population characteristic.

Hypothesis Testing Hypothesis testing allows us to use a sample to decide between two statements made about a Population characteristic. Population Characteristics are things like The mean of a population

Hypothesis Testing. Learning Objectives. After completing this module, the student will be able to

Hypothesis Testing Learning Objectives After completing this module, the student will be able to carry out a statistical test of significance calculate the acceptance and rejection region calculate and

One-Way Analysis of Variance

One-Way Analysis of Variance Note: Much of the math here is tedious but straightforward. We ll skim over it in class but you should be sure to ask questions if you don t understand it. I. Overview A. We

Package ERP. December 14, 2015

Type Package Package ERP December 14, 2015 Title Significance Analysis of Event-Related Potentials Data Version 1.1 Date 2015-12-11 Author David Causeur (Agrocampus, Rennes, France) and Ching-Fan Sheu

There are three kinds of people in the world those who are good at math and those who are not. PSY 511: Advanced Statistics for Psychological and Behavioral Research 1 Positive Views The record of a month

Hypothesis Testing: General Framework 1 1

Hypothesis Testing: General Framework Lecture 2 K. Zuev February 22, 26 In previous lectures we learned how to estimate parameters in parametric and nonparametric settings. Quite often, however, researchers

MONT 107N Understanding Randomness Solutions For Final Examination May 11, 2010

MONT 07N Understanding Randomness Solutions For Final Examination May, 00 Short Answer (a) (0) How are the EV and SE for the sum of n draws with replacement from a box computed? Solution: The EV is n times

Sample Size Determination

Sample Size Determination Population A: 10,000 Population B: 5,000 Sample 10% Sample 15% Sample size 1000 Sample size 750 The process of obtaining information from a subset (sample) of a larger group (population)

Lecture 9: Bayesian hypothesis testing

Lecture 9: Bayesian hypothesis testing 5 November 27 In this lecture we ll learn about Bayesian hypothesis testing. 1 Introduction to Bayesian hypothesis testing Before we go into the details of Bayesian

MAT140: Applied Statistical Methods Summary of Calculating Confidence Intervals and Sample Sizes for Estimating Parameters

MAT140: Applied Statistical Methods Summary of Calculating Confidence Intervals and Sample Sizes for Estimating Parameters Inferences about a population parameter can be made using sample statistics for

Chicago Booth BUSINESS STATISTICS 41000 Final Exam Fall 2011

Chicago Booth BUSINESS STATISTICS 41000 Final Exam Fall 2011 Name: Section: I pledge my honor that I have not violated the Honor Code Signature: This exam has 34 pages. You have 3 hours to complete this

STAT 35A HW2 Solutions

STAT 35A HW2 Solutions http://www.stat.ucla.edu/~dinov/courses_students.dir/09/spring/stat35.dir 1. A computer consulting firm presently has bids out on three projects. Let A i = { awarded project i },

NONPARAMETRIC STATISTICS 1. depend on assumptions about the underlying distribution of the data (or on the Central Limit Theorem)

NONPARAMETRIC STATISTICS 1 PREVIOUSLY parametric statistics in estimation and hypothesis testing... construction of confidence intervals computing of p-values classical significance testing depend on assumptions

An approach to Calculus of Probabilities through real situations

MaMaEuSch Management Mathematics for European Schools http://www.mathematik.unikl.de/ mamaeusch An approach to Calculus of Probabilities through real situations Paula Lagares Barreiro Federico Perea Rojas-Marcos

Aus: Statnotes: Topics in Multivariate Analysis, by G. David Garson (Zugriff am

Aus: Statnotes: Topics in Multivariate Analysis, by G. David Garson http://faculty.chass.ncsu.edu/garson/pa765/anova.htm (Zugriff am 20.10.2010) Planned multiple comparison t-tests, also just called "multiple

tests whether there is an association between the outcome variable and a predictor variable. In the Assistant, you can perform a Chi-Square Test for

This paper explains the research conducted by Minitab statisticians to develop the methods and data checks used in the Assistant in Minitab 17 Statistical Software. In practice, quality professionals sometimes

From Reads to Differentially Expressed Genes. The statistics of differential gene expression analysis using RNA-seq data

From Reads to Differentially Expressed Genes The statistics of differential gene expression analysis using RNA-seq data experimental design data collection modeling statistical testing biological heterogeneity

Basic Elements of a Hypothesis Test. Hypothesis Testing of Proportions and Small Sample Means. Proportions. Proportions

Hypothesis Testing of Proportions and Small Sample Means Dr. Tom Ilvento FREC 408 Basic Elements of a Hypothesis Test H 0 : H a : : : Proportions The Pepsi Challenge asked soda drinkers to compare Diet

15.0 More Hypothesis Testing

15.0 More Hypothesis Testing 1 Answer Questions Type I and Type II Error Power Calculation Bayesian Hypothesis Testing 15.1 Type I and Type II Error In the philosophy of hypothesis testing, the null hypothesis

Multiple Comparisons. Cohen Chpt 13

Multiple Comparisons Cohen Chpt 13 How many t-tests? We do an experiment, 1 factor, 3 levels (= 3 groups). The ANOVA gives us a significant F-value. What now? 4 levels, 1 factor: how many independent comparisons?

Hypothesis Testing. Chapter Introduction

Contents 9 Hypothesis Testing 553 9.1 Introduction............................ 553 9.2 Hypothesis Test for a Mean................... 557 9.2.1 Steps in Hypothesis Testing............... 557 9.2.2 Diagrammatic

Raimund Alt (Autor) Multiple Hypotheses Testing in the Linear Regression Model with Applications to Economics and Finance

Raimund Alt (Autor) Multiple Hypotheses Testing in the Linear Regression Model with Applications to Economics and Finance https://cuvillier.de/de/shop/publications/2694 Copyright: Cuvillier Verlag, Inhaberin

TRANSCRIPT: In this lecture, we will talk about both theoretical and applied concepts related to hypothesis testing.

This is Dr. Chumney. The focus of this lecture is hypothesis testing both what it is, how hypothesis tests are used, and how to conduct hypothesis tests. 1 In this lecture, we will talk about both theoretical

" Y. Notation and Equations for Regression Lecture 11/4. Notation:

Notation: Notation and Equations for Regression Lecture 11/4 m: The number of predictor variables in a regression Xi: One of multiple predictor variables. The subscript i represents any number from 1 through

WISE Power Tutorial All Exercises

ame Date Class WISE Power Tutorial All Exercises Power: The B.E.A.. Mnemonic Four interrelated features of power can be summarized using BEA B Beta Error (Power = 1 Beta Error): Beta error (or Type II

Discrete Mathematics and Probability Theory Fall 2009 Satish Rao,David Tse Note 11

CS 70 Discrete Mathematics and Probability Theory Fall 2009 Satish Rao,David Tse Note Conditional Probability A pharmaceutical company is marketing a new test for a certain medical condition. According

DDBA 8438: Introduction to Hypothesis Testing Video Podcast Transcript

DDBA 8438: Introduction to Hypothesis Testing Video Podcast Transcript JENNIFER ANN MORROW: Welcome to "Introduction to Hypothesis Testing." My name is Dr. Jennifer Ann Morrow. In today's demonstration,

Permutation Tests for Comparing Two Populations

Permutation Tests for Comparing Two Populations Ferry Butar Butar, Ph.D. Jae-Wan Park Abstract Permutation tests for comparing two populations could be widely used in practice because of flexibility of

Multiple Hypothesis Testing: The F-test

Multiple Hypothesis Testing: The F-test Matt Blackwell December 3, 2008 1 A bit of review When moving into the matrix version of linear regression, it is easy to lose sight of the big picture and get lost

AIHA Journal 63:567 571 (00) Ms. #333 AUTHORS K. Krishnamoorthy a Thomas Mathew b a Department of Mathematics, University of Louisiana at Lafayette, Lafayette, LA 70504; b Department of Mathematics and

Comparison of frequentist and Bayesian inference. Class 20, 18.05, Spring 2014 Jeremy Orloff and Jonathan Bloom

Comparison of frequentist and Bayesian inference. Class 20, 18.05, Spring 2014 Jeremy Orloff and Jonathan Bloom 1 Learning Goals 1. Be able to explain the difference between the p-value and a posterior

Partial Least Square Regression PLS-Regression

Partial Least Square Regression Hervé Abdi 1 1 Overview PLS regression is a recent technique that generalizes and combines features from principal component analysis and multiple regression. Its goal is

Misuse of statistical tests in Archives of Clinical Neuropsychology publications

Archives of Clinical Neuropsychology 20 (2005) 1053 1059 Misuse of statistical tests in Archives of Clinical Neuropsychology publications Philip Schatz, Kristin A. Jay, Jason McComb, Jason R. McLaughlin

STATISTICS 8, FINAL EXAM. Last six digits of Student ID#: Circle your Discussion Section: 1 2 3 4

STATISTICS 8, FINAL EXAM NAME: KEY Seat Number: Last six digits of Student ID#: Circle your Discussion Section: 1 2 3 4 Make sure you have 8 pages. You will be provided with a table as well, as a separate

Multiple testing with gene expression array data

Multiple testing with gene expression array data Anja von Heydebreck Max Planck Institute for Molecular Genetics, Dept. Computational Molecular Biology, Berlin, Germany heydebre@molgen.mpg.de Slides partly

CHAPTER 13. Experimental Design and Analysis of Variance

CHAPTER 13 Experimental Design and Analysis of Variance CONTENTS STATISTICS IN PRACTICE: BURKE MARKETING SERVICES, INC. 13.1 AN INTRODUCTION TO EXPERIMENTAL DESIGN AND ANALYSIS OF VARIANCE Data Collection

Example Hypotheses. Chapter 8-2: Basics of Hypothesis Testing. A newspaper headline makes the claim: Most workers get their jobs through networking

Chapter 8-2: Basics of Hypothesis Testing Two main activities in statistical inference are using sample data to: 1. estimate a population parameter forming confidence intervals 2. test a hypothesis or

Tutorial 5: Hypothesis Testing

Tutorial 5: Hypothesis Testing Rob Nicholls nicholls@mrc-lmb.cam.ac.uk MRC LMB Statistics Course 2014 Contents 1 Introduction................................ 1 2 Testing distributional assumptions....................