Hypothesis testing. Null hypothesis H 0 and alternative hypothesis H 1.

Size: px
Start display at page:

Download "Hypothesis testing. Null hypothesis H 0 and alternative hypothesis H 1."

Transcription

1 Hypothesis testing Null hypothesis H 0 and alternative hypothesis H 1.

2 Hypothesis testing Null hypothesis H 0 and alternative hypothesis H 1. Simple and compound hypotheses. Simple : the probabilistic model is specified completely. Compound : the probabilistic model is not specified completely (generally it will contain parameters to be estimated).

3 Hypothesis testing Null hypothesis H 0 and alternative hypothesis H 1. Simple and compound hypotheses. Simple : the probabilistic model is specified completely. Compound : the probabilistic model is not specified completely (generally it will contain parameters to be estimated). Example 1: we want to test whether data are compatible with the assumption that their true mean is µ 0. Then it could be set as: H 0 : X 1,..., X n N(µ 0, σ0 2 ) and independent. [simple if σ 2 known]; H 1 : X 1,..., X n N(µ, σ 2 0 ) and independent, with µ µ 0. [compound]

4 Rejection region Example 2: we have two groups, and we wish to test whether they can be considered as samples from the same population, or from two populations with different means. General assumption: X 1,..., X n N(µ X, σ 2 ), Y 1,..., Y m N(µ Y, σ 2 ), and independent. H 0 : µ X = µ Y, σ 2 > 0. H 1 : µ X µ Y, σ 2 > 0. Both are compound, but H 0 is simpler than H 1. How does a test work? We select a rejection region C: if data fall in C, we reject H 0 (and accept H 1 ); if data do not fall in C, we accept (do not reject) H 0.

5 Errors of first and second species Error of first species: rejecting H 0 if H 0 is true; Error of second species: accepting H 0 if H 1 is true. A smaller rejection region C decreases error of first species, but increases those of second species; a larger C vice versa.

6 Errors of first and second species Error of first species: rejecting H 0 if H 0 is true; Error of second species: accepting H 0 if H 1 is true. A smaller rejection region C decreases error of first species, but increases those of second species; a larger C vice versa. Solution? We choose a level (the risk we take of errors of 1st species) α [often 5%] and select a region s.t. P(X C H 0 ) = α (probability of error of 1st species).

7 Errors of first and second species Error of first species: rejecting H 0 if H 0 is true; Error of second species: accepting H 0 if H 1 is true. A smaller rejection region C decreases error of first species, but increases those of second species; a larger C vice versa. Solution? We choose a level (the risk we take of errors of 1st species) α [often 5%] and select a region s.t. P(X C H 0 ) = α (probability of error of 1st species). A test will have a certain power (the probability of rejecting H 0 when indeed H 1 is true).

8 Errors of first and second species Error of first species: rejecting H 0 if H 0 is true; Error of second species: accepting H 0 if H 1 is true. A smaller rejection region C decreases error of first species, but increases those of second species; a larger C vice versa. Solution? We choose a level (the risk we take of errors of 1st species) α [often 5%] and select a region s.t. P(X C H 0 ) = α (probability of error of 1st species). A test will have a certain power (the probability of rejecting H 0 when indeed H 1 is true). Generally, we are not able to compute exactly the power of a test (it depends on exact parameter value), but theory selects the most powerful test within a certain level.

9 One-sample test on the mean Example 1: (with σ 2 unknown). H 0 : X 1,..., X n N(µ 0, σ 2 ) and independent, where σ 2 > 0. H 1 : X 1,..., X n N(µ, σ 2 ) and independent, where µ µ 0, σ 2 > 0.

10 One-sample test on the mean Example 1: (with σ 2 unknown). H 0 : X 1,..., X n N(µ 0, σ 2 ) and independent, where σ 2 > 0. H 1 : X 1,..., X n N(µ, σ 2 ) and independent, where µ µ 0, σ 2 > 0. The test quantity used is T = X µ 0 S/ n where S 2 is the sample variance. It is natural (and can be justified rigorously) to reject H 0 when T is far away from 0.

11 One-sample test on the mean Example 1: (with σ 2 unknown). H 0 : X 1,..., X n N(µ 0, σ 2 ) and independent, where σ 2 > 0. H 1 : X 1,..., X n N(µ, σ 2 ) and independent, where µ µ 0, σ 2 > 0. The test quantity used is T = X µ 0 S/ n where S 2 is the sample variance. It is natural (and can be justified rigorously) to reject H 0 when T is far away from 0. How far is far?

12 One-sample test on the mean Example 1: (with σ 2 unknown). H 0 : X 1,..., X n N(µ 0, σ 2 ) and independent, where σ 2 > 0. H 1 : X 1,..., X n N(µ, σ 2 ) and independent, where µ µ 0, σ 2 > 0. The test quantity used is T = X µ 0 S/ n where S 2 is the sample variance. It is natural (and can be justified rigorously) to reject H 0 when T is far away from 0. How far is far? Under H 0, T follows a t(n 1) distribution. Then we find t α s.t. P( t(n 1) > t α ) = α. Reject H 0 if T > t α, accept it otherwise.

13 Computations It is expected that mean body temperature is 37.0 C. Temperatures were obtained rom 25 individuals: Are these compatible with µ = 37.0?

14 Computations It is expected that mean body temperature is 37.0 C. Temperatures were obtained rom 25 individuals: Are these compatible with µ = 37.0? X = , S = 0.374, T =

15 Computations It is expected that mean body temperature is 37.0 C. Temperatures were obtained rom 25 individuals: Are these compatible with µ = 37.0? p-value = X = , S = 0.374, T =

16 Computations It is expected that mean body temperature is 37.0 C. Temperatures were obtained rom 25 individuals: Are these compatible with µ = 37.0? X = , S = 0.374, T = p-value = (this means that there is about 56% chance of getting a higher (in absolute value) value of T if H 0 is true). [Many statistical programs (e.g. R) return the p-value; the user will know how it compares with α she/he has chosen] Here we accept µ = 37.0 at any reasonable level of α.

17 One-sample test on the mean. II If [unilateral alternative] H 1 : X 1,..., X n N(µ, σ 2 ) and independent, where µ > µ 0, σ 2 > 0, then the rejection region is for T positive and large. Hence we find t α s.t. P(t(n 1) > t α) = α. Reject H 0 if T > t α, accept it otherwise. Vice versa if the alternative hypothesis is µ < µ 0.

18 One-sample test on the mean. II If [unilateral alternative] H 1 : X 1,..., X n N(µ, σ 2 ) and independent, where µ > µ 0, σ 2 > 0, then the rejection region is for T positive and large. Hence we find t α s.t. P(t(n 1) > t α) = α. Reject H 0 if T > t α, accept it otherwise. Vice versa if the alternative hypothesis is µ < µ 0. In the example, if H 1 had been µ < 37, p-value = P(t(24) < ) = 0.281

19 One-sample test on the mean. II If [unilateral alternative] H 1 : X 1,..., X n N(µ, σ 2 ) and independent, where µ > µ 0, σ 2 > 0, then the rejection region is for T positive and large. Hence we find t α s.t. P(t(n 1) > t α) = α. Reject H 0 if T > t α, accept it otherwise. Vice versa if the alternative hypothesis is µ < µ 0. In the example, if H 1 had been µ < 37, p-value = P(t(24) < ) = Observations: unilateral alternatives make it easier rejecting the null hypothesis (hence they are seldom used). In practice, border-line results suggest further research.

20 Test on equality of the means Independent samples (e.g. 2 groups with different treatments) X 1,..., X n N(µ X, σ 2 ), Y 1,..., Y m N(µ Y, σ 2 ), and independent. H 0 : µ X = µ Y, σ 2 > 0. H 1 : µ X µ Y, σ 2 > 0.

21 Test on equality of the means Independent samples (e.g. 2 groups with different treatments) X 1,..., X n N(µ X, σ 2 ), Y 1,..., Y m N(µ Y, σ 2 ), and independent. H 0 : µ X = µ Y, σ 2 > 0. H 1 : µ X µ Y, σ 2 > 0. Under H 0 Y i X j N(0, 2σ 2 ) for all i = 1... m, j = 1... n. Hence we test whether the true mean of all these variables is 0.

22 Test on equality of the means Independent samples (e.g. 2 groups with different treatments) X 1,..., X n N(µ X, σ 2 ), Y 1,..., Y m N(µ Y, σ 2 ), and independent. H 0 : µ X = µ Y, σ 2 > 0. H 1 : µ X µ Y, σ 2 > 0. Under H 0 Y i X j N(0, 2σ 2 ) for all i = 1... m, j = 1... n. Hence we test whether the true mean of all these variables is 0. Estimate of σ 2 : SX 2,Y = 1 ( (n 1)S 2 n + m 2 X + (m 1)SY 2 ) weighted mean of S 2 X (variance estimated on X i) and S 2 Y.

23 Test on equality of the means Independent samples (e.g. 2 groups with different treatments) X 1,..., X n N(µ X, σ 2 ), Y 1,..., Y m N(µ Y, σ 2 ), and independent. H 0 : µ X = µ Y, σ 2 > 0. H 1 : µ X µ Y, σ 2 > 0. Under H 0 Y i X j N(0, 2σ 2 ) for all i = 1... m, j = 1... n. Hence we test whether the true mean of all these variables is 0. Estimate of σ 2 : SX 2,Y = 1 ( (n 1)S 2 n + m 2 X + (m 1)SY 2 ) weighted mean of SX 2 (variance estimated on X i) and SY 2. Ȳ X Finally, t-test on T =. s X,Y 1 n + 1 m

24 Test on equality of the means Independent samples (e.g. 2 groups with different treatments) X 1,..., X n N(µ X, σ 2 ), Y 1,..., Y m N(µ Y, σ 2 ), and independent. H 0 : µ X = µ Y, σ 2 > 0. H 1 : µ X µ Y, σ 2 > 0. Under H 0 Y i X j N(0, 2σ 2 ) for all i = 1... m, j = 1... n. Hence we test whether the true mean of all these variables is 0. Estimate of σ 2 : SX 2,Y = 1 ( (n 1)S 2 n + m 2 X + (m 1)SY 2 ) weighted mean of SX 2 (variance estimated on X i) and SY 2. Ȳ X Finally, t-test on T =. s X,Y 1 n + 1 m Assumptions: normality, independence, equality of variances (should be checked [sometimes variable transformations help]).

25 Test of equality of means with independent samples: computations Annual flow of the river Nile at Ashwan has been measured We ask whether the mean of the former 50 years is the same as the mean of the latter 50 years.

26 Test of equality of means with independent samples: computations Annual flow of the river Nile at Ashwan has been measured We ask whether the mean of the former 50 years is the same as the mean of the latter 50 years. Treat each year as an independent sample, though assumption questionable

27 Test of equality of means with independent samples: computations Annual flow of the river Nile at Ashwan has been measured We ask whether the mean of the former 50 years is the same as the mean of the latter 50 years. X (mean of years 1 50) = , Ȳ (mean of years ) = SX 2 = 37, 140, S Y 2 = 12, 106. S X 2,Y = 24, 623.

28 Test of equality of means with independent samples: computations Annual flow of the river Nile at Ashwan has been measured We ask whether the mean of the former 50 years is the same as the mean of the latter 50 years. X (mean of years 1 50) = , Ȳ (mean of years ) = SX 2 = 37, 140, S Y 2 = 12, 106. S X 2,Y = 24, 623. Hence T = , p-value = Reject µ X = µ Y.

29 Test of equality of means with independent samples: computations Annual flow of the river Nile at Ashwan has been measured We ask whether the mean of the former 50 years is the same as the mean of the latter 50 years. X (mean of years 1 50) = , Ȳ (mean of years ) = S 2 X = 37, 140, S 2 Y = 12, 106. S 2 X,Y = 24, 623. Hence T = , p-value = Reject µ X = µ Y. One may question equality of variances... A correction of the t-test works without equal variances: t = , df = , p value =

30 Test on equality of the means. II Paired samples (e.g. same individuals before/after treatment) General assumption Y i X i N(µ, σ 2 ), i = 1... n. No assumption on X i and Y i, but only on their differences (the effect of treatment). H 0 : µ = 0, σ 2 > 0. H 1 : µ 0, σ 2 > 0. This is simply a test that the true mean of W = Y X is 0. It will be easier rejecting H 0 because generally s Y X is much smaller than 2s X,Y. Basic assumption: the effect of treatment is additive (does not depend on the original value of X i ).

31 Test on binomial Observations: n trials with k successes. Are these compatible with probability of success p 0? H 0 : X Bin(n, p 0 ) H 1 : X Bin(n, p) with p p 0. Simplest method: under H 0 by central limit theorem Z = X np 0 np0 (1 p 0 ) N(0, 1) for n large [rule np 0, n(1 p 0 ) > 5]. Set α find z α s.t. P( N(0, 1) > z α ) = α. If Z > z α, reject H 0 ; otherwise accept it. Exact method: make computations with binomial distribution. Note. All previous cases could be studied through confidence intervals. Some authors prefer using only confidence intervals.

32 Chi-square test General chi-square test: We have k types of events that can occur in each trial, with a priori probabilities for them p 0 1,..., p 0 k with p p 0 k = 1. After n trials, we observe n 1 events of type 1, n 2 of 2,..., n k of k, with n n k = n. Are data compatible with expectations? (k = 2 is binomial) Classical test: chi-square: Set E i = np 0 i (expected number of events of type i under H 0 ), X 2 = k (n i E i ) 2 i=1 E i χ 2 (k 1) for n large. Find c α s.t. P(χ 2 (k 1) > c α ) = α. If X 2 > c α, reject H 0 ; accept it otherwise.

33 Chi-square for data fit to a distribution The values p1 0,..., p0 k can be those arising from some distribution. Often the distribution will contain parameters to be estimated (e.g. λ of Poisson). One can use the chi-square: if m parameters are estimated, X 2 χ 2 (k m 1) (of course m < k 1). Example: data (Von Bartkiewicz, 1898) on Prussians soldiers kicked to death by horses: i (deaths) n i (number of corps/years) Total 200

34 Chi-square example (continued) Estimate λ with the sample mean ˆλ = ( )/200 = Compute E i. Join the classes 3 (rule of thumb: E i 5) to obtain: i n i Ê i Total 200 Compute X lower than relevant quantiles of χ 2 (2).

35 Chi-square test of independence A classical use of chi-square is when we observe two qualitative variables. H 0 : variables are independent; H 1 : they are not independent. k levels of X and l levels of Y (if k = l = 2, a 2 2 contingency table). Data are n ij (# of observations with X = i (i = 1... k) and Y = j (j = 1... l). H 0 : P(X = i, Y = j) = p i q j for all i and j p i, i = 1... k 1, q j, j = 1... l 1 to be estimated from data. H 1 : P(X = i, Y = j) p i q j. Examples and computations to be briefly presented in next lecture.

Tutorial 5: Hypothesis Testing

Tutorial 5: Hypothesis Testing Tutorial 5: Hypothesis Testing Rob Nicholls nicholls@mrc-lmb.cam.ac.uk MRC LMB Statistics Course 2014 Contents 1 Introduction................................ 1 2 Testing distributional assumptions....................

More information

Chapter 2 Probability Topics SPSS T tests

Chapter 2 Probability Topics SPSS T tests Chapter 2 Probability Topics SPSS T tests Data file used: gss.sav In the lecture about chapter 2, only the One-Sample T test has been explained. In this handout, we also give the SPSS methods to perform

More information

MAT 155. Key Concept. September 27, 2010. 155S5.5_3 Poisson Probability Distributions. Chapter 5 Probability Distributions

MAT 155. Key Concept. September 27, 2010. 155S5.5_3 Poisson Probability Distributions. Chapter 5 Probability Distributions MAT 155 Dr. Claude Moore Cape Fear Community College Chapter 5 Probability Distributions 5 1 Review and Preview 5 2 Random Variables 5 3 Binomial Probability Distributions 5 4 Mean, Variance and Standard

More information

NONPARAMETRIC STATISTICS 1. depend on assumptions about the underlying distribution of the data (or on the Central Limit Theorem)

NONPARAMETRIC STATISTICS 1. depend on assumptions about the underlying distribution of the data (or on the Central Limit Theorem) NONPARAMETRIC STATISTICS 1 PREVIOUSLY parametric statistics in estimation and hypothesis testing... construction of confidence intervals computing of p-values classical significance testing depend on assumptions

More information

CONTENTS OF DAY 2. II. Why Random Sampling is Important 9 A myth, an urban legend, and the real reason NOTES FOR SUMMER STATISTICS INSTITUTE COURSE

CONTENTS OF DAY 2. II. Why Random Sampling is Important 9 A myth, an urban legend, and the real reason NOTES FOR SUMMER STATISTICS INSTITUTE COURSE 1 2 CONTENTS OF DAY 2 I. More Precise Definition of Simple Random Sample 3 Connection with independent random variables 3 Problems with small populations 8 II. Why Random Sampling is Important 9 A myth,

More information

HYPOTHESIS TESTING (ONE SAMPLE) - CHAPTER 7 1. used confidence intervals to answer questions such as...

HYPOTHESIS TESTING (ONE SAMPLE) - CHAPTER 7 1. used confidence intervals to answer questions such as... HYPOTHESIS TESTING (ONE SAMPLE) - CHAPTER 7 1 PREVIOUSLY used confidence intervals to answer questions such as... You know that 0.25% of women have red/green color blindness. You conduct a study of men

More information

Name: Date: Use the following to answer questions 3-4:

Name: Date: Use the following to answer questions 3-4: Name: Date: 1. Determine whether each of the following statements is true or false. A) The margin of error for a 95% confidence interval for the mean increases as the sample size increases. B) The margin

More information

Institute of Actuaries of India Subject CT3 Probability and Mathematical Statistics

Institute of Actuaries of India Subject CT3 Probability and Mathematical Statistics Institute of Actuaries of India Subject CT3 Probability and Mathematical Statistics For 2015 Examinations Aim The aim of the Probability and Mathematical Statistics subject is to provide a grounding in

More information

Comparing Multiple Proportions, Test of Independence and Goodness of Fit

Comparing Multiple Proportions, Test of Independence and Goodness of Fit Comparing Multiple Proportions, Test of Independence and Goodness of Fit Content Testing the Equality of Population Proportions for Three or More Populations Test of Independence Goodness of Fit Test 2

More information

Statistics I for QBIC. Contents and Objectives. Chapters 1 7. Revised: August 2013

Statistics I for QBIC. Contents and Objectives. Chapters 1 7. Revised: August 2013 Statistics I for QBIC Text Book: Biostatistics, 10 th edition, by Daniel & Cross Contents and Objectives Chapters 1 7 Revised: August 2013 Chapter 1: Nature of Statistics (sections 1.1-1.6) Objectives

More information

The Variability of P-Values. Summary

The Variability of P-Values. Summary The Variability of P-Values Dennis D. Boos Department of Statistics North Carolina State University Raleigh, NC 27695-8203 boos@stat.ncsu.edu August 15, 2009 NC State Statistics Departement Tech Report

More information

HYPOTHESIS TESTING (ONE SAMPLE) - CHAPTER 7 1. used confidence intervals to answer questions such as...

HYPOTHESIS TESTING (ONE SAMPLE) - CHAPTER 7 1. used confidence intervals to answer questions such as... HYPOTHESIS TESTING (ONE SAMPLE) - CHAPTER 7 1 PREVIOUSLY used confidence intervals to answer questions such as... You know that 0.25% of women have red/green color blindness. You conduct a study of men

More information

LAB 4 INSTRUCTIONS CONFIDENCE INTERVALS AND HYPOTHESIS TESTING

LAB 4 INSTRUCTIONS CONFIDENCE INTERVALS AND HYPOTHESIS TESTING LAB 4 INSTRUCTIONS CONFIDENCE INTERVALS AND HYPOTHESIS TESTING In this lab you will explore the concept of a confidence interval and hypothesis testing through a simulation problem in engineering setting.

More information

Chapter 23 Inferences About Means

Chapter 23 Inferences About Means Chapter 23 Inferences About Means Chapter 23 - Inferences About Means 391 Chapter 23 Solutions to Class Examples 1. See Class Example 1. 2. We want to know if the mean battery lifespan exceeds the 300-minute

More information

Recall this chart that showed how most of our course would be organized:

Recall this chart that showed how most of our course would be organized: Chapter 4 One-Way ANOVA Recall this chart that showed how most of our course would be organized: Explanatory Variable(s) Response Variable Methods Categorical Categorical Contingency Tables Categorical

More information

Introduction. Hypothesis Testing. Hypothesis Testing. Significance Testing

Introduction. Hypothesis Testing. Hypothesis Testing. Significance Testing Introduction Hypothesis Testing Mark Lunt Arthritis Research UK Centre for Ecellence in Epidemiology University of Manchester 13/10/2015 We saw last week that we can never know the population parameters

More information

Unit 27: Comparing Two Means

Unit 27: Comparing Two Means Unit 27: Comparing Two Means Prerequisites Students should have experience with one-sample t-procedures before they begin this unit. That material is covered in Unit 26, Small Sample Inference for One

More information

Study Guide for the Final Exam

Study Guide for the Final Exam Study Guide for the Final Exam When studying, remember that the computational portion of the exam will only involve new material (covered after the second midterm), that material from Exam 1 will make

More information

t Tests in Excel The Excel Statistical Master By Mark Harmon Copyright 2011 Mark Harmon

t Tests in Excel The Excel Statistical Master By Mark Harmon Copyright 2011 Mark Harmon t-tests in Excel By Mark Harmon Copyright 2011 Mark Harmon No part of this publication may be reproduced or distributed without the express permission of the author. mark@excelmasterseries.com www.excelmasterseries.com

More information

Introduction to Regression and Data Analysis

Introduction to Regression and Data Analysis Statlab Workshop Introduction to Regression and Data Analysis with Dan Campbell and Sherlock Campbell October 28, 2008 I. The basics A. Types of variables Your variables may take several forms, and it

More information

Parametric and non-parametric statistical methods for the life sciences - Session I

Parametric and non-parametric statistical methods for the life sciences - Session I Why nonparametric methods What test to use? Rank Tests Parametric and non-parametric statistical methods for the life sciences - Session I Liesbeth Bruckers Geert Molenberghs Interuniversity Institute

More information

QUANTITATIVE METHODS BIOLOGY FINAL HONOUR SCHOOL NON-PARAMETRIC TESTS

QUANTITATIVE METHODS BIOLOGY FINAL HONOUR SCHOOL NON-PARAMETRIC TESTS QUANTITATIVE METHODS BIOLOGY FINAL HONOUR SCHOOL NON-PARAMETRIC TESTS This booklet contains lecture notes for the nonparametric work in the QM course. This booklet may be online at http://users.ox.ac.uk/~grafen/qmnotes/index.html.

More information

General Method: Difference of Means. 3. Calculate df: either Welch-Satterthwaite formula or simpler df = min(n 1, n 2 ) 1.

General Method: Difference of Means. 3. Calculate df: either Welch-Satterthwaite formula or simpler df = min(n 1, n 2 ) 1. General Method: Difference of Means 1. Calculate x 1, x 2, SE 1, SE 2. 2. Combined SE = SE1 2 + SE2 2. ASSUMES INDEPENDENT SAMPLES. 3. Calculate df: either Welch-Satterthwaite formula or simpler df = min(n

More information

Simple Linear Regression Inference

Simple Linear Regression Inference Simple Linear Regression Inference 1 Inference requirements The Normality assumption of the stochastic term e is needed for inference even if it is not a OLS requirement. Therefore we have: Interpretation

More information

Lesson 1: Comparison of Population Means Part c: Comparison of Two- Means

Lesson 1: Comparison of Population Means Part c: Comparison of Two- Means Lesson : Comparison of Population Means Part c: Comparison of Two- Means Welcome to lesson c. This third lesson of lesson will discuss hypothesis testing for two independent means. Steps in Hypothesis

More information

THE FIRST SET OF EXAMPLES USE SUMMARY DATA... EXAMPLE 7.2, PAGE 227 DESCRIBES A PROBLEM AND A HYPOTHESIS TEST IS PERFORMED IN EXAMPLE 7.

THE FIRST SET OF EXAMPLES USE SUMMARY DATA... EXAMPLE 7.2, PAGE 227 DESCRIBES A PROBLEM AND A HYPOTHESIS TEST IS PERFORMED IN EXAMPLE 7. THERE ARE TWO WAYS TO DO HYPOTHESIS TESTING WITH STATCRUNCH: WITH SUMMARY DATA (AS IN EXAMPLE 7.17, PAGE 236, IN ROSNER); WITH THE ORIGINAL DATA (AS IN EXAMPLE 8.5, PAGE 301 IN ROSNER THAT USES DATA FROM

More information

Non Parametric Inference

Non Parametric Inference Maura Department of Economics and Finance Università Tor Vergata Outline 1 2 3 Inverse distribution function Theorem: Let U be a uniform random variable on (0, 1). Let X be a continuous random variable

More information

Lecture 8. Confidence intervals and the central limit theorem

Lecture 8. Confidence intervals and the central limit theorem Lecture 8. Confidence intervals and the central limit theorem Mathematical Statistics and Discrete Mathematics November 25th, 2015 1 / 15 Central limit theorem Let X 1, X 2,... X n be a random sample of

More information

Chapter 7 Section 1 Homework Set A

Chapter 7 Section 1 Homework Set A Chapter 7 Section 1 Homework Set A 7.15 Finding the critical value t *. What critical value t * from Table D (use software, go to the web and type t distribution applet) should be used to calculate the

More information

University of Chicago Graduate School of Business. Business 41000: Business Statistics

University of Chicago Graduate School of Business. Business 41000: Business Statistics Name: University of Chicago Graduate School of Business Business 41000: Business Statistics Special Notes: 1. This is a closed-book exam. You may use an 8 11 piece of paper for the formulas. 2. Throughout

More information

Experimental Design. Power and Sample Size Determination. Proportions. Proportions. Confidence Interval for p. The Binomial Test

Experimental Design. Power and Sample Size Determination. Proportions. Proportions. Confidence Interval for p. The Binomial Test Experimental Design Power and Sample Size Determination Bret Hanlon and Bret Larget Department of Statistics University of Wisconsin Madison November 3 8, 2011 To this point in the semester, we have largely

More information

Bowerman, O'Connell, Aitken Schermer, & Adcock, Business Statistics in Practice, Canadian edition

Bowerman, O'Connell, Aitken Schermer, & Adcock, Business Statistics in Practice, Canadian edition Bowerman, O'Connell, Aitken Schermer, & Adcock, Business Statistics in Practice, Canadian edition Online Learning Centre Technology Step-by-Step - Excel Microsoft Excel is a spreadsheet software application

More information

How To Check For Differences In The One Way Anova

How To Check For Differences In The One Way Anova MINITAB ASSISTANT WHITE PAPER This paper explains the research conducted by Minitab statisticians to develop the methods and data checks used in the Assistant in Minitab 17 Statistical Software. One-Way

More information

Section 12 Part 2. Chi-square test

Section 12 Part 2. Chi-square test Section 12 Part 2 Chi-square test McNemar s Test Section 12 Part 2 Overview Section 12, Part 1 covered two inference methods for categorical data from 2 groups Confidence Intervals for the difference of

More information

1. How different is the t distribution from the normal?

1. How different is the t distribution from the normal? Statistics 101 106 Lecture 7 (20 October 98) c David Pollard Page 1 Read M&M 7.1 and 7.2, ignoring starred parts. Reread M&M 3.2. The effects of estimated variances on normal approximations. t-distributions.

More information

BA 275 Review Problems - Week 6 (10/30/06-11/3/06) CD Lessons: 53, 54, 55, 56 Textbook: pp. 394-398, 404-408, 410-420

BA 275 Review Problems - Week 6 (10/30/06-11/3/06) CD Lessons: 53, 54, 55, 56 Textbook: pp. 394-398, 404-408, 410-420 BA 275 Review Problems - Week 6 (10/30/06-11/3/06) CD Lessons: 53, 54, 55, 56 Textbook: pp. 394-398, 404-408, 410-420 1. Which of the following will increase the value of the power in a statistical test

More information

Lecture 25. December 19, 2007. Department of Biostatistics Johns Hopkins Bloomberg School of Public Health Johns Hopkins University.

Lecture 25. December 19, 2007. Department of Biostatistics Johns Hopkins Bloomberg School of Public Health Johns Hopkins University. This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike License. Your use of this material constitutes acceptance of that license and the conditions of use of materials on this

More information

Good luck! BUSINESS STATISTICS FINAL EXAM INSTRUCTIONS. Name:

Good luck! BUSINESS STATISTICS FINAL EXAM INSTRUCTIONS. Name: Glo bal Leadership M BA BUSINESS STATISTICS FINAL EXAM Name: INSTRUCTIONS 1. Do not open this exam until instructed to do so. 2. Be sure to fill in your name before starting the exam. 3. You have two hours

More information

HLA data analysis in anthropology: basic theory and practice

HLA data analysis in anthropology: basic theory and practice HLA data analysis in anthropology: basic theory and practice Alicia Sanchez-Mazas and José Manuel Nunes Laboratory of Anthropology, Genetics and Peopling history (AGP), Department of Anthropology and Ecology,

More information

3.4 Statistical inference for 2 populations based on two samples

3.4 Statistical inference for 2 populations based on two samples 3.4 Statistical inference for 2 populations based on two samples Tests for a difference between two population means The first sample will be denoted as X 1, X 2,..., X m. The second sample will be denoted

More information

Regression Analysis: A Complete Example

Regression Analysis: A Complete Example Regression Analysis: A Complete Example This section works out an example that includes all the topics we have discussed so far in this chapter. A complete example of regression analysis. PhotoDisc, Inc./Getty

More information

Projects Involving Statistics (& SPSS)

Projects Involving Statistics (& SPSS) Projects Involving Statistics (& SPSS) Academic Skills Advice Starting a project which involves using statistics can feel confusing as there seems to be many different things you can do (charts, graphs,

More information

Chi Squared and Fisher's Exact Tests. Observed vs Expected Distributions

Chi Squared and Fisher's Exact Tests. Observed vs Expected Distributions BMS 617 Statistical Techniques for the Biomedical Sciences Lecture 11: Chi-Squared and Fisher's Exact Tests Chi Squared and Fisher's Exact Tests This lecture presents two similarly structured tests, Chi-squared

More information

Chicago Booth BUSINESS STATISTICS 41000 Final Exam Fall 2011

Chicago Booth BUSINESS STATISTICS 41000 Final Exam Fall 2011 Chicago Booth BUSINESS STATISTICS 41000 Final Exam Fall 2011 Name: Section: I pledge my honor that I have not violated the Honor Code Signature: This exam has 34 pages. You have 3 hours to complete this

More information

Chapter 8 Hypothesis Testing Chapter 8 Hypothesis Testing 8-1 Overview 8-2 Basics of Hypothesis Testing

Chapter 8 Hypothesis Testing Chapter 8 Hypothesis Testing 8-1 Overview 8-2 Basics of Hypothesis Testing Chapter 8 Hypothesis Testing 1 Chapter 8 Hypothesis Testing 8-1 Overview 8-2 Basics of Hypothesis Testing 8-3 Testing a Claim About a Proportion 8-5 Testing a Claim About a Mean: s Not Known 8-6 Testing

More information

Chapter 2. Hypothesis testing in one population

Chapter 2. Hypothesis testing in one population Chapter 2. Hypothesis testing in one population Contents Introduction, the null and alternative hypotheses Hypothesis testing process Type I and Type II errors, power Test statistic, level of significance

More information

This chapter discusses some of the basic concepts in inferential statistics.

This chapter discusses some of the basic concepts in inferential statistics. Research Skills for Psychology Majors: Everything You Need to Know to Get Started Inferential Statistics: Basic Concepts This chapter discusses some of the basic concepts in inferential statistics. Details

More information

Crash Course on Basic Statistics

Crash Course on Basic Statistics Crash Course on Basic Statistics Marina Wahl, marina.w4hl@gmail.com University of New York at Stony Brook November 6, 2013 2 Contents 1 Basic Probability 5 1.1 Basic Definitions...........................................

More information

Hypothesis testing. c 2014, Jeffrey S. Simonoff 1

Hypothesis testing. c 2014, Jeffrey S. Simonoff 1 Hypothesis testing So far, we ve talked about inference from the point of estimation. We ve tried to answer questions like What is a good estimate for a typical value? or How much variability is there

More information

Nonparametric Statistics

Nonparametric Statistics Nonparametric Statistics References Some good references for the topics in this course are 1. Higgins, James (2004), Introduction to Nonparametric Statistics 2. Hollander and Wolfe, (1999), Nonparametric

More information

Chi-square test Fisher s Exact test

Chi-square test Fisher s Exact test Lesson 1 Chi-square test Fisher s Exact test McNemar s Test Lesson 1 Overview Lesson 11 covered two inference methods for categorical data from groups Confidence Intervals for the difference of two proportions

More information

MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.

MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. STT315 Practice Ch 5-7 MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. Solve the problem. 1) The length of time a traffic signal stays green (nicknamed

More information

Introduction to Hypothesis Testing OPRE 6301

Introduction to Hypothesis Testing OPRE 6301 Introduction to Hypothesis Testing OPRE 6301 Motivation... The purpose of hypothesis testing is to determine whether there is enough statistical evidence in favor of a certain belief, or hypothesis, about

More information

STAT 145 (Notes) Al Nosedal anosedal@unm.edu Department of Mathematics and Statistics University of New Mexico. Fall 2013

STAT 145 (Notes) Al Nosedal anosedal@unm.edu Department of Mathematics and Statistics University of New Mexico. Fall 2013 STAT 145 (Notes) Al Nosedal anosedal@unm.edu Department of Mathematics and Statistics University of New Mexico Fall 2013 CHAPTER 18 INFERENCE ABOUT A POPULATION MEAN. Conditions for Inference about mean

More information

Inference for two Population Means

Inference for two Population Means Inference for two Population Means Bret Hanlon and Bret Larget Department of Statistics University of Wisconsin Madison October 27 November 1, 2011 Two Population Means 1 / 65 Case Study Case Study Example

More information

Non-Inferiority Tests for One Mean

Non-Inferiority Tests for One Mean Chapter 45 Non-Inferiority ests for One Mean Introduction his module computes power and sample size for non-inferiority tests in one-sample designs in which the outcome is distributed as a normal random

More information

One-Way Analysis of Variance (ANOVA) Example Problem

One-Way Analysis of Variance (ANOVA) Example Problem One-Way Analysis of Variance (ANOVA) Example Problem Introduction Analysis of Variance (ANOVA) is a hypothesis-testing technique used to test the equality of two or more population (or treatment) means

More information

Lecture Notes Module 1

Lecture Notes Module 1 Lecture Notes Module 1 Study Populations A study population is a clearly defined collection of people, animals, plants, or objects. In psychological research, a study population usually consists of a specific

More information

Chapter 7 Notes - Inference for Single Samples. You know already for a large sample, you can invoke the CLT so:

Chapter 7 Notes - Inference for Single Samples. You know already for a large sample, you can invoke the CLT so: Chapter 7 Notes - Inference for Single Samples You know already for a large sample, you can invoke the CLT so: X N(µ, ). Also for a large sample, you can replace an unknown σ by s. You know how to do a

More information

Introduction. Statistics Toolbox

Introduction. Statistics Toolbox Introduction A hypothesis test is a procedure for determining if an assertion about a characteristic of a population is reasonable. For example, suppose that someone says that the average price of a gallon

More information

1.5 Oneway Analysis of Variance

1.5 Oneway Analysis of Variance Statistics: Rosie Cornish. 200. 1.5 Oneway Analysis of Variance 1 Introduction Oneway analysis of variance (ANOVA) is used to compare several means. This method is often used in scientific or medical experiments

More information

Two-sample hypothesis testing, II 9.07 3/16/2004

Two-sample hypothesis testing, II 9.07 3/16/2004 Two-sample hypothesis testing, II 9.07 3/16/004 Small sample tests for the difference between two independent means For two-sample tests of the difference in mean, things get a little confusing, here,

More information

Principles of Hypothesis Testing for Public Health

Principles of Hypothesis Testing for Public Health Principles of Hypothesis Testing for Public Health Laura Lee Johnson, Ph.D. Statistician National Center for Complementary and Alternative Medicine johnslau@mail.nih.gov Fall 2011 Answers to Questions

More information

HYPOTHESIS TESTING WITH SPSS:

HYPOTHESIS TESTING WITH SPSS: HYPOTHESIS TESTING WITH SPSS: A NON-STATISTICIAN S GUIDE & TUTORIAL by Dr. Jim Mirabella SPSS 14.0 screenshots reprinted with permission from SPSS Inc. Published June 2006 Copyright Dr. Jim Mirabella CHAPTER

More information

Lecture 11: Confidence intervals and model comparison for linear regression; analysis of variance

Lecture 11: Confidence intervals and model comparison for linear regression; analysis of variance Lecture 11: Confidence intervals and model comparison for linear regression; analysis of variance 14 November 2007 1 Confidence intervals and hypothesis testing for linear regression Just as there was

More information

Chapter 7. One-way ANOVA

Chapter 7. One-way ANOVA Chapter 7 One-way ANOVA One-way ANOVA examines equality of population means for a quantitative outcome and a single categorical explanatory variable with any number of levels. The t-test of Chapter 6 looks

More information

STAT 350 Practice Final Exam Solution (Spring 2015)

STAT 350 Practice Final Exam Solution (Spring 2015) PART 1: Multiple Choice Questions: 1) A study was conducted to compare five different training programs for improving endurance. Forty subjects were randomly divided into five groups of eight subjects

More information

P(every one of the seven intervals covers the true mean yield at its location) = 3.

P(every one of the seven intervals covers the true mean yield at its location) = 3. 1 Let = number of locations at which the computed confidence interval for that location hits the true value of the mean yield at its location has a binomial(7,095) (a) P(every one of the seven intervals

More information

Independent t- Test (Comparing Two Means)

Independent t- Test (Comparing Two Means) Independent t- Test (Comparing Two Means) The objectives of this lesson are to learn: the definition/purpose of independent t-test when to use the independent t-test the use of SPSS to complete an independent

More information

Using Stata for One Sample Tests

Using Stata for One Sample Tests Using Stata for One Sample Tests All of the one sample problems we have discussed so far can be solved in Stata via either (a) statistical calculator functions, where you provide Stata with the necessary

More information

Solutions: Problems for Chapter 3. Solutions: Problems for Chapter 3

Solutions: Problems for Chapter 3. Solutions: Problems for Chapter 3 Problem A: You are dealt five cards from a standard deck. Are you more likely to be dealt two pairs or three of a kind? experiment: choose 5 cards at random from a standard deck Ω = {5-combinations of

More information

Online 12 - Sections 9.1 and 9.2-Doug Ensley

Online 12 - Sections 9.1 and 9.2-Doug Ensley Student: Date: Instructor: Doug Ensley Course: MAT117 01 Applied Statistics - Ensley Assignment: Online 12 - Sections 9.1 and 9.2 1. Does a P-value of 0.001 give strong evidence or not especially strong

More information

Testing differences in proportions

Testing differences in proportions Testing differences in proportions Murray J Fisher RN, ITU Cert., DipAppSc, BHSc, MHPEd, PhD Senior Lecturer and Director Preregistration Programs Sydney Nursing School (MO2) University of Sydney NSW 2006

More information

Rank-Based Non-Parametric Tests

Rank-Based Non-Parametric Tests Rank-Based Non-Parametric Tests Reminder: Student Instructional Rating Surveys You have until May 8 th to fill out the student instructional rating surveys at https://sakai.rutgers.edu/portal/site/sirs

More information

Comparison of frequentist and Bayesian inference. Class 20, 18.05, Spring 2014 Jeremy Orloff and Jonathan Bloom

Comparison of frequentist and Bayesian inference. Class 20, 18.05, Spring 2014 Jeremy Orloff and Jonathan Bloom Comparison of frequentist and Bayesian inference. Class 20, 18.05, Spring 2014 Jeremy Orloff and Jonathan Bloom 1 Learning Goals 1. Be able to explain the difference between the p-value and a posterior

More information

Comparing Two Groups. Standard Error of ȳ 1 ȳ 2. Setting. Two Independent Samples

Comparing Two Groups. Standard Error of ȳ 1 ȳ 2. Setting. Two Independent Samples Comparing Two Groups Chapter 7 describes two ways to compare two populations on the basis of independent samples: a confidence interval for the difference in population means and a hypothesis test. The

More information

Opgaven Onderzoeksmethoden, Onderdeel Statistiek

Opgaven Onderzoeksmethoden, Onderdeel Statistiek Opgaven Onderzoeksmethoden, Onderdeel Statistiek 1. What is the measurement scale of the following variables? a Shoe size b Religion c Car brand d Score in a tennis game e Number of work hours per week

More information

Once saved, if the file was zipped you will need to unzip it. For the files that I will be posting you need to change the preferences.

Once saved, if the file was zipped you will need to unzip it. For the files that I will be posting you need to change the preferences. 1 Commands in JMP and Statcrunch Below are a set of commands in JMP and Statcrunch which facilitate a basic statistical analysis. The first part concerns commands in JMP, the second part is for analysis

More information

Chapter 7 Section 7.1: Inference for the Mean of a Population

Chapter 7 Section 7.1: Inference for the Mean of a Population Chapter 7 Section 7.1: Inference for the Mean of a Population Now let s look at a similar situation Take an SRS of size n Normal Population : N(, ). Both and are unknown parameters. Unlike what we used

More information

www.rmsolutions.net R&M Solutons

www.rmsolutions.net R&M Solutons Ahmed Hassouna, MD Professor of cardiovascular surgery, Ain-Shams University, EGYPT. Diploma of medical statistics and clinical trial, Paris 6 university, Paris. 1A- Choose the best answer The duration

More information

Chapter 19 The Chi-Square Test

Chapter 19 The Chi-Square Test Tutorial for the integration of the software R with introductory statistics Copyright c Grethe Hystad Chapter 19 The Chi-Square Test In this chapter, we will discuss the following topics: We will plot

More information

Using Excel for inferential statistics

Using Excel for inferential statistics FACT SHEET Using Excel for inferential statistics Introduction When you collect data, you expect a certain amount of variation, just caused by chance. A wide variety of statistical tests can be applied

More information

Comparing Means in Two Populations

Comparing Means in Two Populations Comparing Means in Two Populations Overview The previous section discussed hypothesis testing when sampling from a single population (either a single mean or two means from the same population). Now we

More information

. (3.3) n Note that supremum (3.2) must occur at one of the observed values x i or to the left of x i.

. (3.3) n Note that supremum (3.2) must occur at one of the observed values x i or to the left of x i. Chapter 3 Kolmogorov-Smirnov Tests There are many situations where experimenters need to know what is the distribution of the population of their interest. For example, if they want to use a parametric

More information

A POPULATION MEAN, CONFIDENCE INTERVALS AND HYPOTHESIS TESTING

A POPULATION MEAN, CONFIDENCE INTERVALS AND HYPOTHESIS TESTING CHAPTER 5. A POPULATION MEAN, CONFIDENCE INTERVALS AND HYPOTHESIS TESTING 5.1 Concepts When a number of animals or plots are exposed to a certain treatment, we usually estimate the effect of the treatment

More information

Introduction to Analysis of Variance (ANOVA) Limitations of the t-test

Introduction to Analysis of Variance (ANOVA) Limitations of the t-test Introduction to Analysis of Variance (ANOVA) The Structural Model, The Summary Table, and the One- Way ANOVA Limitations of the t-test Although the t-test is commonly used, it has limitations Can only

More information

CHAPTER 14 NONPARAMETRIC TESTS

CHAPTER 14 NONPARAMETRIC TESTS CHAPTER 14 NONPARAMETRIC TESTS Everything that we have done up until now in statistics has relied heavily on one major fact: that our data is normally distributed. We have been able to make inferences

More information

How To Test For Significance On A Data Set

How To Test For Significance On A Data Set Non-Parametric Univariate Tests: 1 Sample Sign Test 1 1 SAMPLE SIGN TEST A non-parametric equivalent of the 1 SAMPLE T-TEST. ASSUMPTIONS: Data is non-normally distributed, even after log transforming.

More information

Hypothesis Testing --- One Mean

Hypothesis Testing --- One Mean Hypothesis Testing --- One Mean A hypothesis is simply a statement that something is true. Typically, there are two hypotheses in a hypothesis test: the null, and the alternative. Null Hypothesis The hypothesis

More information

Simulating Chi-Square Test Using Excel

Simulating Chi-Square Test Using Excel Simulating Chi-Square Test Using Excel Leslie Chandrakantha John Jay College of Criminal Justice of CUNY Mathematics and Computer Science Department 524 West 59 th Street, New York, NY 10019 lchandra@jjay.cuny.edu

More information

Chapter 23. Two Categorical Variables: The Chi-Square Test

Chapter 23. Two Categorical Variables: The Chi-Square Test Chapter 23. Two Categorical Variables: The Chi-Square Test 1 Chapter 23. Two Categorical Variables: The Chi-Square Test Two-Way Tables Note. We quickly review two-way tables with an example. Example. Exercise

More information

In the general population of 0 to 4-year-olds, the annual incidence of asthma is 1.4%

In the general population of 0 to 4-year-olds, the annual incidence of asthma is 1.4% Hypothesis Testing for a Proportion Example: We are interested in the probability of developing asthma over a given one-year period for children 0 to 4 years of age whose mothers smoke in the home In the

More information

Nonparametric tests these test hypotheses that are not statements about population parameters (e.g.,

Nonparametric tests these test hypotheses that are not statements about population parameters (e.g., CHAPTER 13 Nonparametric and Distribution-Free Statistics Nonparametric tests these test hypotheses that are not statements about population parameters (e.g., 2 tests for goodness of fit and independence).

More information

Hypothesis Testing. Steps for a hypothesis test:

Hypothesis Testing. Steps for a hypothesis test: Hypothesis Testing Steps for a hypothesis test: 1. State the claim H 0 and the alternative, H a 2. Choose a significance level or use the given one. 3. Draw the sampling distribution based on the assumption

More information

BASIC STATISTICAL METHODS FOR GENOMIC DATA ANALYSIS

BASIC STATISTICAL METHODS FOR GENOMIC DATA ANALYSIS BASIC STATISTICAL METHODS FOR GENOMIC DATA ANALYSIS SEEMA JAGGI Indian Agricultural Statistics Research Institute Library Avenue, New Delhi-110 012 seema@iasri.res.in Genomics A genome is an organism s

More information

Permutation Tests for Comparing Two Populations

Permutation Tests for Comparing Two Populations Permutation Tests for Comparing Two Populations Ferry Butar Butar, Ph.D. Jae-Wan Park Abstract Permutation tests for comparing two populations could be widely used in practice because of flexibility of

More information

BA 275 Review Problems - Week 5 (10/23/06-10/27/06) CD Lessons: 48, 49, 50, 51, 52 Textbook: pp. 380-394

BA 275 Review Problems - Week 5 (10/23/06-10/27/06) CD Lessons: 48, 49, 50, 51, 52 Textbook: pp. 380-394 BA 275 Review Problems - Week 5 (10/23/06-10/27/06) CD Lessons: 48, 49, 50, 51, 52 Textbook: pp. 380-394 1. Does vigorous exercise affect concentration? In general, the time needed for people to complete

More information

Chapter 8 Paired observations

Chapter 8 Paired observations Chapter 8 Paired observations Timothy Hanson Department of Statistics, University of South Carolina Stat 205: Elementary Statistics for the Biological and Life Sciences 1 / 19 Book review of two-sample

More information