Simple example. Simple example. Simple example. Example

Similar documents
Statistical tests for SPSS

Sample Size and Power in Clinical Trials

Difference tests (2): nonparametric

Non-Inferiority Tests for One Mean

NCSS Statistical Software

Nonparametric tests these test hypotheses that are not statements about population parameters (e.g.,

NCSS Statistical Software

Non-Inferiority Tests for Two Means using Differences

Two-Sample T-Tests Assuming Equal Variance (Enter Means)

Two-Sample T-Tests Allowing Unequal Variance (Enter Difference)

QUANTITATIVE METHODS BIOLOGY FINAL HONOUR SCHOOL NON-PARAMETRIC TESTS

A POPULATION MEAN, CONFIDENCE INTERVALS AND HYPOTHESIS TESTING

Exact Nonparametric Tests for Comparing Means - A Personal Summary

EPS 625 INTERMEDIATE STATISTICS FRIEDMAN TEST

Parametric and non-parametric statistical methods for the life sciences - Session I

Testing Group Differences using T-tests, ANOVA, and Nonparametric Measures

Descriptive Statistics

Non-Parametric Tests (I)

Calculating P-Values. Parkland College. Isela Guerra Parkland College. Recommended Citation

Nonparametric Statistics

UNIVERSITY OF NAIROBI

SCHOOL OF HEALTH AND HUMAN SCIENCES DON T FORGET TO RECODE YOUR MISSING VALUES

Rank-Based Non-Parametric Tests

Tutorial 5: Hypothesis Testing

Permutation Tests for Comparing Two Populations

CHAPTER 14 ORDINAL MEASURES OF CORRELATION: SPEARMAN'S RHO AND GAMMA

Projects Involving Statistics (& SPSS)

1 Nonparametric Statistics

Introduction to Hypothesis Testing. Hypothesis Testing. Step 1: State the Hypotheses

Introduction to. Hypothesis Testing CHAPTER LEARNING OBJECTIVES. 1 Identify the four steps of hypothesis testing.

Statistics in Medicine Research Lecture Series CSMC Fall 2014

NONPARAMETRIC STATISTICS 1. depend on assumptions about the underlying distribution of the data (or on the Central Limit Theorem)

Bivariate Statistics Session 2: Measuring Associations Chi-Square Test

Chapter G08 Nonparametric Statistics

1. What is the critical value for this 95% confidence interval? CV = z.025 = invnorm(0.025) = 1.96

University of Chicago Graduate School of Business. Business 41000: Business Statistics

Recall this chart that showed how most of our course would be organized:

THE KRUSKAL WALLLIS TEST

One-Way Analysis of Variance (ANOVA) Example Problem

Section 7.1. Introduction to Hypothesis Testing. Schrodinger s cat quantum mechanics thought experiment (1935)

Non Parametric Inference

CHAPTER 12 TESTING DIFFERENCES WITH ORDINAL DATA: MANN WHITNEY U

Math 58. Rumbos Fall Solutions to Review Problems for Exam 2

How To Test For Significance On A Data Set

22. HYPOTHESIS TESTING

COMPARING DATA ANALYSIS TECHNIQUES FOR EVALUATION DESIGNS WITH NON -NORMAL POFULP_TIOKS Elaine S. Jeffers, University of Maryland, Eastern Shore*

Chapter 8 Hypothesis Testing Chapter 8 Hypothesis Testing 8-1 Overview 8-2 Basics of Hypothesis Testing

Section 6.1 Discrete Random variables Probability Distribution

X X X a) perfect linear correlation b) no correlation c) positive correlation (r = 1) (r = 0) (0 < r < 1)

Unit 31 A Hypothesis Test about Correlation and Slope in a Simple Linear Regression

Business Statistics, 9e (Groebner/Shannon/Fry) Chapter 9 Introduction to Hypothesis Testing

Introduction to Quantitative Methods

Part 3. Comparing Groups. Chapter 7 Comparing Paired Groups 189. Chapter 8 Comparing Two Independent Groups 217


CHAPTER 11 CHI-SQUARE AND F DISTRIBUTIONS

Hypothesis Testing --- One Mean

Overview of Non-Parametric Statistics PRESENTER: ELAINE EISENBEISZ OWNER AND PRINCIPAL, OMEGA STATISTICS

Pearson's Correlation Tests

Introduction to Hypothesis Testing OPRE 6301

Likert Scales. are the meaning of life: Dane Bertram

MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.

Additional sources Compilation of sources:

R&M Solutons

Using Excel for inferential statistics

Study Guide for the Final Exam

Chapter 4: Statistical Hypothesis Testing

WHERE DOES THE 10% CONDITION COME FROM?

TABLE OF CONTENTS. About Chi Squares What is a CHI SQUARE? Chi Squares Hypothesis Testing with Chi Squares... 2

Nonparametric Two-Sample Tests. Nonparametric Tests. Sign Test

The Wilcoxon Rank-Sum Test

Research Methods & Experimental Design

Crosstabulation & Chi Square

STAT 35A HW2 Solutions

NAG C Library Chapter Introduction. g08 Nonparametric Statistics

Statistics Review PSY379

Introduction to Statistics and Quantitative Research Methods

Types of Data, Descriptive Statistics, and Statistical Tests for Nominal Data. Patrick F. Smith, Pharm.D. University at Buffalo Buffalo, New York

II. DISTRIBUTIONS distribution normal distribution. standard scores

Point Biserial Correlation Tests

Come scegliere un test statistico

. (3.3) n Note that supremum (3.2) must occur at one of the observed values x i or to the left of x i.

Stat 411/511 THE RANDOMIZATION TEST. Charlotte Wickham. stat511.cwick.co.nz. Oct

Tests of Hypotheses Using Statistics

Permutation & Non-Parametric Tests

MONT 107N Understanding Randomness Solutions For Final Examination May 11, 2010

Lesson 1: Comparison of Population Means Part c: Comparison of Two- Means

WISE Power Tutorial All Exercises

In the past, the increase in the price of gasoline could be attributed to major national or global

LAB 4 INSTRUCTIONS CONFIDENCE INTERVALS AND HYPOTHESIS TESTING

12.5: CHI-SQUARE GOODNESS OF FIT TESTS

$ ( $1) = 40

Two-sample inference: Continuous data

Introduction. Statistics Toolbox

Topic 8. Chi Square Tests

MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.

UNDERSTANDING THE DEPENDENT-SAMPLES t TEST

StatCrunch and Nonparametric Statistics

WHAT IS A JOURNAL CLUB?

CHAPTER 14 NONPARAMETRIC TESTS

12: Analysis of Variance. Introduction

Transcription:

Simple example Simple example A box contains 1000 balls some white some black It is believed (hypothesized) that there are 990 W and 10 B 5 balls are drawn at random ( with replacement). They were all found to be B Would you believe in the hypotheses? No Because,if the hypotheses is true prob of getting all 5 B is small? Reasonable conclusion wrong reason!! Suppose I have 10 cards numbered 1,2,..10 They are shuffled I find 1 on top, 2 next, 3 in the third positions and so on Did the shuffle result in random ordering? No? All orders have same probability 1/10!. Small prob of observation not equivalent to doubt on hypotheses!! Simple example A box contains 1000 balls some white some black It is believed (hypothesized) that there are 990 W (H 0 )and 10 B or 995 W and 5 B (H 1 ) 5 balls are drawn at random ( with replacement). They were all found to be B which of the two hypotheses would you chose? What if H 1 is 980 W and 20 B? The decision depends on the alternative A medicine is effective only if the concentration of a certain chemical is at least 200 ppm. At the same time the medicine would produce side effects if the concentration exceeds 200 ppm. A sample of 50 observations had an average of 201.5 ppm with a standard deviation of.75 ppm. Do you think the medicine would be effective? Hypotheses: concentration is more than 220 ppm Hypotheses: concentration is less than 220 ppm choose between the above two hypotheses

Testing Statistical Hypotheses Null and Alternative hypotheses Null and Alternative Hypothesis p-value significance level Type I and II errors, Power An assertion about a parameter that we hold as true unless we have sufficient (statistical) evidence against it Alternative Hypotheses H1 Specification of values of a population parameter deemed as possible alternative values A medicine is effective only if the concentration of a certain chemical is at least 200 ppm. At the same time the medicine would produce side effects if the concentration exceeds 200 ppm. A sample of 50 observations had an average of 201.5 ppm with a standard deviation of.75 ppm. Do you think the medicine would be effective? Hypotheses: concentration is more than 220 ppm Hypotheses: concentration is less than 220 ppm Which of the two hypotheses is Null and which is alternative? Let the concentration be µ Null: µ 220 Alternative µ > 220 H 0 : µ 220 H 1 : µ > 220 An automobile manufacturer substitutes a different engine in cars that were known to have an average mpg of 31.5 miles. The manufacturer wants to test if the new engine changes the mpg. A random sample of 100 trial runs gives X = 29.8, s.d = 6.6. Test the appropriate hypotheses at level of significance is α =.05.

Type I and II errors µ: Average mpg of all cars with the current engine. H 0 : µ = 31.5 H 1 : µ 31.5 Suppose we have to choose between hypotheses H 0 and H 1 Truth H 0 True H 0 False Decision Accept H 0 correct Type II error Reject H 0 Type I error correct We would ideally like a procedure than makes the probability both the error zero or small. This is not possible A medicine is effective only if the concentration of a certain chemical is at least 200 ppm. At the same time the medicine would produce side effects if the concentration exceeds 200 ppm. A sample of 50 observations had an average of 201.5 ppm with a standard deviation of.75 ppm. Do you think the medicine would be effective? Let the average concentration be µ Assume the concentration is distributed normally with σ =.75 H 0 : µ 220 H 1 : µ > 220 X from 50 samples is normal with mean µ and s.d.75 50 = 0.106. Large values of X favors H1 220

Neyman Pearson approach Mechanics of Testing Type I error is considered more serious than Type II error. So, choose Significance Level α (Usually 5%, 1%,). This is the probability of Type I error that we are willing to tolerate subject to this level of Type I error, choose a method that minimizes Type II error Suppose µ: Mean of a population H 0 : µ = µ 0 ( a specified value ) H 1 = µ > µ 0 barx average of a sample from the population Under H 0 : X will be close to µ 0 Under H 1 : X will be large So large values of X will evidence against H 0. p-value p-value Suppose that H 0 ( that is if µ = µ 0 ) is true Let x be the observed sample average P( X > x) is called the p-value. If the NULL HYPOTHESES were true, this is the probability of observing as extreme a value of X as has been observed Smaller the p-value, lesser the evidence in favor of H 0 If p-value is less than the specified significance level α, then Reject the NULL

Steps in Hypotheses testing Steps in Hypotheses testing Formulate the NULL and the Alternative Choose a test statistic that will be close to the value specified in the NULL Use the alternative to decide what is the extreme value of the statistic (Large? small?, two sided?) compute p-value (use software) smaller the p-value less evidence for Null hyp make a decision If p-value < α - Reject NULL. An automobile manufacturer substitutes a different engine in cars that were known to have an average mpg of 31.5 miles. The manufacturer wants to test if the new engine changes the mpg. A random sample of 100 trial runs gives X = 29.8, s.d = 6.6. Test the appropriate hypotheses at level of significance α =.05 µ: Average mpg of all cars with the current engine. H 0 : µ = 31.5 H 1 : µ 31.5 A value far from 5 indicates that the null hypotheses is untenable

some tests for testing for the mean of a normal population If s.d is known: Z-test using normal distribution If s.d is not known: t-test using t-distribution test for s.d.: chi-square distribution for testing equality of mean of two populations, use t-test for testing equality of variance : F-test 0.005 0.005 29.8 31.5 33.2 NONPARAMETRIC TESTS Mann-Whitney test Parametic models: concerned parameters of distributions. In order to apply these tests, certain conditions about the distributions must be verified. In practice, these tests are applied when the sampling distributions of the data variables reasonably satisfy the normal model. Non-parametric tests make no assumptions regarding the distributions of the data variables they are adequate to small samples there are nonparametric tests that can be applied to ordinal and/or nominal data The non-parametric tests are, in general, less powerful than their parametric counterparts also known as Wilcoxon-Mann-Whitney or rank-sum test, used to assess whether two independent samples were drawn from the same population

Mann-Whitney test start by assigning ranks to the samples Let the samples be denoted x 1, x 2,..., x n and y 1, y 2,..., y m The ranking of the x i and y i assigns ranks in 1, 2,..., n + m. x i 12 21 15 8 y i 9 13 19 The ranking of x i and y i would then yield the result Variable X Y X Y X Y X Data 8 9 12 13 15 19 21 Rank 1 2 3 4 5 6 7 The test statistic is the sum of the ranks for one of the variables, say X W X = 16 The null hypothesis tested is P(X > Y ) = 1 2 under the null hypothesis, one expects the ranks to be randomly distributed between the x i and y i, therefore resulting in approximately equal average ranks in each of the two samples. For small samples, there are tables with the exact probabilities of W X. For large samples (say m or n above 10), the sampling distribution of W X rapidly approaches the normal distribution