T s and F s. Statistical testing for means. FETP India



Similar documents
THE FIRST SET OF EXAMPLES USE SUMMARY DATA... EXAMPLE 7.2, PAGE 227 DESCRIBES A PROBLEM AND A HYPOTHESIS TEST IS PERFORMED IN EXAMPLE 7.

Two-sample t-tests. - Independent samples - Pooled standard devation - The equal variance assumption

Chapter 2 Probability Topics SPSS T tests

Section 13, Part 1 ANOVA. Analysis Of Variance

1.5 Oneway Analysis of Variance

Difference of Means and ANOVA Problems

Introduction. Hypothesis Testing. Hypothesis Testing. Significance Testing

Simple Linear Regression Inference

Comparing Two Groups. Standard Error of ȳ 1 ȳ 2. Setting. Two Independent Samples

3.4 Statistical inference for 2 populations based on two samples

Confidence Intervals for the Difference Between Two Means

Statistics Review PSY379

LAB 4 INSTRUCTIONS CONFIDENCE INTERVALS AND HYPOTHESIS TESTING

INTERPRETING THE ONE-WAY ANALYSIS OF VARIANCE (ANOVA)

Mind on Statistics. Chapter 13

Non-Parametric Tests (I)

UNIVERSITY OF NAIROBI

Inference for two Population Means

UNDERSTANDING THE DEPENDENT-SAMPLES t TEST

Non-Inferiority Tests for Two Means using Differences

Lesson 1: Comparison of Population Means Part c: Comparison of Two- Means

Outline. Definitions Descriptive vs. Inferential Statistics The t-test - One-sample t-test

POLYNOMIAL AND MULTIPLE REGRESSION. Polynomial regression used to fit nonlinear (e.g. curvilinear) data into a least squares linear regression model.

Math 108 Exam 3 Solutions Spring 00

Standard Deviation Estimator

CHAPTER 13 SIMPLE LINEAR REGRESSION. Opening Example. Simple Regression. Linear Regression

Two-Sample T-Tests Assuming Equal Variance (Enter Means)

UNDERSTANDING THE TWO-WAY ANOVA

Two-Sample T-Tests Allowing Unequal Variance (Enter Difference)

SCHOOL OF HEALTH AND HUMAN SCIENCES DON T FORGET TO RECODE YOUR MISSING VALUES

Lecture Notes Module 1

Examining Differences (Comparing Groups) using SPSS Inferential statistics (Part I) Dwayne Devonish

Principles of Hypothesis Testing for Public Health

Two-sample hypothesis testing, II /16/2004

Introduction to Analysis of Variance (ANOVA) Limitations of the t-test

Practice problems for Homework 12 - confidence intervals and hypothesis testing. Open the Homework Assignment 12 and solve the problems.

Chapter Seven. Multiple regression An introduction to multiple regression Performing a multiple regression on SPSS

12: Analysis of Variance. Introduction

2 Sample t-test (unequal sample sizes and unequal variances)

Statistics. One-two sided test, Parametric and non-parametric test statistics: one group, two groups, and more than two groups samples

Erik Parner 14 September Basic Biostatistics - Day 2-21 September,

DATA INTERPRETATION AND STATISTICS

Sample Size and Power in Clinical Trials

Hypothesis Testing: Two Means, Paired Data, Two Proportions

Odds ratio, Odds ratio test for independence, chi-squared statistic.

Statistiek II. John Nerbonne. October 1, Dept of Information Science

Module 5: Multiple Regression Analysis

MULTIPLE REGRESSION AND ISSUES IN REGRESSION ANALYSIS

Final Exam Practice Problem Answers

individualdifferences

General Method: Difference of Means. 3. Calculate df: either Welch-Satterthwaite formula or simpler df = min(n 1, n 2 ) 1.

HYPOTHESIS TESTING: POWER OF THE TEST

Class 19: Two Way Tables, Conditional Distributions, Chi-Square (Text: Sections 2.5; 9.1)

DDBA 8438: The t Test for Independent Samples Video Podcast Transcript

Consider a study in which. How many subjects? The importance of sample size calculations. An insignificant effect: two possibilities.

13: Additional ANOVA Topics. Post hoc Comparisons

1 Basic ANOVA concepts

research/scientific includes the following: statistical hypotheses: you have a null and alternative you accept one and reject the other

Skewed Data and Non-parametric Methods

Part 2: Analysis of Relationship Between Two Variables

Comparing Means in Two Populations

Chapter 8: Hypothesis Testing for One Population Mean, Variance, and Proportion

CHAPTER IV FINDINGS AND CONCURRENT DISCUSSIONS

Unit 31 A Hypothesis Test about Correlation and Slope in a Simple Linear Regression

Chapter 7. One-way ANOVA

Study Guide for the Final Exam

EPS 625 INTERMEDIATE STATISTICS FRIEDMAN TEST

Bill Burton Albert Einstein College of Medicine April 28, 2014 EERS: Managing the Tension Between Rigor and Resources 1

Psychology 60 Fall 2013 Practice Exam Actual Exam: Next Monday. Good luck!

Week 4: Standard Error and Confidence Intervals

13. Poisson Regression Analysis

C. The null hypothesis is not rejected when the alternative hypothesis is true. A. population parameters.

Linear Models in STATA and ANOVA

Descriptive Statistics

Types of Data, Descriptive Statistics, and Statistical Tests for Nominal Data. Patrick F. Smith, Pharm.D. University at Buffalo Buffalo, New York

Statistiek I. Proportions aka Sign Tests. John Nerbonne. CLCG, Rijksuniversiteit Groningen.

Analysis of Data. Organizing Data Files in SPSS. Descriptive Statistics

II. DISTRIBUTIONS distribution normal distribution. standard scores

PRACTICE PROBLEMS FOR BIOSTATISTICS

Additional sources Compilation of sources:

Independent t- Test (Comparing Two Means)

Projects Involving Statistics (& SPSS)

Data Analysis Tools. Tools for Summarizing Data

Chapter 8 Paired observations

HYPOTHESIS TESTING (ONE SAMPLE) - CHAPTER 7 1. used confidence intervals to answer questions such as...

t-test Statistics Overview of Statistical Tests Assumptions

TI-Inspire manual 1. Instructions. Ti-Inspire for statistics. General Introduction

BA 275 Review Problems - Week 5 (10/23/06-10/27/06) CD Lessons: 48, 49, 50, 51, 52 Textbook: pp

Two-sample inference: Continuous data

Opgaven Onderzoeksmethoden, Onderdeel Statistiek

2 Precision-based sample size calculations

One-Way Analysis of Variance: A Guide to Testing Differences Between Multiple Groups

Unit 31: One-Way ANOVA

Tests for Two Proportions

Pearson s Correlation

Calculating P-Values. Parkland College. Isela Guerra Parkland College. Recommended Citation

1.0 Abstract. Title: Real Life Evaluation of Rheumatoid Arthritis in Canadians taking HUMIRA. Keywords. Rationale and Background:

Transcription:

T s and F s Statistical testing for means FETP India

Competency to be gained from this lecture Test the statistical significance of the difference between two means

Key elements Paired and unpaired data Paired t-test Unpaired t-test F test

Means Proportion Application of the concept of statistical testing Measures of association Paired and unpaired data

Statistical testing for means Means T-test for paired data T-test for unpaired data F-test to test the difference in variances Proportion Measures of association Paired and unpaired data

Comparing unpaired data Concept Comparing a bag of observations against another bag of observations Example The mean height of the children in one class versus versus the height of the children in another Paired and unpaired data

Comparing paired data Concept Comparing pairs of observations that are linked with each other Example Pre and post treatment values of a parameter in a group of subjects Paired and unpaired data

Paired and unpaired t-tests To test the difference between two sample means that are paired (e.g., before and after treatment) or matched (e.g., patients matched for age, sex, etc) Use PAIRED t-test To test the difference between two sample means that are not paired / unmatched Use UNPAIRED (independent) t-test Paired and unpaired data

Example Drug trial Drug A and Drug B Two groups have equal initial blood sugars levels Question: Does the drug have an impact on the blood sugar level? Null hypothesis There is no difference between the mean blood sugar levels before and after treatment Paired and unpaired data

Options available for the example considered Two paired t-tests Each group has an initial and a post treatment values Two paired t-tests are possible for each group This option is adapted to a research question examining the individual relevance of each drug One unpaired test test on final value This option is adapted to a research question comparing the two drugs Paired and unpaired data

Methods to calculate the paired t-test: Concept We test the probability that the difference between the paired data is equal to 0 Paired t-test

Methods to calculate the paired t-test: Formula (1/2) Number of pairs: n Value before Rx: a Value after treatment: b Difference: Mean (d):

Methods to calculate the paired t-test: Formula (2/2) Variance (d):

Illustration of an application of the t-test Drug No of patients Fasting blood sugar (mg%) Initial Final Decrease A 30 178 153 25* B 31 179 119 60* * Statistically Significant ( P < 0.05) Paired t-test

Numerical example of paired t - test Patient number 1 2 3 4 5 6 7 8 9 10 Erythrocyte sedimentation rate - 1 hour (mm) Before Rx (a) After Rx (b) 8 25 10 43 6 38 7 20 10 41 5 48 8 15 9 28 4 35 3 33 Difference (a b) = d 17 33 32 13 31 43 7 19 31 30 Square of difference (d 2 ) 289 1,088 1,024 169 961 1,849 49 361 961 900 Total 326 70 256 7,652 Paired t-test

d = 256 ; n = 10 ; d = 256/10 = 25.6 d 2 = 7652 Variance (s 2 ) = 1 d n 1 ( d n 2 2 ) 1 (256) 7652 10 1 10 = = 122.04 S 2 122.04 s = = = 11.047 d 25.6 s / n 11.047 / 10 t = = = 7.33 with 9 d.f. 2 Paired t-test

Inference Calculated value of t= 7.33 9 degrees of freedom (df) Tabulated value of t (df=9)(0.1%) = 4.781 The value of t-cal exceeding the value of t-tab The treatment had a significant benefit in reducing the erythrocyte sedimentation rate (P < 0.001) The mean erythrocyte sedimentation rate after treatment (7.0 mm) is significantly lower than the mean pre-treatment ESR value (32.6 mm) Paired t-test

Methods to calculate the unpaired t-test: Concept The pooled variance is a weighted average of the two variances If the two sample sizes are equal, the pooled variance is the mean of the two variances The t-table is identical for unpaired and paired data Unpaired t-test

Methods to calculate the unpaired t-test: Formula Sample I Sample II Size n 1 n 2 Mean x 1 x 2 Variance s 2 1 s 2 2 To test the significance of the difference between the two sample means, calculate x1 x2 x x SE( x x2) 1 2 t = = 1 s 2 1 n 1 1 n 2 (n 1-1) s 2 1 + (n 2-1) s 2 2 where s 2 = ------------------------------- (n 1-1) + (n 2-1) t follows a t distribution with (n 1 + n 2-2) df Unpaired t-test

Numerical example of unpaired t -test Comparing the 24-hour total energy expenditure among an obese and a lean group Null hypothesis: There is no difference between the mean energy expenditure between the two groups

t cal > t tab indicate that the mean energy expenditure in obese group (10.3) is significantly (P<0.001) higher than that of lean group (8.1) Unpaired t-test

Underlying assumptions of the unpaired t-test 1. The distributions of x1 and x2 are normal 2. The population variances of x1 and x2 are equal However, minor deviations from these assumptions do not affect the validity of the test Unpaired t-test

Un-paired t-test on paired data It would be inefficient to test paired observations as though they were unpaired Consequences: Underestimation of t - value Overestimation of probability value Undercalling of significant difference Unpaired t-test

Unequal variances Variances in the two samples may differ considerably from one another Example: Two technicians, one experienced (more consistent) and the other relatively inexperienced (more variable) undertake a blood count Both technicians are estimating the same population mean value The more experienced one will have a smaller variability in his readings than the less experienced one F test

Possible course of action for situations with unequal variances No course of action will suit all situations Options: Transform the values to some other scale (e.g. logarithmic) to equalize variances Use specific methods when this is not possible: Modified t test Fisher-Behren s test F test

Variance ratio test (F- test) To test the equality of two variances, s12 and s22, we use a statistical test called the variance ratio test (F-test) Calculate the ratio of the larger variance to the smaller variance s12 i.e., F = ------- (s12 - larger variance) s22 F follows a F-distribution with (n1 1) and (n2 1) degrees of freedom F test

Example of variance ratio test (F-test) Variance in the infected group 10.9 (n1= 10) Variance in the control group 5.9 (n2 = 12) F is calculated as = 10.9 / 5.9 = 1.85 9,11 degrees of freedom (n1 and n2-1) Tabulated F 9,11(5%) = 2.92 A calculated F (Fcal) smaller than the tabulated F (Ftab) indicates that the variances are equal F test

Assumptions of the variance ratio F test The two samples must be independent e.g., Two series of patients and not the same patients tested twice (before and after treatment) Both samples must have come from a normal distribution F test

What test should be used to test the difference between two means? Test the difference between two sample mean values Values are paired / matched Values are unpaired / unmatched Paired t-test Check if variances are equal Equal variances Different variances Unpaired t-test Modified t-test Fisher-Behren test

Key messages Determine whether the data are paired Used paired t-test for paired data Used unpaired t-test for unpaired data if the variances are comparable Test for the difference in variance with F- test and use other tests if variances differ