Erik Parner 14 September Basic Biostatistics - Day 2-21 September,

Size: px
Start display at page:

Download "Erik Parner 14 September 2016. Basic Biostatistics - Day 2-21 September, 2016 1"

Transcription

1 PhD course in Basic Biostatistics Day Erik Parner, Department of Biostatistics, Aarhus University Log-transformation of continuous data Exercise.+.4+Standard- (Triglyceride) Logarithms and exponentials Two independent samples from normal distributions The model, check of the model, estimation Comparing the two means Approximate confidence interval and test Exact confidence interval and test using the t-distribution Comparing two populations using a non-parametric test The Wilcoxon-Mann-Whitney test Two independent samples from normal distributions Type and type errors Statistical power Sample size calculations Basic Biostatistics - Day Overview Data to analyse Type of analysis Unpaired/Paired Type Day Continuous One sample mean Irrelevant Parametric Day Nonparametric Day 3 Two sample mean Non-paired Parametric Day Nonparametric Day Paired Parametric Day 3 Nonparametric Day 3 Regression Non-paired Parametric Day 5 Several means Non-paired Parametric Day 6 Nonparametric Day 6 Binary One sample mean Irrelevant Parametric Day 4 Two sample mean Non-paired Parametric Day 4 Paired Parametric Day 4 Regression Non-paired Parametric Day 7 Time to event One sample: Cumulative risk Irrelevant Nonparametric Day 8 Regression: Rate/hazard ratio Non-paired Semi-parametric Day 8 Basic Biostatistics - Day Log-transformation of continuous data Continuous data with a long tail to the right are often logtransformed to obtain an approximate normal distribution. Recall the triglyceride measurements. Applying a normal based prediction interval (PI) on the original data gives invalid results: e.g. the PI will not have.5% below and above the two limits. The logarithm of the triglyceride measurements follows (approximately) a normal distribution: Density Log-triglyceride Log-triglyceride Inverse Normal % of data Density % of data We then need to transform the results back to the original scale to obtain useful results on the triglyceride measurements. The method presented here relies on the fact that percentiles are preserved when creating a transformation of the data. Basic Biostatistics - Day.5.5 Triglyceride Basic Biostatistics - Day 3 4 Basic Biostatistics - Day - September, 6

2 y y Erik Parner 4 September 6 Logarithmic and exponential functions Both the logarithm and the exponential function are increasing functions. Logarithm 8 6 Exponential Logarithmic and exponential transformations Medians and percentiles are preserved when making a transformation of the data: 5% to the right exp 6 % to the right log.5.5 x - - x Thus ( X) < ( A) X < A ( X) < ( A) exp exp log log Prediction intervals are given by.5 and 97.5 percentile. For a normal distribution the mean is equal to the median =5 percentile. Basic Biostatistics - Day Basic Biostatistics - Day 5 6 Transforming the results Summary Density PI (-.54;-.) CI mean -.77(-.8;-.74) Let Y denote the original observation. If X=log(Y) has a normal distribution with mean=median=m, and standard deviation=s,then Density Log-triglyceride.5.5 Triglyceride PI (.;.99) exp CI median.46 (.44;.48) a valid 95% CI for m will transform into a valid 95% CI for the median of Y = exp(x) a valid 95% PI for X will transform into a valid 95% PI for Y = exp(x) The relation between the means and medians are median( Y) = exp Basic Biostatistics - Day 7 Basic Biostatistics - Day 8 ( m) ( m s ) meany ( ) = exp +.5 Basic Biostatistics - Day - September, 6

3 It can be shown that ( ) sdy ( ) = meany ( ) exp s - Hence the standard deviation of Y depends on the mean of Y. For this reason the standard deviation is rarely used as a measure of the spread of the distribution of the original data in this setting. In this setting the coefficient of variation (cv) is often used as a measure of the spread of the data sdy ( ) cvy ( ) = = exp( s )- meany ( ) Properties logarithm and exponential function The basic properties of the logarithms and exponentials that we will use throughout the course: Product log exp Sum ( a b) = ( a) + ( b) ( a b) = ( a) - ( b) ( a+ b) = ( a) ( b) ( a- b) = ( a) ( b) log log log log log log exp exp exp exp exp exp b b ( a ) = b ( a) ( a b) = ( a) = ( b) log log exp exp exp a Basic Biostatistics - Day 9 Basic Biostatistics - Day Continuous data two sample mean Body temperature versus gender Scientific question: Do the two gender have different normal body temperature? Design: 3 participants were randomly sampled, 65 males and 65 females Data: Measured temperature, gender Summary of the data (the units are degrees Celsius): Gender N(tempC) mean(tempc) sd(tempc) med(tempc) Male Female Basic Biostatistics - Day Body temperature: Plotting the data Figure. Temperature (C) Temperature (C) Male Female Gender Male Female The data looks fine - a few outliers among females? Basic Biostatistics - Day Basic Biostatistics - Day - September, 6 3

4 Body temperature: Checking the normality in each group Figure. Density.5.5 Male Female Graphs by Gender Normality looks ok! Male Inverse Normal Female Inverse Normal Basic Biostatistics - Day 3 A statistical model: Body temperature: The model Two independent samples from normal distributions, i.e. the two samples are independent and each are assumed to be a random sample from a normal distribution:. The observations are independent (knowing one observation will not alter the distribution of the others). The observations come from the same distribution, e.g. they all have the same mean and variance. 3. This distribution is a normal distribution with unknown mean, m i, and standard deviation, s i. N(m i, s i ) Basic Biostatistics - Day 4 Body temperature: Checking the assumptions The first two think about how data was collected!. Independence between groups information on different individuals Independence within groups: Data are from different individuals, so the assumption is probably ok.. In each group: The observations come from the same distribution. Here we can only speculate. Does the body temperature depend on known factors of interest, for example heart rate, time of day, etc.? Body temperature: The estimates The estimates are found like we did day : ( ) ( ) ( ) ( ) ˆ m = ;36.8, sˆ =.388, sem ˆ m =.48 M M M ˆ m = ;36.99, sˆ =.43, sem ˆ m =.5 F F F Observe that the width of the prediction interval is approximately *.96 *.4 C =.6 C, so there is a large variation in body temperature between individuals within each of the two groups We see that the average body temperature is higher among women Basic Biostatistics - Day 5 Basic Biostatistics - Day 6 Basic Biostatistics - Day - September, 6 4

5 Body temperature: Estimating the difference Remember focus is on the difference between the two groups, meaning, we are interested in : d = mf -mm The unknown difference in mean body temperature. This is of course estimated by: dˆ = ˆ m - ˆ m = =.6 F M What about the precision of this estimate? What is the standard error of a difference? Basic Biostatistics - Day 7 The standard error of a difference If we have two independent estimates and, like here, calculate the differences, then the standard error of the difference is given as ( dˆ) = ( ˆ m ˆ ) ( ˆ ) ( ˆ F - mm = mf + mm ) se se se se We note that standard error of a difference between two independent estimates is larger than both of the two standard errors. In the body temperature data we get: and an approx. 95% CI ( ˆ ) se d = =.7 ( dˆ) ( ) dˆ.96 se = =.5;.3 Basic Biostatistics - Day 8 Testing no difference in means ( ) ( ˆ) d :. 63.5;.3 se d =.7 Here we are especially interested in the hypothesis that body temperature is the same for the two gender: Hypothesis: d = d = We can make an approx. test similar to day dˆ-d dˆ zobs = = = =.3 se dˆ se dˆ.7 and find the p-value as We get p=.3% ( ) ( ) ( z ) Pr standard normal obs Exact inference for two independent normal samples Just like in the one sample setting, it is possible to make exact inference based on the t-distribution. And again these are easily made by a computer. Rememberthe model: Two independent samples from normal distributions with means and standard deviations, m, s and m, s M M F F Note, both the means and the standard deviations might be different in the two populations. If one wants to make exact inference, then one has to make the additional assumption: 4. The standard deviations are the same: s M = s F Basic Biostatistics - Day 9 Basic Biostatistics - Day Basic Biostatistics - Day - September, 6 5

6 Exact inference for two independent normal samples Testing the hypothesis : s M = s F This is done by considering the ratio between the two estimated standard deviations: F obs Ø Largest observed standard deviation ø = Œ Smallest observed standard deviation œ º ß A large value of this F-ratio is critical for the hypothesis Thep-value = the probability of observing a F-ratio at least as large as we have observed - given the hypothesis is true! The p-value is here found by using an F-distribution with (n largest -) and (n smallest -) degrees of freedom: ( ( largest smallest ) obs ) p - value = Pr F n -; n - F Basic Biostatistics - Day Exact inference for two independent normal samples Testing the hypothesis : s M = s F Here we have: n ˆ F = 65 sf =.43 Ø.43ø so F obs = Œ = = n ˆ º.388œ M = 65 sm =.388 ß The observed variance (sd ) is 3% higher among women. But could this be explained by sampling variation what is the p-value? To find the p-value we consult an F-distribution with 64=(65-) and 64=(65-) degrees of freedom. We get p-value = 63%.63.3 The difference in the observed standard deviation can be explained by sampling variation. We accept that s M = s F! The fourth assumption is ok! Basic Biostatistics - Day Exact inference for two independent normal samples We now have a common standard deviation : s = s F = s M This is estimated as a weighted average sˆ = sˆ ( n - ) + sˆ ( n -) ( n - ) + ( n -) F F M M F ( - ) + ( - ) ( 65- ) + ( 65-) M = =.4 Based on this we can calculate a revised/updated standard error of the difference: ( dˆ) se = sˆ + =.4 + =.7 n n F M This is not found in the Stata output Exact inference for two independent normal samples dˆ:.63 se d ˆ =.7 ( ) Exact confidence intervals and p-values are found by using a t-distribution with n M + n F - = = 8 d.f. dˆ t se dˆ = =.4; And the exact test: ( ) ( ) dˆ -.63 H : d = tobs = = =.3 se ˆ.7 ( d ) and find the p-value as Pr( t-distribution tobs ) We get p=.% (either from table of standard normal distribution, or from Stata) Basic Biostatistics - Day 3 Basic Biostatistics - Day 4 Basic Biostatistics - Day - September, 6 6

7 Stata: two-sample normal analysis The F-test and t-test are easily done in Stata (more details can be found in the file day.do).. cd "D:\Teaching\BasalBiostat\Lectures\Day" D:\Teaching\BasalBiostat\Lectures\Day. use normtemp.dta, clear. * Checking the normality.. qnorm tempc if sex==, title("male") name(plot, replace). qnorm tempc if sex==, title("female") name(plot3, replace). graph combine plot plot3, name(plotright, replace) col(). sdtest tempc, by(sex) Variance ratio test Group Obs Mean Std.Err. Std.Dev. [95% Conf.Interval] Male Female combined ratio = sd(male) / sd(female) f =.8847 Ho: ratio = degrees of freedom = 64, 64 Ha: ratio < Ha: ratio!= Ha: ratio > Pr(F < f) =.38 *Pr(F < f)=.656 Pr(F > f)=.687 Basic Biostatistics - Day 5 Basic Biostatistics - Day 6. ttest tempc, by(sex) Two-sample t test with equal variances Group Obs Mean Std.Err. Std.Dev. [95%Conf.Interval] Male Female combined diff diff = mean(male) - mean(female) t = -.34 Ho: diff = degrees of freedom = 8 Ha: diff < Ha: diff!= Ha: diff > Pr(T < t) =. Pr( T > t )=.9 Pr(T > t)=.989 Basic Biostatistics - Day 7 Exact inference for two independent normal samples What if you reject the hypothesis of the same sd in the two groups?. This indicates that the variation in the two groups differ! Think about why!!!. Often it is due to the fact that the assumption of normality is not satisfied. Maybe you would do better by making the statistical analysis on another scale, e.g. log. 3. If you still want to compare the means on the original scale you can make approximate inference based on the t-distribution (e.g. ttest tempc, by(sex) unequal ) 4. If you only want to test the hypothesis that the two distributions are located the same place, then can you use the non-parametric Wilcoxon-Mann-Whitney test see later. Basic Biostatistics - Day 8 Basic Biostatistics - Day - September, 6 7

8 Body temperature example - formulations Methods: Data was analyzed as two independent samples from normal distributions based on the Students t. The assumption of normality was checked by a Q-Q plot. Estimates are given with 95% confidence intervals. Results: The mean body temperature was 36.9(36.8;37.)C among women compared to 36.7(36.6;36.8)C among men. The mean was.6(.;.3)c, higher for females and this was statistically significant (p=.3%). Conclusion: Based on this study we conclude that women have a small, but statistically significantly higher mean body temperature than men. Example 7. Birth weight and heavy smoking Scientific question: Does the smoking habits of the mother influence the birth weight of the child? Design and data: (observational) The birth weight (kg) of children born by 4 heavy smokers and 5 non-smokers were recorded. Summary of the data (the units is kg): Group Obs Mean Std. Err. Std. Dev. [95% Conf. Interval] Non-smok Heavy sm Already here we observe, that the average birth weight is smallest among heavy-smokers: difference=45 g Basic Biostatistics - Day 9 Basic Biostatistics - Day 3 Example 7. Birth weight and heavy smoking Plot the data!!!!!! Example 7. Birth weight and heavy smoking Non-smoker Non-smokers Birth weight 3.5 Birth weight 3.5 Density.5 Heavy smoker Inverse Normal Heavy smokers Non-smoker Heavy smoker Smoking habits.5 Non-smoker Heavy smoker Inverse Normal Graphs by Smoking habits Independence,same distribution and normality seems ok. Basic Biostatistics - Day 3 Basic Biostatistics - Day 3 Basic Biostatistics - Day - September, 6 8

9 Example 7. Birth weight and heavy smoking exact inference Compare the standard deviations (using the computer): Ø.463ø Fobs =.64 p 35% from F (3,4) Œ = = º.3584 œ ß We accept that the two standard deviations are identical. and again by computer we get: Difference in mean birth weight:.45(.38;.767) kg Hypothesis: no difference in mean birth weight. p=.6% Conclusionof the test: If there was no difference between the two groups, then it would be almost impossible to observe such a large difference as we have seen hence the hypothesis cannot be true! Basic Biostatistics - Day 33 The birth weight example - formulations Methods - like the body temperature example: Data intervals. Results: The mean birth weight was 3.67(3.48;3.85) kg among nonsmokers compared to 3.74(.97;3.44) kg among heavy smokers. The difference 45(38;767)g was statistically significant (p=.6%). Conclusion: Children born by heavy-smokers have a birth weight, that is statistically significantly smaller, than that of children born by non-smokers. The study has only limited information on the precise size of the association. Furthermore we have not studied the implications of the difference in birth weight or whether the difference could be explained by other factors, like eating habits Basic Biostatistics - Day 34 Non-Parametric test: Wilcoxon-Mann-Whitney test Until now we have only made statistical inference based on a parametric model. E.g. we have focused on estimating the difference between two groups and supplying the estimate with a confidence interval. We have also performed a statistical test of no difference based on the estimate and the standard error aparametric test. There are other types of tests non-parametric tests that are not based on a parametric model. These test are also based on models, but they are not parametric models. We will here look at the Wilcoxon-Mann-Whitney test, which is the non-parametric analogy to the two sample t-test. Basic Biostatistics - Day 35 Non-Parametric test: Wilcoxon-Mann-Whitney test The key feature of all non-parametric tests is, that they are based on the ranks of the data and not the actual values. Heavy smokers Non-smokers Birth Birth weight Rank weight Rank Smallest Number 7 and 8 Basic Biostatistics - Day 36 Basic Biostatistics - Day - September, 6 9

10 Non-Parametric test: Wilcoxon-Mann-Whitney test We can now add the rank in one of the groups, here the heavy smokers: Heavy-smokers observed rank sum=5.5 Hypothesis:The birth weights among heavy-smokers and non-smokers is the same. Assuming the hypothesis is true one can calculate the expected rank sum among the heavy-smokers and standard error of the observed rank sum and calculate a test statistics: z obs Observed ranksum -Expected ranksum = se( Observed ranksum) = = P-value =.9% The p-value is found as Pr( standard normal zobs ) Basic Biostatistics - Day 37 Non-Parametric test: Wilcoxon-Mann-Whitney test We saw that the ranksum among heavy smokers was smaller than expected if there was no true difference between the two groups. So small that we only observe such a discrepancy in one out of (p-val=.9%) studies like this. We reject the hypothesis! Conclusion Children born by heavy-smokers have a statistically significant lower birth weight than children born by nonsmokers. Remember this depends on, the sample size, the design, the statistical analysis... Basic Biostatistics - Day 38 Non-Parametric test: Wilcoxon-Mann-Whitney test Some comments: There are two assumptions behind the test:. Independence between and within the groups.. Within each group: The observations come from the same distribution, e.g. they all have the same mean and variance. The test is designed to detect a shift in location in the two populations and not, for example, a difference in the variation in the two populations. You will only get a p-value the possible difference in location will is not quantified by an estimate with a confidence interval. As a test it is just as valid as the t-test! Basic Biostatistics - Day 39 Stata: Wilcoxon-Mann-Whitney test. use bwsmoking.dta,clear (Birth weight (kg) of 9 babies born to 4 heavy smokers and 5 non-smokers). ranksum bw, by(group) Two-sample Wilcoxon rank-sum (Mann-Whitney) test group obs rank sum expected Non-smoker Heavy smoker combined unadjusted variance 55. adjustment for ties adjusted variance Ho: bw(group==non-smoker) = bw(group==heavy smoker) z =.597 Prob > z =.94 Basic Biostatistics - Day 4 Basic Biostatistics - Day - September, 6

11 Type and type errors We will here return to the simple interpretation of a statistical test: We test a hypothesis: d = d We will make a Type error if we reject the hypothesis, if it is true. Type error if we accept the hypothesis, if it is false. If we use a specific significance level, a, (typically 5%) then we know: Pr reject d = d given it is true = Pr ( ) ( reject = given = ) = d d d d a The risk of a Type error = a Basic Biostatistics - Day 4 Type and type errors What about the risk of Type error: Pr( accept given it is not true) ( accept d = d given d d ) = b = d = d = Pr? This will depend on several things:. The statisticalmodel and test we will be using. What is the true value of d? 3. The precision of the estimate. What is the sample size and standard deviation? That is, the risk of Type error, b, is not constant. Often we consider the statistical power: ( reject given ) Pr d = d d d = -b Basic Biostatistics - Day 4 Statistical power planning a study - testing for no difference Suppose we are planning a new study of fish oil and its possible effect on diastolic blood pressure (DBP). Assume we want to make a randomized trial with two groups of equal size and we will test the hypothesis of no difference. We believe that the true difference between groups in DBP is 5mmHg. Furthermore we believe that the standard deviation in the increase in DBP is 9mmHg. We plan to include 4 women in each group and analyze using a t-test. What is the chance, that this study will lead to a statistically significant difference between the two groups, given the true difference is 5mmHg? Basic Biostatistics - Day 43 Statistical power, when the true difference is 5 and sd= 7,8,9 or and we test the hypothesis of no difference. Power in % n=4 power=69% True difference = 5 - Test for no difference sd= sd=9 sd=8 sd= Observations in each group Basic Biostatistics - Day 44 Basic Biostatistics - Day - September, 6

12 Statistical power planning a study We plan to include 4 women in each group and analyze using a t-test and the true difference is 5mmHg and sd=9mmhg Power = 69% That is, there is only 69% chance, that such a study will lead to a statistical significant result - given the assumptions are true. How may women should we include in each group if we want to have a power of 9%? Based on the plot we see that more than aprox. 69 women in each group will lead to a power of 9%. Statistical power, when the true difference is 5 and sd= 7,8,9 or and we test the hypothesis of no difference. Power in % power=9% n=69 True difference = 5 - Test for no difference sd= sd=9 sd=8 sd= Observations in each group Basic Biostatistics - Day 45 Basic Biostatistics - Day 46 The power increases as a function of the expected difference between the groups and decreases as a function of the variation, standard deviation, within the groups Power in % True difference = - Test for no difference sd= sd=9 sd=8 sd= Observations in each group Power two unpaired normal samples In general we have the five quantities in play: d = m -m The true difference between groups s = a = b = n= The standard deviation within each group The significance level (typically 5%) The risk of type error = -the power The sample size in each group If we know four of these, then we can determine the last. Typically, we know the first four and want to know the sample size. or we know d, s, a and n and then we want to know the power. Basic Biostatistics - Day 47 Basic Biostatistics - Day 48 Basic Biostatistics - Day - September, 6

13 Stata: Paired sample from a normal distribution Power calculations are done using the power command:. power twomeans 5, sd(9) sd(9) alpha(.5) power(.9) Performing iteration... Estimated sample sizes for a two-sample means test Satterthwaite's t test assuming unequal variances Ho: m = m versus Ha: m!= m Study parameters: alpha =.5 power =.9 delta = m =. m = 5. sd = 9. sd = 9. Estimated sample sizes: N = 4 N per group = 7 * Prior to Stata 3: * sampsi 5, sd(9) sd(9) alpha(.5) power(.9) Basic Biostatistics - Day 49 Comments on sample size calculations Most often done by computer (in Stata power) There are many different formulas see Kirkwood & Stern Table 35.. We will only look at a few in this course. It is in general more relevant to test that the difference is larger than a specified value. A so-called Superiority or Non-inferiority study. Or to plan the study so that your study is expected to yield a confidence interval with a certain width. You need to know the true difference and you must have an idea of the variation within the groups. The latter you might find based on hospital records or in the literature. Sample size calculations after the study has been carried out (post hoc) is nonsense!! The confidence interval will show how much information you have in the study. Basic Biostatistics - Day 5 Basic Biostatistics - Day - September, 6 3

Introduction. Hypothesis Testing. Hypothesis Testing. Significance Testing

Introduction. Hypothesis Testing. Hypothesis Testing. Significance Testing Introduction Hypothesis Testing Mark Lunt Arthritis Research UK Centre for Ecellence in Epidemiology University of Manchester 13/10/2015 We saw last week that we can never know the population parameters

More information

NCSS Statistical Software

NCSS Statistical Software Chapter 06 Introduction This procedure provides several reports for the comparison of two distributions, including confidence intervals for the difference in means, two-sample t-tests, the z-test, the

More information

Lesson 1: Comparison of Population Means Part c: Comparison of Two- Means

Lesson 1: Comparison of Population Means Part c: Comparison of Two- Means Lesson : Comparison of Population Means Part c: Comparison of Two- Means Welcome to lesson c. This third lesson of lesson will discuss hypothesis testing for two independent means. Steps in Hypothesis

More information

Two-Sample T-Tests Allowing Unequal Variance (Enter Difference)

Two-Sample T-Tests Allowing Unequal Variance (Enter Difference) Chapter 45 Two-Sample T-Tests Allowing Unequal Variance (Enter Difference) Introduction This procedure provides sample size and power calculations for one- or two-sided two-sample t-tests when no assumption

More information

Non-Inferiority Tests for Two Means using Differences

Non-Inferiority Tests for Two Means using Differences Chapter 450 on-inferiority Tests for Two Means using Differences Introduction This procedure computes power and sample size for non-inferiority tests in two-sample designs in which the outcome is a continuous

More information

DATA INTERPRETATION AND STATISTICS

DATA INTERPRETATION AND STATISTICS PholC60 September 001 DATA INTERPRETATION AND STATISTICS Books A easy and systematic introductory text is Essentials of Medical Statistics by Betty Kirkwood, published by Blackwell at about 14. DESCRIPTIVE

More information

2 Precision-based sample size calculations

2 Precision-based sample size calculations Statistics: An introduction to sample size calculations Rosie Cornish. 2006. 1 Introduction One crucial aspect of study design is deciding how big your sample should be. If you increase your sample size

More information

Simple Linear Regression Inference

Simple Linear Regression Inference Simple Linear Regression Inference 1 Inference requirements The Normality assumption of the stochastic term e is needed for inference even if it is not a OLS requirement. Therefore we have: Interpretation

More information

NCSS Statistical Software

NCSS Statistical Software Chapter 06 Introduction This procedure provides several reports for the comparison of two distributions, including confidence intervals for the difference in means, two-sample t-tests, the z-test, the

More information

Permutation Tests for Comparing Two Populations

Permutation Tests for Comparing Two Populations Permutation Tests for Comparing Two Populations Ferry Butar Butar, Ph.D. Jae-Wan Park Abstract Permutation tests for comparing two populations could be widely used in practice because of flexibility of

More information

Two-Sample T-Tests Assuming Equal Variance (Enter Means)

Two-Sample T-Tests Assuming Equal Variance (Enter Means) Chapter 4 Two-Sample T-Tests Assuming Equal Variance (Enter Means) Introduction This procedure provides sample size and power calculations for one- or two-sided two-sample t-tests when the variances of

More information

Chapter 7 Section 7.1: Inference for the Mean of a Population

Chapter 7 Section 7.1: Inference for the Mean of a Population Chapter 7 Section 7.1: Inference for the Mean of a Population Now let s look at a similar situation Take an SRS of size n Normal Population : N(, ). Both and are unknown parameters. Unlike what we used

More information

Confidence Intervals for the Difference Between Two Means

Confidence Intervals for the Difference Between Two Means Chapter 47 Confidence Intervals for the Difference Between Two Means Introduction This procedure calculates the sample size necessary to achieve a specified distance from the difference in sample means

More information

Nonparametric Two-Sample Tests. Nonparametric Tests. Sign Test

Nonparametric Two-Sample Tests. Nonparametric Tests. Sign Test Nonparametric Two-Sample Tests Sign test Mann-Whitney U-test (a.k.a. Wilcoxon two-sample test) Kolmogorov-Smirnov Test Wilcoxon Signed-Rank Test Tukey-Duckworth Test 1 Nonparametric Tests Recall, nonparametric

More information

MULTIPLE REGRESSION EXAMPLE

MULTIPLE REGRESSION EXAMPLE MULTIPLE REGRESSION EXAMPLE For a sample of n = 166 college students, the following variables were measured: Y = height X 1 = mother s height ( momheight ) X 2 = father s height ( dadheight ) X 3 = 1 if

More information

General Method: Difference of Means. 3. Calculate df: either Welch-Satterthwaite formula or simpler df = min(n 1, n 2 ) 1.

General Method: Difference of Means. 3. Calculate df: either Welch-Satterthwaite formula or simpler df = min(n 1, n 2 ) 1. General Method: Difference of Means 1. Calculate x 1, x 2, SE 1, SE 2. 2. Combined SE = SE1 2 + SE2 2. ASSUMES INDEPENDENT SAMPLES. 3. Calculate df: either Welch-Satterthwaite formula or simpler df = min(n

More information

Dongfeng Li. Autumn 2010

Dongfeng Li. Autumn 2010 Autumn 2010 Chapter Contents Some statistics background; ; Comparing means and proportions; variance. Students should master the basic concepts, descriptive statistics measures and graphs, basic hypothesis

More information

NONPARAMETRIC STATISTICS 1. depend on assumptions about the underlying distribution of the data (or on the Central Limit Theorem)

NONPARAMETRIC STATISTICS 1. depend on assumptions about the underlying distribution of the data (or on the Central Limit Theorem) NONPARAMETRIC STATISTICS 1 PREVIOUSLY parametric statistics in estimation and hypothesis testing... construction of confidence intervals computing of p-values classical significance testing depend on assumptions

More information

1. What is the critical value for this 95% confidence interval? CV = z.025 = invnorm(0.025) = 1.96

1. What is the critical value for this 95% confidence interval? CV = z.025 = invnorm(0.025) = 1.96 1 Final Review 2 Review 2.1 CI 1-propZint Scenario 1 A TV manufacturer claims in its warranty brochure that in the past not more than 10 percent of its TV sets needed any repair during the first two years

More information

How To Compare Birds To Other Birds

How To Compare Birds To Other Birds STT 430/630/ES 760 Lecture Notes: Chapter 7: Two-Sample Inference 1 February 27, 2009 Chapter 7: Two Sample Inference Chapter 6 introduced hypothesis testing in the one-sample setting: one sample is obtained

More information

Non-Parametric Tests (I)

Non-Parametric Tests (I) Lecture 5: Non-Parametric Tests (I) KimHuat LIM lim@stats.ox.ac.uk http://www.stats.ox.ac.uk/~lim/teaching.html Slide 1 5.1 Outline (i) Overview of Distribution-Free Tests (ii) Median Test for Two Independent

More information

Statistics for Sports Medicine

Statistics for Sports Medicine Statistics for Sports Medicine Suzanne Hecht, MD University of Minnesota (suzanne.hecht@gmail.com) Fellow s Research Conference July 2012: Philadelphia GOALS Try not to bore you to death!! Try to teach

More information

Basic Statistics and Data Analysis for Health Researchers from Foreign Countries

Basic Statistics and Data Analysis for Health Researchers from Foreign Countries Basic Statistics and Data Analysis for Health Researchers from Foreign Countries Volkert Siersma siersma@sund.ku.dk The Research Unit for General Practice in Copenhagen Dias 1 Content Quantifying association

More information

SCHOOL OF HEALTH AND HUMAN SCIENCES DON T FORGET TO RECODE YOUR MISSING VALUES

SCHOOL OF HEALTH AND HUMAN SCIENCES DON T FORGET TO RECODE YOUR MISSING VALUES SCHOOL OF HEALTH AND HUMAN SCIENCES Using SPSS Topics addressed today: 1. Differences between groups 2. Graphing Use the s4data.sav file for the first part of this session. DON T FORGET TO RECODE YOUR

More information

LAB 4 INSTRUCTIONS CONFIDENCE INTERVALS AND HYPOTHESIS TESTING

LAB 4 INSTRUCTIONS CONFIDENCE INTERVALS AND HYPOTHESIS TESTING LAB 4 INSTRUCTIONS CONFIDENCE INTERVALS AND HYPOTHESIS TESTING In this lab you will explore the concept of a confidence interval and hypothesis testing through a simulation problem in engineering setting.

More information

Syntax Menu Description Options Remarks and examples Stored results Methods and formulas References Also see. level(#) , options2

Syntax Menu Description Options Remarks and examples Stored results Methods and formulas References Also see. level(#) , options2 Title stata.com ttest t tests (mean-comparison tests) Syntax Syntax Menu Description Options Remarks and examples Stored results Methods and formulas References Also see One-sample t test ttest varname

More information

Difference of Means and ANOVA Problems

Difference of Means and ANOVA Problems Difference of Means and Problems Dr. Tom Ilvento FREC 408 Accounting Firm Study An accounting firm specializes in auditing the financial records of large firm It is interested in evaluating its fee structure,particularly

More information

SOLUTIONS TO BIOSTATISTICS PRACTICE PROBLEMS

SOLUTIONS TO BIOSTATISTICS PRACTICE PROBLEMS SOLUTIONS TO BIOSTATISTICS PRACTICE PROBLEMS BIOSTATISTICS DESCRIBING DATA, THE NORMAL DISTRIBUTION SOLUTIONS 1. a. To calculate the mean, we just add up all 7 values, and divide by 7. In Xi i= 1 fancy

More information

Basic Statistical and Modeling Procedures Using SAS

Basic Statistical and Modeling Procedures Using SAS Basic Statistical and Modeling Procedures Using SAS One-Sample Tests The statistical procedures illustrated in this handout use two datasets. The first, Pulse, has information collected in a classroom

More information

Part 3. Comparing Groups. Chapter 7 Comparing Paired Groups 189. Chapter 8 Comparing Two Independent Groups 217

Part 3. Comparing Groups. Chapter 7 Comparing Paired Groups 189. Chapter 8 Comparing Two Independent Groups 217 Part 3 Comparing Groups Chapter 7 Comparing Paired Groups 189 Chapter 8 Comparing Two Independent Groups 217 Chapter 9 Comparing More Than Two Groups 257 188 Elementary Statistics Using SAS Chapter 7 Comparing

More information

PRACTICE PROBLEMS FOR BIOSTATISTICS

PRACTICE PROBLEMS FOR BIOSTATISTICS PRACTICE PROBLEMS FOR BIOSTATISTICS BIOSTATISTICS DESCRIBING DATA, THE NORMAL DISTRIBUTION 1. The duration of time from first exposure to HIV infection to AIDS diagnosis is called the incubation period.

More information

11. Analysis of Case-control Studies Logistic Regression

11. Analysis of Case-control Studies Logistic Regression Research methods II 113 11. Analysis of Case-control Studies Logistic Regression This chapter builds upon and further develops the concepts and strategies described in Ch.6 of Mother and Child Health:

More information

3.4 Statistical inference for 2 populations based on two samples

3.4 Statistical inference for 2 populations based on two samples 3.4 Statistical inference for 2 populations based on two samples Tests for a difference between two population means The first sample will be denoted as X 1, X 2,..., X m. The second sample will be denoted

More information

Outline. Definitions Descriptive vs. Inferential Statistics The t-test - One-sample t-test

Outline. Definitions Descriptive vs. Inferential Statistics The t-test - One-sample t-test The t-test Outline Definitions Descriptive vs. Inferential Statistics The t-test - One-sample t-test - Dependent (related) groups t-test - Independent (unrelated) groups t-test Comparing means Correlation

More information

THE KRUSKAL WALLLIS TEST

THE KRUSKAL WALLLIS TEST THE KRUSKAL WALLLIS TEST TEODORA H. MEHOTCHEVA Wednesday, 23 rd April 08 THE KRUSKAL-WALLIS TEST: The non-parametric alternative to ANOVA: testing for difference between several independent groups 2 NON

More information

The Wilcoxon Rank-Sum Test

The Wilcoxon Rank-Sum Test 1 The Wilcoxon Rank-Sum Test The Wilcoxon rank-sum test is a nonparametric alternative to the twosample t-test which is based solely on the order in which the observations from the two samples fall. We

More information

Independent t- Test (Comparing Two Means)

Independent t- Test (Comparing Two Means) Independent t- Test (Comparing Two Means) The objectives of this lesson are to learn: the definition/purpose of independent t-test when to use the independent t-test the use of SPSS to complete an independent

More information

THE FIRST SET OF EXAMPLES USE SUMMARY DATA... EXAMPLE 7.2, PAGE 227 DESCRIBES A PROBLEM AND A HYPOTHESIS TEST IS PERFORMED IN EXAMPLE 7.

THE FIRST SET OF EXAMPLES USE SUMMARY DATA... EXAMPLE 7.2, PAGE 227 DESCRIBES A PROBLEM AND A HYPOTHESIS TEST IS PERFORMED IN EXAMPLE 7. THERE ARE TWO WAYS TO DO HYPOTHESIS TESTING WITH STATCRUNCH: WITH SUMMARY DATA (AS IN EXAMPLE 7.17, PAGE 236, IN ROSNER); WITH THE ORIGINAL DATA (AS IN EXAMPLE 8.5, PAGE 301 IN ROSNER THAT USES DATA FROM

More information

Nonparametric Statistics

Nonparametric Statistics Nonparametric Statistics J. Lozano University of Goettingen Department of Genetic Epidemiology Interdisciplinary PhD Program in Applied Statistics & Empirical Methods Graduate Seminar in Applied Statistics

More information

Comparing Means in Two Populations

Comparing Means in Two Populations Comparing Means in Two Populations Overview The previous section discussed hypothesis testing when sampling from a single population (either a single mean or two means from the same population). Now we

More information

Chapter 8 Hypothesis Testing Chapter 8 Hypothesis Testing 8-1 Overview 8-2 Basics of Hypothesis Testing

Chapter 8 Hypothesis Testing Chapter 8 Hypothesis Testing 8-1 Overview 8-2 Basics of Hypothesis Testing Chapter 8 Hypothesis Testing 1 Chapter 8 Hypothesis Testing 8-1 Overview 8-2 Basics of Hypothesis Testing 8-3 Testing a Claim About a Proportion 8-5 Testing a Claim About a Mean: s Not Known 8-6 Testing

More information

Overview of Non-Parametric Statistics PRESENTER: ELAINE EISENBEISZ OWNER AND PRINCIPAL, OMEGA STATISTICS

Overview of Non-Parametric Statistics PRESENTER: ELAINE EISENBEISZ OWNER AND PRINCIPAL, OMEGA STATISTICS Overview of Non-Parametric Statistics PRESENTER: ELAINE EISENBEISZ OWNER AND PRINCIPAL, OMEGA STATISTICS About Omega Statistics Private practice consultancy based in Southern California, Medical and Clinical

More information

Recall this chart that showed how most of our course would be organized:

Recall this chart that showed how most of our course would be organized: Chapter 4 One-Way ANOVA Recall this chart that showed how most of our course would be organized: Explanatory Variable(s) Response Variable Methods Categorical Categorical Contingency Tables Categorical

More information

II. DISTRIBUTIONS distribution normal distribution. standard scores

II. DISTRIBUTIONS distribution normal distribution. standard scores Appendix D Basic Measurement And Statistics The following information was developed by Steven Rothke, PhD, Department of Psychology, Rehabilitation Institute of Chicago (RIC) and expanded by Mary F. Schmidt,

More information

Principles of Hypothesis Testing for Public Health

Principles of Hypothesis Testing for Public Health Principles of Hypothesis Testing for Public Health Laura Lee Johnson, Ph.D. Statistician National Center for Complementary and Alternative Medicine johnslau@mail.nih.gov Fall 2011 Answers to Questions

More information

UNDERSTANDING THE TWO-WAY ANOVA

UNDERSTANDING THE TWO-WAY ANOVA UNDERSTANDING THE e have seen how the one-way ANOVA can be used to compare two or more sample means in studies involving a single independent variable. This can be extended to two independent variables

More information

Skewed Data and Non-parametric Methods

Skewed Data and Non-parametric Methods 0 2 4 6 8 10 12 14 Skewed Data and Non-parametric Methods Comparing two groups: t-test assumes data are: 1. Normally distributed, and 2. both samples have the same SD (i.e. one sample is simply shifted

More information

Nonparametric Statistics

Nonparametric Statistics Nonparametric Statistics References Some good references for the topics in this course are 1. Higgins, James (2004), Introduction to Nonparametric Statistics 2. Hollander and Wolfe, (1999), Nonparametric

More information

Permutation & Non-Parametric Tests

Permutation & Non-Parametric Tests Permutation & Non-Parametric Tests Statistical tests Gather data to assess some hypothesis (e.g., does this treatment have an effect on this outcome?) Form a test statistic for which large values indicate

More information

Sample Size and Power in Clinical Trials

Sample Size and Power in Clinical Trials Sample Size and Power in Clinical Trials Version 1.0 May 011 1. Power of a Test. Factors affecting Power 3. Required Sample Size RELATED ISSUES 1. Effect Size. Test Statistics 3. Variation 4. Significance

More information

Section 13, Part 1 ANOVA. Analysis Of Variance

Section 13, Part 1 ANOVA. Analysis Of Variance Section 13, Part 1 ANOVA Analysis Of Variance Course Overview So far in this course we ve covered: Descriptive statistics Summary statistics Tables and Graphs Probability Probability Rules Probability

More information

INTERPRETING THE ONE-WAY ANALYSIS OF VARIANCE (ANOVA)

INTERPRETING THE ONE-WAY ANALYSIS OF VARIANCE (ANOVA) INTERPRETING THE ONE-WAY ANALYSIS OF VARIANCE (ANOVA) As with other parametric statistics, we begin the one-way ANOVA with a test of the underlying assumptions. Our first assumption is the assumption of

More information

Multinomial and Ordinal Logistic Regression

Multinomial and Ordinal Logistic Regression Multinomial and Ordinal Logistic Regression ME104: Linear Regression Analysis Kenneth Benoit August 22, 2012 Regression with categorical dependent variables When the dependent variable is categorical,

More information

Statistics. One-two sided test, Parametric and non-parametric test statistics: one group, two groups, and more than two groups samples

Statistics. One-two sided test, Parametric and non-parametric test statistics: one group, two groups, and more than two groups samples Statistics One-two sided test, Parametric and non-parametric test statistics: one group, two groups, and more than two groups samples February 3, 00 Jobayer Hossain, Ph.D. & Tim Bunnell, Ph.D. Nemours

More information

Analysis of Variance ANOVA

Analysis of Variance ANOVA Analysis of Variance ANOVA Overview We ve used the t -test to compare the means from two independent groups. Now we ve come to the final topic of the course: how to compare means from more than two populations.

More information

Tutorial 5: Hypothesis Testing

Tutorial 5: Hypothesis Testing Tutorial 5: Hypothesis Testing Rob Nicholls nicholls@mrc-lmb.cam.ac.uk MRC LMB Statistics Course 2014 Contents 1 Introduction................................ 1 2 Testing distributional assumptions....................

More information

Lecture 2 ESTIMATING THE SURVIVAL FUNCTION. One-sample nonparametric methods

Lecture 2 ESTIMATING THE SURVIVAL FUNCTION. One-sample nonparametric methods Lecture 2 ESTIMATING THE SURVIVAL FUNCTION One-sample nonparametric methods There are commonly three methods for estimating a survivorship function S(t) = P (T > t) without resorting to parametric models:

More information

t-test Statistics Overview of Statistical Tests Assumptions

t-test Statistics Overview of Statistical Tests Assumptions t-test Statistics Overview of Statistical Tests Assumption: Testing for Normality The Student s t-distribution Inference about one mean (one sample t-test) Inference about two means (two sample t-test)

More information

CALCULATIONS & STATISTICS

CALCULATIONS & STATISTICS CALCULATIONS & STATISTICS CALCULATION OF SCORES Conversion of 1-5 scale to 0-100 scores When you look at your report, you will notice that the scores are reported on a 0-100 scale, even though respondents

More information

Statistics Review PSY379

Statistics Review PSY379 Statistics Review PSY379 Basic concepts Measurement scales Populations vs. samples Continuous vs. discrete variable Independent vs. dependent variable Descriptive vs. inferential stats Common analyses

More information

Difference tests (2): nonparametric

Difference tests (2): nonparametric NST 1B Experimental Psychology Statistics practical 3 Difference tests (): nonparametric Rudolf Cardinal & Mike Aitken 10 / 11 February 005; Department of Experimental Psychology University of Cambridge

More information

Parametric and non-parametric statistical methods for the life sciences - Session I

Parametric and non-parametric statistical methods for the life sciences - Session I Why nonparametric methods What test to use? Rank Tests Parametric and non-parametric statistical methods for the life sciences - Session I Liesbeth Bruckers Geert Molenberghs Interuniversity Institute

More information

QUANTITATIVE METHODS BIOLOGY FINAL HONOUR SCHOOL NON-PARAMETRIC TESTS

QUANTITATIVE METHODS BIOLOGY FINAL HONOUR SCHOOL NON-PARAMETRIC TESTS QUANTITATIVE METHODS BIOLOGY FINAL HONOUR SCHOOL NON-PARAMETRIC TESTS This booklet contains lecture notes for the nonparametric work in the QM course. This booklet may be online at http://users.ox.ac.uk/~grafen/qmnotes/index.html.

More information

Biostatistics: Types of Data Analysis

Biostatistics: Types of Data Analysis Biostatistics: Types of Data Analysis Theresa A Scott, MS Vanderbilt University Department of Biostatistics theresa.scott@vanderbilt.edu http://biostat.mc.vanderbilt.edu/theresascott Theresa A Scott, MS

More information

Using Stata for Categorical Data Analysis

Using Stata for Categorical Data Analysis Using Stata for Categorical Data Analysis NOTE: These problems make extensive use of Nick Cox s tab_chi, which is actually a collection of routines, and Adrian Mander s ipf command. From within Stata,

More information

Guide to Microsoft Excel for calculations, statistics, and plotting data

Guide to Microsoft Excel for calculations, statistics, and plotting data Page 1/47 Guide to Microsoft Excel for calculations, statistics, and plotting data Topic Page A. Writing equations and text 2 1. Writing equations with mathematical operations 2 2. Writing equations with

More information

KSTAT MINI-MANUAL. Decision Sciences 434 Kellogg Graduate School of Management

KSTAT MINI-MANUAL. Decision Sciences 434 Kellogg Graduate School of Management KSTAT MINI-MANUAL Decision Sciences 434 Kellogg Graduate School of Management Kstat is a set of macros added to Excel and it will enable you to do the statistics required for this course very easily. To

More information

SUMAN DUVVURU STAT 567 PROJECT REPORT

SUMAN DUVVURU STAT 567 PROJECT REPORT SUMAN DUVVURU STAT 567 PROJECT REPORT SURVIVAL ANALYSIS OF HEROIN ADDICTS Background and introduction: Current illicit drug use among teens is continuing to increase in many countries around the world.

More information

HYPOTHESIS TESTING WITH SPSS:

HYPOTHESIS TESTING WITH SPSS: HYPOTHESIS TESTING WITH SPSS: A NON-STATISTICIAN S GUIDE & TUTORIAL by Dr. Jim Mirabella SPSS 14.0 screenshots reprinted with permission from SPSS Inc. Published June 2006 Copyright Dr. Jim Mirabella CHAPTER

More information

Statistics courses often teach the two-sample t-test, linear regression, and analysis of variance

Statistics courses often teach the two-sample t-test, linear regression, and analysis of variance 2 Making Connections: The Two-Sample t-test, Regression, and ANOVA In theory, there s no difference between theory and practice. In practice, there is. Yogi Berra 1 Statistics courses often teach the two-sample

More information

Descriptive Statistics

Descriptive Statistics Descriptive Statistics Primer Descriptive statistics Central tendency Variation Relative position Relationships Calculating descriptive statistics Descriptive Statistics Purpose to describe or summarize

More information

Non-Inferiority Tests for One Mean

Non-Inferiority Tests for One Mean Chapter 45 Non-Inferiority ests for One Mean Introduction his module computes power and sample size for non-inferiority tests in one-sample designs in which the outcome is distributed as a normal random

More information

How to set the main menu of STATA to default factory settings standards

How to set the main menu of STATA to default factory settings standards University of Pretoria Data analysis for evaluation studies Examples in STATA version 11 List of data sets b1.dta (To be created by students in class) fp1.xls (To be provided to students) fp1.txt (To be

More information

1 Nonparametric Statistics

1 Nonparametric Statistics 1 Nonparametric Statistics When finding confidence intervals or conducting tests so far, we always described the population with a model, which includes a set of parameters. Then we could make decisions

More information

12: Analysis of Variance. Introduction

12: Analysis of Variance. Introduction 1: Analysis of Variance Introduction EDA Hypothesis Test Introduction In Chapter 8 and again in Chapter 11 we compared means from two independent groups. In this chapter we extend the procedure to consider

More information

Parametric and Nonparametric: Demystifying the Terms

Parametric and Nonparametric: Demystifying the Terms Parametric and Nonparametric: Demystifying the Terms By Tanya Hoskin, a statistician in the Mayo Clinic Department of Health Sciences Research who provides consultations through the Mayo Clinic CTSA BERD

More information

Analysis and Interpretation of Clinical Trials. How to conclude?

Analysis and Interpretation of Clinical Trials. How to conclude? www.eurordis.org Analysis and Interpretation of Clinical Trials How to conclude? Statistical Issues Dr Ferran Torres Unitat de Suport en Estadística i Metodología - USEM Statistics and Methodology Support

More information

StatCrunch and Nonparametric Statistics

StatCrunch and Nonparametric Statistics StatCrunch and Nonparametric Statistics You can use StatCrunch to calculate the values of nonparametric statistics. It may not be obvious how to enter the data in StatCrunch for various data sets that

More information

Statistics I for QBIC. Contents and Objectives. Chapters 1 7. Revised: August 2013

Statistics I for QBIC. Contents and Objectives. Chapters 1 7. Revised: August 2013 Statistics I for QBIC Text Book: Biostatistics, 10 th edition, by Daniel & Cross Contents and Objectives Chapters 1 7 Revised: August 2013 Chapter 1: Nature of Statistics (sections 1.1-1.6) Objectives

More information

Projects Involving Statistics (& SPSS)

Projects Involving Statistics (& SPSS) Projects Involving Statistics (& SPSS) Academic Skills Advice Starting a project which involves using statistics can feel confusing as there seems to be many different things you can do (charts, graphs,

More information

Study Guide for the Final Exam

Study Guide for the Final Exam Study Guide for the Final Exam When studying, remember that the computational portion of the exam will only involve new material (covered after the second midterm), that material from Exam 1 will make

More information

HYPOTHESIS TESTING: POWER OF THE TEST

HYPOTHESIS TESTING: POWER OF THE TEST HYPOTHESIS TESTING: POWER OF THE TEST The first 6 steps of the 9-step test of hypothesis are called "the test". These steps are not dependent on the observed data values. When planning a research project,

More information

Quick Stata Guide by Liz Foster

Quick Stata Guide by Liz Foster by Liz Foster Table of Contents Part 1: 1 describe 1 generate 1 regress 3 scatter 4 sort 5 summarize 5 table 6 tabulate 8 test 10 ttest 11 Part 2: Prefixes and Notes 14 by var: 14 capture 14 use of the

More information

A POPULATION MEAN, CONFIDENCE INTERVALS AND HYPOTHESIS TESTING

A POPULATION MEAN, CONFIDENCE INTERVALS AND HYPOTHESIS TESTING CHAPTER 5. A POPULATION MEAN, CONFIDENCE INTERVALS AND HYPOTHESIS TESTING 5.1 Concepts When a number of animals or plots are exposed to a certain treatment, we usually estimate the effect of the treatment

More information

Class 19: Two Way Tables, Conditional Distributions, Chi-Square (Text: Sections 2.5; 9.1)

Class 19: Two Way Tables, Conditional Distributions, Chi-Square (Text: Sections 2.5; 9.1) Spring 204 Class 9: Two Way Tables, Conditional Distributions, Chi-Square (Text: Sections 2.5; 9.) Big Picture: More than Two Samples In Chapter 7: We looked at quantitative variables and compared the

More information

How To Check For Differences In The One Way Anova

How To Check For Differences In The One Way Anova MINITAB ASSISTANT WHITE PAPER This paper explains the research conducted by Minitab statisticians to develop the methods and data checks used in the Assistant in Minitab 17 Statistical Software. One-Way

More information

Unit 27: Comparing Two Means

Unit 27: Comparing Two Means Unit 27: Comparing Two Means Prerequisites Students should have experience with one-sample t-procedures before they begin this unit. That material is covered in Unit 26, Small Sample Inference for One

More information

Examining Differences (Comparing Groups) using SPSS Inferential statistics (Part I) Dwayne Devonish

Examining Differences (Comparing Groups) using SPSS Inferential statistics (Part I) Dwayne Devonish Examining Differences (Comparing Groups) using SPSS Inferential statistics (Part I) Dwayne Devonish Statistics Statistics are quantitative methods of describing, analysing, and drawing inferences (conclusions)

More information

An Introduction to Statistics Course (ECOE 1302) Spring Semester 2011 Chapter 10- TWO-SAMPLE TESTS

An Introduction to Statistics Course (ECOE 1302) Spring Semester 2011 Chapter 10- TWO-SAMPLE TESTS The Islamic University of Gaza Faculty of Commerce Department of Economics and Political Sciences An Introduction to Statistics Course (ECOE 130) Spring Semester 011 Chapter 10- TWO-SAMPLE TESTS Practice

More information

Week 4: Standard Error and Confidence Intervals

Week 4: Standard Error and Confidence Intervals Health Sciences M.Sc. Programme Applied Biostatistics Week 4: Standard Error and Confidence Intervals Sampling Most research data come from subjects we think of as samples drawn from a larger population.

More information

Unit 26: Small Sample Inference for One Mean

Unit 26: Small Sample Inference for One Mean Unit 26: Small Sample Inference for One Mean Prerequisites Students need the background on confidence intervals and significance tests covered in Units 24 and 25. Additional Topic Coverage Additional coverage

More information

Opgaven Onderzoeksmethoden, Onderdeel Statistiek

Opgaven Onderzoeksmethoden, Onderdeel Statistiek Opgaven Onderzoeksmethoden, Onderdeel Statistiek 1. What is the measurement scale of the following variables? a Shoe size b Religion c Car brand d Score in a tennis game e Number of work hours per week

More information

SPSS Tests for Versions 9 to 13

SPSS Tests for Versions 9 to 13 SPSS Tests for Versions 9 to 13 Chapter 2 Descriptive Statistic (including median) Choose Analyze Descriptive statistics Frequencies... Click on variable(s) then press to move to into Variable(s): list

More information

1.5 Oneway Analysis of Variance

1.5 Oneway Analysis of Variance Statistics: Rosie Cornish. 200. 1.5 Oneway Analysis of Variance 1 Introduction Oneway analysis of variance (ANOVA) is used to compare several means. This method is often used in scientific or medical experiments

More information

Comparing Two Groups. Standard Error of ȳ 1 ȳ 2. Setting. Two Independent Samples

Comparing Two Groups. Standard Error of ȳ 1 ȳ 2. Setting. Two Independent Samples Comparing Two Groups Chapter 7 describes two ways to compare two populations on the basis of independent samples: a confidence interval for the difference in population means and a hypothesis test. The

More information

Experimental Design. Power and Sample Size Determination. Proportions. Proportions. Confidence Interval for p. The Binomial Test

Experimental Design. Power and Sample Size Determination. Proportions. Proportions. Confidence Interval for p. The Binomial Test Experimental Design Power and Sample Size Determination Bret Hanlon and Bret Larget Department of Statistics University of Wisconsin Madison November 3 8, 2011 To this point in the semester, we have largely

More information

Interpretation of Somers D under four simple models

Interpretation of Somers D under four simple models Interpretation of Somers D under four simple models Roger B. Newson 03 September, 04 Introduction Somers D is an ordinal measure of association introduced by Somers (96)[9]. It can be defined in terms

More information

Using Excel for inferential statistics

Using Excel for inferential statistics FACT SHEET Using Excel for inferential statistics Introduction When you collect data, you expect a certain amount of variation, just caused by chance. A wide variety of statistical tests can be applied

More information

Introduction to Statistics and Quantitative Research Methods

Introduction to Statistics and Quantitative Research Methods Introduction to Statistics and Quantitative Research Methods Purpose of Presentation To aid in the understanding of basic statistics, including terminology, common terms, and common statistical methods.

More information