Power Analysis and Sample size estimation

Size: px
Start display at page:

Download "Power Analysis and Sample size estimation"

Transcription

1 Power Analysis and Sample size estimation The Power Analysis implements the techniques of statistical power analysis, sample size estimation, and advanced techniques for confidence interval estimation. The main goal of the first two techniques is to allow you to decide, while in the process of designing an experiment, (a) how large a sample is needed to allow statistical judgments that are accurate and reliable, and (b) how likely your statistical test will be to detect effects of a given size in a particular situation. The third technique is useful in implementing objectives (a) and (b) above, and in evaluating the size of experimental effects in practice. Performing power analysis and sample size estimation is an important aspect of all studies, because without these calculations, sample size may be too high or too low. If sample size is too low, the study will lack the precision to provide reliable answers to the questions it is investigating. If sample size is too large, time and resources will be wasted, often for minimal gain. The Power Analysis provides a number of graphical and analytical tools to enable precise evaluation of the factors affecting power and sample size in many of the most commonly encountered statistical analyses. This information can be crucial to the design of a study that is cost-effective and scientifically useful. Therefore, at the design stage of an investigation, you should aim to minimize the probability of failing to detect a real effect (type II error, false negative). The probability of type II error is equal to one minus the power of a study (probability of detecting a true effect). You must select a power level for your study along with the two sided significance level at which you intend to accept or reject null hypotheses in statistical tests. The significance level you choose (usually 5%) is the probability of type I error (incorrectly rejecting the null hypothesis, false positive). For further reading please see Armitage and Berry, 1994; Fleiss, 1981; Gardner and Altman, 1989; Dupont, 1990; Pearson and Hartley, G-Power estimates minimum sample sizes necessary to avoid given levels of type II error in the comparison of means, the comparison of proportions, Correlations and Regressions and variances. Remember that good design lies at the heart of good research and for important studies statistical advice should be sought at the planning stage. You should be familiar with the basic concepts of Statistics before you use this software. Please digest some introductory learning materials, such as Bland (2000) or selected web sites.

2 Errors, P values and Power The P value or calculated probability is the estimated probability of rejecting the null hypothesis (H0) of a study question when that hypothesis is true. The null hypothesis is usually an hypothesis of "no difference" e.g. no difference between blood pressures in group A and group B. Define a null hypothesis for each study question clearly before the start of your study. The only situation in which you should use a one sided P value is when a large change in an unexpected direction would have absolutely no relevance to your study. This situation is unusual; if you are in any doubt then use a two sided P value. The term significance level (alpha) is used to refer to a pre-chosen probability and the term "P value" is used to indicate a probability that you calculate after a given study. The alternative hypothesis (H1) is the opposite of the null hypothesis; in plain language terms this is usually the hypothesis you set out to investigate. For example, question is "is there a significant (not due to chance) difference in blood pressures between groups A and B if we give group A the test drug and group B a sugar pill?" and alternative hypothesis is " there is a difference in blood pressures between groups A and B if we give group A the test drug and group B a sugar pill". If your P value is less than the chosen significance level then you reject the null hypothesis i.e. accept that your sample gives reasonable evidence to support the alternative hypothesis. It does NOT imply a "meaningful" or "important" difference; that is for you to decide when considering the real-world relevance of your result. The choice of significance level at which you reject H0 is arbitrary. Conventionally the 5% (less than 1 in 20 chance of being wrong), 1% and 0.1% (P < 0.05, 0.01 and 0.001) levels have been used. These numbers can give a false sense of security. In the ideal world, we would be able to define a "perfectly" random sample, the most appropriate test and one definitive conclusion. We simply cannot. What we can do is try to optimise all stages of our research to minimise sources of uncertainty. When presenting P values some groups find it helpful to use the asterisk rating system as well as quoting the P value: P < 0.05 * P < 0.01 ** P < Most authors refer to statistically significant as P < 0.05 and statistically highly significant as P < (less than one in a thousand chance of being wrong). The asterisk system avoids the woolly term "significant". Please note, however, that many statisticians do not like the asterisk rating system when it is used without showing P values. As a rule of thumb, if you can quote an exact P value then do. You might also want to refer to a quoted exact P value as an asterisk in text narrative or tables of contrasts elsewhere in a report.

3 At this point, a word about error. Type I error is the false rejection of the null hypothesis and type II error is the false acceptance of the null hypothesis. As an aid memoir: think that our cynical society rejects before it accepts. The significance level (alpha) is the probability of type I error. The power of a test is one minus the probability of type II error (beta). Power should be maximised when selecting statistical methods. If you want to estimate sample sizes then you must understand all of the terms mentioned here. The following table shows the relationship between power and error in hypothesis testing: DECISION TRUTH Accept H 0 Reject H 0 H 0 is true correct decision P type I error P 1-alpha alpha (significance) H 0 is false type II error P correct decision P beta 1-beta (power) H 0 = null hypothesis P = probability If you are interested in further details of probability and sampling theory at this point then please refer to one of the general texts listed in the reference section. You must understand confidence intervals if you intend to quote P values in reports and papers. Statistical referees of scientific journals expect authors to quote confidence intervals with greater prominence than P values. Notes about Type I error: is the incorrect rejection of the null hypothesis maximum probability is set in advance as alpha is not affected by sample size as it is set in advance increases with the number of tests or end points (i.e. do 20 tests and 1 is likely to be wrongly significant) Notes about Type II error: is the incorrect acceptance of the null hypothesis probability is beta beta depends upon sample size and alpha can't be estimated except as a function of the true population effect beta gets smaller as the sample size gets larger beta gets smaller as the number of tests or end points increases

4 How to Work with G-Power by an Example

5

6

7

8

9

10

11 Sample Size Determination in G-Power by Examples: In this section we prepared the input quantities needed for each goal of sample size determination and output by GPower is presented. For more details on statistical and technical issues of the computations see Faul et. al. (2007): 1) Exact Tests Exact - Correlation: Bivariate normal model Options: exact distribution Correlation ρ H1 = 0.3 Correlation ρ H0 = 0 Output: Lower critical r = Upper critical r = Total sample size = 115 Actual power = Exact - Linear multiple regression: Random model Options: Exact distribution H1 ρ² = 0.3 H0 ρ² = 0 Number of predictors = 3 Output: Lower critical R² = Upper critical R² = Total sample size = 49 Actual power = Exact - Proportion: Difference from constant (binomial test, one sample case)

12 Effect size g = 0.3 Constant proportion = 0.5 Output: Lower critical N = Upper critical N = Total sample size = 28 Actual power = Actual α = Exact - Proportions: Inequality, two dependent groups (McNemar) Options: approximation Odds ratio = 1.5 Prop discordant pairs = 0.3 Output: Lower critical N = 148 Upper critical N = 148 Total sample size = 894 Actual power = Actual α = Proportion p12 = Proportion p21 = Exact - Proportions: Inequality, two independent groups (Fisher's exact test) Options: Exact distribution Proportion p1 = 0.5 Proportion p2 = 0.6 Allocation ratio N2/N1 = 1 Output: Sample size group 1 = 557 Sample size group 2 = 557 Total sample size = 1114 Actual power = Actual α = Exact - Proportions: Inequality, two independent groups (unconditional) Options: z-test pooled

13 Odds ratio = Proportion p2 = 0.5 Allocation ratio N2/N1 = 1 Output: Sample size group 1 = 233 Sample size group 2 = 233 Total sample size = 466 Actual α = Actual power = Exact - Proportions: Inequality (offset), two independent groups (unconditional) Options: proportion, z-test pooled Prop. p1 H1 = 0.65 Prop. p1 H0 = 0.55 Proportion p2 = 0.5 Allocation ratio N2/N1 = 1 Output: Sample size group 1 = 524 Sample size group 2 = 524 Total sample size = 1048 Actual α = Actual power = Exact - Proportion: Sign test (binomial test) Effect size g = 0.3 Output: Lower critical N = Upper critical N = Total sample size = 28 Actual power = Actual α = Exact - Generic binomial test Proportion p2 = 0.8

14 Proportion p1 = 0.5 Output: Lower critical N = Upper critical N = Total sample size = 28 Actual power = Actual α = ) F tests F tests - ANCOVA: Fixed effects, main effects and interactions Input: Effect size f = 0.25 Numerator df = 10 Number of groups = 5 Number of covariates = 1 Output: Noncentrality parameter λ = Critical F = Denominator df = 394 Total sample size = 400 Actual power = F tests - ANOVA: Fixed effects, omnibus, one-way Input: Effect size f = 0.25 Number of groups = 5 Output: Noncentrality parameter λ = Critical F = Numerator df = 4 Denominator df = 300 Total sample size = 305 Actual power = F tests - ANOVA: Fixed effects, special, main effects and interactions Input: Effect size f = 0.25

15 Numerator df = 10 Number of groups = 5 Output: Noncentrality parameter λ = Critical F = Denominator df = 395 Total sample size = 400 Actual power = F tests - ANOVA: Repeated measures, between factors Input: Effect size f = 0.25 Number of groups = 2 Number of measurements = 4 Corr among rep measures = 0.5 Output: Noncentrality parameter λ = Critical F = Numerator df = Denominator df = 130 Total sample size = 132 Actual power = F tests - ANOVA: Repeated measures, within factors Input: Effect size f = 0.25 Number of groups = 2 Number of measurements = 4 Corr among rep measures = 0.5 Nonsphericity correction ε = 1 Output: Noncentrality parameter λ = Critical F = Numerator df = Denominator df = 102 Total sample size = 36 Actual power = F tests - ANOVA: Repeated measures, within-between interaction Input: Effect size f = 0.25 Number of groups = 2 Number of measurements = 4 Corr among rep measures = 0.5

16 Nonsphericity correction ε = 1 Output: Noncentrality parameter λ = Critical F = Numerator df = Denominator df = 102 Total sample size = 36 Actual power = F tests - Hotellings T²: One group mean vector Input: Effect size Δ = 0.15 Response variables = 2 Output: Noncentrality parameter λ = Critical F = Numerator df = 2 Denominator df = 688 Total sample size = 690 Actual power = F tests - Hotellings T²: Two group mean vectors Input: Effect size Δ = 0.15 Allocation ratio N2/N1 = 1 Response variables = 2 Output: Noncentrality parameter λ = Critical F = Numerator df = 2 Denominator df = 2747 Sample size group 1 = 1375 Sample size group 2 = 1375 Actual power = F tests - MANOVA: Global effects Options: Pillai V, O'Brien-Shieh Algorithm Input: Effect size f²(v) = 0.25 Number of groups = 3 Response variables = 2 Output: Noncentrality parameter λ = Critical F =

17 Numerator df = Denominator df = Total sample size = 42 Actual power = Pillai V = F tests - MANOVA: Special effects and interactions Options: Pillai V, O'Brien-Shieh Algorithm Input: Effect size f²(v) = 0.25 Number of groups = 6 Number of predictors = 3 Response variables = 2 Output: Noncentrality parameter λ = Critical F = Numerator df = Denominator df = Total sample size = 46 Actual power = Pillai V = F tests - MANOVA: Repeated measures, between factors Options: Pillai V, O'Brien-Shieh Algorithm Input: Effect size f = 0.25 Number of groups = 2 Number of measurements = 4 Corr among rep measures = 0 Output: Noncentrality parameter λ = Critical F = Numerator df = Denominator df = Total sample size = 54 Actual power = Pillai V = F tests - MANOVA: Repeated measures, within factors Options: Pillai V, O'Brien-Shieh Algorithm Input: Effect size f = 0.25 Number of groups = 2

18 Number of measurements = 4 Corr among rep measures = 0 Output: Noncentrality parameter λ = Critical F = Numerator df = Denominator df = Total sample size = 74 Actual power = Pillai V = F tests - MANOVA: Repeated measures, within-between interaction Options: Pillai V, O'Brien-Shieh Algorithm Input: Effect size f(v) = 0.25 Number of groups = 2 Number of measurements = 4 Output: Noncentrality parameter λ = Critical F = Numerator df = Denominator df = 275 Total sample size = 279 Actual power = Pillai V = F tests - Linear multiple regression: Fixed model, R² deviation from zero Input: Effect size f² = 0.15 Number of predictors = 2 Output: Noncentrality parameter λ = Critical F = Numerator df = 2 Denominator df = 104 Total sample size = 107 Actual power = F tests - Linear multiple regression: Fixed model, R² increase Input: Effect size f² = 0.15

19 Number of tested predictors = 2 Total number of predictors = 5 Output: Noncentrality parameter λ = Critical F = Numerator df = 2 Denominator df = 101 Total sample size = 107 Actual power = F tests - Variance: Test of equality (two sample case) Ratio var1/var0 = 1.5 Allocation ratio N2/N1 = 1 Output: Lower critical F = Upper critical F = Numerator df = 265 Denominator df = 265 Sample size group 1 = 266 Sample size group 2 = 266 Actual power = F tests - Generic F test Analysis: Compromise: Compute α and β Input: Noncentrality parameter λ = 30 β/α ratio = 1 Numerator df = 10 Denominator df = 20 Output: Critical F = α err prob = β err prob = Power (1-β err prob) =

20 3) t tests t tests - Correlation: Point biserial model Effect size ρ = Power (1-β err prob) = Output: Noncentrality parameter δ = Critical t = Df = 98 Total sample size = 100 Actual power = t tests - Linear bivariate regression: One group, size of slope Slope H1 = 0.15 α err prob = Power (1-β err prob) = Slope H0 = 0 Std dev σ_x = 1 Std dev σ_y = 1 Output: Noncentrality parameter δ = Critical t = Df = 98 Total sample size = 100 Actual power = t tests - Linear bivariate regression: Two groups, difference between intercepts Δ intercept = 2 Allocation ratio N2/N1 = 1 Std dev residual σ = 2 Std dev σ_x1 = 4 Std dev σ_x2 = 3 Mean μ_x1 = 5 Mean μ_x2 = 7 Output: Noncentrality parameter δ = Critical t = Df = 194

21 Sample size group 1 = 99 Sample size group 2 = 99 Total sample size = 198 Actual power = t tests - Linear bivariate regression: Two groups, difference between slopes Δ slope = α err prob = Power (1-β err prob) = Allocation ratio N2/N1 = 1 Std dev residual σ = 0.5 Std dev σ_x1 = 5 Std dev σ_x2 = 10 Output: Noncentrality parameter δ = Critical t = Df = 98 Sample size group 1 = 51 Sample size group 2 = 51 Total sample size = 102 Actual power = t tests - Linear multiple regression: Fixed model, single regression coefficient Effect size f² = 0.15 α err prob = Power (1-β err prob) = Number of predictors = 2 Output: Noncentrality parameter δ = Critical t = Df = 97 Total sample size = 100 Actual power = t tests - Means: Difference between two dependent means (matched pairs) Effect size dz = 0.5 α err prob = Power (1-β err prob) =

22 Output: Noncentrality parameter δ = Critical t = Df = 100 Total sample size = 101 Actual power = t tests - Means: Difference between two independent means (two groups) Effect size d = 0.5 α err prob = Power (1-β err prob) = Allocation ratio N2/N1 = 1 Output: Noncentrality parameter δ = Critical t = Df = 100 Sample size group 1 = 51 Sample size group 2 = 51 Total sample size = 102 Actual power = t tests - Means: Difference from constant (one sample case) Effect size d = 0.5 α err prob = Power (1-β err prob) = Output: Noncentrality parameter δ = Critical t = Df = 100 Total sample size = 101 Actual power = t tests - Means: Wilcoxon signed-rank test (matched pairs) Options: A.R.E. method Parent distribution = Normal Effect size dz = 0.5 α err prob = Power (1-β err prob) = Output: Noncentrality parameter δ = Critical t = Df = Total sample size = 101 Actual power =

23 t tests - Means: Wilcoxon signed-rank test (one sample case) Options: A.R.E. method Parent distribution = Normal Effect size d = 0.5 α err prob = Power (1-β err prob) = Output: Noncentrality parameter δ = Critical t = Df = Total sample size = 101 Actual power = t tests - Means: Wilcoxon-Mann-Whitney test (two groups) Options: A.R.E. method Parent distribution = Normal Effect size d = 0.5 α err prob = Power (1-β err prob) = Allocation ratio N2/N1 = 1 Output: Noncentrality parameter δ = Critical t = Df = Sample size group 1 = 51 Sample size group 2 = 51 Total sample size = 102 Actual power = t tests - Generic t test Analysis: Compromise: Compute α and β Noncentrality parameter δ = 10 β/α ratio = 1 Df = 1 Output: Critical t = α err prob = β err prob = Power (1-β err prob) =

24 4) χ² tests χ² tests - Goodness-of-fit tests: Contingency tables Input: Effect size w = 0.3 α err prob = Power (1-β err prob) = Df = 5 Output: Noncentrality parameter λ = Critical χ² = Total sample size = 100 Actual power = χ² tests - Variance: Difference from constant (one sample case) Ratio var1/var0 = 1.5 α err prob = Power (1-β err prob) = Output: Lower critical χ² = Upper critical χ² = Df = 99 Total sample size = 100 Actual power = χ² tests - Generic χ² test Analysis: Compromise: Compute α and β Input: Noncentrality parameter λ = 20 β/α ratio = 1 Df = 5 Output: Critical χ² = α err prob = β err prob =

25 5) z tests z tests - Correlation: Tetrachoric model Options: Exact r H1 corr ρ = 0.1 α err prob = Power (1-β err prob) = H0 corr ρ = -0.1 Marginal prop x = 0.6 Marginal prop y = 0.3 Output: Critical z = Total sample size = 100 Actual power = H1 corr ρ = H0 corr ρ = Critical r lwr = Critical r upr = Std err r = z tests - Correlations: Two dependent Pearson r's (common index) H1 corr ρ_ac = -0.1 α err prob = Power (1-β err prob) = H0 corr ρ_ab = 0.1 Corr ρ_bc = -0.1 Output: Critical z = Sample size = 101 Actual power = z tests - Correlations: Two dependent Pearson r's (no common index) H1 corr ρ_cd = 0.15 α err prob = Power (1-β err prob) = H0 corr ρ_ab = 0.1 Corr ρ_ac = 0.7 Corr ρ_ad = 0.6 Corr ρ_bc = 0.4 Corr ρ_bd = 0.41

26 Output: Critical z = Sample size = 100 Actual power = z tests - Correlations: Two independent Pearson r's Effect size q = 0.6 α err prob = Power (1-β err prob) = Allocation ratio N2/N1 = 1 Output: Critical z = Sample size group 1 = 51 Sample size group 2 = 51 Total sample size = 102 Actual power = z tests - Logistic regression Options: Large sample z-test, Demidenko (2007) with var corr Odds ratio = 1.3 Pr(Y=1 X=1) H0 = 0.2 α err prob = Power (1-β err prob) = R² other X = 0 X distribution = Normal X parm μ = 0 X parm σ = 1 Output: Critical z = Total sample size = 101 Actual power = z tests - Poisson regression Options: Large sample z-test, Demidenko (2007) with var corr Exp(β1) = 1.3 Base rate exp(β0) = 0.85 Mean exposure = 1 R² other X = 0 X distribution = Normal X parm μ = 0 X parm σ = 1 Output: Critical z =

27 Total sample size = 179 Actual power = z tests - Proportions: Difference between two independent proportions Proportion p2 = 0.6 Proportion p1 = 0.5 Allocation ratio N2/N1 = 1 Output: Critical z = Sample size group 1 = 533 Sample size group 2 = 533 Total sample size = 1066 Actual power = z tests - Generic z test Analysis: Compromise: Compute α and β Noncentrality parameter μ = 3 Noncentral dist. SD σ = 1 β/α ratio = 1 Output: Critical z = α err prob = β err prob = Power (1-β err prob) = Sample Size Determination in STATA by Examples: 1) Comparing survival in groups

28

29 . stpower logrank Estimated sample sizes for two-sample comparison of survivor functions Log-rank test, Freedman method Ho: S1(t) = S2(t) Input parameters: alpha = (two sided) hratio = power =

30 p1 = Estimated number of events and sample sizes: E = 72 N = 72 N1 = 36 N2 = 36 2) Cox Regression

31

32 . stpower cox, wdprob(.4) Estimated sample size for Cox PH regression Wald test, log-hazard metric Ho: [b1, b2,..., bp] = [0, b2,..., bp] Input parameters: alpha = (two sided) b1 = sd = power = withdrawal(%) = 40.00

33 Estimated number of events and sample size: E = 66 N = 109 Sample Size Determination in PASS by Examples: Some web based sample size calculators: Based on Power: alculators.aspx Based on CI

34 References Armitage P, Berry G, Matthews JNS. Statistical Methods in Medical Research (4th edition). Oxford: Blackwell Science Bland M. An Introduction to Medical Statistics (3rd edition). Oxford Medical Publications Casagrande JT, Pike MC, Smith PG. An improved approximate formula for calculating sample sizes for comparing two binomial distributions. Biometrics 1978;34: Donner A, Eliasziw M. A goodness of fit approach to inference procedures for the Kappa statistic: CI construction, significance testing and sample size estimation. Statistics in Medicine 1992;11: Dupont WD. Power calculations for matched case-control studies. Biometrics 1988;44: Dupont WD. Power and sample size calculations. Controlled Clinical Trials 1990;11: Fleiss JL. Statistical Methods for Rates and Proportions (2nd edition). New York: Wiley Gardner MJ, Altman DG. Statistics with Confidence - Confidence Intervals and Statistical Guidelines. British Medical Journal Pearson & Hartley. Biometrika tables for statisticians (Volumes I and II, 3rd edition). Cambridge University Press Franz, Erdfelder, Lang and Buchner. G*Power 3: A flexible statistical power analysis program for the social, behavioral, and biomedical sciences. Behavior Research Methods 2007, 39 (2),

Statistics in Medicine Research Lecture Series CSMC Fall 2014

Statistics in Medicine Research Lecture Series CSMC Fall 2014 Catherine Bresee, MS Senior Biostatistician Biostatistics & Bioinformatics Research Institute Statistics in Medicine Research Lecture Series CSMC Fall 2014 Overview Review concept of statistical power

More information

Introduction. Hypothesis Testing. Hypothesis Testing. Significance Testing

Introduction. Hypothesis Testing. Hypothesis Testing. Significance Testing Introduction Hypothesis Testing Mark Lunt Arthritis Research UK Centre for Ecellence in Epidemiology University of Manchester 13/10/2015 We saw last week that we can never know the population parameters

More information

11. Analysis of Case-control Studies Logistic Regression

11. Analysis of Case-control Studies Logistic Regression Research methods II 113 11. Analysis of Case-control Studies Logistic Regression This chapter builds upon and further develops the concepts and strategies described in Ch.6 of Mother and Child Health:

More information

SIMPLE LINEAR CORRELATION. r can range from -1 to 1, and is independent of units of measurement. Correlation can be done on two dependent variables.

SIMPLE LINEAR CORRELATION. r can range from -1 to 1, and is independent of units of measurement. Correlation can be done on two dependent variables. SIMPLE LINEAR CORRELATION Simple linear correlation is a measure of the degree to which two variables vary together, or a measure of the intensity of the association between two variables. Correlation

More information

Statistical power analyses using G*Power 3.1: Tests for correlation and regression analyses

Statistical power analyses using G*Power 3.1: Tests for correlation and regression analyses Behavior Research Methods 2009, 41 (4), 1149-1160 doi:10.3758/brm.41.4.1149 Statistical power analyses using G*Power 3.1: Tests for correlation and regression analyses Franz Faul Christian-Albrechts-Universität,

More information

SPSS Guide: Regression Analysis

SPSS Guide: Regression Analysis SPSS Guide: Regression Analysis I put this together to give you a step-by-step guide for replicating what we did in the computer lab. It should help you run the tests we covered. The best way to get familiar

More information

Point Biserial Correlation Tests

Point Biserial Correlation Tests Chapter 807 Point Biserial Correlation Tests Introduction The point biserial correlation coefficient (ρ in this chapter) is the product-moment correlation calculated between a continuous random variable

More information

Study Guide for the Final Exam

Study Guide for the Final Exam Study Guide for the Final Exam When studying, remember that the computational portion of the exam will only involve new material (covered after the second midterm), that material from Exam 1 will make

More information

Part 2: Analysis of Relationship Between Two Variables

Part 2: Analysis of Relationship Between Two Variables Part 2: Analysis of Relationship Between Two Variables Linear Regression Linear correlation Significance Tests Multiple regression Linear Regression Y = a X + b Dependent Variable Independent Variable

More information

1. What is the critical value for this 95% confidence interval? CV = z.025 = invnorm(0.025) = 1.96

1. What is the critical value for this 95% confidence interval? CV = z.025 = invnorm(0.025) = 1.96 1 Final Review 2 Review 2.1 CI 1-propZint Scenario 1 A TV manufacturer claims in its warranty brochure that in the past not more than 10 percent of its TV sets needed any repair during the first two years

More information

Principles of Hypothesis Testing for Public Health

Principles of Hypothesis Testing for Public Health Principles of Hypothesis Testing for Public Health Laura Lee Johnson, Ph.D. Statistician National Center for Complementary and Alternative Medicine johnslau@mail.nih.gov Fall 2011 Answers to Questions

More information

Two-Sample T-Tests Assuming Equal Variance (Enter Means)

Two-Sample T-Tests Assuming Equal Variance (Enter Means) Chapter 4 Two-Sample T-Tests Assuming Equal Variance (Enter Means) Introduction This procedure provides sample size and power calculations for one- or two-sided two-sample t-tests when the variances of

More information

UNDERSTANDING THE DEPENDENT-SAMPLES t TEST

UNDERSTANDING THE DEPENDENT-SAMPLES t TEST UNDERSTANDING THE DEPENDENT-SAMPLES t TEST A dependent-samples t test (a.k.a. matched or paired-samples, matched-pairs, samples, or subjects, simple repeated-measures or within-groups, or correlated groups)

More information

Consider a study in which. How many subjects? The importance of sample size calculations. An insignificant effect: two possibilities.

Consider a study in which. How many subjects? The importance of sample size calculations. An insignificant effect: two possibilities. Consider a study in which How many subjects? The importance of sample size calculations Office of Research Protections Brown Bag Series KB Boomer, Ph.D. Director, boomer@stat.psu.edu A researcher conducts

More information

Lesson 1: Comparison of Population Means Part c: Comparison of Two- Means

Lesson 1: Comparison of Population Means Part c: Comparison of Two- Means Lesson : Comparison of Population Means Part c: Comparison of Two- Means Welcome to lesson c. This third lesson of lesson will discuss hypothesis testing for two independent means. Steps in Hypothesis

More information

Descriptive Statistics

Descriptive Statistics Descriptive Statistics Primer Descriptive statistics Central tendency Variation Relative position Relationships Calculating descriptive statistics Descriptive Statistics Purpose to describe or summarize

More information

Basic Statistical and Modeling Procedures Using SAS

Basic Statistical and Modeling Procedures Using SAS Basic Statistical and Modeling Procedures Using SAS One-Sample Tests The statistical procedures illustrated in this handout use two datasets. The first, Pulse, has information collected in a classroom

More information

Introduction to Quantitative Methods

Introduction to Quantitative Methods Introduction to Quantitative Methods October 15, 2009 Contents 1 Definition of Key Terms 2 2 Descriptive Statistics 3 2.1 Frequency Tables......................... 4 2.2 Measures of Central Tendencies.................

More information

SAS Software to Fit the Generalized Linear Model

SAS Software to Fit the Generalized Linear Model SAS Software to Fit the Generalized Linear Model Gordon Johnston, SAS Institute Inc., Cary, NC Abstract In recent years, the class of generalized linear models has gained popularity as a statistical modeling

More information

Unit 31 A Hypothesis Test about Correlation and Slope in a Simple Linear Regression

Unit 31 A Hypothesis Test about Correlation and Slope in a Simple Linear Regression Unit 31 A Hypothesis Test about Correlation and Slope in a Simple Linear Regression Objectives: To perform a hypothesis test concerning the slope of a least squares line To recognize that testing for a

More information

" Y. Notation and Equations for Regression Lecture 11/4. Notation:

 Y. Notation and Equations for Regression Lecture 11/4. Notation: Notation: Notation and Equations for Regression Lecture 11/4 m: The number of predictor variables in a regression Xi: One of multiple predictor variables. The subscript i represents any number from 1 through

More information

Come scegliere un test statistico

Come scegliere un test statistico Come scegliere un test statistico Estratto dal Capitolo 37 of Intuitive Biostatistics (ISBN 0-19-508607-4) by Harvey Motulsky. Copyright 1995 by Oxfd University Press Inc. (disponibile in Iinternet) Table

More information

Institute of Actuaries of India Subject CT3 Probability and Mathematical Statistics

Institute of Actuaries of India Subject CT3 Probability and Mathematical Statistics Institute of Actuaries of India Subject CT3 Probability and Mathematical Statistics For 2015 Examinations Aim The aim of the Probability and Mathematical Statistics subject is to provide a grounding in

More information

Two-Sample T-Tests Allowing Unequal Variance (Enter Difference)

Two-Sample T-Tests Allowing Unequal Variance (Enter Difference) Chapter 45 Two-Sample T-Tests Allowing Unequal Variance (Enter Difference) Introduction This procedure provides sample size and power calculations for one- or two-sided two-sample t-tests when no assumption

More information

Section 13, Part 1 ANOVA. Analysis Of Variance

Section 13, Part 1 ANOVA. Analysis Of Variance Section 13, Part 1 ANOVA Analysis Of Variance Course Overview So far in this course we ve covered: Descriptive statistics Summary statistics Tables and Graphs Probability Probability Rules Probability

More information

Tests for Two Proportions

Tests for Two Proportions Chapter 200 Tests for Two Proportions Introduction This module computes power and sample size for hypothesis tests of the difference, ratio, or odds ratio of two independent proportions. The test statistics

More information

for the social, behavioral, and biomedical sciences. Behavior Research Methods, 39, 175-191.

for the social, behavioral, and biomedical sciences. Behavior Research Methods, 39, 175-191. Example of a Statistical Power Calculation (p. 206) Photocopiable This example uses the statistical power software package G*Power 3. I am grateful to the creators of the software for giving their permission

More information

Generalized Linear Models

Generalized Linear Models Generalized Linear Models We have previously worked with regression models where the response variable is quantitative and normally distributed. Now we turn our attention to two types of models where the

More information

Tests for Two Survival Curves Using Cox s Proportional Hazards Model

Tests for Two Survival Curves Using Cox s Proportional Hazards Model Chapter 730 Tests for Two Survival Curves Using Cox s Proportional Hazards Model Introduction A clinical trial is often employed to test the equality of survival distributions of two treatment groups.

More information

QUANTITATIVE METHODS BIOLOGY FINAL HONOUR SCHOOL NON-PARAMETRIC TESTS

QUANTITATIVE METHODS BIOLOGY FINAL HONOUR SCHOOL NON-PARAMETRIC TESTS QUANTITATIVE METHODS BIOLOGY FINAL HONOUR SCHOOL NON-PARAMETRIC TESTS This booklet contains lecture notes for the nonparametric work in the QM course. This booklet may be online at http://users.ox.ac.uk/~grafen/qmnotes/index.html.

More information

MULTIPLE REGRESSION AND ISSUES IN REGRESSION ANALYSIS

MULTIPLE REGRESSION AND ISSUES IN REGRESSION ANALYSIS MULTIPLE REGRESSION AND ISSUES IN REGRESSION ANALYSIS MSR = Mean Regression Sum of Squares MSE = Mean Squared Error RSS = Regression Sum of Squares SSE = Sum of Squared Errors/Residuals α = Level of Significance

More information

Regression Analysis: A Complete Example

Regression Analysis: A Complete Example Regression Analysis: A Complete Example This section works out an example that includes all the topics we have discussed so far in this chapter. A complete example of regression analysis. PhotoDisc, Inc./Getty

More information

Simple Regression Theory II 2010 Samuel L. Baker

Simple Regression Theory II 2010 Samuel L. Baker SIMPLE REGRESSION THEORY II 1 Simple Regression Theory II 2010 Samuel L. Baker Assessing how good the regression equation is likely to be Assignment 1A gets into drawing inferences about how close the

More information

SUMAN DUVVURU STAT 567 PROJECT REPORT

SUMAN DUVVURU STAT 567 PROJECT REPORT SUMAN DUVVURU STAT 567 PROJECT REPORT SURVIVAL ANALYSIS OF HEROIN ADDICTS Background and introduction: Current illicit drug use among teens is continuing to increase in many countries around the world.

More information

Outline. Topic 4 - Analysis of Variance Approach to Regression. Partitioning Sums of Squares. Total Sum of Squares. Partitioning sums of squares

Outline. Topic 4 - Analysis of Variance Approach to Regression. Partitioning Sums of Squares. Total Sum of Squares. Partitioning sums of squares Topic 4 - Analysis of Variance Approach to Regression Outline Partitioning sums of squares Degrees of freedom Expected mean squares General linear test - Fall 2013 R 2 and the coefficient of correlation

More information

Study Design Sample Size Calculation & Power Analysis. RCMAR/CHIME April 21, 2014 Honghu Liu, PhD Professor University of California Los Angeles

Study Design Sample Size Calculation & Power Analysis. RCMAR/CHIME April 21, 2014 Honghu Liu, PhD Professor University of California Los Angeles Study Design Sample Size Calculation & Power Analysis RCMAR/CHIME April 21, 2014 Honghu Liu, PhD Professor University of California Los Angeles Contents 1. Background 2. Common Designs 3. Examples 4. Computer

More information

Sample Size Planning, Calculation, and Justification

Sample Size Planning, Calculation, and Justification Sample Size Planning, Calculation, and Justification Theresa A Scott, MS Vanderbilt University Department of Biostatistics theresa.scott@vanderbilt.edu http://biostat.mc.vanderbilt.edu/theresascott Theresa

More information

2013 MBA Jump Start Program. Statistics Module Part 3

2013 MBA Jump Start Program. Statistics Module Part 3 2013 MBA Jump Start Program Module 1: Statistics Thomas Gilbert Part 3 Statistics Module Part 3 Hypothesis Testing (Inference) Regressions 2 1 Making an Investment Decision A researcher in your firm just

More information

HYPOTHESIS TESTING: CONFIDENCE INTERVALS, T-TESTS, ANOVAS, AND REGRESSION

HYPOTHESIS TESTING: CONFIDENCE INTERVALS, T-TESTS, ANOVAS, AND REGRESSION HYPOTHESIS TESTING: CONFIDENCE INTERVALS, T-TESTS, ANOVAS, AND REGRESSION HOD 2990 10 November 2010 Lecture Background This is a lightning speed summary of introductory statistical methods for senior undergraduate

More information

Simple linear regression

Simple linear regression Simple linear regression Introduction Simple linear regression is a statistical method for obtaining a formula to predict values of one variable from another where there is a causal relationship between

More information

Pearson's Correlation Tests

Pearson's Correlation Tests Chapter 800 Pearson's Correlation Tests Introduction The correlation coefficient, ρ (rho), is a popular statistic for describing the strength of the relationship between two variables. The correlation

More information

Nonparametric Statistics

Nonparametric Statistics Nonparametric Statistics J. Lozano University of Goettingen Department of Genetic Epidemiology Interdisciplinary PhD Program in Applied Statistics & Empirical Methods Graduate Seminar in Applied Statistics

More information

Simple Linear Regression Inference

Simple Linear Regression Inference Simple Linear Regression Inference 1 Inference requirements The Normality assumption of the stochastic term e is needed for inference even if it is not a OLS requirement. Therefore we have: Interpretation

More information

Two Correlated Proportions (McNemar Test)

Two Correlated Proportions (McNemar Test) Chapter 50 Two Correlated Proportions (Mcemar Test) Introduction This procedure computes confidence intervals and hypothesis tests for the comparison of the marginal frequencies of two factors (each with

More information

Data Analysis, Research Study Design and the IRB

Data Analysis, Research Study Design and the IRB Minding the p-values p and Quartiles: Data Analysis, Research Study Design and the IRB Don Allensworth-Davies, MSc Research Manager, Data Coordinating Center Boston University School of Public Health IRB

More information

Ordinal Regression. Chapter

Ordinal Regression. Chapter Ordinal Regression Chapter 4 Many variables of interest are ordinal. That is, you can rank the values, but the real distance between categories is unknown. Diseases are graded on scales from least severe

More information

NONPARAMETRIC STATISTICS 1. depend on assumptions about the underlying distribution of the data (or on the Central Limit Theorem)

NONPARAMETRIC STATISTICS 1. depend on assumptions about the underlying distribution of the data (or on the Central Limit Theorem) NONPARAMETRIC STATISTICS 1 PREVIOUSLY parametric statistics in estimation and hypothesis testing... construction of confidence intervals computing of p-values classical significance testing depend on assumptions

More information

X X X a) perfect linear correlation b) no correlation c) positive correlation (r = 1) (r = 0) (0 < r < 1)

X X X a) perfect linear correlation b) no correlation c) positive correlation (r = 1) (r = 0) (0 < r < 1) CORRELATION AND REGRESSION / 47 CHAPTER EIGHT CORRELATION AND REGRESSION Correlation and regression are statistical methods that are commonly used in the medical literature to compare two or more variables.

More information

SPSS Tests for Versions 9 to 13

SPSS Tests for Versions 9 to 13 SPSS Tests for Versions 9 to 13 Chapter 2 Descriptive Statistic (including median) Choose Analyze Descriptive statistics Frequencies... Click on variable(s) then press to move to into Variable(s): list

More information

Guide to Microsoft Excel for calculations, statistics, and plotting data

Guide to Microsoft Excel for calculations, statistics, and plotting data Page 1/47 Guide to Microsoft Excel for calculations, statistics, and plotting data Topic Page A. Writing equations and text 2 1. Writing equations with mathematical operations 2 2. Writing equations with

More information

THE FIRST SET OF EXAMPLES USE SUMMARY DATA... EXAMPLE 7.2, PAGE 227 DESCRIBES A PROBLEM AND A HYPOTHESIS TEST IS PERFORMED IN EXAMPLE 7.

THE FIRST SET OF EXAMPLES USE SUMMARY DATA... EXAMPLE 7.2, PAGE 227 DESCRIBES A PROBLEM AND A HYPOTHESIS TEST IS PERFORMED IN EXAMPLE 7. THERE ARE TWO WAYS TO DO HYPOTHESIS TESTING WITH STATCRUNCH: WITH SUMMARY DATA (AS IN EXAMPLE 7.17, PAGE 236, IN ROSNER); WITH THE ORIGINAL DATA (AS IN EXAMPLE 8.5, PAGE 301 IN ROSNER THAT USES DATA FROM

More information

How To Check For Differences In The One Way Anova

How To Check For Differences In The One Way Anova MINITAB ASSISTANT WHITE PAPER This paper explains the research conducted by Minitab statisticians to develop the methods and data checks used in the Assistant in Minitab 17 Statistical Software. One-Way

More information

Non-Inferiority Tests for Two Means using Differences

Non-Inferiority Tests for Two Means using Differences Chapter 450 on-inferiority Tests for Two Means using Differences Introduction This procedure computes power and sample size for non-inferiority tests in two-sample designs in which the outcome is a continuous

More information

STATISTICA Formula Guide: Logistic Regression. Table of Contents

STATISTICA Formula Guide: Logistic Regression. Table of Contents : Table of Contents... 1 Overview of Model... 1 Dispersion... 2 Parameterization... 3 Sigma-Restricted Model... 3 Overparameterized Model... 4 Reference Coding... 4 Model Summary (Summary Tab)... 5 Summary

More information

Multivariate Logistic Regression

Multivariate Logistic Regression 1 Multivariate Logistic Regression As in univariate logistic regression, let π(x) represent the probability of an event that depends on p covariates or independent variables. Then, using an inv.logit formulation

More information

LAB 4 INSTRUCTIONS CONFIDENCE INTERVALS AND HYPOTHESIS TESTING

LAB 4 INSTRUCTIONS CONFIDENCE INTERVALS AND HYPOTHESIS TESTING LAB 4 INSTRUCTIONS CONFIDENCE INTERVALS AND HYPOTHESIS TESTING In this lab you will explore the concept of a confidence interval and hypothesis testing through a simulation problem in engineering setting.

More information

International Statistical Institute, 56th Session, 2007: Phil Everson

International Statistical Institute, 56th Session, 2007: Phil Everson Teaching Regression using American Football Scores Everson, Phil Swarthmore College Department of Mathematics and Statistics 5 College Avenue Swarthmore, PA198, USA E-mail: peverso1@swarthmore.edu 1. Introduction

More information

Additional sources Compilation of sources: http://lrs.ed.uiuc.edu/tseportal/datacollectionmethodologies/jin-tselink/tselink.htm

Additional sources Compilation of sources: http://lrs.ed.uiuc.edu/tseportal/datacollectionmethodologies/jin-tselink/tselink.htm Mgt 540 Research Methods Data Analysis 1 Additional sources Compilation of sources: http://lrs.ed.uiuc.edu/tseportal/datacollectionmethodologies/jin-tselink/tselink.htm http://web.utk.edu/~dap/random/order/start.htm

More information

Sample Size and Power in Clinical Trials

Sample Size and Power in Clinical Trials Sample Size and Power in Clinical Trials Version 1.0 May 011 1. Power of a Test. Factors affecting Power 3. Required Sample Size RELATED ISSUES 1. Effect Size. Test Statistics 3. Variation 4. Significance

More information

Introduction to Statistics and Quantitative Research Methods

Introduction to Statistics and Quantitative Research Methods Introduction to Statistics and Quantitative Research Methods Purpose of Presentation To aid in the understanding of basic statistics, including terminology, common terms, and common statistical methods.

More information

Some Essential Statistics The Lure of Statistics

Some Essential Statistics The Lure of Statistics Some Essential Statistics The Lure of Statistics Data Mining Techniques, by M.J.A. Berry and G.S Linoff, 2004 Statistics vs. Data Mining..lie, damn lie, and statistics mining data to support preconceived

More information

Final Exam Practice Problem Answers

Final Exam Practice Problem Answers Final Exam Practice Problem Answers The following data set consists of data gathered from 77 popular breakfast cereals. The variables in the data set are as follows: Brand: The brand name of the cereal

More information

Interpretation of Somers D under four simple models

Interpretation of Somers D under four simple models Interpretation of Somers D under four simple models Roger B. Newson 03 September, 04 Introduction Somers D is an ordinal measure of association introduced by Somers (96)[9]. It can be defined in terms

More information

Binary Diagnostic Tests Two Independent Samples

Binary Diagnostic Tests Two Independent Samples Chapter 537 Binary Diagnostic Tests Two Independent Samples Introduction An important task in diagnostic medicine is to measure the accuracy of two diagnostic tests. This can be done by comparing summary

More information

NCSS Statistical Software

NCSS Statistical Software Chapter 06 Introduction This procedure provides several reports for the comparison of two distributions, including confidence intervals for the difference in means, two-sample t-tests, the z-test, the

More information

Multicollinearity Richard Williams, University of Notre Dame, http://www3.nd.edu/~rwilliam/ Last revised January 13, 2015

Multicollinearity Richard Williams, University of Notre Dame, http://www3.nd.edu/~rwilliam/ Last revised January 13, 2015 Multicollinearity Richard Williams, University of Notre Dame, http://www3.nd.edu/~rwilliam/ Last revised January 13, 2015 Stata Example (See appendices for full example).. use http://www.nd.edu/~rwilliam/stats2/statafiles/multicoll.dta,

More information

Chapter 5 Analysis of variance SPSS Analysis of variance

Chapter 5 Analysis of variance SPSS Analysis of variance Chapter 5 Analysis of variance SPSS Analysis of variance Data file used: gss.sav How to get there: Analyze Compare Means One-way ANOVA To test the null hypothesis that several population means are equal,

More information

II. DISTRIBUTIONS distribution normal distribution. standard scores

II. DISTRIBUTIONS distribution normal distribution. standard scores Appendix D Basic Measurement And Statistics The following information was developed by Steven Rothke, PhD, Department of Psychology, Rehabilitation Institute of Chicago (RIC) and expanded by Mary F. Schmidt,

More information

Chapter 13 Introduction to Linear Regression and Correlation Analysis

Chapter 13 Introduction to Linear Regression and Correlation Analysis Chapter 3 Student Lecture Notes 3- Chapter 3 Introduction to Linear Regression and Correlation Analsis Fall 2006 Fundamentals of Business Statistics Chapter Goals To understand the methods for displaing

More information

Bowerman, O'Connell, Aitken Schermer, & Adcock, Business Statistics in Practice, Canadian edition

Bowerman, O'Connell, Aitken Schermer, & Adcock, Business Statistics in Practice, Canadian edition Bowerman, O'Connell, Aitken Schermer, & Adcock, Business Statistics in Practice, Canadian edition Online Learning Centre Technology Step-by-Step - Excel Microsoft Excel is a spreadsheet software application

More information

Biostatistics: Types of Data Analysis

Biostatistics: Types of Data Analysis Biostatistics: Types of Data Analysis Theresa A Scott, MS Vanderbilt University Department of Biostatistics theresa.scott@vanderbilt.edu http://biostat.mc.vanderbilt.edu/theresascott Theresa A Scott, MS

More information

How to get accurate sample size and power with nquery Advisor R

How to get accurate sample size and power with nquery Advisor R How to get accurate sample size and power with nquery Advisor R Brian Sullivan Statistical Solutions Ltd. ASA Meeting, Chicago, March 2007 Sample Size Two group t-test χ 2 -test Survival Analysis 2 2 Crossover

More information

Non-Inferiority Tests for One Mean

Non-Inferiority Tests for One Mean Chapter 45 Non-Inferiority ests for One Mean Introduction his module computes power and sample size for non-inferiority tests in one-sample designs in which the outcome is distributed as a normal random

More information

Independent t- Test (Comparing Two Means)

Independent t- Test (Comparing Two Means) Independent t- Test (Comparing Two Means) The objectives of this lesson are to learn: the definition/purpose of independent t-test when to use the independent t-test the use of SPSS to complete an independent

More information

Chicago Booth BUSINESS STATISTICS 41000 Final Exam Fall 2011

Chicago Booth BUSINESS STATISTICS 41000 Final Exam Fall 2011 Chicago Booth BUSINESS STATISTICS 41000 Final Exam Fall 2011 Name: Section: I pledge my honor that I have not violated the Honor Code Signature: This exam has 34 pages. You have 3 hours to complete this

More information

Hypothesis testing - Steps

Hypothesis testing - Steps Hypothesis testing - Steps Steps to do a two-tailed test of the hypothesis that β 1 0: 1. Set up the hypotheses: H 0 : β 1 = 0 H a : β 1 0. 2. Compute the test statistic: t = b 1 0 Std. error of b 1 =

More information

Non-Inferiority Tests for Two Proportions

Non-Inferiority Tests for Two Proportions Chapter 0 Non-Inferiority Tests for Two Proportions Introduction This module provides power analysis and sample size calculation for non-inferiority and superiority tests in twosample designs in which

More information

Research Methods & Experimental Design

Research Methods & Experimental Design Research Methods & Experimental Design 16.422 Human Supervisory Control April 2004 Research Methods Qualitative vs. quantitative Understanding the relationship between objectives (research question) and

More information

Statistical Functions in Excel

Statistical Functions in Excel Statistical Functions in Excel There are many statistical functions in Excel. Moreover, there are other functions that are not specified as statistical functions that are helpful in some statistical analyses.

More information

Calculating, Interpreting, and Reporting Estimates of Effect Size (Magnitude of an Effect or the Strength of a Relationship)

Calculating, Interpreting, and Reporting Estimates of Effect Size (Magnitude of an Effect or the Strength of a Relationship) 1 Calculating, Interpreting, and Reporting Estimates of Effect Size (Magnitude of an Effect or the Strength of a Relationship) I. Authors should report effect sizes in the manuscript and tables when reporting

More information

Introduction to Regression and Data Analysis

Introduction to Regression and Data Analysis Statlab Workshop Introduction to Regression and Data Analysis with Dan Campbell and Sherlock Campbell October 28, 2008 I. The basics A. Types of variables Your variables may take several forms, and it

More information

Categorical Data Analysis

Categorical Data Analysis Richard L. Scheaffer University of Florida The reference material and many examples for this section are based on Chapter 8, Analyzing Association Between Categorical Variables, from Statistical Methods

More information

A POPULATION MEAN, CONFIDENCE INTERVALS AND HYPOTHESIS TESTING

A POPULATION MEAN, CONFIDENCE INTERVALS AND HYPOTHESIS TESTING CHAPTER 5. A POPULATION MEAN, CONFIDENCE INTERVALS AND HYPOTHESIS TESTING 5.1 Concepts When a number of animals or plots are exposed to a certain treatment, we usually estimate the effect of the treatment

More information

Econometrics and Data Analysis I

Econometrics and Data Analysis I Econometrics and Data Analysis I Yale University ECON S131 (ONLINE) Summer Session A, 2014 June 2 July 4 Instructor: Doug McKee (douglas.mckee@yale.edu) Teaching Fellow: Yu Liu (dav.yu.liu@yale.edu) Classroom:

More information

"Statistical methods are objective methods by which group trends are abstracted from observations on many separate individuals." 1

Statistical methods are objective methods by which group trends are abstracted from observations on many separate individuals. 1 BASIC STATISTICAL THEORY / 3 CHAPTER ONE BASIC STATISTICAL THEORY "Statistical methods are objective methods by which group trends are abstracted from observations on many separate individuals." 1 Medicine

More information

The Statistics Tutor s Quick Guide to

The Statistics Tutor s Quick Guide to statstutor community project encouraging academics to share statistics support resources All stcp resources are released under a Creative Commons licence The Statistics Tutor s Quick Guide to Stcp-marshallowen-7

More information

DATA INTERPRETATION AND STATISTICS

DATA INTERPRETATION AND STATISTICS PholC60 September 001 DATA INTERPRETATION AND STATISTICS Books A easy and systematic introductory text is Essentials of Medical Statistics by Betty Kirkwood, published by Blackwell at about 14. DESCRIPTIVE

More information

Univariate Regression

Univariate Regression Univariate Regression Correlation and Regression The regression line summarizes the linear relationship between 2 variables Correlation coefficient, r, measures strength of relationship: the closer r is

More information

COMPARISONS OF CUSTOMER LOYALTY: PUBLIC & PRIVATE INSURANCE COMPANIES.

COMPARISONS OF CUSTOMER LOYALTY: PUBLIC & PRIVATE INSURANCE COMPANIES. 277 CHAPTER VI COMPARISONS OF CUSTOMER LOYALTY: PUBLIC & PRIVATE INSURANCE COMPANIES. This chapter contains a full discussion of customer loyalty comparisons between private and public insurance companies

More information

Analysis of Variance. MINITAB User s Guide 2 3-1

Analysis of Variance. MINITAB User s Guide 2 3-1 3 Analysis of Variance Analysis of Variance Overview, 3-2 One-Way Analysis of Variance, 3-5 Two-Way Analysis of Variance, 3-11 Analysis of Means, 3-13 Overview of Balanced ANOVA and GLM, 3-18 Balanced

More information

6 Variables: PD MF MA K IAH SBS

6 Variables: PD MF MA K IAH SBS options pageno=min nodate formdlim='-'; title 'Canonical Correlation, Journal of Interpersonal Violence, 10: 354-366.'; data SunitaPatel; infile 'C:\Users\Vati\Documents\StatData\Sunita.dat'; input Group

More information

Syntax Menu Description Options Remarks and examples Stored results Methods and formulas References Also see. level(#) , options2

Syntax Menu Description Options Remarks and examples Stored results Methods and formulas References Also see. level(#) , options2 Title stata.com ttest t tests (mean-comparison tests) Syntax Syntax Menu Description Options Remarks and examples Stored results Methods and formulas References Also see One-sample t test ttest varname

More information

Service courses for graduate students in degree programs other than the MS or PhD programs in Biostatistics.

Service courses for graduate students in degree programs other than the MS or PhD programs in Biostatistics. Course Catalog In order to be assured that all prerequisites are met, students must acquire a permission number from the education coordinator prior to enrolling in any Biostatistics course. Courses are

More information

Business Statistics. Successful completion of Introductory and/or Intermediate Algebra courses is recommended before taking Business Statistics.

Business Statistics. Successful completion of Introductory and/or Intermediate Algebra courses is recommended before taking Business Statistics. Business Course Text Bowerman, Bruce L., Richard T. O'Connell, J. B. Orris, and Dawn C. Porter. Essentials of Business, 2nd edition, McGraw-Hill/Irwin, 2008, ISBN: 978-0-07-331988-9. Required Computing

More information

Permutation Tests for Comparing Two Populations

Permutation Tests for Comparing Two Populations Permutation Tests for Comparing Two Populations Ferry Butar Butar, Ph.D. Jae-Wan Park Abstract Permutation tests for comparing two populations could be widely used in practice because of flexibility of

More information

Lecture Notes Module 1

Lecture Notes Module 1 Lecture Notes Module 1 Study Populations A study population is a clearly defined collection of people, animals, plants, or objects. In psychological research, a study population usually consists of a specific

More information

Confidence Intervals for the Difference Between Two Means

Confidence Intervals for the Difference Between Two Means Chapter 47 Confidence Intervals for the Difference Between Two Means Introduction This procedure calculates the sample size necessary to achieve a specified distance from the difference in sample means

More information

Week TSX Index 1 8480 2 8470 3 8475 4 8510 5 8500 6 8480

Week TSX Index 1 8480 2 8470 3 8475 4 8510 5 8500 6 8480 1) The S & P/TSX Composite Index is based on common stock prices of a group of Canadian stocks. The weekly close level of the TSX for 6 weeks are shown: Week TSX Index 1 8480 2 8470 3 8475 4 8510 5 8500

More information

Logit Models for Binary Data

Logit Models for Binary Data Chapter 3 Logit Models for Binary Data We now turn our attention to regression models for dichotomous data, including logistic regression and probit analysis. These models are appropriate when the response

More information