Power Analysis and Sample size estimation
|
|
- Regina Chapman
- 7 years ago
- Views:
Transcription
1 Power Analysis and Sample size estimation The Power Analysis implements the techniques of statistical power analysis, sample size estimation, and advanced techniques for confidence interval estimation. The main goal of the first two techniques is to allow you to decide, while in the process of designing an experiment, (a) how large a sample is needed to allow statistical judgments that are accurate and reliable, and (b) how likely your statistical test will be to detect effects of a given size in a particular situation. The third technique is useful in implementing objectives (a) and (b) above, and in evaluating the size of experimental effects in practice. Performing power analysis and sample size estimation is an important aspect of all studies, because without these calculations, sample size may be too high or too low. If sample size is too low, the study will lack the precision to provide reliable answers to the questions it is investigating. If sample size is too large, time and resources will be wasted, often for minimal gain. The Power Analysis provides a number of graphical and analytical tools to enable precise evaluation of the factors affecting power and sample size in many of the most commonly encountered statistical analyses. This information can be crucial to the design of a study that is cost-effective and scientifically useful. Therefore, at the design stage of an investigation, you should aim to minimize the probability of failing to detect a real effect (type II error, false negative). The probability of type II error is equal to one minus the power of a study (probability of detecting a true effect). You must select a power level for your study along with the two sided significance level at which you intend to accept or reject null hypotheses in statistical tests. The significance level you choose (usually 5%) is the probability of type I error (incorrectly rejecting the null hypothesis, false positive). For further reading please see Armitage and Berry, 1994; Fleiss, 1981; Gardner and Altman, 1989; Dupont, 1990; Pearson and Hartley, G-Power estimates minimum sample sizes necessary to avoid given levels of type II error in the comparison of means, the comparison of proportions, Correlations and Regressions and variances. Remember that good design lies at the heart of good research and for important studies statistical advice should be sought at the planning stage. You should be familiar with the basic concepts of Statistics before you use this software. Please digest some introductory learning materials, such as Bland (2000) or selected web sites.
2 Errors, P values and Power The P value or calculated probability is the estimated probability of rejecting the null hypothesis (H0) of a study question when that hypothesis is true. The null hypothesis is usually an hypothesis of "no difference" e.g. no difference between blood pressures in group A and group B. Define a null hypothesis for each study question clearly before the start of your study. The only situation in which you should use a one sided P value is when a large change in an unexpected direction would have absolutely no relevance to your study. This situation is unusual; if you are in any doubt then use a two sided P value. The term significance level (alpha) is used to refer to a pre-chosen probability and the term "P value" is used to indicate a probability that you calculate after a given study. The alternative hypothesis (H1) is the opposite of the null hypothesis; in plain language terms this is usually the hypothesis you set out to investigate. For example, question is "is there a significant (not due to chance) difference in blood pressures between groups A and B if we give group A the test drug and group B a sugar pill?" and alternative hypothesis is " there is a difference in blood pressures between groups A and B if we give group A the test drug and group B a sugar pill". If your P value is less than the chosen significance level then you reject the null hypothesis i.e. accept that your sample gives reasonable evidence to support the alternative hypothesis. It does NOT imply a "meaningful" or "important" difference; that is for you to decide when considering the real-world relevance of your result. The choice of significance level at which you reject H0 is arbitrary. Conventionally the 5% (less than 1 in 20 chance of being wrong), 1% and 0.1% (P < 0.05, 0.01 and 0.001) levels have been used. These numbers can give a false sense of security. In the ideal world, we would be able to define a "perfectly" random sample, the most appropriate test and one definitive conclusion. We simply cannot. What we can do is try to optimise all stages of our research to minimise sources of uncertainty. When presenting P values some groups find it helpful to use the asterisk rating system as well as quoting the P value: P < 0.05 * P < 0.01 ** P < Most authors refer to statistically significant as P < 0.05 and statistically highly significant as P < (less than one in a thousand chance of being wrong). The asterisk system avoids the woolly term "significant". Please note, however, that many statisticians do not like the asterisk rating system when it is used without showing P values. As a rule of thumb, if you can quote an exact P value then do. You might also want to refer to a quoted exact P value as an asterisk in text narrative or tables of contrasts elsewhere in a report.
3 At this point, a word about error. Type I error is the false rejection of the null hypothesis and type II error is the false acceptance of the null hypothesis. As an aid memoir: think that our cynical society rejects before it accepts. The significance level (alpha) is the probability of type I error. The power of a test is one minus the probability of type II error (beta). Power should be maximised when selecting statistical methods. If you want to estimate sample sizes then you must understand all of the terms mentioned here. The following table shows the relationship between power and error in hypothesis testing: DECISION TRUTH Accept H 0 Reject H 0 H 0 is true correct decision P type I error P 1-alpha alpha (significance) H 0 is false type II error P correct decision P beta 1-beta (power) H 0 = null hypothesis P = probability If you are interested in further details of probability and sampling theory at this point then please refer to one of the general texts listed in the reference section. You must understand confidence intervals if you intend to quote P values in reports and papers. Statistical referees of scientific journals expect authors to quote confidence intervals with greater prominence than P values. Notes about Type I error: is the incorrect rejection of the null hypothesis maximum probability is set in advance as alpha is not affected by sample size as it is set in advance increases with the number of tests or end points (i.e. do 20 tests and 1 is likely to be wrongly significant) Notes about Type II error: is the incorrect acceptance of the null hypothesis probability is beta beta depends upon sample size and alpha can't be estimated except as a function of the true population effect beta gets smaller as the sample size gets larger beta gets smaller as the number of tests or end points increases
4 How to Work with G-Power by an Example
5
6
7
8
9
10
11 Sample Size Determination in G-Power by Examples: In this section we prepared the input quantities needed for each goal of sample size determination and output by GPower is presented. For more details on statistical and technical issues of the computations see Faul et. al. (2007): 1) Exact Tests Exact - Correlation: Bivariate normal model Options: exact distribution Correlation ρ H1 = 0.3 Correlation ρ H0 = 0 Output: Lower critical r = Upper critical r = Total sample size = 115 Actual power = Exact - Linear multiple regression: Random model Options: Exact distribution H1 ρ² = 0.3 H0 ρ² = 0 Number of predictors = 3 Output: Lower critical R² = Upper critical R² = Total sample size = 49 Actual power = Exact - Proportion: Difference from constant (binomial test, one sample case)
12 Effect size g = 0.3 Constant proportion = 0.5 Output: Lower critical N = Upper critical N = Total sample size = 28 Actual power = Actual α = Exact - Proportions: Inequality, two dependent groups (McNemar) Options: approximation Odds ratio = 1.5 Prop discordant pairs = 0.3 Output: Lower critical N = 148 Upper critical N = 148 Total sample size = 894 Actual power = Actual α = Proportion p12 = Proportion p21 = Exact - Proportions: Inequality, two independent groups (Fisher's exact test) Options: Exact distribution Proportion p1 = 0.5 Proportion p2 = 0.6 Allocation ratio N2/N1 = 1 Output: Sample size group 1 = 557 Sample size group 2 = 557 Total sample size = 1114 Actual power = Actual α = Exact - Proportions: Inequality, two independent groups (unconditional) Options: z-test pooled
13 Odds ratio = Proportion p2 = 0.5 Allocation ratio N2/N1 = 1 Output: Sample size group 1 = 233 Sample size group 2 = 233 Total sample size = 466 Actual α = Actual power = Exact - Proportions: Inequality (offset), two independent groups (unconditional) Options: proportion, z-test pooled Prop. p1 H1 = 0.65 Prop. p1 H0 = 0.55 Proportion p2 = 0.5 Allocation ratio N2/N1 = 1 Output: Sample size group 1 = 524 Sample size group 2 = 524 Total sample size = 1048 Actual α = Actual power = Exact - Proportion: Sign test (binomial test) Effect size g = 0.3 Output: Lower critical N = Upper critical N = Total sample size = 28 Actual power = Actual α = Exact - Generic binomial test Proportion p2 = 0.8
14 Proportion p1 = 0.5 Output: Lower critical N = Upper critical N = Total sample size = 28 Actual power = Actual α = ) F tests F tests - ANCOVA: Fixed effects, main effects and interactions Input: Effect size f = 0.25 Numerator df = 10 Number of groups = 5 Number of covariates = 1 Output: Noncentrality parameter λ = Critical F = Denominator df = 394 Total sample size = 400 Actual power = F tests - ANOVA: Fixed effects, omnibus, one-way Input: Effect size f = 0.25 Number of groups = 5 Output: Noncentrality parameter λ = Critical F = Numerator df = 4 Denominator df = 300 Total sample size = 305 Actual power = F tests - ANOVA: Fixed effects, special, main effects and interactions Input: Effect size f = 0.25
15 Numerator df = 10 Number of groups = 5 Output: Noncentrality parameter λ = Critical F = Denominator df = 395 Total sample size = 400 Actual power = F tests - ANOVA: Repeated measures, between factors Input: Effect size f = 0.25 Number of groups = 2 Number of measurements = 4 Corr among rep measures = 0.5 Output: Noncentrality parameter λ = Critical F = Numerator df = Denominator df = 130 Total sample size = 132 Actual power = F tests - ANOVA: Repeated measures, within factors Input: Effect size f = 0.25 Number of groups = 2 Number of measurements = 4 Corr among rep measures = 0.5 Nonsphericity correction ε = 1 Output: Noncentrality parameter λ = Critical F = Numerator df = Denominator df = 102 Total sample size = 36 Actual power = F tests - ANOVA: Repeated measures, within-between interaction Input: Effect size f = 0.25 Number of groups = 2 Number of measurements = 4 Corr among rep measures = 0.5
16 Nonsphericity correction ε = 1 Output: Noncentrality parameter λ = Critical F = Numerator df = Denominator df = 102 Total sample size = 36 Actual power = F tests - Hotellings T²: One group mean vector Input: Effect size Δ = 0.15 Response variables = 2 Output: Noncentrality parameter λ = Critical F = Numerator df = 2 Denominator df = 688 Total sample size = 690 Actual power = F tests - Hotellings T²: Two group mean vectors Input: Effect size Δ = 0.15 Allocation ratio N2/N1 = 1 Response variables = 2 Output: Noncentrality parameter λ = Critical F = Numerator df = 2 Denominator df = 2747 Sample size group 1 = 1375 Sample size group 2 = 1375 Actual power = F tests - MANOVA: Global effects Options: Pillai V, O'Brien-Shieh Algorithm Input: Effect size f²(v) = 0.25 Number of groups = 3 Response variables = 2 Output: Noncentrality parameter λ = Critical F =
17 Numerator df = Denominator df = Total sample size = 42 Actual power = Pillai V = F tests - MANOVA: Special effects and interactions Options: Pillai V, O'Brien-Shieh Algorithm Input: Effect size f²(v) = 0.25 Number of groups = 6 Number of predictors = 3 Response variables = 2 Output: Noncentrality parameter λ = Critical F = Numerator df = Denominator df = Total sample size = 46 Actual power = Pillai V = F tests - MANOVA: Repeated measures, between factors Options: Pillai V, O'Brien-Shieh Algorithm Input: Effect size f = 0.25 Number of groups = 2 Number of measurements = 4 Corr among rep measures = 0 Output: Noncentrality parameter λ = Critical F = Numerator df = Denominator df = Total sample size = 54 Actual power = Pillai V = F tests - MANOVA: Repeated measures, within factors Options: Pillai V, O'Brien-Shieh Algorithm Input: Effect size f = 0.25 Number of groups = 2
18 Number of measurements = 4 Corr among rep measures = 0 Output: Noncentrality parameter λ = Critical F = Numerator df = Denominator df = Total sample size = 74 Actual power = Pillai V = F tests - MANOVA: Repeated measures, within-between interaction Options: Pillai V, O'Brien-Shieh Algorithm Input: Effect size f(v) = 0.25 Number of groups = 2 Number of measurements = 4 Output: Noncentrality parameter λ = Critical F = Numerator df = Denominator df = 275 Total sample size = 279 Actual power = Pillai V = F tests - Linear multiple regression: Fixed model, R² deviation from zero Input: Effect size f² = 0.15 Number of predictors = 2 Output: Noncentrality parameter λ = Critical F = Numerator df = 2 Denominator df = 104 Total sample size = 107 Actual power = F tests - Linear multiple regression: Fixed model, R² increase Input: Effect size f² = 0.15
19 Number of tested predictors = 2 Total number of predictors = 5 Output: Noncentrality parameter λ = Critical F = Numerator df = 2 Denominator df = 101 Total sample size = 107 Actual power = F tests - Variance: Test of equality (two sample case) Ratio var1/var0 = 1.5 Allocation ratio N2/N1 = 1 Output: Lower critical F = Upper critical F = Numerator df = 265 Denominator df = 265 Sample size group 1 = 266 Sample size group 2 = 266 Actual power = F tests - Generic F test Analysis: Compromise: Compute α and β Input: Noncentrality parameter λ = 30 β/α ratio = 1 Numerator df = 10 Denominator df = 20 Output: Critical F = α err prob = β err prob = Power (1-β err prob) =
20 3) t tests t tests - Correlation: Point biserial model Effect size ρ = Power (1-β err prob) = Output: Noncentrality parameter δ = Critical t = Df = 98 Total sample size = 100 Actual power = t tests - Linear bivariate regression: One group, size of slope Slope H1 = 0.15 α err prob = Power (1-β err prob) = Slope H0 = 0 Std dev σ_x = 1 Std dev σ_y = 1 Output: Noncentrality parameter δ = Critical t = Df = 98 Total sample size = 100 Actual power = t tests - Linear bivariate regression: Two groups, difference between intercepts Δ intercept = 2 Allocation ratio N2/N1 = 1 Std dev residual σ = 2 Std dev σ_x1 = 4 Std dev σ_x2 = 3 Mean μ_x1 = 5 Mean μ_x2 = 7 Output: Noncentrality parameter δ = Critical t = Df = 194
21 Sample size group 1 = 99 Sample size group 2 = 99 Total sample size = 198 Actual power = t tests - Linear bivariate regression: Two groups, difference between slopes Δ slope = α err prob = Power (1-β err prob) = Allocation ratio N2/N1 = 1 Std dev residual σ = 0.5 Std dev σ_x1 = 5 Std dev σ_x2 = 10 Output: Noncentrality parameter δ = Critical t = Df = 98 Sample size group 1 = 51 Sample size group 2 = 51 Total sample size = 102 Actual power = t tests - Linear multiple regression: Fixed model, single regression coefficient Effect size f² = 0.15 α err prob = Power (1-β err prob) = Number of predictors = 2 Output: Noncentrality parameter δ = Critical t = Df = 97 Total sample size = 100 Actual power = t tests - Means: Difference between two dependent means (matched pairs) Effect size dz = 0.5 α err prob = Power (1-β err prob) =
22 Output: Noncentrality parameter δ = Critical t = Df = 100 Total sample size = 101 Actual power = t tests - Means: Difference between two independent means (two groups) Effect size d = 0.5 α err prob = Power (1-β err prob) = Allocation ratio N2/N1 = 1 Output: Noncentrality parameter δ = Critical t = Df = 100 Sample size group 1 = 51 Sample size group 2 = 51 Total sample size = 102 Actual power = t tests - Means: Difference from constant (one sample case) Effect size d = 0.5 α err prob = Power (1-β err prob) = Output: Noncentrality parameter δ = Critical t = Df = 100 Total sample size = 101 Actual power = t tests - Means: Wilcoxon signed-rank test (matched pairs) Options: A.R.E. method Parent distribution = Normal Effect size dz = 0.5 α err prob = Power (1-β err prob) = Output: Noncentrality parameter δ = Critical t = Df = Total sample size = 101 Actual power =
23 t tests - Means: Wilcoxon signed-rank test (one sample case) Options: A.R.E. method Parent distribution = Normal Effect size d = 0.5 α err prob = Power (1-β err prob) = Output: Noncentrality parameter δ = Critical t = Df = Total sample size = 101 Actual power = t tests - Means: Wilcoxon-Mann-Whitney test (two groups) Options: A.R.E. method Parent distribution = Normal Effect size d = 0.5 α err prob = Power (1-β err prob) = Allocation ratio N2/N1 = 1 Output: Noncentrality parameter δ = Critical t = Df = Sample size group 1 = 51 Sample size group 2 = 51 Total sample size = 102 Actual power = t tests - Generic t test Analysis: Compromise: Compute α and β Noncentrality parameter δ = 10 β/α ratio = 1 Df = 1 Output: Critical t = α err prob = β err prob = Power (1-β err prob) =
24 4) χ² tests χ² tests - Goodness-of-fit tests: Contingency tables Input: Effect size w = 0.3 α err prob = Power (1-β err prob) = Df = 5 Output: Noncentrality parameter λ = Critical χ² = Total sample size = 100 Actual power = χ² tests - Variance: Difference from constant (one sample case) Ratio var1/var0 = 1.5 α err prob = Power (1-β err prob) = Output: Lower critical χ² = Upper critical χ² = Df = 99 Total sample size = 100 Actual power = χ² tests - Generic χ² test Analysis: Compromise: Compute α and β Input: Noncentrality parameter λ = 20 β/α ratio = 1 Df = 5 Output: Critical χ² = α err prob = β err prob =
25 5) z tests z tests - Correlation: Tetrachoric model Options: Exact r H1 corr ρ = 0.1 α err prob = Power (1-β err prob) = H0 corr ρ = -0.1 Marginal prop x = 0.6 Marginal prop y = 0.3 Output: Critical z = Total sample size = 100 Actual power = H1 corr ρ = H0 corr ρ = Critical r lwr = Critical r upr = Std err r = z tests - Correlations: Two dependent Pearson r's (common index) H1 corr ρ_ac = -0.1 α err prob = Power (1-β err prob) = H0 corr ρ_ab = 0.1 Corr ρ_bc = -0.1 Output: Critical z = Sample size = 101 Actual power = z tests - Correlations: Two dependent Pearson r's (no common index) H1 corr ρ_cd = 0.15 α err prob = Power (1-β err prob) = H0 corr ρ_ab = 0.1 Corr ρ_ac = 0.7 Corr ρ_ad = 0.6 Corr ρ_bc = 0.4 Corr ρ_bd = 0.41
26 Output: Critical z = Sample size = 100 Actual power = z tests - Correlations: Two independent Pearson r's Effect size q = 0.6 α err prob = Power (1-β err prob) = Allocation ratio N2/N1 = 1 Output: Critical z = Sample size group 1 = 51 Sample size group 2 = 51 Total sample size = 102 Actual power = z tests - Logistic regression Options: Large sample z-test, Demidenko (2007) with var corr Odds ratio = 1.3 Pr(Y=1 X=1) H0 = 0.2 α err prob = Power (1-β err prob) = R² other X = 0 X distribution = Normal X parm μ = 0 X parm σ = 1 Output: Critical z = Total sample size = 101 Actual power = z tests - Poisson regression Options: Large sample z-test, Demidenko (2007) with var corr Exp(β1) = 1.3 Base rate exp(β0) = 0.85 Mean exposure = 1 R² other X = 0 X distribution = Normal X parm μ = 0 X parm σ = 1 Output: Critical z =
27 Total sample size = 179 Actual power = z tests - Proportions: Difference between two independent proportions Proportion p2 = 0.6 Proportion p1 = 0.5 Allocation ratio N2/N1 = 1 Output: Critical z = Sample size group 1 = 533 Sample size group 2 = 533 Total sample size = 1066 Actual power = z tests - Generic z test Analysis: Compromise: Compute α and β Noncentrality parameter μ = 3 Noncentral dist. SD σ = 1 β/α ratio = 1 Output: Critical z = α err prob = β err prob = Power (1-β err prob) = Sample Size Determination in STATA by Examples: 1) Comparing survival in groups
28
29 . stpower logrank Estimated sample sizes for two-sample comparison of survivor functions Log-rank test, Freedman method Ho: S1(t) = S2(t) Input parameters: alpha = (two sided) hratio = power =
30 p1 = Estimated number of events and sample sizes: E = 72 N = 72 N1 = 36 N2 = 36 2) Cox Regression
31
32 . stpower cox, wdprob(.4) Estimated sample size for Cox PH regression Wald test, log-hazard metric Ho: [b1, b2,..., bp] = [0, b2,..., bp] Input parameters: alpha = (two sided) b1 = sd = power = withdrawal(%) = 40.00
33 Estimated number of events and sample size: E = 66 N = 109 Sample Size Determination in PASS by Examples: Some web based sample size calculators: Based on Power: alculators.aspx Based on CI
34 References Armitage P, Berry G, Matthews JNS. Statistical Methods in Medical Research (4th edition). Oxford: Blackwell Science Bland M. An Introduction to Medical Statistics (3rd edition). Oxford Medical Publications Casagrande JT, Pike MC, Smith PG. An improved approximate formula for calculating sample sizes for comparing two binomial distributions. Biometrics 1978;34: Donner A, Eliasziw M. A goodness of fit approach to inference procedures for the Kappa statistic: CI construction, significance testing and sample size estimation. Statistics in Medicine 1992;11: Dupont WD. Power calculations for matched case-control studies. Biometrics 1988;44: Dupont WD. Power and sample size calculations. Controlled Clinical Trials 1990;11: Fleiss JL. Statistical Methods for Rates and Proportions (2nd edition). New York: Wiley Gardner MJ, Altman DG. Statistics with Confidence - Confidence Intervals and Statistical Guidelines. British Medical Journal Pearson & Hartley. Biometrika tables for statisticians (Volumes I and II, 3rd edition). Cambridge University Press Franz, Erdfelder, Lang and Buchner. G*Power 3: A flexible statistical power analysis program for the social, behavioral, and biomedical sciences. Behavior Research Methods 2007, 39 (2),
Statistics in Medicine Research Lecture Series CSMC Fall 2014
Catherine Bresee, MS Senior Biostatistician Biostatistics & Bioinformatics Research Institute Statistics in Medicine Research Lecture Series CSMC Fall 2014 Overview Review concept of statistical power
More informationIntroduction. Hypothesis Testing. Hypothesis Testing. Significance Testing
Introduction Hypothesis Testing Mark Lunt Arthritis Research UK Centre for Ecellence in Epidemiology University of Manchester 13/10/2015 We saw last week that we can never know the population parameters
More information11. Analysis of Case-control Studies Logistic Regression
Research methods II 113 11. Analysis of Case-control Studies Logistic Regression This chapter builds upon and further develops the concepts and strategies described in Ch.6 of Mother and Child Health:
More informationSIMPLE LINEAR CORRELATION. r can range from -1 to 1, and is independent of units of measurement. Correlation can be done on two dependent variables.
SIMPLE LINEAR CORRELATION Simple linear correlation is a measure of the degree to which two variables vary together, or a measure of the intensity of the association between two variables. Correlation
More informationStatistical power analyses using G*Power 3.1: Tests for correlation and regression analyses
Behavior Research Methods 2009, 41 (4), 1149-1160 doi:10.3758/brm.41.4.1149 Statistical power analyses using G*Power 3.1: Tests for correlation and regression analyses Franz Faul Christian-Albrechts-Universität,
More informationSPSS Guide: Regression Analysis
SPSS Guide: Regression Analysis I put this together to give you a step-by-step guide for replicating what we did in the computer lab. It should help you run the tests we covered. The best way to get familiar
More informationPoint Biserial Correlation Tests
Chapter 807 Point Biserial Correlation Tests Introduction The point biserial correlation coefficient (ρ in this chapter) is the product-moment correlation calculated between a continuous random variable
More informationStudy Guide for the Final Exam
Study Guide for the Final Exam When studying, remember that the computational portion of the exam will only involve new material (covered after the second midterm), that material from Exam 1 will make
More informationPart 2: Analysis of Relationship Between Two Variables
Part 2: Analysis of Relationship Between Two Variables Linear Regression Linear correlation Significance Tests Multiple regression Linear Regression Y = a X + b Dependent Variable Independent Variable
More information1. What is the critical value for this 95% confidence interval? CV = z.025 = invnorm(0.025) = 1.96
1 Final Review 2 Review 2.1 CI 1-propZint Scenario 1 A TV manufacturer claims in its warranty brochure that in the past not more than 10 percent of its TV sets needed any repair during the first two years
More informationPrinciples of Hypothesis Testing for Public Health
Principles of Hypothesis Testing for Public Health Laura Lee Johnson, Ph.D. Statistician National Center for Complementary and Alternative Medicine johnslau@mail.nih.gov Fall 2011 Answers to Questions
More informationTwo-Sample T-Tests Assuming Equal Variance (Enter Means)
Chapter 4 Two-Sample T-Tests Assuming Equal Variance (Enter Means) Introduction This procedure provides sample size and power calculations for one- or two-sided two-sample t-tests when the variances of
More informationUNDERSTANDING THE DEPENDENT-SAMPLES t TEST
UNDERSTANDING THE DEPENDENT-SAMPLES t TEST A dependent-samples t test (a.k.a. matched or paired-samples, matched-pairs, samples, or subjects, simple repeated-measures or within-groups, or correlated groups)
More informationConsider a study in which. How many subjects? The importance of sample size calculations. An insignificant effect: two possibilities.
Consider a study in which How many subjects? The importance of sample size calculations Office of Research Protections Brown Bag Series KB Boomer, Ph.D. Director, boomer@stat.psu.edu A researcher conducts
More informationLesson 1: Comparison of Population Means Part c: Comparison of Two- Means
Lesson : Comparison of Population Means Part c: Comparison of Two- Means Welcome to lesson c. This third lesson of lesson will discuss hypothesis testing for two independent means. Steps in Hypothesis
More informationDescriptive Statistics
Descriptive Statistics Primer Descriptive statistics Central tendency Variation Relative position Relationships Calculating descriptive statistics Descriptive Statistics Purpose to describe or summarize
More informationBasic Statistical and Modeling Procedures Using SAS
Basic Statistical and Modeling Procedures Using SAS One-Sample Tests The statistical procedures illustrated in this handout use two datasets. The first, Pulse, has information collected in a classroom
More informationIntroduction to Quantitative Methods
Introduction to Quantitative Methods October 15, 2009 Contents 1 Definition of Key Terms 2 2 Descriptive Statistics 3 2.1 Frequency Tables......................... 4 2.2 Measures of Central Tendencies.................
More informationSAS Software to Fit the Generalized Linear Model
SAS Software to Fit the Generalized Linear Model Gordon Johnston, SAS Institute Inc., Cary, NC Abstract In recent years, the class of generalized linear models has gained popularity as a statistical modeling
More informationUnit 31 A Hypothesis Test about Correlation and Slope in a Simple Linear Regression
Unit 31 A Hypothesis Test about Correlation and Slope in a Simple Linear Regression Objectives: To perform a hypothesis test concerning the slope of a least squares line To recognize that testing for a
More information" Y. Notation and Equations for Regression Lecture 11/4. Notation:
Notation: Notation and Equations for Regression Lecture 11/4 m: The number of predictor variables in a regression Xi: One of multiple predictor variables. The subscript i represents any number from 1 through
More informationCome scegliere un test statistico
Come scegliere un test statistico Estratto dal Capitolo 37 of Intuitive Biostatistics (ISBN 0-19-508607-4) by Harvey Motulsky. Copyright 1995 by Oxfd University Press Inc. (disponibile in Iinternet) Table
More informationInstitute of Actuaries of India Subject CT3 Probability and Mathematical Statistics
Institute of Actuaries of India Subject CT3 Probability and Mathematical Statistics For 2015 Examinations Aim The aim of the Probability and Mathematical Statistics subject is to provide a grounding in
More informationTwo-Sample T-Tests Allowing Unequal Variance (Enter Difference)
Chapter 45 Two-Sample T-Tests Allowing Unequal Variance (Enter Difference) Introduction This procedure provides sample size and power calculations for one- or two-sided two-sample t-tests when no assumption
More informationSection 13, Part 1 ANOVA. Analysis Of Variance
Section 13, Part 1 ANOVA Analysis Of Variance Course Overview So far in this course we ve covered: Descriptive statistics Summary statistics Tables and Graphs Probability Probability Rules Probability
More informationTests for Two Proportions
Chapter 200 Tests for Two Proportions Introduction This module computes power and sample size for hypothesis tests of the difference, ratio, or odds ratio of two independent proportions. The test statistics
More informationfor the social, behavioral, and biomedical sciences. Behavior Research Methods, 39, 175-191.
Example of a Statistical Power Calculation (p. 206) Photocopiable This example uses the statistical power software package G*Power 3. I am grateful to the creators of the software for giving their permission
More informationGeneralized Linear Models
Generalized Linear Models We have previously worked with regression models where the response variable is quantitative and normally distributed. Now we turn our attention to two types of models where the
More informationTests for Two Survival Curves Using Cox s Proportional Hazards Model
Chapter 730 Tests for Two Survival Curves Using Cox s Proportional Hazards Model Introduction A clinical trial is often employed to test the equality of survival distributions of two treatment groups.
More informationQUANTITATIVE METHODS BIOLOGY FINAL HONOUR SCHOOL NON-PARAMETRIC TESTS
QUANTITATIVE METHODS BIOLOGY FINAL HONOUR SCHOOL NON-PARAMETRIC TESTS This booklet contains lecture notes for the nonparametric work in the QM course. This booklet may be online at http://users.ox.ac.uk/~grafen/qmnotes/index.html.
More informationMULTIPLE REGRESSION AND ISSUES IN REGRESSION ANALYSIS
MULTIPLE REGRESSION AND ISSUES IN REGRESSION ANALYSIS MSR = Mean Regression Sum of Squares MSE = Mean Squared Error RSS = Regression Sum of Squares SSE = Sum of Squared Errors/Residuals α = Level of Significance
More informationRegression Analysis: A Complete Example
Regression Analysis: A Complete Example This section works out an example that includes all the topics we have discussed so far in this chapter. A complete example of regression analysis. PhotoDisc, Inc./Getty
More informationSimple Regression Theory II 2010 Samuel L. Baker
SIMPLE REGRESSION THEORY II 1 Simple Regression Theory II 2010 Samuel L. Baker Assessing how good the regression equation is likely to be Assignment 1A gets into drawing inferences about how close the
More informationSUMAN DUVVURU STAT 567 PROJECT REPORT
SUMAN DUVVURU STAT 567 PROJECT REPORT SURVIVAL ANALYSIS OF HEROIN ADDICTS Background and introduction: Current illicit drug use among teens is continuing to increase in many countries around the world.
More informationOutline. Topic 4 - Analysis of Variance Approach to Regression. Partitioning Sums of Squares. Total Sum of Squares. Partitioning sums of squares
Topic 4 - Analysis of Variance Approach to Regression Outline Partitioning sums of squares Degrees of freedom Expected mean squares General linear test - Fall 2013 R 2 and the coefficient of correlation
More informationStudy Design Sample Size Calculation & Power Analysis. RCMAR/CHIME April 21, 2014 Honghu Liu, PhD Professor University of California Los Angeles
Study Design Sample Size Calculation & Power Analysis RCMAR/CHIME April 21, 2014 Honghu Liu, PhD Professor University of California Los Angeles Contents 1. Background 2. Common Designs 3. Examples 4. Computer
More informationSample Size Planning, Calculation, and Justification
Sample Size Planning, Calculation, and Justification Theresa A Scott, MS Vanderbilt University Department of Biostatistics theresa.scott@vanderbilt.edu http://biostat.mc.vanderbilt.edu/theresascott Theresa
More information2013 MBA Jump Start Program. Statistics Module Part 3
2013 MBA Jump Start Program Module 1: Statistics Thomas Gilbert Part 3 Statistics Module Part 3 Hypothesis Testing (Inference) Regressions 2 1 Making an Investment Decision A researcher in your firm just
More informationHYPOTHESIS TESTING: CONFIDENCE INTERVALS, T-TESTS, ANOVAS, AND REGRESSION
HYPOTHESIS TESTING: CONFIDENCE INTERVALS, T-TESTS, ANOVAS, AND REGRESSION HOD 2990 10 November 2010 Lecture Background This is a lightning speed summary of introductory statistical methods for senior undergraduate
More informationSimple linear regression
Simple linear regression Introduction Simple linear regression is a statistical method for obtaining a formula to predict values of one variable from another where there is a causal relationship between
More informationPearson's Correlation Tests
Chapter 800 Pearson's Correlation Tests Introduction The correlation coefficient, ρ (rho), is a popular statistic for describing the strength of the relationship between two variables. The correlation
More informationNonparametric Statistics
Nonparametric Statistics J. Lozano University of Goettingen Department of Genetic Epidemiology Interdisciplinary PhD Program in Applied Statistics & Empirical Methods Graduate Seminar in Applied Statistics
More informationSimple Linear Regression Inference
Simple Linear Regression Inference 1 Inference requirements The Normality assumption of the stochastic term e is needed for inference even if it is not a OLS requirement. Therefore we have: Interpretation
More informationTwo Correlated Proportions (McNemar Test)
Chapter 50 Two Correlated Proportions (Mcemar Test) Introduction This procedure computes confidence intervals and hypothesis tests for the comparison of the marginal frequencies of two factors (each with
More informationData Analysis, Research Study Design and the IRB
Minding the p-values p and Quartiles: Data Analysis, Research Study Design and the IRB Don Allensworth-Davies, MSc Research Manager, Data Coordinating Center Boston University School of Public Health IRB
More informationOrdinal Regression. Chapter
Ordinal Regression Chapter 4 Many variables of interest are ordinal. That is, you can rank the values, but the real distance between categories is unknown. Diseases are graded on scales from least severe
More informationNONPARAMETRIC STATISTICS 1. depend on assumptions about the underlying distribution of the data (or on the Central Limit Theorem)
NONPARAMETRIC STATISTICS 1 PREVIOUSLY parametric statistics in estimation and hypothesis testing... construction of confidence intervals computing of p-values classical significance testing depend on assumptions
More informationX X X a) perfect linear correlation b) no correlation c) positive correlation (r = 1) (r = 0) (0 < r < 1)
CORRELATION AND REGRESSION / 47 CHAPTER EIGHT CORRELATION AND REGRESSION Correlation and regression are statistical methods that are commonly used in the medical literature to compare two or more variables.
More informationSPSS Tests for Versions 9 to 13
SPSS Tests for Versions 9 to 13 Chapter 2 Descriptive Statistic (including median) Choose Analyze Descriptive statistics Frequencies... Click on variable(s) then press to move to into Variable(s): list
More informationGuide to Microsoft Excel for calculations, statistics, and plotting data
Page 1/47 Guide to Microsoft Excel for calculations, statistics, and plotting data Topic Page A. Writing equations and text 2 1. Writing equations with mathematical operations 2 2. Writing equations with
More informationTHE FIRST SET OF EXAMPLES USE SUMMARY DATA... EXAMPLE 7.2, PAGE 227 DESCRIBES A PROBLEM AND A HYPOTHESIS TEST IS PERFORMED IN EXAMPLE 7.
THERE ARE TWO WAYS TO DO HYPOTHESIS TESTING WITH STATCRUNCH: WITH SUMMARY DATA (AS IN EXAMPLE 7.17, PAGE 236, IN ROSNER); WITH THE ORIGINAL DATA (AS IN EXAMPLE 8.5, PAGE 301 IN ROSNER THAT USES DATA FROM
More informationHow To Check For Differences In The One Way Anova
MINITAB ASSISTANT WHITE PAPER This paper explains the research conducted by Minitab statisticians to develop the methods and data checks used in the Assistant in Minitab 17 Statistical Software. One-Way
More informationNon-Inferiority Tests for Two Means using Differences
Chapter 450 on-inferiority Tests for Two Means using Differences Introduction This procedure computes power and sample size for non-inferiority tests in two-sample designs in which the outcome is a continuous
More informationSTATISTICA Formula Guide: Logistic Regression. Table of Contents
: Table of Contents... 1 Overview of Model... 1 Dispersion... 2 Parameterization... 3 Sigma-Restricted Model... 3 Overparameterized Model... 4 Reference Coding... 4 Model Summary (Summary Tab)... 5 Summary
More informationMultivariate Logistic Regression
1 Multivariate Logistic Regression As in univariate logistic regression, let π(x) represent the probability of an event that depends on p covariates or independent variables. Then, using an inv.logit formulation
More informationLAB 4 INSTRUCTIONS CONFIDENCE INTERVALS AND HYPOTHESIS TESTING
LAB 4 INSTRUCTIONS CONFIDENCE INTERVALS AND HYPOTHESIS TESTING In this lab you will explore the concept of a confidence interval and hypothesis testing through a simulation problem in engineering setting.
More informationInternational Statistical Institute, 56th Session, 2007: Phil Everson
Teaching Regression using American Football Scores Everson, Phil Swarthmore College Department of Mathematics and Statistics 5 College Avenue Swarthmore, PA198, USA E-mail: peverso1@swarthmore.edu 1. Introduction
More informationAdditional sources Compilation of sources: http://lrs.ed.uiuc.edu/tseportal/datacollectionmethodologies/jin-tselink/tselink.htm
Mgt 540 Research Methods Data Analysis 1 Additional sources Compilation of sources: http://lrs.ed.uiuc.edu/tseportal/datacollectionmethodologies/jin-tselink/tselink.htm http://web.utk.edu/~dap/random/order/start.htm
More informationSample Size and Power in Clinical Trials
Sample Size and Power in Clinical Trials Version 1.0 May 011 1. Power of a Test. Factors affecting Power 3. Required Sample Size RELATED ISSUES 1. Effect Size. Test Statistics 3. Variation 4. Significance
More informationIntroduction to Statistics and Quantitative Research Methods
Introduction to Statistics and Quantitative Research Methods Purpose of Presentation To aid in the understanding of basic statistics, including terminology, common terms, and common statistical methods.
More informationSome Essential Statistics The Lure of Statistics
Some Essential Statistics The Lure of Statistics Data Mining Techniques, by M.J.A. Berry and G.S Linoff, 2004 Statistics vs. Data Mining..lie, damn lie, and statistics mining data to support preconceived
More informationFinal Exam Practice Problem Answers
Final Exam Practice Problem Answers The following data set consists of data gathered from 77 popular breakfast cereals. The variables in the data set are as follows: Brand: The brand name of the cereal
More informationInterpretation of Somers D under four simple models
Interpretation of Somers D under four simple models Roger B. Newson 03 September, 04 Introduction Somers D is an ordinal measure of association introduced by Somers (96)[9]. It can be defined in terms
More informationBinary Diagnostic Tests Two Independent Samples
Chapter 537 Binary Diagnostic Tests Two Independent Samples Introduction An important task in diagnostic medicine is to measure the accuracy of two diagnostic tests. This can be done by comparing summary
More informationNCSS Statistical Software
Chapter 06 Introduction This procedure provides several reports for the comparison of two distributions, including confidence intervals for the difference in means, two-sample t-tests, the z-test, the
More informationMulticollinearity Richard Williams, University of Notre Dame, http://www3.nd.edu/~rwilliam/ Last revised January 13, 2015
Multicollinearity Richard Williams, University of Notre Dame, http://www3.nd.edu/~rwilliam/ Last revised January 13, 2015 Stata Example (See appendices for full example).. use http://www.nd.edu/~rwilliam/stats2/statafiles/multicoll.dta,
More informationChapter 5 Analysis of variance SPSS Analysis of variance
Chapter 5 Analysis of variance SPSS Analysis of variance Data file used: gss.sav How to get there: Analyze Compare Means One-way ANOVA To test the null hypothesis that several population means are equal,
More informationII. DISTRIBUTIONS distribution normal distribution. standard scores
Appendix D Basic Measurement And Statistics The following information was developed by Steven Rothke, PhD, Department of Psychology, Rehabilitation Institute of Chicago (RIC) and expanded by Mary F. Schmidt,
More informationChapter 13 Introduction to Linear Regression and Correlation Analysis
Chapter 3 Student Lecture Notes 3- Chapter 3 Introduction to Linear Regression and Correlation Analsis Fall 2006 Fundamentals of Business Statistics Chapter Goals To understand the methods for displaing
More informationBowerman, O'Connell, Aitken Schermer, & Adcock, Business Statistics in Practice, Canadian edition
Bowerman, O'Connell, Aitken Schermer, & Adcock, Business Statistics in Practice, Canadian edition Online Learning Centre Technology Step-by-Step - Excel Microsoft Excel is a spreadsheet software application
More informationBiostatistics: Types of Data Analysis
Biostatistics: Types of Data Analysis Theresa A Scott, MS Vanderbilt University Department of Biostatistics theresa.scott@vanderbilt.edu http://biostat.mc.vanderbilt.edu/theresascott Theresa A Scott, MS
More informationHow to get accurate sample size and power with nquery Advisor R
How to get accurate sample size and power with nquery Advisor R Brian Sullivan Statistical Solutions Ltd. ASA Meeting, Chicago, March 2007 Sample Size Two group t-test χ 2 -test Survival Analysis 2 2 Crossover
More informationNon-Inferiority Tests for One Mean
Chapter 45 Non-Inferiority ests for One Mean Introduction his module computes power and sample size for non-inferiority tests in one-sample designs in which the outcome is distributed as a normal random
More informationIndependent t- Test (Comparing Two Means)
Independent t- Test (Comparing Two Means) The objectives of this lesson are to learn: the definition/purpose of independent t-test when to use the independent t-test the use of SPSS to complete an independent
More informationChicago Booth BUSINESS STATISTICS 41000 Final Exam Fall 2011
Chicago Booth BUSINESS STATISTICS 41000 Final Exam Fall 2011 Name: Section: I pledge my honor that I have not violated the Honor Code Signature: This exam has 34 pages. You have 3 hours to complete this
More informationHypothesis testing - Steps
Hypothesis testing - Steps Steps to do a two-tailed test of the hypothesis that β 1 0: 1. Set up the hypotheses: H 0 : β 1 = 0 H a : β 1 0. 2. Compute the test statistic: t = b 1 0 Std. error of b 1 =
More informationNon-Inferiority Tests for Two Proportions
Chapter 0 Non-Inferiority Tests for Two Proportions Introduction This module provides power analysis and sample size calculation for non-inferiority and superiority tests in twosample designs in which
More informationResearch Methods & Experimental Design
Research Methods & Experimental Design 16.422 Human Supervisory Control April 2004 Research Methods Qualitative vs. quantitative Understanding the relationship between objectives (research question) and
More informationStatistical Functions in Excel
Statistical Functions in Excel There are many statistical functions in Excel. Moreover, there are other functions that are not specified as statistical functions that are helpful in some statistical analyses.
More informationCalculating, Interpreting, and Reporting Estimates of Effect Size (Magnitude of an Effect or the Strength of a Relationship)
1 Calculating, Interpreting, and Reporting Estimates of Effect Size (Magnitude of an Effect or the Strength of a Relationship) I. Authors should report effect sizes in the manuscript and tables when reporting
More informationIntroduction to Regression and Data Analysis
Statlab Workshop Introduction to Regression and Data Analysis with Dan Campbell and Sherlock Campbell October 28, 2008 I. The basics A. Types of variables Your variables may take several forms, and it
More informationCategorical Data Analysis
Richard L. Scheaffer University of Florida The reference material and many examples for this section are based on Chapter 8, Analyzing Association Between Categorical Variables, from Statistical Methods
More informationA POPULATION MEAN, CONFIDENCE INTERVALS AND HYPOTHESIS TESTING
CHAPTER 5. A POPULATION MEAN, CONFIDENCE INTERVALS AND HYPOTHESIS TESTING 5.1 Concepts When a number of animals or plots are exposed to a certain treatment, we usually estimate the effect of the treatment
More informationEconometrics and Data Analysis I
Econometrics and Data Analysis I Yale University ECON S131 (ONLINE) Summer Session A, 2014 June 2 July 4 Instructor: Doug McKee (douglas.mckee@yale.edu) Teaching Fellow: Yu Liu (dav.yu.liu@yale.edu) Classroom:
More information"Statistical methods are objective methods by which group trends are abstracted from observations on many separate individuals." 1
BASIC STATISTICAL THEORY / 3 CHAPTER ONE BASIC STATISTICAL THEORY "Statistical methods are objective methods by which group trends are abstracted from observations on many separate individuals." 1 Medicine
More informationThe Statistics Tutor s Quick Guide to
statstutor community project encouraging academics to share statistics support resources All stcp resources are released under a Creative Commons licence The Statistics Tutor s Quick Guide to Stcp-marshallowen-7
More informationDATA INTERPRETATION AND STATISTICS
PholC60 September 001 DATA INTERPRETATION AND STATISTICS Books A easy and systematic introductory text is Essentials of Medical Statistics by Betty Kirkwood, published by Blackwell at about 14. DESCRIPTIVE
More informationUnivariate Regression
Univariate Regression Correlation and Regression The regression line summarizes the linear relationship between 2 variables Correlation coefficient, r, measures strength of relationship: the closer r is
More informationCOMPARISONS OF CUSTOMER LOYALTY: PUBLIC & PRIVATE INSURANCE COMPANIES.
277 CHAPTER VI COMPARISONS OF CUSTOMER LOYALTY: PUBLIC & PRIVATE INSURANCE COMPANIES. This chapter contains a full discussion of customer loyalty comparisons between private and public insurance companies
More informationAnalysis of Variance. MINITAB User s Guide 2 3-1
3 Analysis of Variance Analysis of Variance Overview, 3-2 One-Way Analysis of Variance, 3-5 Two-Way Analysis of Variance, 3-11 Analysis of Means, 3-13 Overview of Balanced ANOVA and GLM, 3-18 Balanced
More information6 Variables: PD MF MA K IAH SBS
options pageno=min nodate formdlim='-'; title 'Canonical Correlation, Journal of Interpersonal Violence, 10: 354-366.'; data SunitaPatel; infile 'C:\Users\Vati\Documents\StatData\Sunita.dat'; input Group
More informationSyntax Menu Description Options Remarks and examples Stored results Methods and formulas References Also see. level(#) , options2
Title stata.com ttest t tests (mean-comparison tests) Syntax Syntax Menu Description Options Remarks and examples Stored results Methods and formulas References Also see One-sample t test ttest varname
More informationService courses for graduate students in degree programs other than the MS or PhD programs in Biostatistics.
Course Catalog In order to be assured that all prerequisites are met, students must acquire a permission number from the education coordinator prior to enrolling in any Biostatistics course. Courses are
More informationBusiness Statistics. Successful completion of Introductory and/or Intermediate Algebra courses is recommended before taking Business Statistics.
Business Course Text Bowerman, Bruce L., Richard T. O'Connell, J. B. Orris, and Dawn C. Porter. Essentials of Business, 2nd edition, McGraw-Hill/Irwin, 2008, ISBN: 978-0-07-331988-9. Required Computing
More informationPermutation Tests for Comparing Two Populations
Permutation Tests for Comparing Two Populations Ferry Butar Butar, Ph.D. Jae-Wan Park Abstract Permutation tests for comparing two populations could be widely used in practice because of flexibility of
More informationLecture Notes Module 1
Lecture Notes Module 1 Study Populations A study population is a clearly defined collection of people, animals, plants, or objects. In psychological research, a study population usually consists of a specific
More informationConfidence Intervals for the Difference Between Two Means
Chapter 47 Confidence Intervals for the Difference Between Two Means Introduction This procedure calculates the sample size necessary to achieve a specified distance from the difference in sample means
More informationWeek TSX Index 1 8480 2 8470 3 8475 4 8510 5 8500 6 8480
1) The S & P/TSX Composite Index is based on common stock prices of a group of Canadian stocks. The weekly close level of the TSX for 6 weeks are shown: Week TSX Index 1 8480 2 8470 3 8475 4 8510 5 8500
More informationLogit Models for Binary Data
Chapter 3 Logit Models for Binary Data We now turn our attention to regression models for dichotomous data, including logistic regression and probit analysis. These models are appropriate when the response
More information