Sample size and power analysis

Size: px
Start display at page:

Download "Sample size and power analysis"

Transcription

1 Sample size and power analysis You should have at least 80% power of your study (Anonymous) Sample size conversation Two scenarios A researcher conducted a study comparing the effect of an intervention vs. placebo on reducing body weight, and found 5 lbs reduction among the intervention group with P=0.01. Another researcher conducted a similar study comparing the effect of the same intervention vs. the same placebo on reducing body weight, and found the same 5 lbs reduction with the intervention group but could not claim that the intervention was effective because P=0.35. What do you think the crying researcher did differently from the smiling one? Source: Question (1) to Statistician Question: How can I make my P-value smaller? Enroll as many as you can. Answer (1) to Researcher You almost always need to estimate a required sample size or estimate analytical power given a sample size when you are planning a study. Only exception may be a pilot study (a smaller study to show feasibility, or to collect data to plan a larger study). Through this process, you can avoid wasting your efforts and resources conducting studies that are hopeless to begin with. 1

2 Question (2) to Statistician Question: Can I keep enrolling participants into my study until I observe P<0.05? Answer (2) Researcher: Absolutely NOT Question (3) to Statistician If only I had a cent for every time I was asked How many participants do I need for my study? Answer (3) to Researcher Answer (3) to Researcher The purpose of sample size formulae is not to give an exact number but rather to subject the study design to scrutiny, including an assessment of the validity and reliability of data collection, and to give an estimate to distinguish whether tens, hundreds, or thousands of participants are required It does not seem an easy question, like How much money should I take on my holidays? Williamson et al. (2000) JRSSA 163(1): 5-13 Statistics lecture There is no such thing as a sample size problem. Sample size is but one aspect of study design. When you are asked to help determine the sample size a lot of questions must be asked and answered before you get to that one.you may often end up never discussing sample size because there are other matters that override it in importance. Russell Lenth (2001) sample size is dependent upon not only on the desired power but also the true variability in the population and a specification of a practically significant effect size Question (3) to Statistician Question: How to play with these terms? 2

3 Session Start 1 hour session Ingradiants 1- Δ Type I error: The probability of erroneously rejecting the H0. (Conclude that there is an effect, when in fact there is no effect. Type II error: The probability of erroneously failing to reject the H0. (Conclude that there is no effect, when in fact there is an effect.) Power: The chance of correctly identify H1 (Conclude that there is an effect, when in fact there is an effect) Effect: Significant difference of body weight between intervention and placebo groups Significance level () First type of error : Conclude that there is an effect, when in fact there is no effect., Significance level The level of your test is the probability that you will falsely conclude that the program has an effect, when in fact it does not. So with a level of 5%, you can be 95% confident in the validity of your conclusion that the program had an effect For policy purpose, you want to be very confident in the answer you give: the level will be set fairly low. Common level of : 5%, 10%, 1%. Purpose of power analysis 1-, Power Power analyses need to be conducted to ensure adequate sample size to detect a meaningful effect of your intervention 3

4 Interpretation A power of 80% tells us that, in 80% of the experiments of this sample size conducted in this population, if there is indeed an effect in the population, we will be able to say in our sample that there is an effect with the level of confidence desired. The larger the sample, the larger the power. Common Power used: 80%, 90% Visual concept of power Null Distribution: difference=0. Power= chance of being in the rejection region if the Clinically relevant alternative is true=area to the alternative: right of this line (in yellow) difference=10%. Rejection region. Any value >= 6.5 (0+3.3*1.96) For 5% significance level, one-tail area=2.5% (Z /2 = 1.96) Visual concept of power Power here: P(Z > ) = 3.3 P(Z > 1.06) = 85% Rejection region. Any value >= 6.5 (0+3.3*1.96) Power= chance of being in the rejection region if the alternative is true=area to the right of this line (in yellow) Is power analysis always needed? Needed when: Designing a study Applying for grant Less needed when: Secondary data analysis Pilot study to assess effect A priori power anlysis You want to find how many cases you will need to have a specified amount of power given a specified effect size the criterion of significance to be employed A posteriori power analysis You want to find out what power would be for a specified effect size sample size the criterion of significance to be employed 4

5 Effect size, Effect sizes A descriptive metric that characterizes the standardized difference (in SD units) between the mean of a control group and the mean of a treatment group (intervention) Can also be calculated from correlational data derived from pre-experimental designs or from repeated measures designs Sources of finding effect size Unstandardized effect size On the basis of previous research Meta-Analysis: Reviewing the previous literature and calculating the previously observed effect size (in the same and/or similar situations) Pilot study When no prior studies exist for which one can extrapolate an ES, it is often appropriate to conduct a small study with participants in order to get an initial estimate of the effect size On the basis of theoretical importance Deciding whether a small, medium or large effect is required. USA = s D = 0.01 s GB = s Smallest size that would be clinically meaningful. Standardized effect size Zero effect size The standard deviation captures the variability in the outcome. The more variability, the higher the standard deviation is Control Group = 0.00 Intervention Group The Standardized effect size is the effect size divided by the standard deviation of the outcome Overlapping Distributions = effect size/standard deviation = 0.00 means that the average treatment participant outperformed 0% of the control participants 5

6 Moderate effect size Large effect size = 0.40 = 0.85 Control Group Treatment Group Control Group Intervention Condition = 0.40 means that the average treatment participant outperformed 65% of the control participants = 0.85 means that the average treatment participant outperformed 80% of the control participants Attrition rate Study design Measurement of outcome Attrition rate If study is longitudinal or intervention study need to adjust sample size by attrition rate Get attrition estimates from pilot studies or the literature of studies in the same population Default estimate would be 20% Do power calculation and then adjust sample size Final N=(N from Power estimate)/(1-attrition rate) Example: 20% attrition rate Power analysis yields total sample size of 100 Targeted N=??? Study design Different designs have different power distributions and considerations Regression type design different than 2 x 2 ANOVA Longitudinal vs. cross-sectional designs Some designs harder to find power programs than others - Longitudinal - Nested/clustered designs - Dichotomous and categorical outcomes Keep in mind aim of study and not just design Measurement of outcome Level of measurement of outcome can have some influence on power estimates Differences in means - Ex: Intervention study looking at differences in depression using CESD Differences in proportions - Ex: Intervention study looking at differences in depression dx Power done for primary outcome If several important outcomes, conduct power for all and select sample size so that power is at least.80 for all outcomes 6

7 Factors needed for sample sizes Inter-relationship Power Size of the effect - Study design - Measurement of outcome Significance level desired Attrition Inter-relatioship n Sample size Standard case Sampling P(T) distribution if H 0 is true alpha 0.05 if H A is true Power 1- Δ Effect size Significance level POWER = 1 - Effect Size Increased Decreased if HP(T) 0 is true alpha 0.1 if H A is true if HP(T) 0 is true alpha 0.01 if H A is true POWER = 1 - POWER = 1 - T T 7

8 PAPER SECTION Item Description TITLE & ABSTRACT 1 How participants were allocated to interventions (e.g., "random allocation", "randomized", or "randomly assigned"). INTRODUCTION Background 2 Scientific background and explanation of rationale. METHODS Participants 3 Eligibility criteria for participants and the settings and locations where the data were collected. Interventions 4 Precise details of the interventions intended for each group and how and when they were actually administered. Objectives 5 Specific objectives and hypotheses. Outcomes 6 Clearly defined primary and secondary outcome measures and, when applicable, any methods used to enhance the quality of measurements (e.g., multiple observations, training of assessors). Sample size 7 How sample size was determined and, when applicable, explanation of any interim analyses and stopping rules. Randomization -- Sequence generation Randomization -- Allocation concealment Randomization -- Implementation 8 Method used to generate the random allocation sequence, including details of any restriction (e.g., blocking, stratification). 9 Method used to implement the random allocation sequence (e.g., numbered containers or central telephone), clarifying whether the sequence was concealed until interventions were assigned. 10 Who generated the allocation sequence, who enrolled participants, and who assigned participants to their groups. Blinding (masking) 11 Whether or not participants, those administering the interventions, and those assessing the outcomes were blinded to group assignment. When relevant, how the success of blinding was evaluated. Statistical methods 12 Statistical methods used to compare groups for primary outcome(s); Methods for additional analyses, such as subgroup analyses and adjusted analyses. RESULTS Participant flow 13 Flow of participants through each stage (a diagram is strongly recommended). Specifically, for each group report the numbers of participants randomly assigned, receiving intended treatment, completing the study protocol, and analyzed for the primary outcome. Describe protocol deviations from study as planned, together with reasons. Recruitment 14 Dates defining the periods of recruitment and follow-up. Baseline data 15 Baseline demographic and clinical characteristics of each group. Numbers analyzed 16 Number of participants (denominator) in each group included in each analysis and whether the analysis was by "intention -to-treat". State the results in absolute numbers when feasible (e.g., 10/20, not 50%). Outcomes and estimation 17 For each primary and secondary outcome, a summary of results for each group, and the estimated effect size and its precision (e.g., 95% confidence interval). Ancillary analyses 18 Address multiplicity by reporting any other analyses performed, including subgroup analyses and adjusted analyses, indicating those pre-specified and those exploratory. Adverse events 19 All important adverse events or side effects in each intervention group. DISCUSSION Interpretation 20 Interpretation of the results, taking into account study hypotheses, sources of potential bias or imprecision and the dangers associated with multiplicity of analyses and outcomes. Generalizability 21 Generalizability (external validity) of the trial findings. Overall evidence 22 General interpretation of the results in the context of current evidence. 14/06/2012 Increased n Increased if HP(T) 0 is true alpha 0.05 if H A is true Sampling P(T) distribution if H 0 is true alpha 0.05 if H A is true POWER = 1 - POWER = 1 - T Effect Size T Key points The power of a statistical test is influenced by the Sample size (n) n power Significance level (α) α power Difference (effect) to be detected (Δ) Δ power Variation in the outcome (σ 2 ) σ 2 power Key points What we need Where we get it Significance level This is often conventionally set at 5%. The lower it is, the larger the sample size needed for a given power The mean and the variability of the outcome in the comparison group The effect size that we want to detect From previous surveys conducted in similar settings. The larger the variability is, the larger the sample for a given power What is the smallest effect that should prompt a policy response? The smaller the effect size, the larger a sample size we need for a given power CONSORT 22-point checklist Scientific rationale Patient population Reporting power and sample size Sample size Study designs & methods Patient flow Statistical analysis & results Interpretation 8

9 Reporting power Reality and scientific validity Reality vs. scientific validity Resources Reality: Resources n Scientific validity: Sample size formulae Number of available participants Laboratory resources Diagnostic tests, training program etc. if needed Time you have available Set by funding agency Set by your career trajectory Funds and personnel Example Estimation of sample size comparing 2 group means (independent sample t-test): Comparing post trial values Example and software A Clinician wants to conduct RCT to assess the effect of an intervention to reduce HbA1c level among patients with type 2 diabetes. A pilot data suggests that mean HbA1c level among patients without this intervention is 8.7% with standard deviation of 2.2%. We believe that the intervention will decrease patient s HbA1c level by 1%. A total of 154 patients (77 patients in each group) are needed to achieve 80% power at two-sided 5% significance level. 9

10 Example Select the appropriate statistical test, based on the types of outcome measures. Determine the minimum effect size. For continuous outcomes, estimate the standard deviation. For dichotomous outcomes, estimate the baseline risk or incidence/ prevalence of the event. Set limits for Type I (α) and Type II (β) error. Specify your null hypothesis and alternative hypothesis (1- tailed or 2-tailed). Parameters needed for sample size computation, Significance level = 5% (2 sided) 1-, Power = 80%, Effect size = 1, Variability = 2.2 m, sample size ratio between the two groups = 1 Software G*Power 3 G*Power: PS: WinPepi: Russ Lenth: Epi Info: PASS: Determine the effect size Determine sample size Click on Determine Select n1=n2 for equal sample size Calculate and transfer to main window 154 patients 77 in each group 10

11 Determine power Sample sizes vs. power Achieved 80% power for 154 patients Conclusion Crying researcher understood 80% (CI 70%-90%) what he needs for Better understanding Good knowledge of of study design outcome measure? Good statistical approach Smaller p-value Optimum sample size Greater power to detect a true difference! If in doubt Call Biostatisticians!!!! FCEB (Flinders Centre for Epidemiology and Biostatistics) Discipline of General Practice Level 3 Health Sciences Building Flinders Medical Centre Don t miss FCEB Launch!!!! 3:00 PM today Rooms , Health Science Lecture Theatre Complex 11

Biostat Methods STAT 5820/6910 Handout #6: Intro. to Clinical Trials (Matthews text)

Biostat Methods STAT 5820/6910 Handout #6: Intro. to Clinical Trials (Matthews text) Biostat Methods STAT 5820/6910 Handout #6: Intro. to Clinical Trials (Matthews text) Key features of RCT (randomized controlled trial) One group (treatment) receives its treatment at the same time another

More information

Guidelines for AJO-DO submissions: Randomized Clinical Trials June 2015

Guidelines for AJO-DO submissions: Randomized Clinical Trials June 2015 Guidelines for AJO-DO submissions: Randomized Clinical Trials June 2015 Complete and transparent reporting allows for accurate assessment of the quality of trial and correct interpretation of the trial

More information

Study Design and Statistical Analysis

Study Design and Statistical Analysis Study Design and Statistical Analysis Anny H Xiang, PhD Department of Preventive Medicine University of Southern California Outline Designing Clinical Research Studies Statistical Data Analysis Designing

More information

Statistics in Medicine Research Lecture Series CSMC Fall 2014

Statistics in Medicine Research Lecture Series CSMC Fall 2014 Catherine Bresee, MS Senior Biostatistician Biostatistics & Bioinformatics Research Institute Statistics in Medicine Research Lecture Series CSMC Fall 2014 Overview Review concept of statistical power

More information

Sample Size Planning, Calculation, and Justification

Sample Size Planning, Calculation, and Justification Sample Size Planning, Calculation, and Justification Theresa A Scott, MS Vanderbilt University Department of Biostatistics theresa.scott@vanderbilt.edu http://biostat.mc.vanderbilt.edu/theresascott Theresa

More information

Data Analysis, Research Study Design and the IRB

Data Analysis, Research Study Design and the IRB Minding the p-values p and Quartiles: Data Analysis, Research Study Design and the IRB Don Allensworth-Davies, MSc Research Manager, Data Coordinating Center Boston University School of Public Health IRB

More information

MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.

MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. Sample Practice problems - chapter 12-1 and 2 proportions for inference - Z Distributions Name MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. Provide

More information

"Statistical methods are objective methods by which group trends are abstracted from observations on many separate individuals." 1

Statistical methods are objective methods by which group trends are abstracted from observations on many separate individuals. 1 BASIC STATISTICAL THEORY / 3 CHAPTER ONE BASIC STATISTICAL THEORY "Statistical methods are objective methods by which group trends are abstracted from observations on many separate individuals." 1 Medicine

More information

Chapter 3. Sampling. Sampling Methods

Chapter 3. Sampling. Sampling Methods Oxford University Press Chapter 3 40 Sampling Resources are always limited. It is usually not possible nor necessary for the researcher to study an entire target population of subjects. Most medical research

More information

Fairfield Public Schools

Fairfield Public Schools Mathematics Fairfield Public Schools AP Statistics AP Statistics BOE Approved 04/08/2014 1 AP STATISTICS Critical Areas of Focus AP Statistics is a rigorous course that offers advanced students an opportunity

More information

IPDET Module 6: Descriptive, Normative, and Impact Evaluation Designs

IPDET Module 6: Descriptive, Normative, and Impact Evaluation Designs IPDET Module 6: Descriptive, Normative, and Impact Evaluation Designs Intervention or Policy Evaluation Questions Design Questions Elements Types Key Points Introduction What Is Evaluation Design? Connecting

More information

Inclusion and Exclusion Criteria

Inclusion and Exclusion Criteria Inclusion and Exclusion Criteria Inclusion criteria = attributes of subjects that are essential for their selection to participate. Inclusion criteria function remove the influence of specific confounding

More information

If several different trials are mentioned in one publication, the data of each should be extracted in a separate data extraction form.

If several different trials are mentioned in one publication, the data of each should be extracted in a separate data extraction form. General Remarks This template of a data extraction form is intended to help you to start developing your own data extraction form, it certainly has to be adapted to your specific question. Delete unnecessary

More information

Sample Size and Power in Clinical Trials

Sample Size and Power in Clinical Trials Sample Size and Power in Clinical Trials Version 1.0 May 011 1. Power of a Test. Factors affecting Power 3. Required Sample Size RELATED ISSUES 1. Effect Size. Test Statistics 3. Variation 4. Significance

More information

Power Analysis: Intermediate Course in the UCLA Statistical Consulting Series on Power

Power Analysis: Intermediate Course in the UCLA Statistical Consulting Series on Power Power Analysis: Intermediate Course in the UCLA Statistical Consulting Series on Power By Jason C. Cole, PhD QualityMetric, Inc. Senior Consulting Scientist jcole@qualitymetric.com 310-539-2024 Consulting

More information

Chapter Seven. Multiple regression An introduction to multiple regression Performing a multiple regression on SPSS

Chapter Seven. Multiple regression An introduction to multiple regression Performing a multiple regression on SPSS Chapter Seven Multiple regression An introduction to multiple regression Performing a multiple regression on SPSS Section : An introduction to multiple regression WHAT IS MULTIPLE REGRESSION? Multiple

More information

Clinical Study Design and Methods Terminology

Clinical Study Design and Methods Terminology Home College of Veterinary Medicine Washington State University WSU Faculty &Staff Page Page 1 of 5 John Gay, DVM PhD DACVPM AAHP FDIU VCS Clinical Epidemiology & Evidence-Based Medicine Glossary: Clinical

More information

Sampling. COUN 695 Experimental Design

Sampling. COUN 695 Experimental Design Sampling COUN 695 Experimental Design Principles of Sampling Procedures are different for quantitative and qualitative research Sampling in quantitative research focuses on representativeness Sampling

More information

Organizing Your Approach to a Data Analysis

Organizing Your Approach to a Data Analysis Biost/Stat 578 B: Data Analysis Emerson, September 29, 2003 Handout #1 Organizing Your Approach to a Data Analysis The general theme should be to maximize thinking about the data analysis and to minimize

More information

WHAT IS A JOURNAL CLUB?

WHAT IS A JOURNAL CLUB? WHAT IS A JOURNAL CLUB? With its September 2002 issue, the American Journal of Critical Care debuts a new feature, the AJCC Journal Club. Each issue of the journal will now feature an AJCC Journal Club

More information

Descriptive Methods Ch. 6 and 7

Descriptive Methods Ch. 6 and 7 Descriptive Methods Ch. 6 and 7 Purpose of Descriptive Research Purely descriptive research describes the characteristics or behaviors of a given population in a systematic and accurate fashion. Correlational

More information

IS 30 THE MAGIC NUMBER? ISSUES IN SAMPLE SIZE ESTIMATION

IS 30 THE MAGIC NUMBER? ISSUES IN SAMPLE SIZE ESTIMATION Current Topic IS 30 THE MAGIC NUMBER? ISSUES IN SAMPLE SIZE ESTIMATION Sitanshu Sekhar Kar 1, Archana Ramalingam 2 1Assistant Professor; 2 Post- graduate, Department of Preventive and Social Medicine,

More information

PEER REVIEW HISTORY ARTICLE DETAILS TITLE (PROVISIONAL)

PEER REVIEW HISTORY ARTICLE DETAILS TITLE (PROVISIONAL) PEER REVIEW HISTORY BMJ Open publishes all reviews undertaken for accepted manuscripts. Reviewers are asked to complete a checklist review form (http://bmjopen.bmj.com/site/about/resources/checklist.pdf)

More information

Unit 31 A Hypothesis Test about Correlation and Slope in a Simple Linear Regression

Unit 31 A Hypothesis Test about Correlation and Slope in a Simple Linear Regression Unit 31 A Hypothesis Test about Correlation and Slope in a Simple Linear Regression Objectives: To perform a hypothesis test concerning the slope of a least squares line To recognize that testing for a

More information

Survey Research: Choice of Instrument, Sample. Lynda Burton, ScD Johns Hopkins University

Survey Research: Choice of Instrument, Sample. Lynda Burton, ScD Johns Hopkins University This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike License. Your use of this material constitutes acceptance of that license and the conditions of use of materials on this

More information

Consider a study in which. How many subjects? The importance of sample size calculations. An insignificant effect: two possibilities.

Consider a study in which. How many subjects? The importance of sample size calculations. An insignificant effect: two possibilities. Consider a study in which How many subjects? The importance of sample size calculations Office of Research Protections Brown Bag Series KB Boomer, Ph.D. Director, boomer@stat.psu.edu A researcher conducts

More information

HYPOTHESIS TESTING (ONE SAMPLE) - CHAPTER 7 1. used confidence intervals to answer questions such as...

HYPOTHESIS TESTING (ONE SAMPLE) - CHAPTER 7 1. used confidence intervals to answer questions such as... HYPOTHESIS TESTING (ONE SAMPLE) - CHAPTER 7 1 PREVIOUSLY used confidence intervals to answer questions such as... You know that 0.25% of women have red/green color blindness. You conduct a study of men

More information

Correlational Research

Correlational Research Correlational Research Chapter Fifteen Correlational Research Chapter Fifteen Bring folder of readings The Nature of Correlational Research Correlational Research is also known as Associational Research.

More information

Basic research methods. Basic research methods. Question: BRM.2. Question: BRM.1

Basic research methods. Basic research methods. Question: BRM.2. Question: BRM.1 BRM.1 The proportion of individuals with a particular disease who die from that condition is called... BRM.2 This study design examines factors that may contribute to a condition by comparing subjects

More information

1. What is the critical value for this 95% confidence interval? CV = z.025 = invnorm(0.025) = 1.96

1. What is the critical value for this 95% confidence interval? CV = z.025 = invnorm(0.025) = 1.96 1 Final Review 2 Review 2.1 CI 1-propZint Scenario 1 A TV manufacturer claims in its warranty brochure that in the past not more than 10 percent of its TV sets needed any repair during the first two years

More information

HYPOTHESIS TESTING: CONFIDENCE INTERVALS, T-TESTS, ANOVAS, AND REGRESSION

HYPOTHESIS TESTING: CONFIDENCE INTERVALS, T-TESTS, ANOVAS, AND REGRESSION HYPOTHESIS TESTING: CONFIDENCE INTERVALS, T-TESTS, ANOVAS, AND REGRESSION HOD 2990 10 November 2010 Lecture Background This is a lightning speed summary of introductory statistical methods for senior undergraduate

More information

Use advanced techniques for summary and visualization of complex data for exploratory analysis and presentation.

Use advanced techniques for summary and visualization of complex data for exploratory analysis and presentation. MS Biostatistics MS Biostatistics Competencies Study Development: Work collaboratively with biomedical or public health researchers and PhD biostatisticians, as necessary, to provide biostatistical expertise

More information

Two-Sample T-Tests Assuming Equal Variance (Enter Means)

Two-Sample T-Tests Assuming Equal Variance (Enter Means) Chapter 4 Two-Sample T-Tests Assuming Equal Variance (Enter Means) Introduction This procedure provides sample size and power calculations for one- or two-sided two-sample t-tests when the variances of

More information

Study Designs. Simon Day, PhD Johns Hopkins University

Study Designs. Simon Day, PhD Johns Hopkins University This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike License. Your use of this material constitutes acceptance of that license and the conditions of use of materials on this

More information

Testing Research and Statistical Hypotheses

Testing Research and Statistical Hypotheses Testing Research and Statistical Hypotheses Introduction In the last lab we analyzed metric artifact attributes such as thickness or width/thickness ratio. Those were continuous variables, which as you

More information

Improving reporting in randomised trials: CONSORT statement and extensions Doug Altman

Improving reporting in randomised trials: CONSORT statement and extensions Doug Altman Improving reporting in randomised trials: CONSORT statement and extensions Doug Altman Centre for Statistics in Medicine University of Oxford Whatever the outcome of a study, it is really hard for the

More information

Hypothesis testing - Steps

Hypothesis testing - Steps Hypothesis testing - Steps Steps to do a two-tailed test of the hypothesis that β 1 0: 1. Set up the hypotheses: H 0 : β 1 = 0 H a : β 1 0. 2. Compute the test statistic: t = b 1 0 Std. error of b 1 =

More information

Chapter 8 Hypothesis Testing Chapter 8 Hypothesis Testing 8-1 Overview 8-2 Basics of Hypothesis Testing

Chapter 8 Hypothesis Testing Chapter 8 Hypothesis Testing 8-1 Overview 8-2 Basics of Hypothesis Testing Chapter 8 Hypothesis Testing 1 Chapter 8 Hypothesis Testing 8-1 Overview 8-2 Basics of Hypothesis Testing 8-3 Testing a Claim About a Proportion 8-5 Testing a Claim About a Mean: s Not Known 8-6 Testing

More information

Research Methods & Experimental Design

Research Methods & Experimental Design Research Methods & Experimental Design 16.422 Human Supervisory Control April 2004 Research Methods Qualitative vs. quantitative Understanding the relationship between objectives (research question) and

More information

Sample size estimation is an important concern

Sample size estimation is an important concern Sample size and power calculations made simple Evie McCrum-Gardner Background: Sample size estimation is an important concern for researchers as guidelines must be adhered to for ethics committees, grant

More information

1.0 Abstract. Title: Real Life Evaluation of Rheumatoid Arthritis in Canadians taking HUMIRA. Keywords. Rationale and Background:

1.0 Abstract. Title: Real Life Evaluation of Rheumatoid Arthritis in Canadians taking HUMIRA. Keywords. Rationale and Background: 1.0 Abstract Title: Real Life Evaluation of Rheumatoid Arthritis in Canadians taking HUMIRA Keywords Rationale and Background: This abbreviated clinical study report is based on a clinical surveillance

More information

Hypothesis testing. c 2014, Jeffrey S. Simonoff 1

Hypothesis testing. c 2014, Jeffrey S. Simonoff 1 Hypothesis testing So far, we ve talked about inference from the point of estimation. We ve tried to answer questions like What is a good estimate for a typical value? or How much variability is there

More information

The Cross-Sectional Study:

The Cross-Sectional Study: The Cross-Sectional Study: Investigating Prevalence and Association Ronald A. Thisted Departments of Health Studies and Statistics The University of Chicago CRTP Track I Seminar, Autumn, 2006 Lecture Objectives

More information

HYPOTHESIS TESTING (ONE SAMPLE) - CHAPTER 7 1. used confidence intervals to answer questions such as...

HYPOTHESIS TESTING (ONE SAMPLE) - CHAPTER 7 1. used confidence intervals to answer questions such as... HYPOTHESIS TESTING (ONE SAMPLE) - CHAPTER 7 1 PREVIOUSLY used confidence intervals to answer questions such as... You know that 0.25% of women have red/green color blindness. You conduct a study of men

More information

Two-Sample T-Tests Allowing Unequal Variance (Enter Difference)

Two-Sample T-Tests Allowing Unequal Variance (Enter Difference) Chapter 45 Two-Sample T-Tests Allowing Unequal Variance (Enter Difference) Introduction This procedure provides sample size and power calculations for one- or two-sided two-sample t-tests when no assumption

More information

Descriptive Statistics

Descriptive Statistics Descriptive Statistics Primer Descriptive statistics Central tendency Variation Relative position Relationships Calculating descriptive statistics Descriptive Statistics Purpose to describe or summarize

More information

Introduction to Hypothesis Testing. Hypothesis Testing. Step 1: State the Hypotheses

Introduction to Hypothesis Testing. Hypothesis Testing. Step 1: State the Hypotheses Introduction to Hypothesis Testing 1 Hypothesis Testing A hypothesis test is a statistical procedure that uses sample data to evaluate a hypothesis about a population Hypothesis is stated in terms of the

More information

II. DISTRIBUTIONS distribution normal distribution. standard scores

II. DISTRIBUTIONS distribution normal distribution. standard scores Appendix D Basic Measurement And Statistics The following information was developed by Steven Rothke, PhD, Department of Psychology, Rehabilitation Institute of Chicago (RIC) and expanded by Mary F. Schmidt,

More information

Chapter Eight: Quantitative Methods

Chapter Eight: Quantitative Methods Chapter Eight: Quantitative Methods RESEARCH DESIGN Qualitative, Quantitative, and Mixed Methods Approaches Third Edition John W. Creswell Chapter Outline Defining Surveys and Experiments Components of

More information

Nursing Journal Toolkit: Critiquing a Quantitative Research Article

Nursing Journal Toolkit: Critiquing a Quantitative Research Article A Virtual World Consortium: Using Second Life to Facilitate Nursing Journal Clubs Nursing Journal Toolkit: Critiquing a Quantitative Research Article 1. Guidelines for Critiquing a Quantitative Research

More information

Statistics courses often teach the two-sample t-test, linear regression, and analysis of variance

Statistics courses often teach the two-sample t-test, linear regression, and analysis of variance 2 Making Connections: The Two-Sample t-test, Regression, and ANOVA In theory, there s no difference between theory and practice. In practice, there is. Yogi Berra 1 Statistics courses often teach the two-sample

More information

research/scientific includes the following: statistical hypotheses: you have a null and alternative you accept one and reject the other

research/scientific includes the following: statistical hypotheses: you have a null and alternative you accept one and reject the other 1 Hypothesis Testing Richard S. Balkin, Ph.D., LPC-S, NCC 2 Overview When we have questions about the effect of a treatment or intervention or wish to compare groups, we use hypothesis testing Parametric

More information

Online 12 - Sections 9.1 and 9.2-Doug Ensley

Online 12 - Sections 9.1 and 9.2-Doug Ensley Student: Date: Instructor: Doug Ensley Course: MAT117 01 Applied Statistics - Ensley Assignment: Online 12 - Sections 9.1 and 9.2 1. Does a P-value of 0.001 give strong evidence or not especially strong

More information

Additional sources Compilation of sources: http://lrs.ed.uiuc.edu/tseportal/datacollectionmethodologies/jin-tselink/tselink.htm

Additional sources Compilation of sources: http://lrs.ed.uiuc.edu/tseportal/datacollectionmethodologies/jin-tselink/tselink.htm Mgt 540 Research Methods Data Analysis 1 Additional sources Compilation of sources: http://lrs.ed.uiuc.edu/tseportal/datacollectionmethodologies/jin-tselink/tselink.htm http://web.utk.edu/~dap/random/order/start.htm

More information

" Y. Notation and Equations for Regression Lecture 11/4. Notation:

 Y. Notation and Equations for Regression Lecture 11/4. Notation: Notation: Notation and Equations for Regression Lecture 11/4 m: The number of predictor variables in a regression Xi: One of multiple predictor variables. The subscript i represents any number from 1 through

More information

Chapter 10. Key Ideas Correlation, Correlation Coefficient (r),

Chapter 10. Key Ideas Correlation, Correlation Coefficient (r), Chapter 0 Key Ideas Correlation, Correlation Coefficient (r), Section 0-: Overview We have already explored the basics of describing single variable data sets. However, when two quantitative variables

More information

Fixed-Effect Versus Random-Effects Models

Fixed-Effect Versus Random-Effects Models CHAPTER 13 Fixed-Effect Versus Random-Effects Models Introduction Definition of a summary effect Estimating the summary effect Extreme effect size in a large study or a small study Confidence interval

More information

Service courses for graduate students in degree programs other than the MS or PhD programs in Biostatistics.

Service courses for graduate students in degree programs other than the MS or PhD programs in Biostatistics. Course Catalog In order to be assured that all prerequisites are met, students must acquire a permission number from the education coordinator prior to enrolling in any Biostatistics course. Courses are

More information

Rapid Critical Appraisal of Controlled Trials

Rapid Critical Appraisal of Controlled Trials Rapid Critical Appraisal of Controlled Trials Carl Heneghan Dept of Primary Health Care University of Oxford November 23rd 2009 Five steps in EBM 1. Formulate an answerable question 2. Track down the best

More information

Two-sample inference: Continuous data

Two-sample inference: Continuous data Two-sample inference: Continuous data Patrick Breheny April 5 Patrick Breheny STA 580: Biostatistics I 1/32 Introduction Our next two lectures will deal with two-sample inference for continuous data As

More information

In an experimental study there are two types of variables: Independent variable (I will abbreviate this as the IV)

In an experimental study there are two types of variables: Independent variable (I will abbreviate this as the IV) 1 Experimental Design Part I Richard S. Balkin, Ph. D, LPC-S, NCC 2 Overview Experimental design is the blueprint for quantitative research and serves as the foundation of what makes quantitative research

More information

Missing data in randomized controlled trials (RCTs) can

Missing data in randomized controlled trials (RCTs) can EVALUATION TECHNICAL ASSISTANCE BRIEF for OAH & ACYF Teenage Pregnancy Prevention Grantees May 2013 Brief 3 Coping with Missing Data in Randomized Controlled Trials Missing data in randomized controlled

More information

Mary B Codd. MD, MPH, PhD, FFPHMI UCD School of Public Health, Physiotherapy & Pop. Sciences

Mary B Codd. MD, MPH, PhD, FFPHMI UCD School of Public Health, Physiotherapy & Pop. Sciences HRB / CSTAR Grant Applications Training Day Convention Centre Dublin, 9 th September 2010 Key Elements of a Research Protocol Mary B Codd. MD, MPH, PhD, FFPHMI UCD School of Public Health, Physiotherapy

More information

SAMPLING & INFERENTIAL STATISTICS. Sampling is necessary to make inferences about a population.

SAMPLING & INFERENTIAL STATISTICS. Sampling is necessary to make inferences about a population. SAMPLING & INFERENTIAL STATISTICS Sampling is necessary to make inferences about a population. SAMPLING The group that you observe or collect data from is the sample. The group that you make generalizations

More information

Planning sample size for randomized evaluations

Planning sample size for randomized evaluations TRANSLATING RESEARCH INTO ACTION Planning sample size for randomized evaluations Simone Schaner Dartmouth College povertyactionlab.org 1 Course Overview Why evaluate? What is evaluation? Outcomes, indicators

More information

Principles of Hypothesis Testing for Public Health

Principles of Hypothesis Testing for Public Health Principles of Hypothesis Testing for Public Health Laura Lee Johnson, Ph.D. Statistician National Center for Complementary and Alternative Medicine johnslau@mail.nih.gov Fall 2011 Answers to Questions

More information

Introduction to. Hypothesis Testing CHAPTER LEARNING OBJECTIVES. 1 Identify the four steps of hypothesis testing.

Introduction to. Hypothesis Testing CHAPTER LEARNING OBJECTIVES. 1 Identify the four steps of hypothesis testing. Introduction to Hypothesis Testing CHAPTER 8 LEARNING OBJECTIVES After reading this chapter, you should be able to: 1 Identify the four steps of hypothesis testing. 2 Define null hypothesis, alternative

More information

www.rmsolutions.net R&M Solutons

www.rmsolutions.net R&M Solutons Ahmed Hassouna, MD Professor of cardiovascular surgery, Ain-Shams University, EGYPT. Diploma of medical statistics and clinical trial, Paris 6 university, Paris. 1A- Choose the best answer The duration

More information

BA 275 Review Problems - Week 6 (10/30/06-11/3/06) CD Lessons: 53, 54, 55, 56 Textbook: pp. 394-398, 404-408, 410-420

BA 275 Review Problems - Week 6 (10/30/06-11/3/06) CD Lessons: 53, 54, 55, 56 Textbook: pp. 394-398, 404-408, 410-420 BA 275 Review Problems - Week 6 (10/30/06-11/3/06) CD Lessons: 53, 54, 55, 56 Textbook: pp. 394-398, 404-408, 410-420 1. Which of the following will increase the value of the power in a statistical test

More information

Hypothesis Testing --- One Mean

Hypothesis Testing --- One Mean Hypothesis Testing --- One Mean A hypothesis is simply a statement that something is true. Typically, there are two hypotheses in a hypothesis test: the null, and the alternative. Null Hypothesis The hypothesis

More information

Standard Deviation Estimator

Standard Deviation Estimator CSS.com Chapter 905 Standard Deviation Estimator Introduction Even though it is not of primary interest, an estimate of the standard deviation (SD) is needed when calculating the power or sample size of

More information

Analysing Questionnaires using Minitab (for SPSS queries contact -) Graham.Currell@uwe.ac.uk

Analysing Questionnaires using Minitab (for SPSS queries contact -) Graham.Currell@uwe.ac.uk Analysing Questionnaires using Minitab (for SPSS queries contact -) Graham.Currell@uwe.ac.uk Structure As a starting point it is useful to consider a basic questionnaire as containing three main sections:

More information

How To Check For Differences In The One Way Anova

How To Check For Differences In The One Way Anova MINITAB ASSISTANT WHITE PAPER This paper explains the research conducted by Minitab statisticians to develop the methods and data checks used in the Assistant in Minitab 17 Statistical Software. One-Way

More information

Statistics I for QBIC. Contents and Objectives. Chapters 1 7. Revised: August 2013

Statistics I for QBIC. Contents and Objectives. Chapters 1 7. Revised: August 2013 Statistics I for QBIC Text Book: Biostatistics, 10 th edition, by Daniel & Cross Contents and Objectives Chapters 1 7 Revised: August 2013 Chapter 1: Nature of Statistics (sections 1.1-1.6) Objectives

More information

SIMULATION STUDIES IN STATISTICS WHAT IS A SIMULATION STUDY, AND WHY DO ONE? What is a (Monte Carlo) simulation study, and why do one?

SIMULATION STUDIES IN STATISTICS WHAT IS A SIMULATION STUDY, AND WHY DO ONE? What is a (Monte Carlo) simulation study, and why do one? SIMULATION STUDIES IN STATISTICS WHAT IS A SIMULATION STUDY, AND WHY DO ONE? What is a (Monte Carlo) simulation study, and why do one? Simulations for properties of estimators Simulations for properties

More information

Study Guide for the Final Exam

Study Guide for the Final Exam Study Guide for the Final Exam When studying, remember that the computational portion of the exam will only involve new material (covered after the second midterm), that material from Exam 1 will make

More information

Content Sheet 7-1: Overview of Quality Control for Quantitative Tests

Content Sheet 7-1: Overview of Quality Control for Quantitative Tests Content Sheet 7-1: Overview of Quality Control for Quantitative Tests Role in quality management system Quality Control (QC) is a component of process control, and is a major element of the quality management

More information

Class 19: Two Way Tables, Conditional Distributions, Chi-Square (Text: Sections 2.5; 9.1)

Class 19: Two Way Tables, Conditional Distributions, Chi-Square (Text: Sections 2.5; 9.1) Spring 204 Class 9: Two Way Tables, Conditional Distributions, Chi-Square (Text: Sections 2.5; 9.) Big Picture: More than Two Samples In Chapter 7: We looked at quantitative variables and compared the

More information

UNDERSTANDING THE TWO-WAY ANOVA

UNDERSTANDING THE TWO-WAY ANOVA UNDERSTANDING THE e have seen how the one-way ANOVA can be used to compare two or more sample means in studies involving a single independent variable. This can be extended to two independent variables

More information

Introduction to Hypothesis Testing

Introduction to Hypothesis Testing I. Terms, Concepts. Introduction to Hypothesis Testing A. In general, we do not know the true value of population parameters - they must be estimated. However, we do have hypotheses about what the true

More information

Two Correlated Proportions (McNemar Test)

Two Correlated Proportions (McNemar Test) Chapter 50 Two Correlated Proportions (Mcemar Test) Introduction This procedure computes confidence intervals and hypothesis tests for the comparison of the marginal frequencies of two factors (each with

More information

Course Overview Lean Six Sigma Green Belt

Course Overview Lean Six Sigma Green Belt Course Overview Lean Six Sigma Green Belt Summary and Objectives This Six Sigma Green Belt course is comprised of 11 separate sessions. Each session is a collection of related lessons and includes an interactive

More information

Basic of Epidemiology in Ophthalmology Rajiv Khandekar. Presented in the 2nd Series of the MEACO Live Scientific Lectures 11 August 2014 Riyadh, KSA

Basic of Epidemiology in Ophthalmology Rajiv Khandekar. Presented in the 2nd Series of the MEACO Live Scientific Lectures 11 August 2014 Riyadh, KSA Basic of Epidemiology in Ophthalmology Rajiv Khandekar Presented in the 2nd Series of the MEACO Live Scientific Lectures 11 August 2014 Riyadh, KSA Basics of Epidemiology in Ophthalmology Dr Rajiv Khandekar

More information

Form B-1. Inclusion form for the effectiveness of different methods of toilet training for bowel and bladder control

Form B-1. Inclusion form for the effectiveness of different methods of toilet training for bowel and bladder control Form B-1. Inclusion form for the effectiveness of different methods of toilet training for bowel and bladder control Form B-2. Assessment of methodology for non-randomized controlled trials for the effectiveness

More information

Lecture Notes Module 1

Lecture Notes Module 1 Lecture Notes Module 1 Study Populations A study population is a clearly defined collection of people, animals, plants, or objects. In psychological research, a study population usually consists of a specific

More information

X X X a) perfect linear correlation b) no correlation c) positive correlation (r = 1) (r = 0) (0 < r < 1)

X X X a) perfect linear correlation b) no correlation c) positive correlation (r = 1) (r = 0) (0 < r < 1) CORRELATION AND REGRESSION / 47 CHAPTER EIGHT CORRELATION AND REGRESSION Correlation and regression are statistical methods that are commonly used in the medical literature to compare two or more variables.

More information

Lesson 1: Comparison of Population Means Part c: Comparison of Two- Means

Lesson 1: Comparison of Population Means Part c: Comparison of Two- Means Lesson : Comparison of Population Means Part c: Comparison of Two- Means Welcome to lesson c. This third lesson of lesson will discuss hypothesis testing for two independent means. Steps in Hypothesis

More information

TUTORIAL on ICH E9 and Other Statistical Regulatory Guidance. Session 1: ICH E9 and E10. PSI Conference, May 2011

TUTORIAL on ICH E9 and Other Statistical Regulatory Guidance. Session 1: ICH E9 and E10. PSI Conference, May 2011 TUTORIAL on ICH E9 and Other Statistical Regulatory Guidance Session 1: PSI Conference, May 2011 Kerry Gordon, Quintiles 1 E9, and how to locate it 2 ICH E9 Statistical Principles for Clinical Trials (Issued

More information

Lean Six Sigma Black Belt Body of Knowledge

Lean Six Sigma Black Belt Body of Knowledge General Lean Six Sigma Defined UN Describe Nature and purpose of Lean Six Sigma Integration of Lean and Six Sigma UN Compare and contrast focus and approaches (Process Velocity and Quality) Y=f(X) Input

More information

Introduction to Hypothesis Testing OPRE 6301

Introduction to Hypothesis Testing OPRE 6301 Introduction to Hypothesis Testing OPRE 6301 Motivation... The purpose of hypothesis testing is to determine whether there is enough statistical evidence in favor of a certain belief, or hypothesis, about

More information

Validation and Calibration. Definitions and Terminology

Validation and Calibration. Definitions and Terminology Validation and Calibration Definitions and Terminology ACCEPTANCE CRITERIA: The specifications and acceptance/rejection criteria, such as acceptable quality level and unacceptable quality level, with an

More information

Version 4.0. Statistics Guide. Statistical analyses for laboratory and clinical researchers. Harvey Motulsky

Version 4.0. Statistics Guide. Statistical analyses for laboratory and clinical researchers. Harvey Motulsky Version 4.0 Statistics Guide Statistical analyses for laboratory and clinical researchers Harvey Motulsky 1999-2005 GraphPad Software, Inc. All rights reserved. Third printing February 2005 GraphPad Prism

More information

HYPOTHESIS TESTING: POWER OF THE TEST

HYPOTHESIS TESTING: POWER OF THE TEST HYPOTHESIS TESTING: POWER OF THE TEST The first 6 steps of the 9-step test of hypothesis are called "the test". These steps are not dependent on the observed data values. When planning a research project,

More information

First question to ask yourself Do you really need to do a randomised controlled trial (RCT)?

First question to ask yourself Do you really need to do a randomised controlled trial (RCT)? So, you think you want to run a Randomised Controlled Trial? Do you know what is meant by: blinding, stratified randomisation, intention-to-treat analysis, surrogate outcomes, generalisability, confounding

More information

10:20 11:30 Roundtable discussion: the role of biostatistics in advancing medicine and improving health; career as a biostatistician Monday

10:20 11:30 Roundtable discussion: the role of biostatistics in advancing medicine and improving health; career as a biostatistician Monday SUMMER INSTITUTE FOR TRAINING IN BIOSTATISTICS (SIBS) Week 1: Epidemiology, Data, and Statistical Software 10:00 10:15 Welcome and Introduction 10:20 11:30 Roundtable discussion: the role of biostatistics

More information

2 Precision-based sample size calculations

2 Precision-based sample size calculations Statistics: An introduction to sample size calculations Rosie Cornish. 2006. 1 Introduction One crucial aspect of study design is deciding how big your sample should be. If you increase your sample size

More information

Lecture 19: Chapter 8, Section 1 Sampling Distributions: Proportions

Lecture 19: Chapter 8, Section 1 Sampling Distributions: Proportions Lecture 19: Chapter 8, Section 1 Sampling Distributions: Proportions Typical Inference Problem Definition of Sampling Distribution 3 Approaches to Understanding Sampling Dist. Applying 68-95-99.7 Rule

More information

MRC Autism Research Forum Interventions in Autism

MRC Autism Research Forum Interventions in Autism MRC Autism Research Forum Interventions in Autism Date: 10 July 2003 Location: Aim: Commonwealth Institute, London The aim of the forum was to bring academics in relevant disciplines together, to discuss

More information

QUANTITATIVE METHODS BIOLOGY FINAL HONOUR SCHOOL NON-PARAMETRIC TESTS

QUANTITATIVE METHODS BIOLOGY FINAL HONOUR SCHOOL NON-PARAMETRIC TESTS QUANTITATIVE METHODS BIOLOGY FINAL HONOUR SCHOOL NON-PARAMETRIC TESTS This booklet contains lecture notes for the nonparametric work in the QM course. This booklet may be online at http://users.ox.ac.uk/~grafen/qmnotes/index.html.

More information