Sample Size Determination
|
|
- Melina Griffith
- 7 years ago
- Views:
Transcription
1 Sample Size Determination Bandit Thinkhamrop, PhD (Statistics) Dept. of Biostatistics & Demography Khon Kaen University
2 Essential of sample size calculation No one accept any magic number Too large vs Too small To justify with the sponsor and the Ethics Committee To ensure: adequate power to test a hypothesis desired precision to obtain an estimate
3 Two main approaches Hypothesis-based sample size calculation Involve power or beta error Ensure a significant finding but may not be conclusive clinically Easy and widely available Confidence interval methods of sample size calculation Involve precision of the estimation Ensure a conclusive finding clinically as this method is directly estimate the magnitude of effect Difficult and not widely available
4 Overall steps Identify the primary outcome Identify and review the magnitude of effect and its variability that will be used as the basis of the conclusion of the research. Identify what statistical method that will be used to obtain the main magnitude of effect. Calculate the sample size Describe how the sample size is calculated with sufficient details that allow explicability.
5 Steps in the calculation Base sample size calculation Design effect (for correlated outcome) Contingency (increase to account for nonresponses or dropout) Rounding up to a nearest (and comfortable) number Evaluate if this sample size would provide a precise and conclusive answer to the research question by analyze the data as if it is as expected.
6 Suggested approaches For unknown parameters in the formula, try to find existing evidences or use your best GUESTIMATE, a.k.a. educated guest. Do not use only one scenario or based on only one reference fro the calculation. It is highly recommended that all key parameters should be varied to see how they effect on the sample size. Always evaluate its sufficiency by estimate the main magnitude of effect and its 95% CI and see if it provide a conclusive finding. Consult with the statistician early
7 Common pitfalls Unjustified sample size by specifying a magic number Based on a simplify formula or a sample size table without understanding its limitations "A previous study in this area recruited 50 subjects and found highly significant results (p=0.001), and therefore a similar sample size should be sufficient." never do it like this Inconsistent with the protocol Too much rely on the previous findings in sample size calculation
8 Examples of common calculations Mean one group Mean two independent groups Proportion one group Proportion two independent groups Get some idea from those Practice with your own research
9 Mean one group :Formula Where: n = The sample size Z /2 = The standard normal coefficient, typically 1.96 for 95% CI s =The standard deviation. d = The desired precision level expressed as half of the maximum acceptable confidence interval width.
10 Mean one group :Calculations (fix = 0.05) Expected Standard deviation Precision (half width) n
11 Mean one group :Descriptions A sample size of 38 would be able to estimate a mean with a precision of 10 assuming a standard deviation of 30 according to a study by <Reference>. That is, based on the expected mean of 55 <Reference>, the 95% confidence interval of the estimated mean would be between 45 and 65.
12 Mean two independent group :Formula Sample size in each group (assumes equal sized groups) Represents the desired power (typically.84 for 80% or 1.28 for 90% power) A measure of variability (This is a variance or a square of the standard deviation) Represents the desired level of statistical significance (typically 1.96 for = 0.05). Minimum meaningful difference or Effect Size
13 Mean two independent groups :Calculations (fix = 0.05) H0: M1-M2=0. H1: M1-M2=D1<>0. Test Statistic: Z test with pooled variance (SD1 = 20; SD2 = 25) Power Mean in Control grp. Minimum and meaningful difference 90% % % % % % % % n1 n2
14 Mean two independent groups :Descriptions A total sample size of 37 in group one and 37 in group two would have a power of 80% to detect a difference between group of 15 assuming a mean of 35 in control group with estimated group standard deviations of 20 and 25, respectively, according to a study by <Reference>. The test statistic used is the two-sided two sample t-test. The significance level of the test was targeted at 0.05.
15 Proportion one group :Formula Where: n = The sample size Z /2 = The standard normal coefficient,, typically 1.96 for 95% CI p = The value of the proportion as a decimal percent (e.g., 0.45). d = The desired precision level expressed as half of the maximum acceptable confidence interval width.
16 Proportion one group :Calculations (fix = 0.05) Expected Prevalence Precision (half width) 15% 2% 1,225 20% 2% 1,537 15% 4% % 4% 385 n
17 Proportion one group :Descriptions A sample size of 400 would have a 95% confidence interval of 16% to 24% assuming a prevalence of 20% according to a study by <Reference>.
18 Proportion two independent group :Formula Sample size in each group (assumes equal sized groups) Represents the desired power (typically.84 for 80% or 1.28 for 90% power) A measure of variability (similar to standard deviation) Represents the desired level of statistical significance (typically 1.96 for = 0.05). Minimum meaningful difference or Effect Size
19 Proportion two independent groups :Calculations (fix = 0.05) H0: P1-P2=0. H1: P1-P2=D1<>0. Test Statistic: Z test with pooled variance Power Proportion in Control grp. Minimum and meaningful difference 90% 40% 5% 2,053 2,053 80% 40% 5% 1,534 1,534 90% 50% 5% 2,095 2,095 80% 50% 5% 1,565 1,565 90% 40% 10% % 40% 10% % 50% 10% % 50% 10% n1 n2
20 Proportion two independent groups :Descriptions A total sample size of 388 in group one and 388 in group two would have a power of 80% to detect a difference between group of 10% assuming a prevalence of 50% in control group according to a study by <Reference>. The test statistic used is the two-sided Z test. The significance level of the test was targeted at
21 Other considerations Sampling design affects the calculation of sample size Simple random sampling / assignment Stratified random sampling / assignment Clustered random sampling / assignment Complex study designs affects the calculation of sample size Matching Multiple stages of sampling Repeated measures Usually the sample size calculation is based on method of analysis Correlation, Agreement, Diagnostic performance Z-test Regression multiple linear, logistic Multivariate analyses such as principle component or factor analysis Survival analyses Multilevel models
22 Other considerations Demonstrate superiority Sample size sufficient to detect difference between treatments Require to specify minimum meaningful difference Demonstrate non-inferiority or equally effective Sample size required to demonstrate equivalence larger than required to demonstrate superiority Require to specify non-inferiority margin or equivalence range
23 Precision or Power Estimation Equivalence to sample size calculation do it in the planning phase of the study Do it when the number of available sample is known Wrong: There are around 50 patients per year, of whom 10% may refuse to take part in the study. Therefore over the 2 years of the study, the sample size will be 90 patients. Correct: It is estimated that there will be 90 patients in the clinic. This will give a precision of the prevalence estimation of 20% assuming a prevalence of 65%.
24 Suggested learning resources WWW: Statistics Guide for Research Grant Applicants at St George s University of London (maintained by Martin Bland): Software: PASS, nquery, EpiTable, SeqTrial, PS, etc.
25 Q & A
Sample Size Planning, Calculation, and Justification
Sample Size Planning, Calculation, and Justification Theresa A Scott, MS Vanderbilt University Department of Biostatistics theresa.scott@vanderbilt.edu http://biostat.mc.vanderbilt.edu/theresascott Theresa
More informationA Basic Introduction to Missing Data
John Fox Sociology 740 Winter 2014 Outline Why Missing Data Arise Why Missing Data Arise Global or unit non-response. In a survey, certain respondents may be unreachable or may refuse to participate. Item
More informationStatistics in Medicine Research Lecture Series CSMC Fall 2014
Catherine Bresee, MS Senior Biostatistician Biostatistics & Bioinformatics Research Institute Statistics in Medicine Research Lecture Series CSMC Fall 2014 Overview Review concept of statistical power
More information1. What is the critical value for this 95% confidence interval? CV = z.025 = invnorm(0.025) = 1.96
1 Final Review 2 Review 2.1 CI 1-propZint Scenario 1 A TV manufacturer claims in its warranty brochure that in the past not more than 10 percent of its TV sets needed any repair during the first two years
More informationData Analysis, Research Study Design and the IRB
Minding the p-values p and Quartiles: Data Analysis, Research Study Design and the IRB Don Allensworth-Davies, MSc Research Manager, Data Coordinating Center Boston University School of Public Health IRB
More informationSimple Linear Regression Inference
Simple Linear Regression Inference 1 Inference requirements The Normality assumption of the stochastic term e is needed for inference even if it is not a OLS requirement. Therefore we have: Interpretation
More informationConsider a study in which. How many subjects? The importance of sample size calculations. An insignificant effect: two possibilities.
Consider a study in which How many subjects? The importance of sample size calculations Office of Research Protections Brown Bag Series KB Boomer, Ph.D. Director, boomer@stat.psu.edu A researcher conducts
More informationConfidence Intervals for One Standard Deviation Using Standard Deviation
Chapter 640 Confidence Intervals for One Standard Deviation Using Standard Deviation Introduction This routine calculates the sample size necessary to achieve a specified interval width or distance from
More informationHYPOTHESIS TESTING: CONFIDENCE INTERVALS, T-TESTS, ANOVAS, AND REGRESSION
HYPOTHESIS TESTING: CONFIDENCE INTERVALS, T-TESTS, ANOVAS, AND REGRESSION HOD 2990 10 November 2010 Lecture Background This is a lightning speed summary of introductory statistical methods for senior undergraduate
More informationSimple Regression Theory II 2010 Samuel L. Baker
SIMPLE REGRESSION THEORY II 1 Simple Regression Theory II 2010 Samuel L. Baker Assessing how good the regression equation is likely to be Assignment 1A gets into drawing inferences about how close the
More informationConfidence Intervals for Spearman s Rank Correlation
Chapter 808 Confidence Intervals for Spearman s Rank Correlation Introduction This routine calculates the sample size needed to obtain a specified width of Spearman s rank correlation coefficient confidence
More informationConstructing and Interpreting Confidence Intervals
Constructing and Interpreting Confidence Intervals Confidence Intervals In this power point, you will learn: Why confidence intervals are important in evaluation research How to interpret a confidence
More informationNeed for Sampling. Very large populations Destructive testing Continuous production process
Chapter 4 Sampling and Estimation Need for Sampling Very large populations Destructive testing Continuous production process The objective of sampling is to draw a valid inference about a population. 4-
More informationUnivariate Regression
Univariate Regression Correlation and Regression The regression line summarizes the linear relationship between 2 variables Correlation coefficient, r, measures strength of relationship: the closer r is
More informationresearch/scientific includes the following: statistical hypotheses: you have a null and alternative you accept one and reject the other
1 Hypothesis Testing Richard S. Balkin, Ph.D., LPC-S, NCC 2 Overview When we have questions about the effect of a treatment or intervention or wish to compare groups, we use hypothesis testing Parametric
More informationSimple Linear Regression
STAT 101 Dr. Kari Lock Morgan Simple Linear Regression SECTIONS 9.3 Confidence and prediction intervals (9.3) Conditions for inference (9.1) Want More Stats??? If you have enjoyed learning how to analyze
More informationService courses for graduate students in degree programs other than the MS or PhD programs in Biostatistics.
Course Catalog In order to be assured that all prerequisites are met, students must acquire a permission number from the education coordinator prior to enrolling in any Biostatistics course. Courses are
More informationConfidence Intervals for the Difference Between Two Means
Chapter 47 Confidence Intervals for the Difference Between Two Means Introduction This procedure calculates the sample size necessary to achieve a specified distance from the difference in sample means
More informationPrinciples of Hypothesis Testing for Public Health
Principles of Hypothesis Testing for Public Health Laura Lee Johnson, Ph.D. Statistician National Center for Complementary and Alternative Medicine johnslau@mail.nih.gov Fall 2011 Answers to Questions
More informationStatistics Graduate Courses
Statistics Graduate Courses STAT 7002--Topics in Statistics-Biological/Physical/Mathematics (cr.arr.).organized study of selected topics. Subjects and earnable credit may vary from semester to semester.
More informationSimple linear regression
Simple linear regression Introduction Simple linear regression is a statistical method for obtaining a formula to predict values of one variable from another where there is a causal relationship between
More informationAdditional sources Compilation of sources: http://lrs.ed.uiuc.edu/tseportal/datacollectionmethodologies/jin-tselink/tselink.htm
Mgt 540 Research Methods Data Analysis 1 Additional sources Compilation of sources: http://lrs.ed.uiuc.edu/tseportal/datacollectionmethodologies/jin-tselink/tselink.htm http://web.utk.edu/~dap/random/order/start.htm
More informationDescriptive Methods Ch. 6 and 7
Descriptive Methods Ch. 6 and 7 Purpose of Descriptive Research Purely descriptive research describes the characteristics or behaviors of a given population in a systematic and accurate fashion. Correlational
More informationStatistics I for QBIC. Contents and Objectives. Chapters 1 7. Revised: August 2013
Statistics I for QBIC Text Book: Biostatistics, 10 th edition, by Daniel & Cross Contents and Objectives Chapters 1 7 Revised: August 2013 Chapter 1: Nature of Statistics (sections 1.1-1.6) Objectives
More informationBusiness Statistics. Successful completion of Introductory and/or Intermediate Algebra courses is recommended before taking Business Statistics.
Business Course Text Bowerman, Bruce L., Richard T. O'Connell, J. B. Orris, and Dawn C. Porter. Essentials of Business, 2nd edition, McGraw-Hill/Irwin, 2008, ISBN: 978-0-07-331988-9. Required Computing
More informationFactors affecting online sales
Factors affecting online sales Table of contents Summary... 1 Research questions... 1 The dataset... 2 Descriptive statistics: The exploratory stage... 3 Confidence intervals... 4 Hypothesis tests... 4
More informationFarm Business Survey - Statistical information
Farm Business Survey - Statistical information Sample representation and design The sample structure of the FBS was re-designed starting from the 2010/11 accounting year. The coverage of the survey is
More informationCALCULATIONS & STATISTICS
CALCULATIONS & STATISTICS CALCULATION OF SCORES Conversion of 1-5 scale to 0-100 scores When you look at your report, you will notice that the scores are reported on a 0-100 scale, even though respondents
More informationApplication in Predictive Analytics. FirstName LastName. Northwestern University
Application in Predictive Analytics FirstName LastName Northwestern University Prepared for: Dr. Nethra Sambamoorthi, Ph.D. Author Note: Final Assignment PRED 402 Sec 55 Page 1 of 18 Contents Introduction...
More informationExample: Boats and Manatees
Figure 9-6 Example: Boats and Manatees Slide 1 Given the sample data in Table 9-1, find the value of the linear correlation coefficient r, then refer to Table A-6 to determine whether there is a significant
More informationX X X a) perfect linear correlation b) no correlation c) positive correlation (r = 1) (r = 0) (0 < r < 1)
CORRELATION AND REGRESSION / 47 CHAPTER EIGHT CORRELATION AND REGRESSION Correlation and regression are statistical methods that are commonly used in the medical literature to compare two or more variables.
More informationStudy Design Sample Size Calculation & Power Analysis. RCMAR/CHIME April 21, 2014 Honghu Liu, PhD Professor University of California Los Angeles
Study Design Sample Size Calculation & Power Analysis RCMAR/CHIME April 21, 2014 Honghu Liu, PhD Professor University of California Los Angeles Contents 1. Background 2. Common Designs 3. Examples 4. Computer
More informationCourse Text. Required Computing Software. Course Description. Course Objectives. StraighterLine. Business Statistics
Course Text Business Statistics Lind, Douglas A., Marchal, William A. and Samuel A. Wathen. Basic Statistics for Business and Economics, 7th edition, McGraw-Hill/Irwin, 2010, ISBN: 9780077384470 [This
More informationTwo-Sample T-Tests Assuming Equal Variance (Enter Means)
Chapter 4 Two-Sample T-Tests Assuming Equal Variance (Enter Means) Introduction This procedure provides sample size and power calculations for one- or two-sided two-sample t-tests when the variances of
More informationHypothesis testing - Steps
Hypothesis testing - Steps Steps to do a two-tailed test of the hypothesis that β 1 0: 1. Set up the hypotheses: H 0 : β 1 = 0 H a : β 1 0. 2. Compute the test statistic: t = b 1 0 Std. error of b 1 =
More informationName of the module: Multivariate biostatistics and SPSS Number of module: 471-8-4081
Name of the module: Multivariate biostatistics and SPSS Number of module: 471-8-4081 BGU Credits: 1.5 ECTS credits: Academic year: 4 th Semester: 15 days during fall semester Hours of instruction: 8:00-17:00
More informationSTRUTS: Statistical Rules of Thumb. Seattle, WA
STRUTS: Statistical Rules of Thumb Gerald van Belle Departments of Environmental Health and Biostatistics University ofwashington Seattle, WA 98195-4691 Steven P. Millard Probability, Statistics and Information
More informationIntroduction. Hypothesis Testing. Hypothesis Testing. Significance Testing
Introduction Hypothesis Testing Mark Lunt Arthritis Research UK Centre for Ecellence in Epidemiology University of Manchester 13/10/2015 We saw last week that we can never know the population parameters
More informationChapter 10. Key Ideas Correlation, Correlation Coefficient (r),
Chapter 0 Key Ideas Correlation, Correlation Coefficient (r), Section 0-: Overview We have already explored the basics of describing single variable data sets. However, when two quantitative variables
More informationStatistical Rules of Thumb
Statistical Rules of Thumb Second Edition Gerald van Belle University of Washington Department of Biostatistics and Department of Environmental and Occupational Health Sciences Seattle, WA WILEY AJOHN
More informationHow to evaluate medications in Multiple Sclerosis when placebo controlled RCTs are not feasible
University of Florence Dept. of Neurosciences Careggi University Hospital Dept of Neurosciences How to evaluate medications in Multiple Sclerosis when placebo controlled RCTs are not feasible Luca Massacesi,
More informationTwo-Sample T-Tests Allowing Unequal Variance (Enter Difference)
Chapter 45 Two-Sample T-Tests Allowing Unequal Variance (Enter Difference) Introduction This procedure provides sample size and power calculations for one- or two-sided two-sample t-tests when no assumption
More information1.0 Abstract. Title: Real Life Evaluation of Rheumatoid Arthritis in Canadians taking HUMIRA. Keywords. Rationale and Background:
1.0 Abstract Title: Real Life Evaluation of Rheumatoid Arthritis in Canadians taking HUMIRA Keywords Rationale and Background: This abbreviated clinical study report is based on a clinical surveillance
More informationSIMPLE LINEAR CORRELATION. r can range from -1 to 1, and is independent of units of measurement. Correlation can be done on two dependent variables.
SIMPLE LINEAR CORRELATION Simple linear correlation is a measure of the degree to which two variables vary together, or a measure of the intensity of the association between two variables. Correlation
More informationUnit 31 A Hypothesis Test about Correlation and Slope in a Simple Linear Regression
Unit 31 A Hypothesis Test about Correlation and Slope in a Simple Linear Regression Objectives: To perform a hypothesis test concerning the slope of a least squares line To recognize that testing for a
More information430 Statistics and Financial Mathematics for Business
Prescription: 430 Statistics and Financial Mathematics for Business Elective prescription Level 4 Credit 20 Version 2 Aim Students will be able to summarise, analyse, interpret and present data, make predictions
More informationtraining programme in pharmaceutical medicine Clinical Data Management and Analysis
training programme in pharmaceutical medicine Clinical Data Management and Analysis 19-21 may 2011 Clinical Data Management and Analysis 19 21 MAY 2011 LocaL: University of Aveiro, Campus Universitário
More informationUniversally Accepted Lean Six Sigma Body of Knowledge for Green Belts
Universally Accepted Lean Six Sigma Body of Knowledge for Green Belts The IASSC Certified Green Belt Exam was developed and constructed based on the topics within the body of knowledge listed here. Questions
More information2 Sample t-test (unequal sample sizes and unequal variances)
Variations of the t-test: Sample tail Sample t-test (unequal sample sizes and unequal variances) Like the last example, below we have ceramic sherd thickness measurements (in cm) of two samples representing
More informationHow to Verify Performance Specifications
How to Verify Performance Specifications VERIFICATION OF PERFORMANCE SPECIFICATIONS In 2003, the Centers for Medicare and Medicaid Services (CMS) updated the CLIA 88 regulations. As a result of the updated
More informationStudy Guide for the Final Exam
Study Guide for the Final Exam When studying, remember that the computational portion of the exam will only involve new material (covered after the second midterm), that material from Exam 1 will make
More informationBasic Statistics and Data Analysis for Health Researchers from Foreign Countries
Basic Statistics and Data Analysis for Health Researchers from Foreign Countries Volkert Siersma siersma@sund.ku.dk The Research Unit for General Practice in Copenhagen Dias 1 Content Quantifying association
More informationThe correlation coefficient
The correlation coefficient Clinical Biostatistics The correlation coefficient Martin Bland Correlation coefficients are used to measure the of the relationship or association between two quantitative
More informationINCORPORATION OF LIQUIDITY RISKS INTO EQUITY PORTFOLIO RISK ESTIMATES. Dan dibartolomeo September 2010
INCORPORATION OF LIQUIDITY RISKS INTO EQUITY PORTFOLIO RISK ESTIMATES Dan dibartolomeo September 2010 GOALS FOR THIS TALK Assert that liquidity of a stock is properly measured as the expected price change,
More informationStandard Deviation Estimator
CSS.com Chapter 905 Standard Deviation Estimator Introduction Even though it is not of primary interest, an estimate of the standard deviation (SD) is needed when calculating the power or sample size of
More informationConfidence Intervals for Cp
Chapter 296 Confidence Intervals for Cp Introduction This routine calculates the sample size needed to obtain a specified width of a Cp confidence interval at a stated confidence level. Cp is a process
More informationChicago Booth BUSINESS STATISTICS 41000 Final Exam Fall 2011
Chicago Booth BUSINESS STATISTICS 41000 Final Exam Fall 2011 Name: Section: I pledge my honor that I have not violated the Honor Code Signature: This exam has 34 pages. You have 3 hours to complete this
More informationMissing Data: Part 1 What to Do? Carol B. Thompson Johns Hopkins Biostatistics Center SON Brown Bag 3/20/13
Missing Data: Part 1 What to Do? Carol B. Thompson Johns Hopkins Biostatistics Center SON Brown Bag 3/20/13 Overview Missingness and impact on statistical analysis Missing data assumptions/mechanisms Conventional
More informationInferential Statistics. What are they? When would you use them?
Inferential Statistics What are they? When would you use them? What are inferential statistics? Why learn about inferential statistics? Why use inferential statistics? When are inferential statistics utilized?
More informationDescriptive Statistics
Descriptive Statistics Primer Descriptive statistics Central tendency Variation Relative position Relationships Calculating descriptive statistics Descriptive Statistics Purpose to describe or summarize
More informationApplying Statistics Recommended by Regulatory Documents
Applying Statistics Recommended by Regulatory Documents Steven Walfish President, Statistical Outsourcing Services steven@statisticaloutsourcingservices.com 301-325 325-31293129 About the Speaker Mr. Steven
More informationBiostatistics: Types of Data Analysis
Biostatistics: Types of Data Analysis Theresa A Scott, MS Vanderbilt University Department of Biostatistics theresa.scott@vanderbilt.edu http://biostat.mc.vanderbilt.edu/theresascott Theresa A Scott, MS
More informationInstitute of Actuaries of India Subject CT3 Probability and Mathematical Statistics
Institute of Actuaries of India Subject CT3 Probability and Mathematical Statistics For 2015 Examinations Aim The aim of the Probability and Mathematical Statistics subject is to provide a grounding in
More information2 Precision-based sample size calculations
Statistics: An introduction to sample size calculations Rosie Cornish. 2006. 1 Introduction One crucial aspect of study design is deciding how big your sample should be. If you increase your sample size
More informationSocial Studies 201 Notes for November 19, 2003
1 Social Studies 201 Notes for November 19, 2003 Determining sample size for estimation of a population proportion Section 8.6.2, p. 541. As indicated in the notes for November 17, when sample size is
More informationThe importance of graphing the data: Anscombe s regression examples
The importance of graphing the data: Anscombe s regression examples Bruce Weaver Northern Health Research Conference Nipissing University, North Bay May 30-31, 2008 B. Weaver, NHRC 2008 1 The Objective
More informationStrategies for Identifying Students at Risk for USMLE Step 1 Failure
Vol. 42, No. 2 105 Medical Student Education Strategies for Identifying Students at Risk for USMLE Step 1 Failure Jira Coumarbatch, MD; Leah Robinson, EdS; Ronald Thomas, PhD; Patrick D. Bridge, PhD Background
More informationThe Margin of Error for Differences in Polls
The Margin of Error for Differences in Polls Charles H. Franklin University of Wisconsin, Madison October 27, 2002 (Revised, February 9, 2007) The margin of error for a poll is routinely reported. 1 But
More informationFairfield Public Schools
Mathematics Fairfield Public Schools AP Statistics AP Statistics BOE Approved 04/08/2014 1 AP STATISTICS Critical Areas of Focus AP Statistics is a rigorous course that offers advanced students an opportunity
More informationMULTIPLE REGRESSION AND ISSUES IN REGRESSION ANALYSIS
MULTIPLE REGRESSION AND ISSUES IN REGRESSION ANALYSIS MSR = Mean Regression Sum of Squares MSE = Mean Squared Error RSS = Regression Sum of Squares SSE = Sum of Squared Errors/Residuals α = Level of Significance
More informationReview Jeopardy. Blue vs. Orange. Review Jeopardy
Review Jeopardy Blue vs. Orange Review Jeopardy Jeopardy Round Lectures 0-3 Jeopardy Round $200 How could I measure how far apart (i.e. how different) two observations, y 1 and y 2, are from each other?
More informationOrganizing Your Approach to a Data Analysis
Biost/Stat 578 B: Data Analysis Emerson, September 29, 2003 Handout #1 Organizing Your Approach to a Data Analysis The general theme should be to maximize thinking about the data analysis and to minimize
More informationResearch Methods & Experimental Design
Research Methods & Experimental Design 16.422 Human Supervisory Control April 2004 Research Methods Qualitative vs. quantitative Understanding the relationship between objectives (research question) and
More information17. SIMPLE LINEAR REGRESSION II
17. SIMPLE LINEAR REGRESSION II The Model In linear regression analysis, we assume that the relationship between X and Y is linear. This does not mean, however, that Y can be perfectly predicted from X.
More informationSummary ID# 13614. Clinical Study Summary: Study F3Z-JE-PV06
CT Registry ID# Page 1 Summary ID# 13614 Clinical Study Summary: Study F3Z-JE-PV06 INSIGHTS; INSulin-changing study Intending to Gain patients insights into insulin treatment with patient-reported Health
More informationPoint and Interval Estimates
Point and Interval Estimates Suppose we want to estimate a parameter, such as p or µ, based on a finite sample of data. There are two main methods: 1. Point estimate: Summarize the sample by a single number
More information" Y. Notation and Equations for Regression Lecture 11/4. Notation:
Notation: Notation and Equations for Regression Lecture 11/4 m: The number of predictor variables in a regression Xi: One of multiple predictor variables. The subscript i represents any number from 1 through
More informationMISSING DATA TECHNIQUES WITH SAS. IDRE Statistical Consulting Group
MISSING DATA TECHNIQUES WITH SAS IDRE Statistical Consulting Group ROAD MAP FOR TODAY To discuss: 1. Commonly used techniques for handling missing data, focusing on multiple imputation 2. Issues that could
More informationTUTORIAL on ICH E9 and Other Statistical Regulatory Guidance. Session 1: ICH E9 and E10. PSI Conference, May 2011
TUTORIAL on ICH E9 and Other Statistical Regulatory Guidance Session 1: PSI Conference, May 2011 Kerry Gordon, Quintiles 1 E9, and how to locate it 2 ICH E9 Statistical Principles for Clinical Trials (Issued
More informationCurriculum Map Statistics and Probability Honors (348) Saugus High School Saugus Public Schools 2009-2010
Curriculum Map Statistics and Probability Honors (348) Saugus High School Saugus Public Schools 2009-2010 Week 1 Week 2 14.0 Students organize and describe distributions of data by using a number of different
More informationEconomic Statistics (ECON2006), Statistics and Research Design in Psychology (PSYC2010), Survey Design and Analysis (SOCI2007)
COURSE DESCRIPTION Title Code Level Semester Credits 3 Prerequisites Post requisites Introduction to Statistics ECON1005 (EC160) I I None Economic Statistics (ECON2006), Statistics and Research Design
More informationProspects, Problems of Marketing Research and Data Mining in Turkey
Prospects, Problems of Marketing Research and Data Mining in Turkey Sema Kurtulu, and Kemal Kurtulu Abstract The objective of this paper is to review and assess the methodological issues and problems in
More informationSample Size Determination in Clinical Trials HRM-733 CLass Notes
Sample Size Determination in Clinical Trials HRM-733 CLass Notes Lehana Thabane, BSc, MSc, PhD Biostatistician Center for Evaluation of Medicines St. Joseph s Heathcare 105 Main Street East, Level P1 Hamilton
More informationRegression Analysis: A Complete Example
Regression Analysis: A Complete Example This section works out an example that includes all the topics we have discussed so far in this chapter. A complete example of regression analysis. PhotoDisc, Inc./Getty
More informationPower Analysis: Intermediate Course in the UCLA Statistical Consulting Series on Power
Power Analysis: Intermediate Course in the UCLA Statistical Consulting Series on Power By Jason C. Cole, PhD QualityMetric, Inc. Senior Consulting Scientist jcole@qualitymetric.com 310-539-2024 Consulting
More informationMULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.
Final Exam Review MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. 1) A researcher for an airline interviews all of the passengers on five randomly
More informationSupplementary PROCESS Documentation
Supplementary PROCESS Documentation This document is an addendum to Appendix A of Introduction to Mediation, Moderation, and Conditional Process Analysis that describes options and output added to PROCESS
More informationEfficient Curve Fitting Techniques
15/11/11 Life Conference and Exhibition 11 Stuart Carroll, Christopher Hursey Efficient Curve Fitting Techniques - November 1 The Actuarial Profession www.actuaries.org.uk Agenda Background Outline of
More informationHow Far is too Far? Statistical Outlier Detection
How Far is too Far? Statistical Outlier Detection Steven Walfish President, Statistical Outsourcing Services steven@statisticaloutsourcingservices.com 30-325-329 Outline What is an Outlier, and Why are
More informationConfidence Intervals for Cpk
Chapter 297 Confidence Intervals for Cpk Introduction This routine calculates the sample size needed to obtain a specified width of a Cpk confidence interval at a stated confidence level. Cpk is a process
More informationAnnex 6 BEST PRACTICE EXAMPLES FOCUSING ON SAMPLE SIZE AND RELIABILITY CALCULATIONS AND SAMPLING FOR VALIDATION/VERIFICATION. (Version 01.
Page 1 BEST PRACTICE EXAMPLES FOCUSING ON SAMPLE SIZE AND RELIABILITY CALCULATIONS AND SAMPLING FOR VALIDATION/VERIFICATION (Version 01.1) I. Introduction 1. The clean development mechanism (CDM) Executive
More informationAP Statistics: Syllabus 1
AP Statistics: Syllabus 1 Scoring Components SC1 The course provides instruction in exploring data. 4 SC2 The course provides instruction in sampling. 5 SC3 The course provides instruction in experimentation.
More informationEstimation of σ 2, the variance of ɛ
Estimation of σ 2, the variance of ɛ The variance of the errors σ 2 indicates how much observations deviate from the fitted surface. If σ 2 is small, parameters β 0, β 1,..., β k will be reliably estimated
More informationPart 2: Analysis of Relationship Between Two Variables
Part 2: Analysis of Relationship Between Two Variables Linear Regression Linear correlation Significance Tests Multiple regression Linear Regression Y = a X + b Dependent Variable Independent Variable
More informationsensr - Part 2 Similarity testing and replicated data (and sensr) p 1 2 p d 1. Analysing similarity test data.
Similarity testing and replicated data (and sensr) Per Bruun Brockhoff Professor, Statistics DTU, Copenhagen August 17 2015 sensr - Part 2 1. Analysing similarity test data. 2. Planning similarity tests
More informationAuxiliary Variables in Mixture Modeling: 3-Step Approaches Using Mplus
Auxiliary Variables in Mixture Modeling: 3-Step Approaches Using Mplus Tihomir Asparouhov and Bengt Muthén Mplus Web Notes: No. 15 Version 8, August 5, 2014 1 Abstract This paper discusses alternatives
More informationbusiness statistics using Excel OXFORD UNIVERSITY PRESS Glyn Davis & Branko Pecar
business statistics using Excel Glyn Davis & Branko Pecar OXFORD UNIVERSITY PRESS Detailed contents Introduction to Microsoft Excel 2003 Overview Learning Objectives 1.1 Introduction to Microsoft Excel
More informationUsing Excel for inferential statistics
FACT SHEET Using Excel for inferential statistics Introduction When you collect data, you expect a certain amount of variation, just caused by chance. A wide variety of statistical tests can be applied
More information