Effect Size Calculation for experimental & quasiexperimental
|
|
|
- Rosamond Wiggins
- 9 years ago
- Views:
Transcription
1 Effect Size Calculation for experimental & quasiexperimental methods Jorge Garcia Hombrados & Hugh Waddington
2 Our goal is to get from here... Source: Baird et al Worms at work
3 ... to here Our goal is to get from here Source: Petrosino et al. (2012) Interventions in Developing Nations for Improving Primary and Secondary School Enrollment of Children: A Systematic Review Campbell Systematic Reviews
4 Working in small groups, you will learn: To calculate effect sizes: odds ratios, risk ratios, response ratios, standardized mean differences (SMDs) Effect sizes for different types of data based on data reported as group comparisons from raw and adjusted data based on data reported from statistical inference (eg regression analysis) For different types of rigorous evaluation design: randomized control trials (RCTs), cross-section design with Propensity Score Matching (PSM), longitudinal design with Difference-in-Differences (DID) analysis
5 Concepts we will use liberally Mean Standard deviation Standard error Confidence interval Average treatment effect on the treated (ATET)
6 What is an effect size? Impact evaluation tells us what difference an intervention makes to outcomes, measured by the treatment effect (eg regression coefficient on treatment variable) The effect size scales the treatment effect in units which tell us the magnitude of this difference and its statistical significance (as indicated by 95% confidence interval) There are different units and measures in which we can quantify this difference but it needs to be consistent and comparable across studies. Effect sizes are the unit of analysis in meta-analysis but they are also important to indicate policy relevance of findings
7 Example: Height-for-age Mean Z-score Percentage below z= Bangla-DHS % 2011 Bangla-DHS %
8 Examples of different effect sizes Standardized mean difference Group contrast Continuous outcome variable e.g. test scores Odds ratio/ risk ratio Group contrast Dichotomous outcome variable e.g. mortality Correlation coefficient (pearson s r) Association between 2 variables e.g. value of micro-credit loan and income Proportion Disease prevalence rates e.g. proportion suffering diarrhoea
9 Effect sizes measures For continuous outcomes: Standardised Mean Differences (SMD) Response Ratios (RR) d-based Regression coefficients, t-stat effect sizes R-based effect sizes (correlation coefficients) For dichotomous outcomes: Odds ratios Risk ratios
10 Standardized mean difference Uses the pooled standard deviation (some cases use control group standard deviation)
11 Exercise: calculate absolute and SMD Income per capita Treatment mean Comparis on mean Treatment SD Comparis on SD Sample size t Sample size c Bangladesh Nepal
12 Odds ratio and risk ratio Frequencies Success Failure Treatment Group a b Control Group c d OR a / b ad Ratio of odds of success in the c / d bc of success in the comparison treatment group relative to odds RR a c /( a b) /( c d) Ratio of probability of success in the treatment group relative to probability of success in the comparison group
13 Exercise: calculate and interpret OR and RR Poor Non-Poor Treatment Comparison
14 Selecting the effect size measure in practice We need to estimate the same effect size measure across all the included studies pooled in the metaanalysis However, very often not all the studies report enough information to compute all the effect size measures and contacting authors does not always work The selection of the effect size measure in practice takes into account: Nature of outcome (dichotomous vs continuous) Minimising the number of studies lost
15 Option 1: SMD Measures the impact of the programme in standard deviations of the outcome variable Can be computed consistently for both experimental and non-experimental studies Is the less problematic methodology However, its interpretation is not straightforward and the data required for its computation is not always available
16 Option 2: Response Ratio Measures the impact of the programme in percentage change Based on the Risk Ratio effect size used for dichotomous outcomes Data required for its computation is minimum Can be computed consistently for both experimental and non-experimental studies It is appropriate for both continuous and dichotomous outcomes (risk ratio) providing the outcome measure has a natural scale unit and natural zero points (but is not likely to equal zero) Synthesis uses logarithmic scales for both RR and SE(RR)
17 Other measures are problematic T-statistics are noisy and when applied to regression has some important shortcomings (Becker & Wu, 2007) Regression coefficients: data on covariance matrix (hardly reported) are required for appropriate synthesis R-based effect sizes do not seem to work properly for multivariate regression
18 Computation of effect sizes: SMD Standardised Mean Difference (SMD): How to estimate SMD from different study designs? An easy way of dealing with it is thinking separately in the numerator and denominator.
19 Computation of effect size: SMD The numerator Y t -Y c represents the causal raw impact of the programme in the outcome: In a regression analysis is the coefficient of interest (Beta). In a matching-based study this is the causal impact (ATT) or the difference in outcomes between groups after matching.
20 Computation of effect size: SMD The denominator S p is a measure of the standard deviation of the outcome. It makes the effect size comparable across studies In regression studies, we can use the standard deviation of the regression errors. Alternatively, we can use the sample standard deviation or the treatment and control standard deviation to calculate S p in matched-based studies or approximate it in regression based studies:
21 Computation of effect size: SMD Diff-in-Diff models In Diff-in-Diff model, the dependent variable is the change in the dependent variable of reference, or in semi log models, the growth rate of the variable of reference. In these cases, Sp should measure the pool standard deviation of the change (or the growth rate) in the dependent variable of reference. Unfortunately, this information is not reported very often and most of the times the calculation of SMDs requires assumptions.
22 Computation of effect size: SMD Tobit, Logit and Probit models Where for: Probit models Δ= Φ (x i β+γ) - Φ(x i β) Censored Tobit models Δ=((Φ ((x i β+γ)/σ))*(( x i β+γ)+ σ φ(x i β+γ))) - ((Φ ((x i β)/σ))*(( x i β)+ σ φ(x i β)) Logit models
23 SMD correction for small sample bias When the sample size is small, a correction in the effect size and its variance is needed. Although the correction is going to be almost imperceptible, we recommend to apply to all SMD calculations. For regression studies, there is a more efficient alternative (see Keef and Roberts, 2004)
24 Computation of effect size: Response Ratios For matched-based studies (e.g., PSM), Y c = Y t ATT where Y t is the outcome level in the treatment, ATT is the average treatment effect on the treated and Y c is the outcome level in the control group after matching. For regression-based studies, Y t = Y c + β where Y c is the outcome level in the total sample and Y t is the ceteris paribus average predicted outcome if the sample received the treatment.
25 Computation of Response Ratio Standard Error When the standard deviation of the dependent variable or the necessary information to calculate SD is not reported, we can approximate the SE for response ratios using the t statistics/p-value of the regression coefficient or of the results of the t test for equality of means between groups after matching:
26 Computation of effect size: Response Ratios Semi-log difference-in-differences (DID): RR= e β If Sp is computed to calculate the SE (RR), Sp should measure the pooled standard deviation of the change (or the growth rate) in the variable of reference.
27 Computation of effect size: Response Ratios Logit, Probit and Tobit model Where for: Probit models Δ= Φ (x i β+γ) - Φ(x i β) Censored Tobit models Δ=((Φ ((x i β+γ)/σ))*(( x i β+γ)+ σ φ(x i β+γ))) - ((Φ ((x i β)/σ))*(( x i β)+ σ φ(x i β)) Logit models
28 Computation of effect size: Response Ratios For logit model we can also estimate RR as:
29 Unit of analysis error correction Unit of analysis error (UoA) arises in impact evaluation studies in which programme placement and analysis are conducted at a different unit level and the researcher does not account for this within-cluster dependency. E.g. Programme placement at cluster level and outcomes analysed at household level. The consequences of UoA are false smaller variances and false narrower confidence intervals. If the study conduct analysis and programme placement at different levels and do not use cluster robust standard errors, we need to apply a correction to the standard errors to avoid potential Type II error: Where m is the cluster size and ICC is the intra-cluster correlation coefficient.
30 Formulae Table for ES computation Effect size measure Formulae for matched-based studies: Information needed to be reported in matched-based studies: Formulae for regression-based studies: Information needed to be reported in regression-based studies: Standardizes Mean Differences (SMD) SE SMD = S p = S p = SMD = Y t Y c S p n t + n c n t n c + OR SE SMD = SMD t SMD 2 2 (n t + n c ) n t 1 S t 2 + n c 1 S c 2 OR n t + n c 2 SD y 2 n t + n c 1 β2 n t n c n t + n c n t + n c SMD -Sample mean outcome for the treated and control group after matching OR -Sample mean outcome for the treatment group AND Average Treatment Effect on the Treated. -Sample standard deviation for treatment and control group AND sample size for treatment and control group OR -Sample standard deviation for the total sample AND sample mean outcome for treatment and control group OR sample mean outcome for treatment group and ATT AND sample size for treatment and control group. SE (SMD) -Sample size of the treated and control group OR t statistics of the treatment effect. S p = SMD = β S p SD y 2 (n t + n c 1) β2 (n t n c ) n t + n c n t + n c OR S p = SD of te regression residuals. SE SMD = SE SMD = SMD t OR SMD 2 v 2 v t 2 + v c(v) 2 v + 2 SMD -Regression coefficient -Sample standard deviation of the dependent variable AND sample size for treatment and control group OR -Standard deviation of the error term in the regression. SE(SMD) -t statistics for the regression coefficient OR t statistics for the regression coefficient AND Number of covariates AND sample size for the total sample.
31 Formulae Table for ES computation Effect size measure Formulae for matched-based studies: Information needed to be reported in matched-based studies: Formulae for regressionbased studies: Information needed to be reported in regressionbased studies: Response Ratio (RR) S p = RR = Y t Y c SE RR = Exp Ln(RR) t SE RR = S p 2 S p = OR 1 2 n t Y + 1 t n c Y 2 c n t 1 S t 2 + n c 1 S c 2 n t + n c 2 OR SD y2 n t + n c 1 n t + n c β 2 n t n c n t + n c RR -Sample mean outcome for the treated and control group after matching OR -Sample mean outcome for the treatment group AND Average Treatment Effect on the Treated. SE(RR) -t statistics of the treatment effec t. OR -Sample mean outcome for the treated and control group after matching OR Sample mean outcome for the treatment group AND Average Treatment Effect on the Treated AND Sample size for the treatment and control group AND sample standard deviation for the treatment and control group. OR sample standard deviation for all the sample. RR = Y s + β Y s SE RR = Exp Ln(RR ) t RR -Mean outcome for the total sample. -Beta of the regression coefficient. SE(RR) -t statistics of the regression coefficient.
32 Definitions S p is the pool standard deviation β is the coefficient or impact effect of interest. t is the t statistics of the regression coefficient or of the relevant treatment impact (t-test for equality of means). Exp is the exponential function (e f(x) ) Y t, Y c, Y s, n t, n c and n s are the mean outcome in the treatment group, control group and total sample and the sample size for the treatment group, control group and total sample. SD t, SD c and SD Y are the standard deviation for the treatment group, control group and total sample. v is the degrees of freedom of the regression equation. ATT Average treatment effect on the treated. m is the cluster size ICC is the intra-cluster correlation coefficient, an estimate of the relative variability within clusters.
33 Definitions In a logit, probit or tobit regression: Δ is the impact effect xiβ is the mean predicted outcomes for without participating in the programme. γ is the coefficient of interest. Ф is the cumulative distribution function. φ is the probability distribution function.
34 Exercises Compute effect sizes for the following studies: For Banerjee et al. 2009: estimate RR and SMD of the impact of microfinance access on per capita expenditure Estimate RR of the impact of microfinance access on non-food decision making by women For Chen et al. 2012, estimate RR and SMD of the impact of tuition relief on the change in test scores (use the Diff-in-Diff multivariate regression specification)
35 Banerjee et al Information coded: In an RCT, at the baseline and assuming n t =n c : Y s Y t Y c SD t SD c Beta= SE(Beta)= n s = 6821; we assumed n c = 3410 and n t = 3411 RCT with regression analysis.
36 Banerjee et al Information coded: In an RCT, at the baseline and assuming n t =n c : Y s Y t Y c SD t SD c Beta= SE(Beta)= n s = 6821; we assumed n c = 3410 and n t = 3411 RCT with regression analysis Estimate Standardised Mean Difference (SMD): 1. Compute t: 2. Compute SD y: : 3. Compute S p :
37 Banerjee et al Information coded: In an RCT, at the baseline and assuming n t =n c : Y s Y t Y c SD t SD c Beta= SE(Beta)= n s = 6821; we assumed n c = 3410 and n t = 3411 t=0.809 SD y =928.1 S p =927.8 RCT with regression analysis Estimate Standardised Mean Difference (SMD): 4. Compute SMD: 5. Compute SE(SMD): 6. Interpret the results 7. For yourself: correct for sample bias
38 Banerjee et al Information coded: In an RCT, at the baseline and assuming n t =n c : Y s Y t Y c SD t SD c Beta= SE(Beta)= n s = 6821; we assumed n c = 3410 and n t = 3411 t=0.809 SD y =928.1 S p =927.8 RCT with regression analysis Estimate Response Ratio (RR): 1. Compute RR: 2. Compute SE(RR): 3. Interpret the results
39 Banerjee et al Information coded: In an RCT, at the baseline and assuming n t =n c : Y s Y t Y c SD t SD c Beta= SE(Beta)= n s = 6821; we assumed n c = 3410 and n t = 3411 t= RCT with regression analysis
40 Banerjee et al Information coded: In an RCT, at the baseline and assuming n t =n c : Y s Y t Y c SD t SD c Beta= SE(Beta)= n s = 6821; we assumed n c = 3410 and n t = 3411 t= RCT with regression analysis Estimate Standardised Mean Difference (SMD): 1. Compute t statistics: 2. Compute SD y : 3. Compute S p :
41 Banerjee et al Information coded: In an RCT, at the baseline and assuming n t =n c : Y s Y t Y c SD t SD c Beta= SE(Beta)= n s = 6821; we assumed n c = 3410 and n t = 3411 t= SD y =0.284 S p =0.284 RCT with regression analysis Estimate Standardised Mean Difference (SMD): 4. Compute SMD: 5. Compute SE(SMD): 6. Interpret the results:
42 Banerjee et al Information coded: In an RCT, at the baseline and assuming n t =n c : Y s Y t Y c SD t SD c Beta= SE(Beta)= n s = 6821; we assumed n c = 3410 and n t = 3411 t= SMD= SE(SMD)=0.207 RCT with regression analysis Estimate Odds ratio (OR): 1. Compute OR: 2. Compute SE(OR): 3. Interpret the results:
43 Banerjee et al Information coded: In an RCT, at the baseline and assuming n t =n c : Y s Y t Y c SD t SD c Beta= SE(Beta)= n s = 6821; we assumed n c = 3410 and n t = 3411 t= RCT with regression analysis Estimate Risk Ratio (RR): 1. Compute RR: 2. Compute SE(RR): 3. Interpret the results
44 Chen et al Information coded: Y s = Beta=2.85 t= 1.82 n t = 555 n c = 1709 SD y 2010 =16.61 SD y 2009 =16.85 Diff-in Diff regression study
45 Chen et al Information coded: Y s = Beta=2.85 t= 1.82 n t = 555 n c = 1709 SD y 2010 =16.61 SD y 2009 =16.85 Diff-in Diff regression study Estimate Standardised Mean Difference (SMD): 1. Compute SD y: on the change in the outcome: 3. Compute S p :
46 Chen et al Information coded: Y s = Beta=2.85 SE(Beta)= 1.82 n t = 555 n c = 1709 SD y 2010 =16.61 SD y 2009 =16.85 SD y =10.58 t=1.82 S p =10.51 Diff-in Diff regression study Estimate Standardised Mean Difference (SMD): 4. Compute SMD: 5. Compute SE(SMD): 6. Interpret the results 7. For yourself: correct for sample bias
47 Chen et al Information coded: Y s = Beta=2.85 SE(Beta)= 1.82 n t = 555 n c = 1709 SD y 2010 =16.61 SD y 2009 =16.85 SD y =10.58 t=1.82 S p =10.51 Diff-in Diff regression study Estimate Response Ratios (RR): 1. Compute RR: 2. Compute SE(RR): 3. Interpret the results
48 Practical tips David Wilson effect size calculator available here: /index.html but remember to apply corrections for sample bias! Code all the relevant information in a spreadsheet for all relevant studies before starting ES calculations. Decision on the selection of ES measure would depend on which ES measure would lead to the smaller study loss.
49 Main Bibliography Used Banerjee, A., Duflo, E., Glennerster, R., Kinnan, C. 2009, The Miracle of Microfinance? Evidence from a Randomized Evaluation, JPAL Becker, B. & Wu, M. 2007, The Synthesis of Regression Slopes in Meta-Analysis, Statistical Science, Vol. 22, No 3, Borenstein, M., Hedges, L., Higgins, J., Rothstein,H. 2009, Introduction to Meta- Analysis. WILEY Chen, X., Shi, Y., Yi, H., Zhang, L., Mo, D., Chu, J., Rozelle, S. 2012, The Impact of a Senior High School Tuition Relief Program on Poor Junior High School Students in Rural China, REAP Working Paper 239, University of Stanford. Hansen, H., Klejntrup, N.R., Andersen, O.W. 2011, A comparison of Model-based and Design-based Impact Evaluations of Interventions in Developing Countries, FOI Working Paper n o.16, University of Copenhagen. Hasler, S. Forthcoming, The Effect of Multi-grade Teaching on Education Quality in Cameroon. A Propensity Score Matching Model, Journal of Development Studies. Hedges, L.V. 1981, Distribution Theory for Glass s Estimator of Effect Size and Related Estimators, Journal of Educational Statistics, vol. 6, pp Keef, S.P. & Roberts, L.A The meta-analysis of partial effect sizes, British J. Math. Statist. Psych, Vol 57, pp Lipsey, M.W. & Wilson, D.B. 2001, Practical Meta-Analysis, Applied Social Research Methods Series, Volume 49. SAGE Publications.
50 Thank you very much Visit: tional_development/index.php
51 Annex: standard errors and 95% CI for OR / RR d c b a SE OR % confidence interval = exp[ln(or))±1.96*ln(se OR )] d c c b a a SE RR % confidence interval = exp[ln(rr))±1.96*ln(se RR )]
Calculating Effect-Sizes
Calculating Effect-Sizes David B. Wilson, PhD George Mason University August 2011 The Heart and Soul of Meta-analysis: The Effect Size Meta-analysis shifts focus from statistical significance to the direction
MULTIPLE REGRESSION AND ISSUES IN REGRESSION ANALYSIS
MULTIPLE REGRESSION AND ISSUES IN REGRESSION ANALYSIS MSR = Mean Regression Sum of Squares MSE = Mean Squared Error RSS = Regression Sum of Squares SSE = Sum of Squared Errors/Residuals α = Level of Significance
Study Guide for the Final Exam
Study Guide for the Final Exam When studying, remember that the computational portion of the exam will only involve new material (covered after the second midterm), that material from Exam 1 will make
11. Analysis of Case-control Studies Logistic Regression
Research methods II 113 11. Analysis of Case-control Studies Logistic Regression This chapter builds upon and further develops the concepts and strategies described in Ch.6 of Mother and Child Health:
CLUSTER SAMPLE SIZE CALCULATOR USER MANUAL
1 4 9 5 CLUSTER SAMPLE SIZE CALCULATOR USER MANUAL Health Services Research Unit University of Aberdeen Polwarth Building Foresterhill ABERDEEN AB25 2ZD UK Tel: +44 (0)1224 663123 extn 53909 May 1999 1
Descriptive Statistics
Descriptive Statistics Primer Descriptive statistics Central tendency Variation Relative position Relationships Calculating descriptive statistics Descriptive Statistics Purpose to describe or summarize
Data Analysis, Research Study Design and the IRB
Minding the p-values p and Quartiles: Data Analysis, Research Study Design and the IRB Don Allensworth-Davies, MSc Research Manager, Data Coordinating Center Boston University School of Public Health IRB
A Basic Introduction to Missing Data
John Fox Sociology 740 Winter 2014 Outline Why Missing Data Arise Why Missing Data Arise Global or unit non-response. In a survey, certain respondents may be unreachable or may refuse to participate. Item
IMPACT EVALUATION: INSTRUMENTAL VARIABLE METHOD
REPUBLIC OF SOUTH AFRICA GOVERNMENT-WIDE MONITORING & IMPACT EVALUATION SEMINAR IMPACT EVALUATION: INSTRUMENTAL VARIABLE METHOD SHAHID KHANDKER World Bank June 2006 ORGANIZED BY THE WORLD BANK AFRICA IMPACT
Simple Linear Regression Inference
Simple Linear Regression Inference 1 Inference requirements The Normality assumption of the stochastic term e is needed for inference even if it is not a OLS requirement. Therefore we have: Interpretation
Introduction. Hypothesis Testing. Hypothesis Testing. Significance Testing
Introduction Hypothesis Testing Mark Lunt Arthritis Research UK Centre for Ecellence in Epidemiology University of Manchester 13/10/2015 We saw last week that we can never know the population parameters
Simple Linear Regression
STAT 101 Dr. Kari Lock Morgan Simple Linear Regression SECTIONS 9.3 Confidence and prediction intervals (9.3) Conditions for inference (9.1) Want More Stats??? If you have enjoyed learning how to analyze
Ordinal Regression. Chapter
Ordinal Regression Chapter 4 Many variables of interest are ordinal. That is, you can rank the values, but the real distance between categories is unknown. Diseases are graded on scales from least severe
HYPOTHESIS TESTING: CONFIDENCE INTERVALS, T-TESTS, ANOVAS, AND REGRESSION
HYPOTHESIS TESTING: CONFIDENCE INTERVALS, T-TESTS, ANOVAS, AND REGRESSION HOD 2990 10 November 2010 Lecture Background This is a lightning speed summary of introductory statistical methods for senior undergraduate
Unit 31 A Hypothesis Test about Correlation and Slope in a Simple Linear Regression
Unit 31 A Hypothesis Test about Correlation and Slope in a Simple Linear Regression Objectives: To perform a hypothesis test concerning the slope of a least squares line To recognize that testing for a
Missing data in randomized controlled trials (RCTs) can
EVALUATION TECHNICAL ASSISTANCE BRIEF for OAH & ACYF Teenage Pregnancy Prevention Grantees May 2013 Brief 3 Coping with Missing Data in Randomized Controlled Trials Missing data in randomized controlled
Additional sources Compilation of sources: http://lrs.ed.uiuc.edu/tseportal/datacollectionmethodologies/jin-tselink/tselink.htm
Mgt 540 Research Methods Data Analysis 1 Additional sources Compilation of sources: http://lrs.ed.uiuc.edu/tseportal/datacollectionmethodologies/jin-tselink/tselink.htm http://web.utk.edu/~dap/random/order/start.htm
Guided Reading 9 th Edition. informed consent, protection from harm, deception, confidentiality, and anonymity.
Guided Reading Educational Research: Competencies for Analysis and Applications 9th Edition EDFS 635: Educational Research Chapter 1: Introduction to Educational Research 1. List and briefly describe the
Calculating, Interpreting, and Reporting Estimates of Effect Size (Magnitude of an Effect or the Strength of a Relationship)
1 Calculating, Interpreting, and Reporting Estimates of Effect Size (Magnitude of an Effect or the Strength of a Relationship) I. Authors should report effect sizes in the manuscript and tables when reporting
Linda K. Muthén Bengt Muthén. Copyright 2008 Muthén & Muthén www.statmodel.com. Table Of Contents
Mplus Short Courses Topic 2 Regression Analysis, Eploratory Factor Analysis, Confirmatory Factor Analysis, And Structural Equation Modeling For Categorical, Censored, And Count Outcomes Linda K. Muthén
Stat 411/511 THE RANDOMIZATION TEST. Charlotte Wickham. stat511.cwick.co.nz. Oct 16 2015
Stat 411/511 THE RANDOMIZATION TEST Oct 16 2015 Charlotte Wickham stat511.cwick.co.nz Today Review randomization model Conduct randomization test What about CIs? Using a t-distribution as an approximation
Fixed-Effect Versus Random-Effects Models
CHAPTER 13 Fixed-Effect Versus Random-Effects Models Introduction Definition of a summary effect Estimating the summary effect Extreme effect size in a large study or a small study Confidence interval
Understanding and Quantifying EFFECT SIZES
Understanding and Quantifying EFFECT SIZES Karabi Nandy, Ph.d. Assistant Adjunct Professor Translational Sciences Section, School of Nursing Department of Biostatistics, School of Public Health, University
Statistical Rules of Thumb
Statistical Rules of Thumb Second Edition Gerald van Belle University of Washington Department of Biostatistics and Department of Environmental and Occupational Health Sciences Seattle, WA WILEY AJOHN
individualdifferences
1 Simple ANalysis Of Variance (ANOVA) Oftentimes we have more than two groups that we want to compare. The purpose of ANOVA is to allow us to compare group means from several independent samples. In general,
STA-201-TE. 5. Measures of relationship: correlation (5%) Correlation coefficient; Pearson r; correlation and causation; proportion of common variance
Principles of Statistics STA-201-TE This TECEP is an introduction to descriptive and inferential statistics. Topics include: measures of central tendency, variability, correlation, regression, hypothesis
Simple linear regression
Simple linear regression Introduction Simple linear regression is a statistical method for obtaining a formula to predict values of one variable from another where there is a causal relationship between
Illustration (and the use of HLM)
Illustration (and the use of HLM) Chapter 4 1 Measurement Incorporated HLM Workshop The Illustration Data Now we cover the example. In doing so we does the use of the software HLM. In addition, we will
Chapter Eight: Quantitative Methods
Chapter Eight: Quantitative Methods RESEARCH DESIGN Qualitative, Quantitative, and Mixed Methods Approaches Third Edition John W. Creswell Chapter Outline Defining Surveys and Experiments Components of
Specifications for this HLM2 run
One way ANOVA model 1. How much do U.S. high schools vary in their mean mathematics achievement? 2. What is the reliability of each school s sample mean as an estimate of its true population mean? 3. Do
Overview Classes. 12-3 Logistic regression (5) 19-3 Building and applying logistic regression (6) 26-3 Generalizations of logistic regression (7)
Overview Classes 12-3 Logistic regression (5) 19-3 Building and applying logistic regression (6) 26-3 Generalizations of logistic regression (7) 2-4 Loglinear models (8) 5-4 15-17 hrs; 5B02 Building and
Lesson 14 14 Outline Outline
Lesson 14 Confidence Intervals of Odds Ratio and Relative Risk Lesson 14 Outline Lesson 14 covers Confidence Interval of an Odds Ratio Review of Odds Ratio Sampling distribution of OR on natural log scale
MISSING DATA TECHNIQUES WITH SAS. IDRE Statistical Consulting Group
MISSING DATA TECHNIQUES WITH SAS IDRE Statistical Consulting Group ROAD MAP FOR TODAY To discuss: 1. Commonly used techniques for handling missing data, focusing on multiple imputation 2. Issues that could
Multinomial and Ordinal Logistic Regression
Multinomial and Ordinal Logistic Regression ME104: Linear Regression Analysis Kenneth Benoit August 22, 2012 Regression with categorical dependent variables When the dependent variable is categorical,
Introduction to Statistics and Quantitative Research Methods
Introduction to Statistics and Quantitative Research Methods Purpose of Presentation To aid in the understanding of basic statistics, including terminology, common terms, and common statistical methods.
When to Use a Particular Statistical Test
When to Use a Particular Statistical Test Central Tendency Univariate Descriptive Mode the most commonly occurring value 6 people with ages 21, 22, 21, 23, 19, 21 - mode = 21 Median the center value the
II. DISTRIBUTIONS distribution normal distribution. standard scores
Appendix D Basic Measurement And Statistics The following information was developed by Steven Rothke, PhD, Department of Psychology, Rehabilitation Institute of Chicago (RIC) and expanded by Mary F. Schmidt,
UNDERSTANDING THE DEPENDENT-SAMPLES t TEST
UNDERSTANDING THE DEPENDENT-SAMPLES t TEST A dependent-samples t test (a.k.a. matched or paired-samples, matched-pairs, samples, or subjects, simple repeated-measures or within-groups, or correlated groups)
Two Correlated Proportions (McNemar Test)
Chapter 50 Two Correlated Proportions (Mcemar Test) Introduction This procedure computes confidence intervals and hypothesis tests for the comparison of the marginal frequencies of two factors (each with
Trial Sequential Analysis (TSA)
User manual for Trial Sequential Analysis (TSA) Kristian Thorlund, Janus Engstrøm, Jørn Wetterslev, Jesper Brok, Georgina Imberger, and Christian Gluud Copenhagen Trial Unit Centre for Clinical Intervention
WHAT IS A JOURNAL CLUB?
WHAT IS A JOURNAL CLUB? With its September 2002 issue, the American Journal of Critical Care debuts a new feature, the AJCC Journal Club. Each issue of the journal will now feature an AJCC Journal Club
Tests for Two Proportions
Chapter 200 Tests for Two Proportions Introduction This module computes power and sample size for hypothesis tests of the difference, ratio, or odds ratio of two independent proportions. The test statistics
Multiple Choice: 2 points each
MID TERM MSF 503 Modeling 1 Name: Answers go here! NEATNESS COUNTS!!! Multiple Choice: 2 points each 1. In Excel, the VLOOKUP function does what? Searches the first row of a range of cells, and then returns
Multivariate Logistic Regression
1 Multivariate Logistic Regression As in univariate logistic regression, let π(x) represent the probability of an event that depends on p covariates or independent variables. Then, using an inv.logit formulation
IPDET Module 6: Descriptive, Normative, and Impact Evaluation Designs
IPDET Module 6: Descriptive, Normative, and Impact Evaluation Designs Intervention or Policy Evaluation Questions Design Questions Elements Types Key Points Introduction What Is Evaluation Design? Connecting
Probability Calculator
Chapter 95 Introduction Most statisticians have a set of probability tables that they refer to in doing their statistical wor. This procedure provides you with a set of electronic statistical tables that
Simple Regression Theory II 2010 Samuel L. Baker
SIMPLE REGRESSION THEORY II 1 Simple Regression Theory II 2010 Samuel L. Baker Assessing how good the regression equation is likely to be Assignment 1A gets into drawing inferences about how close the
DATA COLLECTION AND ANALYSIS
DATA COLLECTION AND ANALYSIS Quality Education for Minorities (QEM) Network HBCU-UP Fundamentals of Education Research Workshop Gerunda B. Hughes, Ph.D. August 23, 2013 Objectives of the Discussion 2 Discuss
I L L I N O I S UNIVERSITY OF ILLINOIS AT URBANA-CHAMPAIGN
Beckman HLM Reading Group: Questions, Answers and Examples Carolyn J. Anderson Department of Educational Psychology I L L I N O I S UNIVERSITY OF ILLINOIS AT URBANA-CHAMPAIGN Linear Algebra Slide 1 of
What s New in Econometrics? Lecture 8 Cluster and Stratified Sampling
What s New in Econometrics? Lecture 8 Cluster and Stratified Sampling Jeff Wooldridge NBER Summer Institute, 2007 1. The Linear Model with Cluster Effects 2. Estimation with a Small Number of Groups and
SUMAN DUVVURU STAT 567 PROJECT REPORT
SUMAN DUVVURU STAT 567 PROJECT REPORT SURVIVAL ANALYSIS OF HEROIN ADDICTS Background and introduction: Current illicit drug use among teens is continuing to increase in many countries around the world.
Introduction to mixed model and missing data issues in longitudinal studies
Introduction to mixed model and missing data issues in longitudinal studies Hélène Jacqmin-Gadda INSERM, U897, Bordeaux, France Inserm workshop, St Raphael Outline of the talk I Introduction Mixed models
Non-Inferiority Tests for Two Proportions
Chapter 0 Non-Inferiority Tests for Two Proportions Introduction This module provides power analysis and sample size calculation for non-inferiority and superiority tests in twosample designs in which
LOGIT AND PROBIT ANALYSIS
LOGIT AND PROBIT ANALYSIS A.K. Vasisht I.A.S.R.I., Library Avenue, New Delhi 110 012 [email protected] In dummy regression variable models, it is assumed implicitly that the dependent variable Y
2. Simple Linear Regression
Research methods - II 3 2. Simple Linear Regression Simple linear regression is a technique in parametric statistics that is commonly used for analyzing mean response of a variable Y which changes according
Chapter 13 Introduction to Linear Regression and Correlation Analysis
Chapter 3 Student Lecture Notes 3- Chapter 3 Introduction to Linear Regression and Correlation Analsis Fall 2006 Fundamentals of Business Statistics Chapter Goals To understand the methods for displaing
ECON 142 SKETCH OF SOLUTIONS FOR APPLIED EXERCISE #2
University of California, Berkeley Prof. Ken Chay Department of Economics Fall Semester, 005 ECON 14 SKETCH OF SOLUTIONS FOR APPLIED EXERCISE # Question 1: a. Below are the scatter plots of hourly wages
1/27/2013. PSY 512: Advanced Statistics for Psychological and Behavioral Research 2
PSY 512: Advanced Statistics for Psychological and Behavioral Research 2 Introduce moderated multiple regression Continuous predictor continuous predictor Continuous predictor categorical predictor Understand
Multiple Regression: What Is It?
Multiple Regression Multiple Regression: What Is It? Multiple regression is a collection of techniques in which there are multiple predictors of varying kinds and a single outcome We are interested in
Module 3: Correlation and Covariance
Using Statistical Data to Make Decisions Module 3: Correlation and Covariance Tom Ilvento Dr. Mugdim Pašiƒ University of Delaware Sarajevo Graduate School of Business O ften our interest in data analysis
Calculating the Probability of Returning a Loan with Binary Probability Models
Calculating the Probability of Returning a Loan with Binary Probability Models Associate Professor PhD Julian VASILEV (e-mail: [email protected]) Varna University of Economics, Bulgaria ABSTRACT The
Regression with a Binary Dependent Variable
Regression with a Binary Dependent Variable Chapter 9 Michael Ash CPPA Lecture 22 Course Notes Endgame Take-home final Distributed Friday 19 May Due Tuesday 23 May (Paper or emailed PDF ok; no Word, Excel,
CALCULATIONS & STATISTICS
CALCULATIONS & STATISTICS CALCULATION OF SCORES Conversion of 1-5 scale to 0-100 scores When you look at your report, you will notice that the scores are reported on a 0-100 scale, even though respondents
Univariate Regression
Univariate Regression Correlation and Regression The regression line summarizes the linear relationship between 2 variables Correlation coefficient, r, measures strength of relationship: the closer r is
Longitudinal Meta-analysis
Quality & Quantity 38: 381 389, 2004. 2004 Kluwer Academic Publishers. Printed in the Netherlands. 381 Longitudinal Meta-analysis CORA J. M. MAAS, JOOP J. HOX and GERTY J. L. M. LENSVELT-MULDERS Department
Week 4: Standard Error and Confidence Intervals
Health Sciences M.Sc. Programme Applied Biostatistics Week 4: Standard Error and Confidence Intervals Sampling Most research data come from subjects we think of as samples drawn from a larger population.
Crash Course on Basic Statistics
Crash Course on Basic Statistics Marina Wahl, [email protected] University of New York at Stony Brook November 6, 2013 2 Contents 1 Basic Probability 5 1.1 Basic Definitions...........................................
SAMPLE SIZE CONSIDERATIONS
SAMPLE SIZE CONSIDERATIONS Learning Objectives Understand the critical role having the right sample size has on an analysis or study. Know how to determine the correct sample size for a specific study.
Approaches for Analyzing Survey Data: a Discussion
Approaches for Analyzing Survey Data: a Discussion David Binder 1, Georgia Roberts 1 Statistics Canada 1 Abstract In recent years, an increasing number of researchers have been able to access survey microdata
Module 5: Multiple Regression Analysis
Using Statistical Data Using to Make Statistical Decisions: Data Multiple to Make Regression Decisions Analysis Page 1 Module 5: Multiple Regression Analysis Tom Ilvento, University of Delaware, College
10. Analysis of Longitudinal Studies Repeat-measures analysis
Research Methods II 99 10. Analysis of Longitudinal Studies Repeat-measures analysis This chapter builds on the concepts and methods described in Chapters 7 and 8 of Mother and Child Health: Research methods.
Interpretation of Somers D under four simple models
Interpretation of Somers D under four simple models Roger B. Newson 03 September, 04 Introduction Somers D is an ordinal measure of association introduced by Somers (96)[9]. It can be defined in terms
The Impact of Retail Payment Innovations on Cash Usage
1/23 The Impact of Retail Payment Innovations on Cash Usage Ben S.C. Fung a Kim P. Huynh a Leonard Sabetti b a Bank of Canada b George Mason University ECB-MNB Joint Conference Cost and efficiency of retail
CORRELATIONAL ANALYSIS: PEARSON S r Purpose of correlational analysis The purpose of performing a correlational analysis: To discover whether there
CORRELATIONAL ANALYSIS: PEARSON S r Purpose of correlational analysis The purpose of performing a correlational analysis: To discover whether there is a relationship between variables, To find out the
Supplementary PROCESS Documentation
Supplementary PROCESS Documentation This document is an addendum to Appendix A of Introduction to Mediation, Moderation, and Conditional Process Analysis that describes options and output added to PROCESS
Chapter 7: Simple linear regression Learning Objectives
Chapter 7: Simple linear regression Learning Objectives Reading: Section 7.1 of OpenIntro Statistics Video: Correlation vs. causation, YouTube (2:19) Video: Intro to Linear Regression, YouTube (5:18) -
Principles of Hypothesis Testing for Public Health
Principles of Hypothesis Testing for Public Health Laura Lee Johnson, Ph.D. Statistician National Center for Complementary and Alternative Medicine [email protected] Fall 2011 Answers to Questions
Planning sample size for randomized evaluations
TRANSLATING RESEARCH INTO ACTION Planning sample size for randomized evaluations Simone Schaner Dartmouth College povertyactionlab.org 1 Course Overview Why evaluate? What is evaluation? Outcomes, indicators
Using Repeated Measures Techniques To Analyze Cluster-correlated Survey Responses
Using Repeated Measures Techniques To Analyze Cluster-correlated Survey Responses G. Gordon Brown, Celia R. Eicheldinger, and James R. Chromy RTI International, Research Triangle Park, NC 27709 Abstract
Standard errors of marginal effects in the heteroskedastic probit model
Standard errors of marginal effects in the heteroskedastic probit model Thomas Cornelißen Discussion Paper No. 320 August 2005 ISSN: 0949 9962 Abstract In non-linear regression models, such as the heteroskedastic
Basic Statistics and Data Analysis for Health Researchers from Foreign Countries
Basic Statistics and Data Analysis for Health Researchers from Foreign Countries Volkert Siersma [email protected] The Research Unit for General Practice in Copenhagen Dias 1 Content Quantifying association
Multiple Imputation for Missing Data: A Cautionary Tale
Multiple Imputation for Missing Data: A Cautionary Tale Paul D. Allison University of Pennsylvania Address correspondence to Paul D. Allison, Sociology Department, University of Pennsylvania, 3718 Locust
Introduction to Fixed Effects Methods
Introduction to Fixed Effects Methods 1 1.1 The Promise of Fixed Effects for Nonexperimental Research... 1 1.2 The Paired-Comparisons t-test as a Fixed Effects Method... 2 1.3 Costs and Benefits of Fixed
Methods for Meta-analysis in Medical Research
Methods for Meta-analysis in Medical Research Alex J. Sutton University of Leicester, UK Keith R. Abrams University of Leicester, UK David R. Jones University of Leicester, UK Trevor A. Sheldon University
Statistical Functions in Excel
Statistical Functions in Excel There are many statistical functions in Excel. Moreover, there are other functions that are not specified as statistical functions that are helpful in some statistical analyses.
Basic research methods. Basic research methods. Question: BRM.2. Question: BRM.1
BRM.1 The proportion of individuals with a particular disease who die from that condition is called... BRM.2 This study design examines factors that may contribute to a condition by comparing subjects
Introduction to Longitudinal Data Analysis
Introduction to Longitudinal Data Analysis Longitudinal Data Analysis Workshop Section 1 University of Georgia: Institute for Interdisciplinary Research in Education and Human Development Section 1: Introduction
Introduction to Regression and Data Analysis
Statlab Workshop Introduction to Regression and Data Analysis with Dan Campbell and Sherlock Campbell October 28, 2008 I. The basics A. Types of variables Your variables may take several forms, and it
Logit and Probit. Brad Jones 1. April 21, 2009. University of California, Davis. Bradford S. Jones, UC-Davis, Dept. of Political Science
Logit and Probit Brad 1 1 Department of Political Science University of California, Davis April 21, 2009 Logit, redux Logit resolves the functional form problem (in terms of the response function in the
Statistical modelling with missing data using multiple imputation. Session 4: Sensitivity Analysis after Multiple Imputation
Statistical modelling with missing data using multiple imputation Session 4: Sensitivity Analysis after Multiple Imputation James Carpenter London School of Hygiene & Tropical Medicine Email: [email protected]
Meta-Analytic Synthesis of Studies Conducted at Marzano Research Laboratory on Instructional Strategies
Meta-Analytic Synthesis of Studies Conducted at Marzano Research Laboratory on Instructional Strategies By Mark W. Haystead & Dr. Robert J. Marzano Marzano Research Laboratory Englewood, CO August, 2009
Correlational Research. Correlational Research. Stephen E. Brock, Ph.D., NCSP EDS 250. Descriptive Research 1. Correlational Research: Scatter Plots
Correlational Research Stephen E. Brock, Ph.D., NCSP California State University, Sacramento 1 Correlational Research A quantitative methodology used to determine whether, and to what degree, a relationship
An Application of the G-formula to Asbestos and Lung Cancer. Stephen R. Cole. Epidemiology, UNC Chapel Hill. Slides: www.unc.
An Application of the G-formula to Asbestos and Lung Cancer Stephen R. Cole Epidemiology, UNC Chapel Hill Slides: www.unc.edu/~colesr/ 1 Acknowledgements Collaboration with David B. Richardson, Haitao
Analysing Questionnaires using Minitab (for SPSS queries contact -) [email protected]
Analysing Questionnaires using Minitab (for SPSS queries contact -) [email protected] Structure As a starting point it is useful to consider a basic questionnaire as containing three main sections:
Binary Diagnostic Tests Two Independent Samples
Chapter 537 Binary Diagnostic Tests Two Independent Samples Introduction An important task in diagnostic medicine is to measure the accuracy of two diagnostic tests. This can be done by comparing summary
Structural Equation Modelling (SEM)
(SEM) Aims and Objectives By the end of this seminar you should: Have a working knowledge of the principles behind causality. Understand the basic steps to building a Model of the phenomenon of interest.
