On Knowing What We Don't Know: An Empirical Comparison of Methods to Detect Publication Bias in Meta-Analysis

Size: px
Start display at page:

Download "On Knowing What We Don't Know: An Empirical Comparison of Methods to Detect Publication Bias in Meta-Analysis"

Transcription

1 Publication Bias 1 On Knowing What We Don't Know: An Empirical Comparison of Methods to Detect Publication Bias in Meta-Analysis Gianna Rendina-Gobioff Jeffrey D. Kromrey University of South Florida Paper presented at the annual meeting of the American Educational Research Association, San Diego, April 12 16, 2004

2 Publication Bias 2 On Knowing What We Don't Know: An Empirical Comparison of Methods to Detect Publication Bias in Meta-Analysis Abstract The performance of methods for detecting publication bias in meta-analysis was evaluated using Monte Carlo methods. Four methods of bias detection were investigated: Begg s Tau, Egger s Regression, Funnel Plot Regression and Trim and Fill. Five factors were included in the simulation design: number of primary studies in each meta-analysis, sample sizes of primary studies, population variances in primary studies, magnitude of population effect size and magnitude of selection bias. Results were evaluated in terms of Type I error control and statistical power. Results suggest poor Type I error control in many conditions for all of the methods examined except Begg s Tau using sample size rather than the estimated variance. Statistical power was typically very low for conditions in which Type I error rates were adequately controlled, although power increased with larger sample sizes in the primary studies and larger numbers of studies in the meta-analysis.

3 Publication Bias 3 On Knowing What We Don't Know: An Empirical Comparison of Methods to Detect Publication Bias in Meta-Analysis As researchers we rely heavily on the literature available through professional publications to inspire and guide future studies. Meta-analytic syntheses of research provide important summaries of collections of empirical studies and indications of directions that future inquiry should take. However, due to publication bias we cannot assume that the resources available are complete and accurate. One type of publication bias is selection bias that occurs when the decision to publish, either by the researcher or the editorial staff of a journal, is influenced by research results (often the attainment of statistical significance). Essentially, when researchers do not submit studies or publishers reject submissions with small treatment effects or non-significant results, the resource of literature on the topic of interest becomes biased (Thornton & Lee, 2000). When the decision to publish is based on statistical significance there is a direct relationship between publication probability and sample size. Essentially, the greater the sample size the greater the chances of a statistically significant result and thus a greater chance of being published. Research with small samples and large treatment effects are more likely to be published than small sample studies with smaller treatment effects. This results in a relationship between the effect sizes and sample sizes in the published literature. Other study design factors that may influence publication bias are the failure to use randomization, the absence of a control group and the heterogeneity of treatment effects related to moderating variables (Begg, 2001; Macaskill, Walter, & Irwig, 2001). The implications of publication bias for meta-analysis research methods are very important. Due to practical issues the selection of articles for meta-analysis is usually restricted to published articles. Although non-published research may be included, it is impossible to collect all the potential research endeavors for a topic when one does not know that they exist. Essentially, the file drawers of all researchers are not made available to the meta-analyst (Rosenthal, 1979). The investigation and accurate

4 Publication Bias 4 identification of publication bias is extremely important because the results from metaanalytic studies provide a summary of effect sizes for a pool of research on a topic. In other words the conclusions drawn from meta-analysis research have a greater impact than one research study on a topic. Although there are statistical methods that have been suggested for correcting for publication bias (Hedges, 1992; Hedges & Vevea, 1996), before one corrects for bias one must detect its presence. Typically researchers have examined a funnel plot to detect publication bias, yet recent literature provides formal significance tests. However, because these methods often provide conflicting results, research that provides a comparison across the various methods is needed (Sutton, Abrams, Jones, Sheldon, & Song, 2000). Therefore, the focus of this research was to examine methods for detecting publication bias. Specifically, this research empirically compared the performance of four methods for identifying publication bias: Begg's Tau, Egger's regression, funnel plot regression and the trim and fill method. Analysis Options for Detecting Publication Bias As an aid for the presentation of methods for detecting publication bias, a hypothetical sample of 10 studies was simulated with an average sample size of five observations in each of two groups. For each study, the sample effect size was calculated. These data (presented in Table 1) will be used to illustrate each method. Begg s Tau. Begg's Tau method (Begg & Mazumdar, 1994) examines the relationship between the standardized treatment effect and the variance of the treatment effect using Kendall's Tau. The standardized treatment effects are estimated as: d d = d B i i i * ν i Where d i is the i th observed study effect size d i is the weighted average effect size = wd i w i i

5 Publication Bias 5 Where w i is the weight or inverse of the effect size variance (vi) * 1 vi is the standardized variance of the treatment effect = v i ( v i ) The relationship between the standardized treatment effect and the variance is illustrated in Figure 1. Ranks are assigned for the observed standardized treatment effects and the variances of those treatment effects (alternatively, the sample sizes may be ranked rather than the estimated variances). The correlation between these ranked values (Kendall s tau) leads to a statistical test for the presence of publication bias. The standardized treatment effects ( d ) are provided in Table 1. In this sample of effect sizes the weighted mean effect size is and the sum of the reciprocals of the sampling variances is The standardized treatment effect for the first study is B i 1 d B 1 d d ( ) 1 i = = = * ν Cliff and Charlin (1991) derived the estimated sampling variance of Kendall s tau as 2 2 ( ) ( ) 2 tih 12 n n 1 t12 2 tih12 n n 1 t12 i h i h var( t12) = n n n n ( 1)( 2)( 3) where tih12 = a concordance indicator (equal to 1 if observations i and h are ranked in the same order on variables 1 and 2, equal to -1 if they have opposite ranks, and equal to zero if they are tied), and t12 = the sample estimate of tau. t12 The ratio z = is compared to a critical value from a standard normal var( t ) 12 distribution to provide a test of the null hypothesis that tau = 0. For these data,

6 Publication Bias z = = , p >.05, and the null hypothesis is not rejected. Thus, these data do not provide sufficient evidence that publication bias exists in this sample of studies. Egger's Regression Method. The Egger regression method (Egger, Davey Smith, Schneider & Minder, 1997) treats the standardized treatment effect as the criterion and the precision of effect size estimation (the inverse of its standard error) as the predictor in a regression model (estimated by either OLS or WLS, with observations weighted by the inverse of their variances). The standardized treatment effects are estimated as: d E i Where di is the observed study effect size for study i vi is the variance of the effect size for study i E For example, for study 1 in Table 1, d 1 = = For Egger s Regression Method, the precision of the effect size is estimated as: = Examining the example data, an OLS regression with the standardized treatment effect as the criterion and the precision as the predictor resulted in a slope of and intercept of (presented in Figure 2). Although not conducted for these sample data, this regression model can also be estimated with WLS regression. The slope of this regression equation provides an estimate of the true effect and the intercept is expected to have the value of zero when no publication bias exists. Thus, a test of the null hypothesis that the regression intercept equals zero provides a test of publication bias. In the example, the null hypothesis would not be rejected (t = , p >.05, df=8), thus these data do not suggest a relationship between standardized treatment effects and precision of estimation. Therefore, these data do not provide sufficient evidence that publication bias exists in this sample of studies. Funnel Plot Regression Method. The funnel plot regression method, suggested by Macaskill, Walter, and Irwig (2001), uses a regression model with the criterion variable d i ν i 1 v. 1

7 Publication Bias 7 being the treatment effect and study size being the predictor variable. Estimation by WLS is recommended for such a model, using as weights the inverse of the estimation 1 variance (e.g., ν ). For example, for study 1 in Table 1, the weight is: i v = = 1 A WLS regression with the treatment effect as the criterion and the study size as the predictor resulted in a slope of and intercept of for the example data (presented in Figure 3). In contrast to the Egger method, in this regression equation the slope will indicate no publication bias when it has the value of zero and the intercept in this regression equation will indicate the true effect. Thus, a test of the null hypothesis that the regression slope equals zero provides a test of publication bias. In the example, the null hypothesis would not be rejected (t = , p >.05, df = 8). Therefore, these data do not suggest a relationship between the observed effect sizes and sample size, and these data do not provide sufficient evidence that publication bias exists in this sample of studies. Trim and Fill Method. The Trim and Fill method, introduced by Duval and Tweedie (2000a, 2000b), is a nonparametric approach which is based on the funnel plot. Using symmetry assumptions the observed studies are ranked using the absolute values of the effect sizes; positive ranks for studies with effect sizes greater than the mean effect size, negative ranks for studies with effect sizes less than the mean effect size. The ranks are estimated as: r = rank d d * i i i ( ) ( ) ( ) * For example, for study 1 in Table 1, ( ) r1 = rank = rank = 8. A negative algebraic sign is assigned to ranks where the d i is less than the d i. For example, the rank for study 4 would be assigned a negative algebraic sign: ( ( )) ( ) * r rank rank 4 = = = 10

8 Publication Bias 8 Using these ranks the number of research studies missing from the funnel plot due to publication bias is estimated by: * R0 = γ 1= 0 1= 1 where R0 is the estimated number of studies concealed due to publication bias, * * γ = = = 0, k r h where k is the number of studies included in the meta-analysis, and * r h is the largest negative rank Publication bias is evidenced when R0 > 3, with power greater than 0.80 and =.05 (Duval & Tweedie, 2000a). In this example, the R0 is not greater than 3 and thus would suggest that these data do not provide sufficient evidence that publication bias exists in this sample of studies. Purpose of the Study Although the investigation of publication bias has received an abundance of attention in the physical sciences, the problem has been somewhat neglected in the social sciences. The need for research that examines publication bias under conditions that are applicable to the social sciences has driven this research. While Macaskill, Walter, and Irwig (2001) provided the framework for this study, the conditions chosen were selected to represent those commonly reported in the fields of psychology and education. The researchers hope that this research will bring attention in the social sciences to the issues and concerns surrounding publication bias in meta-analysis. Method This research was a Monte Carlo study designed to simulate meta-analyses. The use of simulation methods allows the control and manipulation of research design factors and the incorporation of sampling error into the analyses. Observations in primary studies were generated under known population conditions, and primary studies were combined to simulate meta-analyses.

9 Publication Bias 9 This simulation was modeled after that reported by Macaskill, Walter and Irwig (2001), but extends the conditions examined to more adequately represent the data structures evident in meta-analyses conducted in psychology and education. Each primary study consisted of two groups of observations (e.g., a treatment and control group). For each primary study, the Hedges g effect size (Hedges & Olkin, 1985) was calculated based on the simulated data. The Monte Carlo study included five factors in the design. These factors were (a) the number of primary studies in each meta-analysis (10, 20, 50, and 100), (b) the sample sizes of the two groups in each primary study (with mean sample sizes ranging from 5 in each group to 100 in each group, as well as unbalanced conditions), (c) group variances in the primary studies (variance ratios of 1:2, 1:4, and 1:8, as well as a homogeneous variance condition), (d) the magnitude of the population effect size ( = 0.00, 0.20, 0.50, 0.80), and (e) the magnitude of the selection bias (no bias, moderate bias, and strong bias). The selection bias in the meta-analyses was manipulated by generating a weight function to establish the probability of a simulated study being included in the metaanalysis, with the probability of inclusion being inversely related to the p-value obtained in the simulated study (Begg & Mazumdar, 1994). That is, samples that yield statistically significant results are more likely to be published than those that do not yield statistically significant results. The weight functions are illustrated in Figure 4. Table 2 provides the proportion of simulated studies that were available for meta-analysis based on the selection bias described above. As would be expected the proportion available/selected for meta-analysis was greater for the smaller publication bias than the larger publication bias. The overall range for the proportion selected across all conditions was from (few studies being selected) to (almost all studies being selected). In each simulated meta-analysis, publication bias was assessed using Begg s Tau method (using both the estimated variance and the sample size), Egger s regression method, the funnel plot regression method suggested by Macaskill, Walter and Irwig (2001) and the trim and fill method suggested by Duval and Tweedie (2000a, 2000b). For

10 Publication Bias 10 the Egger Regression, OLS estimation was employed because preliminary analyses suggested that WLS estimates were substantially biased. The proportion of simulated meta-analyses for which publication bias was indicated by each detection method was calculated. For simulation conditions with no publication bias, these proportions represent estimates of Type I error rates. Conversely, for conditions that included publication bias, these proportions estimate the statistical power. This research was conducted using SAS/IML version 8.2. Conditions for the study were run under Windows and Unix platforms. Normally distributed random variables were generated using the RANNOR random number generator in SAS. A different seed value for the random number generator was used in each execution of the program. The program code was verified by hand-checking results from benchmark datasets. For each condition examined in the Monte Carlo study, meta-analyses were simulated. The use of samples provides a maximum 95% confidence interval width around an observed proportion that is ± (Robey & Barcikowski, 1992). The relative effectiveness of the publication bias detection methods was evaluated in terms of Type I error control and statistical power. Results Type I Error Control The distributions of estimated Type I error rates across the conditions examined in this study are presented as box and whisker plots in Figure 5. In general, the use of Egger s Regression or Kendall s Tau based on sample estimated effect size variance, Tau (V), resulted in liberal Type I error control (with the estimated Type I error rates exceeding.90 in some conditions). Conversely, the funnel plot and the trim and fill methods evidenced conservative Type I error control in most conditions. Only Kendall s Tau based on sample sizes, Tau (N), resulted in Type I error control that was very close to the nominal alpha level across the majority of conditions examined.

11 Publication Bias 11 To determine the influence of the Monte Carlo study design factors on the Type I error control of these five tests, the proportions of simulation conditions that resulted in adequate Type I error control according to Bradley s (1975) liberal criterion were calculated. These proportions, disaggregated by the study design factors, are presented in Table 3 and the corresponding average estimated Type I error rates are presented in Table 4. For Egger s Regression, as the number of studies in each meta-analysis (k) increased, the proportion of conditions with adequate Type I error control decreased from 0.55 (with k = 10) to 0.17 (with k = 100). Table 4 indicates that this test became increasingly liberal with larger number of studies, yielding an average Type I error rate of 0.29 with k = 100. Similar results were evident with the use of Kendall s Tau (V), with the average estimated Type I error rate reaching as high as 0.22 with k = 100. Both the funnel plot and trim and fill methods evidenced the opposite relationship with k: with larger numbers of studies in the meta-analysis, the proportions of conditions with adequate Type I error control increased. The use of Kendall s Tau (N) maintained the average Type I error rate near the nominal alpha across values of K, although the proportion of conditions with adequate Type I error control decreased slightly as k increased, reaching a low of 0.94 with k = 100. The proportions of conditions with adequate Type I error control and the average estimated Type I error rates are graphed in Figures 6 and 7, respectively. With small sample sizes in the primary studies, Egger s Regression, Kendall s Tau (V) and the trim and fill approach evidenced adequate Type I error control in fewer than one quarter of the conditions (Figures 8-11). These proportions increased with larger samples, with Kendall s Tau (V) performing adequately in a larger proportion of conditions than either Egger s Regression or trim and fill (reaching as high as 0.89 with sample sizes of 40 and 60 in the two groups). With small samples, the funnel plot regression showed better Type I error control than did Egger s Regression, Kendall s Tau (V) or the trim and fill (except for conditions with positive pairing of sample size with variance), but the best Type I error control was evidenced by Kendall s Tau (N).

12 Publication Bias 12 For the latter test, the worst performance in Type I error control was for conditions with sample sizes of 40 and 60, in which only 91% of the conditions evidenced adequate control. All methods except the trim and fill approach evidenced decreasing performance as the population effect size increased (Figures 12 and 13). For example, Kendall s Tau (V) showed adequate control in 94% of the conditions with a population effect size of zero, but in only 17% of the conditions with a population effect size of 0.8. With larger population effect sizes, the Egger Regression, Kendall s Tau (V), and Trim and Fill became excessively liberal in Type I error control. The trim and fill approach evidenced its best performance with small population effects (41% of the conditions with an effect size of 0.2), but showed decreasing Type I error control in conditions with larger population effect sizes and in conditions with a population effect size of zero. The Kendall s Tau (N) test also became liberal with larger population effect sizes, but it still maintained adequate Type I error control in 93% of the conditions with a population effect size of 0.8. As expected, all of the methods showed decreasing performance in Type I error control under conditions of variance heterogeneity in the primary studies, with the average Type I error rates increasing with greater degrees of heterogeneity (Figures 14 and 15). With homogeneous variance conditions or conditions with a 1:2 variance ratio, all of the methods except the trim and fill maintained adequate Type I error control in more than half of the conditions examined. With the largest degree of heterogeneity examined (variance ratio of 1:8), the funnel plot method still maintained adequate control in 56% of the conditions, while the Kendall s Tau (N) test maintained control in 95% of the conditions. Statistical Power The statistical power of each test was estimated only for the conditions in which Type I error was adequately controlled (using Bradley s, 1975, liberal robustness criterion). An initial overview of these power results is provided in Table 5, which

13 Publication Bias 13 presents the maximum power estimates that were observed, disaggregated by the design factors of the Monte Carlo study. These results underscore the limited statistical power of these tests. For example, the highest power observed for Kendall s Tau (V) was only (a slightly higher maximum power, 0.671, was observed for Kendall s Tau (N)). The power estimates for Egger s Regression rarely exceeded 0.5 (a value reached only for conditions with k = 100, and n1 + n2 = 100). Even lower estimates were observed for the funnel plot and the trim and fill methods (neither of which evidenced power exceeding 0.40). As expected, the maximum power increased with larger numbers of studies in the meta-analyses, with larger sample sizes in the primary studies, and with larger magnitudes of publication bias. The funnel plot method and Kendall s Tau (N) appeared to be more influenced by unbalanced designs in the primary studies than were the other procedures. The funnel plot, Kendall s Tau (N), and the trim and fill methods evidenced increasing power as the population effect size increased. In contrast, the Egger Regression and Kendall s Tau (V) showed a decrease in maximum power with the largest population effect size examined. Similarly, higher levels of power were evident with heterogeneous population variances for Kendall s Tau (N) and the funnel plot methods, but lower power was evidenced for the other procedures. More detailed power estimates are provided for a subset of conditions in Table 6 (for balanced designs in primary studies, with nj = 10 and nj = 50). This table illustrates the abysmal power for all five procedures with small numbers of studies in the metaanalysis (regardless of the sample sizes in these studies) and with larger meta-analyses conducted from small sample primary studies. For conditions in which Type I error is adequately controlled, the Kendall Tau (V) procedure evidences distinct power advantages relative to Kendall s Tau (N). For example, with homogeneous variances, k = 100, n1 = n2 = 50, a large magnitude of publication bias, and population effect size of 0.50, the power of Kendall s Tau (V) was estimated to be 0.623, in contrast to a power estimate of for Kendall s Tau (N). Similar power advantages are occasionally

14 Publication Bias 14 evident for Egger s Regression (see Figure 16), but the number of conditions in which this procedure provides adequate Type I error control is very small. Discussion and Conclusions The results for conditions in which no publication bias was present suggests that only the Kendall s Tau (N) procedure maintained adequate Type I error control across the majority of conditions examined. The Egger Regression and Kendall s Tau (V) methods suggested liberal Type I error control in most conditions, while the Funnel Plot Regression and the trim and fill methods evidenced conservative control with small numbers of studies in the meta-analysis. The contrast in Type I error control between the two approaches to Kendall s Tau is likely related to the fact that the estimation variance of Cohen s d is a function of as well as sample size: σˆ ( d ) 2 n + n = + nn 2( n + n ) That is, even in the absence of publication bias, a non-zero correlation is expected between the sample value of Cohen s d and the estimated variance of d. In contrast, no relationship is expected between the sample value of Cohen s d and the sample size if publication bias is not present. Despite the overall satisfactory Type I error control exhibited by Kendall s Tau (N), statistical power was wanting in all conditions examined. The increase in power with larger numbers of studies in a meta-analysis and with larger sample sizes in the primary studies was anticipated, but power remained substantially below frequently used target power of 0.80 even under conditions with reasonably large meta-analyses simulated (i.e., with k = 100 and n1 + n2 = 100). Clearly, more research is needed to develop methods for adequately screening meta-analytic data sets for publication bias. Meta-analysis has become increasingly important for the synthesis of research results in a variety of fields, including education, the behavioral sciences and medicine. The accuracy of inferences derived from meta-analysis depends upon the appropriate application of statistical tools. As the use of meta-analytic methods becomes more

15 Publication Bias 15 commonplace, researchers must remain mindful of the limitations of certain estimates. The results of this study underscore the need to exercise caution in the interpretation of the results obtained from meta-analyses. This research furnishes valuable cautionary information about the poor Type I error control of several methods advocated for detecting publication bias, and the generally impoverished levels of statistical power.

16 Publication Bias 16 References Begg, C. B. (2001). Publication bias. In H. Cooper & L. V. Hedges (Eds.), The handbook of research synthesis (pp ). New York: Russell Sage Foundation. Begg, C. B., & Mazumdar, M. (1994). Operating characteristics of a rank correlation test for publication bias. Biometrics, 50(4), Bradley, J. V. (1978). Robustness? British Journal of Mathematical and Statistical Psychology, 31, Cliff, N. & Charlin, V. (1991). Variances and covariances of Kendall s tau and their estimation. Multivariate Behavioral Research, 26, Duval, S. & Tweedie, R. (2000a). Trim and fill: A simple funnel-plot-based method of testing and adjusting for publication bias in meta-analysis. Biometrics, 56, Duval, S. & Tweedie, R. (2000b). A nonparametric Trim and Fill method of accounting for publication bias in meta-analysis. Journal of the American Statistical Association, 95, Egger, M., Davey Smith, G., Schnedier, M. & Minder, C. Bias in meta-analysis detected by a simple graphical test. British Medical Journal, 315, Hedges, L. (1992). Modeling publication selection effects in meta-analysis. Statistical Science, 7(2), Hedges & Olkin (1985). Statistical Methods for Meta-Analysis. San Diego, CA: Academic Press. Hedges, L. & Vevea, J. (1996). Estimating effect size under publication bias: Small sample properties and robustness of a random effects selection model. Journal of Educational and Behavioral Statistics, 21(4),

17 Publication Bias 17 Macaskill, P., Walter, S., & Irwig, L. (2001). A comparison of methods to detect publication bias in meta-analysis. Statistics in Medicine, 20, Robey, R. R. & Barcikowski, R. S. (1992). Type I error and the number of iterations in Monte Carlo studies of robustness. British Journal of Mathematical and Statistical Psychology, 45, Rosenthal, R. (1979). The "file-drawer problem" and tolerance for null results. Psychological Bulletin, 86, Sutton, S.J., Abrams, K.R., Jones, D.R., Sheldon, T.A., & Song, F. (2000). Methods for Meta-Analysis in Medical Research. Chichester, NY: John Wiley & Sons, LTD. Thornton, A. & Lee, P. (2000). Publication bias in meta-analysis: Its causes and consequences. Journal of Clinical Epidemiology, 53,

18 Publication Bias 18 Table 1: Hypothetical sample of 10 effect sizes. Study n1 n2 1 di B * E d i ν i ν i ν i d i ν i d i Sum Mean Effect Size =

19 Publication Bias 19 Table 2 Proportion of Simulated Studies Available for Meta-Analysis. Number of Studies (K) Mean Min Max Primary Study Sample Size (n1 n2) 4, , , , , , , , , Population Effect Size Population Variance Ratio 1: : : : Magnitude of Publication Bias Small Large

20 Publication Bias 20 Table 3 Estimated Proportion of Conditions with Adequate Type I Error Control by Monte Carlo Design Factors. Trim and Egger Kendall (V) Kendall (N) Funnel Fill Number of Studies (K) Primary Study Sample Size (n 1 n 2) 4/ / / / / / / / / Population Effect Size Population Variance Ratio 1: : : : Note. Type I error estimates were based on 10,000 samples of each condition.

21 Publication Bias 21 Table 4 Mean Estimated Type I Error Rates by Monte Carlo Design Factors. Egger Kendall (V) Kendall (N) Funnel Number of Studies (K) Trim and Fill Primary Study Sample Size (n 1 n 2) 4/ / / / / / / / / Population Effect Size Population Variance Ratio 1: : : : Note. Type I error estimates were based on 10,000 samples of each condition.

22 Publication Bias 22 Table 5 Maximum Power Estimates by Monte Carlo Design Factors. Number of Studies (K) Egger Ken (V) Ken (N) Funnel Trim and Fill Primary Study Sample Size (n 1 n 2) 4, , , , , , , , , Population Effect Size Population Variance Ratio 1: : : : Magnitude of Publication Bias Small Large Note. Statistical power estimates were computed only for conditions in which Type I error was adequately controlled. Power estimates were based on 10,000 samples of each condition.

23 Publication Bias 23 Table 6 Estimated Power for Selected Conditions. Homogeneous Variances Population Effect Size = 0.0 Population Effect Size = 0.2 Population Effect Size = 0.5 Population Effect Size = 0.8 k n j Bias Egger Ken V Ken N Funnl T & F Egger Ken V Ken N Funnl T & F Egger Ken V Ken N Funnl T & F Egger Ken V Ken N Funnl T & F Sml Lrg Sml Lrg Sml Lrg Sml Lrg Sml Lrg Sml Lrg Heterogeneous Variances (1:4) Population Effect Size = 0.0 Population Effect Size = 0.2 Population Effect Size = 0.5 Population Effect Size = 0.8 k n j Bias Egger Ken V Ken N Funnl T & F Egger Ken V Ken N Funnl T & F Egger Ken V Ken N Funnl T & F Egger Ken V Ken N Funnl T & F Sml Lrg Sml Lrg Sml Lrg Sml Lrg Sml Lrg Sml Lrg Note. Statistical power estimates were computed only for conditions in which Type I error was adequately controlled. Power estimates were based on 10,000 samples of each condition.

24 Publication Bias 24 3 Begg's Standardized Effect Size Standardized Variance Figure 1: Begg's Tau plot with standardized effect size by standardized variance

25 Publication Bias 25 3 Egger's Standardized Effect Size y = X Precision Figure 2: Egger's regression plot with standardized effect size by precision

26 Publication Bias Effect Size y = X Sample Size Figure 3: Funnel plot regression method with effect size by sample size

27 Publication Bias Moderate Bias: exp(-2p**1.5) Strong Bias: exp(-4p**1.5) Weight p-value Figure 4. Weight functions to establish probability of inclusion in a meta-analysis as a function of the p-value.

28 Publication Bias Rate Egger Kendal (V) Kendal (N) Funnel Trim and Fill Method Figure 5: Distribution of Type I Error Rate Estimates Across Conditions Simulated

29 Publication Bias Proportion of Samples Egger Kendall (V) Kendall (N) Funnel Trim and Fill Analysis Method Figure 6: Proportions of Conditions with Adequate Type I Error Control by Number of Studies in the Meta-analysis

30 Publication Bias Average Type I Error Rate Egger Kendall (V) Kendall (N) Funnel Trim and Fill Analysis Method Figure 7: Mean Type I Error Rate by Number of Studies in the Meta-analysis

31 Publication Bias Proportion of Samples Egger Kendall (V) Kendall (N) Funnel Trim and Fill Analysis Method 5/5 10/10 50/50 Figure 8: Proportions of Conditions with Adequate Type I Error Control by Sample Size in Primary Studies (Equal N)

32 Publication Bias Proportion of Samples Egger Kendall (V) Kendall (N) Funnel Trim and Fill Analysis Method 4/6 6/4 8/12 12/8 40/60 60/40 Figure 9: Proportions of Conditions with Adequate Type I Error Control by Sample Size in Primary Studies (Unequal N)

33 Publication Bias Average Type I Error Rate Egger Kendall (V) Kendall (N) Funnel Trim and Fill Analysis Method 5/5 10/10 50/50 Figure 10: Mean Type I Error Rate by Sample Size in Primary Studies (Equal N)

34 Publication Bias Average Type I Error Rate Egger Kendall (V) Kendall (N) Funnel Trim and Fill Analysis M ethod 4/6 6/4 8/12 12/8 40/60 60/40 Figure 11: Mean Type I Error Rate by Sample Size in Primary Studies (Unequal N)

35 Publication Bias Proportion of Samples Egger Kendall (V) Kendall (N) Funnel Trim and Fill Analysis Method Figure 12: Proportions of Conditions with Adequate Type I Error Control by Population Effect Size

36 Publication Bias Average Type I Error Rate Egger Kendall (V) Kendall (N) Funnel Trim and Fill Analysis Method Figure 13: Mean Type I Error Rate by Population Effect Size

37 Publication Bias Proportion of Samples Egger Kendall (V) Kendall (N) Funnel Trim and Fill Analysis Method 1:1 1:2 1:4 1:8 Figure 14: Proportions of Conditions with Adequate Type I Error Control by Population Variance Ratio

38 Publication Bias Average Type I Error Rate Egger Kendall (V) Kendall (N) Funnel Trim and Fill Analysis Method 1:1 1:2 1:4 1:8 Figure 15: Mean Type I Error Rate by Population Variance Ratio

39 Publication Bias Estimated Power Egger Kendall (V) Kendall (N) Funnel Trim and Fill Analysis Method k=20, nj=10 k=20, nj=50 k=50, nj=10 k=50, nj=50 k=100, nj=10 k=100, nj=50 Figure 16: Statistical Power Estimates for Large Publication Bias, Homogeneous Variances, and Small Population Effect Size.

Methods for Meta-analysis in Medical Research

Methods for Meta-analysis in Medical Research Methods for Meta-analysis in Medical Research Alex J. Sutton University of Leicester, UK Keith R. Abrams University of Leicester, UK David R. Jones University of Leicester, UK Trevor A. Sheldon University

More information

Calculating, Interpreting, and Reporting Estimates of Effect Size (Magnitude of an Effect or the Strength of a Relationship)

Calculating, Interpreting, and Reporting Estimates of Effect Size (Magnitude of an Effect or the Strength of a Relationship) 1 Calculating, Interpreting, and Reporting Estimates of Effect Size (Magnitude of an Effect or the Strength of a Relationship) I. Authors should report effect sizes in the manuscript and tables when reporting

More information

Confidence Intervals for One Standard Deviation Using Standard Deviation

Confidence Intervals for One Standard Deviation Using Standard Deviation Chapter 640 Confidence Intervals for One Standard Deviation Using Standard Deviation Introduction This routine calculates the sample size necessary to achieve a specified interval width or distance from

More information

Nonrandomly Missing Data in Multiple Regression Analysis: An Empirical Comparison of Ten Missing Data Treatments

Nonrandomly Missing Data in Multiple Regression Analysis: An Empirical Comparison of Ten Missing Data Treatments Brockmeier, Kromrey, & Hogarty Nonrandomly Missing Data in Multiple Regression Analysis: An Empirical Comparison of Ten s Lantry L. Brockmeier Jeffrey D. Kromrey Kristine Y. Hogarty Florida A & M University

More information

Sample Size and Power in Clinical Trials

Sample Size and Power in Clinical Trials Sample Size and Power in Clinical Trials Version 1.0 May 011 1. Power of a Test. Factors affecting Power 3. Required Sample Size RELATED ISSUES 1. Effect Size. Test Statistics 3. Variation 4. Significance

More information

Confidence Intervals for the Difference Between Two Means

Confidence Intervals for the Difference Between Two Means Chapter 47 Confidence Intervals for the Difference Between Two Means Introduction This procedure calculates the sample size necessary to achieve a specified distance from the difference in sample means

More information

Confidence Intervals for Spearman s Rank Correlation

Confidence Intervals for Spearman s Rank Correlation Chapter 808 Confidence Intervals for Spearman s Rank Correlation Introduction This routine calculates the sample size needed to obtain a specified width of Spearman s rank correlation coefficient confidence

More information

Descriptive Statistics

Descriptive Statistics Descriptive Statistics Primer Descriptive statistics Central tendency Variation Relative position Relationships Calculating descriptive statistics Descriptive Statistics Purpose to describe or summarize

More information

Association Between Variables

Association Between Variables Contents 11 Association Between Variables 767 11.1 Introduction............................ 767 11.1.1 Measure of Association................. 768 11.1.2 Chapter Summary.................... 769 11.2 Chi

More information

II. DISTRIBUTIONS distribution normal distribution. standard scores

II. DISTRIBUTIONS distribution normal distribution. standard scores Appendix D Basic Measurement And Statistics The following information was developed by Steven Rothke, PhD, Department of Psychology, Rehabilitation Institute of Chicago (RIC) and expanded by Mary F. Schmidt,

More information

SIMULATION STUDIES IN STATISTICS WHAT IS A SIMULATION STUDY, AND WHY DO ONE? What is a (Monte Carlo) simulation study, and why do one?

SIMULATION STUDIES IN STATISTICS WHAT IS A SIMULATION STUDY, AND WHY DO ONE? What is a (Monte Carlo) simulation study, and why do one? SIMULATION STUDIES IN STATISTICS WHAT IS A SIMULATION STUDY, AND WHY DO ONE? What is a (Monte Carlo) simulation study, and why do one? Simulations for properties of estimators Simulations for properties

More information

MISSING DATA TECHNIQUES WITH SAS. IDRE Statistical Consulting Group

MISSING DATA TECHNIQUES WITH SAS. IDRE Statistical Consulting Group MISSING DATA TECHNIQUES WITH SAS IDRE Statistical Consulting Group ROAD MAP FOR TODAY To discuss: 1. Commonly used techniques for handling missing data, focusing on multiple imputation 2. Issues that could

More information

Multivariate Analysis of Variance (MANOVA)

Multivariate Analysis of Variance (MANOVA) Multivariate Analysis of Variance (MANOVA) Aaron French, Marcelo Macedo, John Poulsen, Tyler Waterson and Angela Yu Keywords: MANCOVA, special cases, assumptions, further reading, computations Introduction

More information

UNDERSTANDING THE DEPENDENT-SAMPLES t TEST

UNDERSTANDING THE DEPENDENT-SAMPLES t TEST UNDERSTANDING THE DEPENDENT-SAMPLES t TEST A dependent-samples t test (a.k.a. matched or paired-samples, matched-pairs, samples, or subjects, simple repeated-measures or within-groups, or correlated groups)

More information

Chapter 1 Introduction. 1.1 Introduction

Chapter 1 Introduction. 1.1 Introduction Chapter 1 Introduction 1.1 Introduction 1 1.2 What Is a Monte Carlo Study? 2 1.2.1 Simulating the Rolling of Two Dice 2 1.3 Why Is Monte Carlo Simulation Often Necessary? 4 1.4 What Are Some Typical Situations

More information

Variables Control Charts

Variables Control Charts MINITAB ASSISTANT WHITE PAPER This paper explains the research conducted by Minitab statisticians to develop the methods and data checks used in the Assistant in Minitab 17 Statistical Software. Variables

More information

Part 2: Analysis of Relationship Between Two Variables

Part 2: Analysis of Relationship Between Two Variables Part 2: Analysis of Relationship Between Two Variables Linear Regression Linear correlation Significance Tests Multiple regression Linear Regression Y = a X + b Dependent Variable Independent Variable

More information

UNDERSTANDING THE INDEPENDENT-SAMPLES t TEST

UNDERSTANDING THE INDEPENDENT-SAMPLES t TEST UNDERSTANDING The independent-samples t test evaluates the difference between the means of two independent or unrelated groups. That is, we evaluate whether the means for two independent groups are significantly

More information

Software for Publication Bias

Software for Publication Bias CHAPTER 11 Software for Publication Bias Michael Borenstein Biostat, Inc., USA KEY POINTS Various procedures for addressing publication bias are discussed elsewhere in this volume. The goal of this chapter

More information

Inequalities in Self-Reported Health: A Meta-Regression Analysis

Inequalities in Self-Reported Health: A Meta-Regression Analysis WP 13/05 Inequalities in Self-Reported Health: A Meta-Regression Analysis Joan Costa-Font and Cristina Hernandez-Quevedo February 2013 york.ac.uk/res/herc/hedgwp Inequalities in Self-Reported Health: A

More information

Simple Linear Regression Inference

Simple Linear Regression Inference Simple Linear Regression Inference 1 Inference requirements The Normality assumption of the stochastic term e is needed for inference even if it is not a OLS requirement. Therefore we have: Interpretation

More information

WHAT IS A JOURNAL CLUB?

WHAT IS A JOURNAL CLUB? WHAT IS A JOURNAL CLUB? With its September 2002 issue, the American Journal of Critical Care debuts a new feature, the AJCC Journal Club. Each issue of the journal will now feature an AJCC Journal Club

More information

UNDERSTANDING ANALYSIS OF COVARIANCE (ANCOVA)

UNDERSTANDING ANALYSIS OF COVARIANCE (ANCOVA) UNDERSTANDING ANALYSIS OF COVARIANCE () In general, research is conducted for the purpose of explaining the effects of the independent variable on the dependent variable, and the purpose of research design

More information

NCSS Statistical Software

NCSS Statistical Software Chapter 06 Introduction This procedure provides several reports for the comparison of two distributions, including confidence intervals for the difference in means, two-sample t-tests, the z-test, the

More information

Testing for Granger causality between stock prices and economic growth

Testing for Granger causality between stock prices and economic growth MPRA Munich Personal RePEc Archive Testing for Granger causality between stock prices and economic growth Pasquale Foresti 2006 Online at http://mpra.ub.uni-muenchen.de/2962/ MPRA Paper No. 2962, posted

More information

Schools Value-added Information System Technical Manual

Schools Value-added Information System Technical Manual Schools Value-added Information System Technical Manual Quality Assurance & School-based Support Division Education Bureau 2015 Contents Unit 1 Overview... 1 Unit 2 The Concept of VA... 2 Unit 3 Control

More information

MODIFIED PARAMETRIC BOOTSTRAP: A ROBUST ALTERNATIVE TO CLASSICAL TEST

MODIFIED PARAMETRIC BOOTSTRAP: A ROBUST ALTERNATIVE TO CLASSICAL TEST MODIFIED PARAMETRIC BOOTSTRAP: A ROBUST ALTERNATIVE TO CLASSICAL TEST Zahayu Md Yusof, Nurul Hanis Harun, Sharipah Sooad Syed Yahaya & Suhaida Abdullah School of Quantitative Sciences College of Arts and

More information

A Power Primer. Jacob Cohen New York University ABSTRACT

A Power Primer. Jacob Cohen New York University ABSTRACT Psychological Bulletin July 1992 Vol. 112, No. 1, 155-159 1992 by the American Psychological Association For personal use only--not for distribution. A Power Primer Jacob Cohen New York University ABSTRACT

More information

SAMPLE SIZE TABLES FOR LOGISTIC REGRESSION

SAMPLE SIZE TABLES FOR LOGISTIC REGRESSION STATISTICS IN MEDICINE, VOL. 8, 795-802 (1989) SAMPLE SIZE TABLES FOR LOGISTIC REGRESSION F. Y. HSIEH* Department of Epidemiology and Social Medicine, Albert Einstein College of Medicine, Bronx, N Y 10461,

More information

Multiple Imputation for Missing Data: A Cautionary Tale

Multiple Imputation for Missing Data: A Cautionary Tale Multiple Imputation for Missing Data: A Cautionary Tale Paul D. Allison University of Pennsylvania Address correspondence to Paul D. Allison, Sociology Department, University of Pennsylvania, 3718 Locust

More information

Systematic Reviews and Meta-analyses

Systematic Reviews and Meta-analyses Systematic Reviews and Meta-analyses Introduction A systematic review (also called an overview) attempts to summarize the scientific evidence related to treatment, causation, diagnosis, or prognosis of

More information

An introduction to Value-at-Risk Learning Curve September 2003

An introduction to Value-at-Risk Learning Curve September 2003 An introduction to Value-at-Risk Learning Curve September 2003 Value-at-Risk The introduction of Value-at-Risk (VaR) as an accepted methodology for quantifying market risk is part of the evolution of risk

More information

Fairfield Public Schools

Fairfield Public Schools Mathematics Fairfield Public Schools AP Statistics AP Statistics BOE Approved 04/08/2014 1 AP STATISTICS Critical Areas of Focus AP Statistics is a rigorous course that offers advanced students an opportunity

More information

Unit 31 A Hypothesis Test about Correlation and Slope in a Simple Linear Regression

Unit 31 A Hypothesis Test about Correlation and Slope in a Simple Linear Regression Unit 31 A Hypothesis Test about Correlation and Slope in a Simple Linear Regression Objectives: To perform a hypothesis test concerning the slope of a least squares line To recognize that testing for a

More information

CHAPTER 13 SIMPLE LINEAR REGRESSION. Opening Example. Simple Regression. Linear Regression

CHAPTER 13 SIMPLE LINEAR REGRESSION. Opening Example. Simple Regression. Linear Regression Opening Example CHAPTER 13 SIMPLE LINEAR REGREION SIMPLE LINEAR REGREION! Simple Regression! Linear Regression Simple Regression Definition A regression model is a mathematical equation that descries the

More information

X X X a) perfect linear correlation b) no correlation c) positive correlation (r = 1) (r = 0) (0 < r < 1)

X X X a) perfect linear correlation b) no correlation c) positive correlation (r = 1) (r = 0) (0 < r < 1) CORRELATION AND REGRESSION / 47 CHAPTER EIGHT CORRELATION AND REGRESSION Correlation and regression are statistical methods that are commonly used in the medical literature to compare two or more variables.

More information

SENSITIVITY ANALYSIS AND INFERENCE. Lecture 12

SENSITIVITY ANALYSIS AND INFERENCE. Lecture 12 This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike License. Your use of this material constitutes acceptance of that license and the conditions of use of materials on this

More information

There are three kinds of people in the world those who are good at math and those who are not. PSY 511: Advanced Statistics for Psychological and Behavioral Research 1 Positive Views The record of a month

More information

The operating characteristics of the nonparametric Levene test for equal variances with assessment and evaluation data

The operating characteristics of the nonparametric Levene test for equal variances with assessment and evaluation data A peer-reviewed electronic journal. Copyright is retained by the first or sole author, who grants right of first publication to the Practical Assessment, Research & Evaluation. Permission is granted to

More information

Sample Size Planning for the Squared Multiple Correlation Coefficient: Accuracy in Parameter Estimation via Narrow Confidence Intervals

Sample Size Planning for the Squared Multiple Correlation Coefficient: Accuracy in Parameter Estimation via Narrow Confidence Intervals Multivariate Behavioral Research, 43:524 555, 2008 Copyright Taylor & Francis Group, LLC ISSN: 0027-3171 print/1532-7906 online DOI: 10.1080/00273170802490632 Sample Size Planning for the Squared Multiple

More information

Basic Statistics and Data Analysis for Health Researchers from Foreign Countries

Basic Statistics and Data Analysis for Health Researchers from Foreign Countries Basic Statistics and Data Analysis for Health Researchers from Foreign Countries Volkert Siersma siersma@sund.ku.dk The Research Unit for General Practice in Copenhagen Dias 1 Content Quantifying association

More information

Longitudinal Meta-analysis

Longitudinal Meta-analysis Quality & Quantity 38: 381 389, 2004. 2004 Kluwer Academic Publishers. Printed in the Netherlands. 381 Longitudinal Meta-analysis CORA J. M. MAAS, JOOP J. HOX and GERTY J. L. M. LENSVELT-MULDERS Department

More information

Fixed-Effect Versus Random-Effects Models

Fixed-Effect Versus Random-Effects Models CHAPTER 13 Fixed-Effect Versus Random-Effects Models Introduction Definition of a summary effect Estimating the summary effect Extreme effect size in a large study or a small study Confidence interval

More information

PARTIAL LEAST SQUARES IS TO LISREL AS PRINCIPAL COMPONENTS ANALYSIS IS TO COMMON FACTOR ANALYSIS. Wynne W. Chin University of Calgary, CANADA

PARTIAL LEAST SQUARES IS TO LISREL AS PRINCIPAL COMPONENTS ANALYSIS IS TO COMMON FACTOR ANALYSIS. Wynne W. Chin University of Calgary, CANADA PARTIAL LEAST SQUARES IS TO LISREL AS PRINCIPAL COMPONENTS ANALYSIS IS TO COMMON FACTOR ANALYSIS. Wynne W. Chin University of Calgary, CANADA ABSTRACT The decision of whether to use PLS instead of a covariance

More information

Least Squares Estimation

Least Squares Estimation Least Squares Estimation SARA A VAN DE GEER Volume 2, pp 1041 1045 in Encyclopedia of Statistics in Behavioral Science ISBN-13: 978-0-470-86080-9 ISBN-10: 0-470-86080-4 Editors Brian S Everitt & David

More information

Quantitative Methods for Finance

Quantitative Methods for Finance Quantitative Methods for Finance Module 1: The Time Value of Money 1 Learning how to interpret interest rates as required rates of return, discount rates, or opportunity costs. 2 Learning how to explain

More information

T-test & factor analysis

T-test & factor analysis Parametric tests T-test & factor analysis Better than non parametric tests Stringent assumptions More strings attached Assumes population distribution of sample is normal Major problem Alternatives Continue

More information

5. Linear Regression

5. Linear Regression 5. Linear Regression Outline.................................................................... 2 Simple linear regression 3 Linear model............................................................. 4

More information

LAB 4 INSTRUCTIONS CONFIDENCE INTERVALS AND HYPOTHESIS TESTING

LAB 4 INSTRUCTIONS CONFIDENCE INTERVALS AND HYPOTHESIS TESTING LAB 4 INSTRUCTIONS CONFIDENCE INTERVALS AND HYPOTHESIS TESTING In this lab you will explore the concept of a confidence interval and hypothesis testing through a simulation problem in engineering setting.

More information

From the help desk: Bootstrapped standard errors

From the help desk: Bootstrapped standard errors The Stata Journal (2003) 3, Number 1, pp. 71 80 From the help desk: Bootstrapped standard errors Weihua Guan Stata Corporation Abstract. Bootstrapping is a nonparametric approach for evaluating the distribution

More information

Lecture Notes Module 1

Lecture Notes Module 1 Lecture Notes Module 1 Study Populations A study population is a clearly defined collection of people, animals, plants, or objects. In psychological research, a study population usually consists of a specific

More information

Statistical Models in R

Statistical Models in R Statistical Models in R Some Examples Steven Buechler Department of Mathematics 276B Hurley Hall; 1-6233 Fall, 2007 Outline Statistical Models Structure of models in R Model Assessment (Part IA) Anova

More information

Statistical Rules of Thumb

Statistical Rules of Thumb Statistical Rules of Thumb Second Edition Gerald van Belle University of Washington Department of Biostatistics and Department of Environmental and Occupational Health Sciences Seattle, WA WILEY AJOHN

More information

A ew onparametric Levene Test for Equal Variances

A ew onparametric Levene Test for Equal Variances Psicológica (200), 3, 40-430. A ew onparametric Test for Equal Variances David W. Nordstokke * () & Bruno D. Zumbo (2) () University of Calgary, Canada (2) University of British Columbia, Canada Tests

More information

UNDERSTANDING THE TWO-WAY ANOVA

UNDERSTANDING THE TWO-WAY ANOVA UNDERSTANDING THE e have seen how the one-way ANOVA can be used to compare two or more sample means in studies involving a single independent variable. This can be extended to two independent variables

More information

International Statistical Institute, 56th Session, 2007: Phil Everson

International Statistical Institute, 56th Session, 2007: Phil Everson Teaching Regression using American Football Scores Everson, Phil Swarthmore College Department of Mathematics and Statistics 5 College Avenue Swarthmore, PA198, USA E-mail: peverso1@swarthmore.edu 1. Introduction

More information

http://www.jstor.org This content downloaded on Tue, 19 Feb 2013 17:28:43 PM All use subject to JSTOR Terms and Conditions

http://www.jstor.org This content downloaded on Tue, 19 Feb 2013 17:28:43 PM All use subject to JSTOR Terms and Conditions A Significance Test for Time Series Analysis Author(s): W. Allen Wallis and Geoffrey H. Moore Reviewed work(s): Source: Journal of the American Statistical Association, Vol. 36, No. 215 (Sep., 1941), pp.

More information

Interpretation of Somers D under four simple models

Interpretation of Somers D under four simple models Interpretation of Somers D under four simple models Roger B. Newson 03 September, 04 Introduction Somers D is an ordinal measure of association introduced by Somers (96)[9]. It can be defined in terms

More information

Extending the debate between Spearman and Wilson 1929: When do single variables optimally reproduce the common part of the observed covariances?

Extending the debate between Spearman and Wilson 1929: When do single variables optimally reproduce the common part of the observed covariances? 1 Extending the debate between Spearman and Wilson 1929: When do single variables optimally reproduce the common part of the observed covariances? André Beauducel 1 & Norbert Hilger University of Bonn,

More information

11. Analysis of Case-control Studies Logistic Regression

11. Analysis of Case-control Studies Logistic Regression Research methods II 113 11. Analysis of Case-control Studies Logistic Regression This chapter builds upon and further develops the concepts and strategies described in Ch.6 of Mother and Child Health:

More information

UNIVERSITY OF NAIROBI

UNIVERSITY OF NAIROBI UNIVERSITY OF NAIROBI MASTERS IN PROJECT PLANNING AND MANAGEMENT NAME: SARU CAROLYNN ELIZABETH REGISTRATION NO: L50/61646/2013 COURSE CODE: LDP 603 COURSE TITLE: RESEARCH METHODS LECTURER: GAKUU CHRISTOPHER

More information

Section Format Day Begin End Building Rm# Instructor. 001 Lecture Tue 6:45 PM 8:40 PM Silver 401 Ballerini

Section Format Day Begin End Building Rm# Instructor. 001 Lecture Tue 6:45 PM 8:40 PM Silver 401 Ballerini NEW YORK UNIVERSITY ROBERT F. WAGNER GRADUATE SCHOOL OF PUBLIC SERVICE Course Syllabus Spring 2016 Statistical Methods for Public, Nonprofit, and Health Management Section Format Day Begin End Building

More information

COMPARING DATA ANALYSIS TECHNIQUES FOR EVALUATION DESIGNS WITH NON -NORMAL POFULP_TIOKS Elaine S. Jeffers, University of Maryland, Eastern Shore*

COMPARING DATA ANALYSIS TECHNIQUES FOR EVALUATION DESIGNS WITH NON -NORMAL POFULP_TIOKS Elaine S. Jeffers, University of Maryland, Eastern Shore* COMPARING DATA ANALYSIS TECHNIQUES FOR EVALUATION DESIGNS WITH NON -NORMAL POFULP_TIOKS Elaine S. Jeffers, University of Maryland, Eastern Shore* The data collection phases for evaluation designs may involve

More information

Multivariate Analysis of Variance. The general purpose of multivariate analysis of variance (MANOVA) is to determine

Multivariate Analysis of Variance. The general purpose of multivariate analysis of variance (MANOVA) is to determine 2 - Manova 4.3.05 25 Multivariate Analysis of Variance What Multivariate Analysis of Variance is The general purpose of multivariate analysis of variance (MANOVA) is to determine whether multiple levels

More information

A C T R esearcli R e p o rt S eries 2 0 0 5. Using ACT Assessment Scores to Set Benchmarks for College Readiness. IJeff Allen.

A C T R esearcli R e p o rt S eries 2 0 0 5. Using ACT Assessment Scores to Set Benchmarks for College Readiness. IJeff Allen. A C T R esearcli R e p o rt S eries 2 0 0 5 Using ACT Assessment Scores to Set Benchmarks for College Readiness IJeff Allen Jim Sconing ACT August 2005 For additional copies write: ACT Research Report

More information

NCSS Statistical Software

NCSS Statistical Software Chapter 06 Introduction This procedure provides several reports for the comparison of two distributions, including confidence intervals for the difference in means, two-sample t-tests, the z-test, the

More information

A Study to Predict No Show Probability for a Scheduled Appointment at Free Health Clinic

A Study to Predict No Show Probability for a Scheduled Appointment at Free Health Clinic A Study to Predict No Show Probability for a Scheduled Appointment at Free Health Clinic Report prepared for Brandon Slama Department of Health Management and Informatics University of Missouri, Columbia

More information

Permutation Tests for Comparing Two Populations

Permutation Tests for Comparing Two Populations Permutation Tests for Comparing Two Populations Ferry Butar Butar, Ph.D. Jae-Wan Park Abstract Permutation tests for comparing two populations could be widely used in practice because of flexibility of

More information

Projects Involving Statistics (& SPSS)

Projects Involving Statistics (& SPSS) Projects Involving Statistics (& SPSS) Academic Skills Advice Starting a project which involves using statistics can feel confusing as there seems to be many different things you can do (charts, graphs,

More information

Basic Concepts in Research and Data Analysis

Basic Concepts in Research and Data Analysis Basic Concepts in Research and Data Analysis Introduction: A Common Language for Researchers...2 Steps to Follow When Conducting Research...3 The Research Question... 3 The Hypothesis... 4 Defining the

More information

business statistics using Excel OXFORD UNIVERSITY PRESS Glyn Davis & Branko Pecar

business statistics using Excel OXFORD UNIVERSITY PRESS Glyn Davis & Branko Pecar business statistics using Excel Glyn Davis & Branko Pecar OXFORD UNIVERSITY PRESS Detailed contents Introduction to Microsoft Excel 2003 Overview Learning Objectives 1.1 Introduction to Microsoft Excel

More information

Understanding and Quantifying EFFECT SIZES

Understanding and Quantifying EFFECT SIZES Understanding and Quantifying EFFECT SIZES Karabi Nandy, Ph.d. Assistant Adjunct Professor Translational Sciences Section, School of Nursing Department of Biostatistics, School of Public Health, University

More information

Overview of Violations of the Basic Assumptions in the Classical Normal Linear Regression Model

Overview of Violations of the Basic Assumptions in the Classical Normal Linear Regression Model Overview of Violations of the Basic Assumptions in the Classical Normal Linear Regression Model 1 September 004 A. Introduction and assumptions The classical normal linear regression model can be written

More information

Effect-Size Indices for Dichotomized Outcomes in Meta-Analysis

Effect-Size Indices for Dichotomized Outcomes in Meta-Analysis Psychological Methods Copyright 003 by the American Psychological Association, Inc. 003, Vol. 8, No. 4, 448 467 108-989X/03/$1.00 DOI: 10.1037/108-989X.8.4.448 Effect-Size Indices for Dichotomized Outcomes

More information

HYPOTHESIS TESTING: CONFIDENCE INTERVALS, T-TESTS, ANOVAS, AND REGRESSION

HYPOTHESIS TESTING: CONFIDENCE INTERVALS, T-TESTS, ANOVAS, AND REGRESSION HYPOTHESIS TESTING: CONFIDENCE INTERVALS, T-TESTS, ANOVAS, AND REGRESSION HOD 2990 10 November 2010 Lecture Background This is a lightning speed summary of introductory statistical methods for senior undergraduate

More information

Study Guide for the Final Exam

Study Guide for the Final Exam Study Guide for the Final Exam When studying, remember that the computational portion of the exam will only involve new material (covered after the second midterm), that material from Exam 1 will make

More information

Statistics courses often teach the two-sample t-test, linear regression, and analysis of variance

Statistics courses often teach the two-sample t-test, linear regression, and analysis of variance 2 Making Connections: The Two-Sample t-test, Regression, and ANOVA In theory, there s no difference between theory and practice. In practice, there is. Yogi Berra 1 Statistics courses often teach the two-sample

More information

Introduction to Regression and Data Analysis

Introduction to Regression and Data Analysis Statlab Workshop Introduction to Regression and Data Analysis with Dan Campbell and Sherlock Campbell October 28, 2008 I. The basics A. Types of variables Your variables may take several forms, and it

More information

DATA INTERPRETATION AND STATISTICS

DATA INTERPRETATION AND STATISTICS PholC60 September 001 DATA INTERPRETATION AND STATISTICS Books A easy and systematic introductory text is Essentials of Medical Statistics by Betty Kirkwood, published by Blackwell at about 14. DESCRIPTIVE

More information

Outline. Definitions Descriptive vs. Inferential Statistics The t-test - One-sample t-test

Outline. Definitions Descriptive vs. Inferential Statistics The t-test - One-sample t-test The t-test Outline Definitions Descriptive vs. Inferential Statistics The t-test - One-sample t-test - Dependent (related) groups t-test - Independent (unrelated) groups t-test Comparing means Correlation

More information

Chapter G08 Nonparametric Statistics

Chapter G08 Nonparametric Statistics G08 Nonparametric Statistics Chapter G08 Nonparametric Statistics Contents 1 Scope of the Chapter 2 2 Background to the Problems 2 2.1 Parametric and Nonparametric Hypothesis Testing......................

More information

A Bayesian hierarchical surrogate outcome model for multiple sclerosis

A Bayesian hierarchical surrogate outcome model for multiple sclerosis A Bayesian hierarchical surrogate outcome model for multiple sclerosis 3 rd Annual ASA New Jersey Chapter / Bayer Statistics Workshop David Ohlssen (Novartis), Luca Pozzi and Heinz Schmidli (Novartis)

More information

Research Methods & Experimental Design

Research Methods & Experimental Design Research Methods & Experimental Design 16.422 Human Supervisory Control April 2004 Research Methods Qualitative vs. quantitative Understanding the relationship between objectives (research question) and

More information

NAG C Library Chapter Introduction. g08 Nonparametric Statistics

NAG C Library Chapter Introduction. g08 Nonparametric Statistics g08 Nonparametric Statistics Introduction g08 NAG C Library Chapter Introduction g08 Nonparametric Statistics Contents 1 Scope of the Chapter... 2 2 Background to the Problems... 2 2.1 Parametric and Nonparametric

More information

ECON 142 SKETCH OF SOLUTIONS FOR APPLIED EXERCISE #2

ECON 142 SKETCH OF SOLUTIONS FOR APPLIED EXERCISE #2 University of California, Berkeley Prof. Ken Chay Department of Economics Fall Semester, 005 ECON 14 SKETCH OF SOLUTIONS FOR APPLIED EXERCISE # Question 1: a. Below are the scatter plots of hourly wages

More information

Module 5: Multiple Regression Analysis

Module 5: Multiple Regression Analysis Using Statistical Data Using to Make Statistical Decisions: Data Multiple to Make Regression Decisions Analysis Page 1 Module 5: Multiple Regression Analysis Tom Ilvento, University of Delaware, College

More information

Transferability of Economic Evaluations in Clinical Trials

Transferability of Economic Evaluations in Clinical Trials Transferability of Economic Evaluations in Clinical Trials Henry Glick Institutt for helseledelse og helseøkonomi November 25, 2008 The Problem Multicenter and multinational studies are the norm for the

More information

2. Simple Linear Regression

2. Simple Linear Regression Research methods - II 3 2. Simple Linear Regression Simple linear regression is a technique in parametric statistics that is commonly used for analyzing mean response of a variable Y which changes according

More information

We are often interested in the relationship between two variables. Do people with more years of full-time education earn higher salaries?

We are often interested in the relationship between two variables. Do people with more years of full-time education earn higher salaries? Statistics: Correlation Richard Buxton. 2008. 1 Introduction We are often interested in the relationship between two variables. Do people with more years of full-time education earn higher salaries? Do

More information

PROPERTIES OF THE SAMPLE CORRELATION OF THE BIVARIATE LOGNORMAL DISTRIBUTION

PROPERTIES OF THE SAMPLE CORRELATION OF THE BIVARIATE LOGNORMAL DISTRIBUTION PROPERTIES OF THE SAMPLE CORRELATION OF THE BIVARIATE LOGNORMAL DISTRIBUTION Chin-Diew Lai, Department of Statistics, Massey University, New Zealand John C W Rayner, School of Mathematics and Applied Statistics,

More information

Descriptive Analysis

Descriptive Analysis Research Methods William G. Zikmund Basic Data Analysis: Descriptive Statistics Descriptive Analysis The transformation of raw data into a form that will make them easy to understand and interpret; rearranging,

More information

Confidence Intervals for Exponential Reliability

Confidence Intervals for Exponential Reliability Chapter 408 Confidence Intervals for Exponential Reliability Introduction This routine calculates the number of events needed to obtain a specified width of a confidence interval for the reliability (proportion

More information

Non Parametric Inference

Non Parametric Inference Maura Department of Economics and Finance Università Tor Vergata Outline 1 2 3 Inverse distribution function Theorem: Let U be a uniform random variable on (0, 1). Let X be a continuous random variable

More information

Eta Squared, Partial Eta Squared, and Misreporting of Effect Size in Communication Research

Eta Squared, Partial Eta Squared, and Misreporting of Effect Size in Communication Research 612 HUMAN COMMUNICATION RESEARCH /October 2002 Eta Squared, Partial Eta Squared, and Misreporting of Effect Size in Communication Research TIMOTHY R. LEVINE Michigan State University CRAIG R. HULLETT University

More information

Chapter 13 Introduction to Linear Regression and Correlation Analysis

Chapter 13 Introduction to Linear Regression and Correlation Analysis Chapter 3 Student Lecture Notes 3- Chapter 3 Introduction to Linear Regression and Correlation Analsis Fall 2006 Fundamentals of Business Statistics Chapter Goals To understand the methods for displaing

More information

Algebra 1 Course Information

Algebra 1 Course Information Course Information Course Description: Students will study patterns, relations, and functions, and focus on the use of mathematical models to understand and analyze quantitative relationships. Through

More information

Correlation. What Is Correlation? Perfect Correlation. Perfect Correlation. Greg C Elvers

Correlation. What Is Correlation? Perfect Correlation. Perfect Correlation. Greg C Elvers Correlation Greg C Elvers What Is Correlation? Correlation is a descriptive statistic that tells you if two variables are related to each other E.g. Is your related to how much you study? When two variables

More information

INTERPRETING THE ONE-WAY ANALYSIS OF VARIANCE (ANOVA)

INTERPRETING THE ONE-WAY ANALYSIS OF VARIANCE (ANOVA) INTERPRETING THE ONE-WAY ANALYSIS OF VARIANCE (ANOVA) As with other parametric statistics, we begin the one-way ANOVA with a test of the underlying assumptions. Our first assumption is the assumption of

More information