HANDLING MISSING DATA IN CLINICAL TRIALS: AN OVERVIEW
|
|
|
- Angelica George
- 9 years ago
- Views:
Transcription
1 Drug Information Journal, Vol. 34, pp , /2000 Printed in the USA. All rights reserved. Copyright 2000 Drug Information Association Inc. HANDLING MISSING DATA IN CLINICAL TRIALS: AN OVERVIEW WILLIAM R. MYERS, PHD Senior Statistician, Department of Biometrics and Statistical Sciences, Procter and Gamble Pharmaceuticals, Cincinnati, Ohio A major problem in the analysis of clinical trials is missing data caused by patients dropping out of the study before completion. This problem can result in biased treatment comparisons and also impact the overall statistical power of the study. This paper discusses some basic issues about missing data as well as potential watch outs. The topic of missing data is often not a major concern until it is time for data collection and data analysis. This paper provides potential design considerations that should be considered in order to mitigate patients from dropping out of a clinical study. In addition, the concept of the missing-data mechanism is discussed. Five general strategies of handling missing data are presented: 1. Complete-case analysis, 2. Weighting methods, 3. Imputation methods, 4. Analyzing data as incomplete, and 5. Other methods. Within each strategy, several methods are presented along with advantages and disadvantages. Also briefly discussed is how the International Conference on Harmonization (ICH) addresses the issue of missing data. Finally, several of the methods that are illustrated in the paper are compared using a simulated data set. Key Words: Clinical trials; Missing data; Dropouts; Imputation methods; Missing-data mechanism INTRODUCTION trials can produce biased treatment comparisons and reduce the overall statistical power. A PRIMARY CONCERN when conducting This paper will focus on the case where missa clinical trial is that patients will drop out ing data occur as a result of patients dropping (or withdraw) before study completion. The out of the study. More specifically, it focuses reason for withdrawal may be study-related on the case in which a patient s missing re- (eg, adverse event, death, unpleasant study sponse at assessment time t implies it will procedures, lack of improvement) or unrebe missing at all subsequent times. This is lated to the study (eg, moving away, unretermed a monotone pattern of unit-level misslated disease). This problem is especially ing data (1). An example where missing data prevalent in clinical trials where a slow-actdeviate from the aforementioned pattern is ing treatment or a drug that may be intolerant in the case of health related quality-of-life is being investigated. Dropouts in clinical research, where a patient does not answer an item (or question) within a questionnaire, but does not necessarily drop out of the clinical Presented at the DIA 35 th Annual Meeting, June 27 trial. July 1, 1999, Baltimore, Maryland. There are numerous issues one must con- Reprint address: William R. Myers, PhD, Departsider when confronted with a data set where ment of Biometrics and Statistical Sciences, Procter and Gamble Pharmaceuticals, 8700 Mason Montgomery patients have dropped out of the clinical Road, P.O. Box 47, Mason, OH study. First, compliant patients often have a 525
2 526 William R. Myers not be practical depending on the particular scenario of the clinical study. One is to care- fully consider eligibility criteria in order to exclude patients with characteristics that might prevent them from completing the study. This could obviously have an impact on product labeling in the case of a pivotal Phase III trial being considered for FDA ap- proval. Eligibility criteria may be more appropriate in an explanatory trial to determine if a drug has potential therapeutic effect. Another design consideration is to have a non- randomized trial period where all patients receive active treatment. Then those who are free of side effects are randomized to either active treatment or placebo. This can be done in order to account for early toxicity of a drug. It will most likely provide a selected subset of the original patient sample and potentially impact the results with respect to the patient population that was studied. In many cases this will not be a practical option. There are ways to potentially mitigate the impact of patients dropping out of a clinical study. A very important consideration at the design stage of a clinical trial is to collect data on all baseline variables that could potentially impact the likelihood of a patient dropping out. How collecting these variables will help assess the potential missing-data mechanism will be discussed later. These particular variables may be incorporated into the statistical analysis, as well as help identify the type of patients who are intolerable to a particular treatment. better response to treatment than noncompliant patients, even in the placebo group. Therefore, the fully compliant subgroup is not a random subsample of the original sample. In addition, the pattern of selection might differ between the placebo group and the active treatment group. It can be problematic if the rate, time to, and reason for withdrawal differ widely among treatment groups. Also, the last observed response for an early dropout often does not reflect the potential benefit of a drug with a slow onset of action. Furthermore, there are situations where a patient may drop out of a clinical study because of early recovery. He/she may then suffer a relapse. Many of these situations will result in biased treatment comparisons. Unstated reasons for dropping out of a clinical trial may be associated with the last observed response or other study-related reasons. This will be a primary concern for those methods that incorporate the reason for dropout in the statistical analysis. It is imperative to have accurate documentation of the cause of dropout. The method for handling dropouts in clinical studies will often depend on the objective of the study. On the one hand, the goal may be more explanatory in nature, as in the case of a Phase I or early Phase II study, where the true pharmacological properties of a drug are being investigated. On the other hand, it may have a more pragmatic objective, where a pivotal Phase III study is providing an overall evaluation of treatment policy in clinical practice. CLINICAL DESIGN THE MISSING-DATA MECHANISM CONSIDERATIONS A concept that is often discussed when missing Most of the literature on handling dropouts data occur is the missing-data mecha- (or missing data) in clinical trials involves nism. Little (2) has also used the term dropout the statistical analysis. Those involved with mechanism when it relates to patients conducting clinical trials, however, should be dropping out of a clinical study prematurely. cognizant of the issue at the design stage. These two terms will be used synonymously There are potential ways to minimize patient throughout the paper. Little and Rubin (3) withdrawal. One obvious method is through classify the missing-data mechanism into patient and investigator retention programs three basic categories. They define missing including education and information sharing completely at random (MCAR) as the pro- regarding the research program. There are cess in which the probability of dropout is other possible options which may or may independent of both observed measurements
3 Handling Missing Data in Clinical Trials 527 (eg, baseline covariates, observed responses) that the MAR assumption is inherently untestable. and unobserved measurements (those that Therefore, one can never truly would have been observed if the patient had achieve complete certainty that conditioning stayed in the study). Under MCAR the observed on observed variables achieves ignorable responses form a random subsample dropout. of the sampled responses. When data are If the dropout mechanism is neither MCAR there is no impact on bias and there- MCAR or MAR then it is nonignorable. In fore most standard approaches of analysis this case, the dropout mechanism needs to are valid. A loss of statistical power, how- be incorporated into the analysis. This paper ever, can still occur. The MCAR assumption will briefly discuss, in the following section, can be assessed by comparing the distribu- available methods in the case of nonignorable tion of observed variables between dropouts dropout. Diggle (9) proposed a method and nondropouts (4). If no significant differ- for testing for random dropouts in repeated ences are found with respect to the variables, measures data. Diggle uses the term random then there is no apparent evidence that the dropout in a more restrictive sense than Rubin s data from the clinical trial are representative missing at random. In addition, Park only of completers. The MCAR assumption and Davis (10) proposed a method for testing is often not plausible in clinical trials (1). the missing-data mechanism for repeated cat- The second classification, which is more egorical data. restrictive than MCAR, is missing at random (MAR). Under MAR, the probability of GENERAL STRATEGIES FOR dropout depends on the observed data, but STATISTICAL ANALYSIS not the unobserved data. When the response is MAR the observed responses are a random Much of the literature involving missing data subsample of the sampled values within a (or dropouts) in clinical trials pertain to the subclass defined by the observed data (eg, various methods developed to handle the age). In this case, the missing-data mechanism problem. This paper will divide those meth- can be considered ignorable (5,6). Laird ods into the following five basic classifica- (6) points out that ignorable missingness is tions: plausible in longitudinal studies, when patient withdrawal is related to a previous re- 1. Complete-case analysis, sponse. In addition, Murray and Findlay (7) 2. Weighting methods, point out that if a protocol removes patients 3. Imputation methods, from a study because they reach a threshold 4. Analyzing data as incomplete, and for the response (eg, diastolic blood pressure 5. Other methods. exceeding 110 mmhg) then the data are MAR. The rationale for the MAR assump- The category defined as other methods includes tion here is traced to the fact that the lack of those methods that do not logically response can be deduced from the recorded fit into the other four classifications. value. Little (2) also discusses a special case Complete-case methods use only those of MAR dropout which is not discussed ex- patients with complete data. For example, in tensively in the literature. For this case the a longitudinal study, a complete-case analy- dropout mechanism depends only on the co- sis will use only those patients who have variates X and is classified as covariate-dependent observed responses at each scheduled time dropout. For this missing-data point. Another example is the case of a sin- mechanism Diggle and Kenward (8) used the gle-endpoint study where a regression analy- term completely random dropout. Murray sis is used. Only those patients who have the and Findlay (7) state that an observation is observed endpoint and observed values for MAR if the fact that it is missing is not in all relevant covariates are involved in the itself informative. Lavori et al. (1) point out analysis. An obvious advantage of this type
4 528 William R. Myers which allows generalized estimating equation (GEE) analyses to be correct under the MAR assumption. A third classification of statistical analy- ses to handle missing data is imputation methods. Imputation is any method whereby missing values in the data set are filled with plausible estimates. The choice of plausible estimates is what differentiates the various imputation methods. The objective of any imputation method is to produce a complete data set which can then be analyzed using standard statistical methods. The Last Observation Carried Forward (LOCF) method is a commonly used imputation procedure. This method is implemented when longitudinal measurements are observed for each patient. LOCF takes the last available response and substitutes the value into all subsequent missing values. The LOCF can be problematic if early dropouts occur and if the response variable has ex- pected changes over time. It can provide biased treatment comparisons if there are dif- ferent rates of dropout or different time to dropout between the treatment groups. For example, LOCF can provide conservative results with respect to active treatment if pla- cebo patients drop out early because of lack of efficacy. In this case, the mean placebo response is biased upward. On the other hand, when the active treatment is slowing down the severity of an illness and if those patients in the active treatment group are dropping out early due to intolerability, the LOCF method can render anti-conservative results with respect to active treatment. Another type of imputation method is a worst case analysis. This analysis imputes the worst response observed among the ac- tive treatment group for those missing values within the active treatment group. For the placebo group, the method imputes the best observed response among the placebo group for those missing values within the placebo group. This particular analysis can be viewed as a type of sensitivity analysis (14). From a purely statistical perspective this type of method can increase the overall variability, bias the active treatment mean downward, and of analysis is ease in implementation. In addition, it provides valid results in the case of MCAR. There are numerous disadvantages, however, to excluding patients with incomplete data. First, the complete-case method provides inefficient estimates, that is, loss of statistical power. If the dropout mechanism is not MCAR, then the analysis can produce biased treatment comparisons. In the case of a clinical trial with longitudinal measurements, it is typically not good practice to throw out data. Maybe the most important concern with this type of analysis is that it does not follow the Intention-to-Treat paradigm. The following simple example demonstrates the limitations of the complete-case analysis. Suppose Treatment X is modestly effective for patients regardless of the baseline severity of their condition. On the other hand, Treatment Y provides a benefit for the less severe patients, while providing no improvement for the more severely ill patients. If the patients have a tendency to drop out before completion because of lack of efficacy, then the complete-case analysis may unduly favor Treatment Y. A second strategy to handle missing data is weighting methods. Some may actually consider this a form of imputation. The general idea is to construct weights for complete cases in order to reduce or remove bias. Little and Schenker (11) discuss the basic concept of weighting adjustments in the sample survey setting. Heyting et al. (12) describe the heuristic appeal of this method as follows. Each patient belongs to a subgroup in the patient population in which all patients have a similar baseline and response profile. A proportion within each subgroup are destined to complete the clinical trial, while the remainder are destined to drop out early. Those completer patients with a very low probability of completing can certainly have an overly strong influence on the results. Heyting et al. (12) provide a particular weighting method where the evaluation of the mean treatment differences at the end of the study is of primary interest. The authors primary objective was explanatory in nature. Robins et al. (13) introduced a weighting method
5 Handling Missing Data in Clinical Trials 529 bias the placebo mean upward. This could (18). Multiple imputation can be implemented potentially limit a promising treatment from for either longitudinal measurements demonstrating efficacy. In many cases, how- or a single response. ever, this method is usually not the planned The general strategy for multiple imputa- primary analysis, but rather a secondary anal- tion is to replace each missing value with ysis. It can be used to assess the robustness two or more values from an appropriate dis- of the results and provide a so called lower tribution for the missing values. This produces bound on treatment efficacy. If the worst two or more complete data sets. Rebound case analysis demonstrates a treatment ben- peated draws are made from the posterior efit it can be a very powerful result. For predictive distribution of the missing values, example, one could state either that the treat- Y miss. As Rubin and Schenker (18) point out, ment efficacy is so strong that even imputing in practice, implicit models can be used in the worst case scenario does not alter the place of explicit models. Lavori et al. (1) positive results or the missing response rate discuss a propensity-based imputation where is so low that it does not alter the conclusions. one models the probability of remaining in Brown (15) proposed a slightly different the study given a vector of observed covarimethod than that discussed above. A prede- ates. A logistic regression model is typically termined percentile (eg, median) of the placebo used. This method stratifies patients into group is assigned to all those patients groups based on propensity scores, that is, who dropped out from either the placebo the propensity to drop out of the study. Impu- group or the active treatment group. This pre- tations are made by the approximate Bayesian determined score is assigned to all patients bootstrap. One first draws a potential set in both groups with values worse than the of observed responses at random with replacement assigned score. A Mann-Whitney statistic is from the observed responses in used to test the equality of the distribution the propensity quintile. Then the imputed of the two groups and thus provides a bound values are chosen at random from the poten- on the test of the efficacy for the treatment. tial observed sample. This process is performed There are other single imputation methods m times in order to produce m com- such as mean imputation, conditional mean plete data sets. Analyses for each complete imputation, and the Hot deck method. Little data set are then combined in a way that and Rubin (3) discuss these methods as well reflects the extra variability. The total variability as other single imputation methods. Paik (16) consists of both within and between proposed a mean imputation method as well imputation variability. Multiple imputation as a multiple imputation method for handling methodology relies on the MAR assumption. missing data in GEE analysis. Each method Little and Schenker (11) and Rubin (19) provides valid estimates when data are MAR. indicate that typically only a few imputations One particular imputation method that has (m = 3 5) are necessary for a modest amount received a significant amount of attention in of missing information (eg, < 30%). The pre- the literature recently is multiple imputation. vious authors do point out, however, that as The idea of multiple imputation, first proposed the percentage of missing data increases by Rubin (17), is to impute more than more imputations will be necessary. The soft- one value for the missing item. The advantage ware SOLAS for Missing Data Analysis (20) of multiple imputation is that it repre- can implement the previously discussed mulware sents the uncertainty about which value to tiple imputation methods as well as other impute. This is as opposed to imputing the single imputation methods. A good list of mean response which does not incorporate references on this topic include Lavori et al. the degree of uncertainty about which value (1), Little and Schenker (11), Rubin and to impute. Therefore, the analyses that treat Schenker (18), and Rubin (19). singly imputed values just like observed val- A fourth strategy to handle missing data ues generally underestimate the variability is to analyze data as incomplete. This is typi-
6 530 William R. Myers cally performed in longitudinal studies. One fication of Gould s method by taking into option is to use summary statistics. In longitudinal account the time to dropout in order to pro- studies a slope estimate could be used vide a finer measure for the desirability out- to summarize each patient s response. Early come. For example, patients who drop out dropouts could obviously be problematic very early in the study due to intolerability with this type of analysis. Likelihood methods would be assigned a lower desirability out- that ignore the drop-out mechanism are come than those patients who dropped out also a popular analysis of choice. Most software later in the study due to intolerability. packages (eg, PROC MIXED in SAS ) Shih and Quan (23) proposed a method assume MAR and ignore the missing-data that they defined as a composite approach. mechanism. In the case of nonignorable They consider the situation where the outcome dropout, inferences based on likelihood measure is a continuous response varidropout, methods that ignore the dropout mechanism able and the final outcome is of main interest. may produce biased results. Little and Rubin Shih and Quan (23) indicate that the relevant (2) and Little (3) discuss nonignorable miss- clinical question when dropouts occur is ing-data models. Implementing a nonignorable what is the chance that a patient completes missing-data model is not trivial. Little the prescribed treatment course and if he/ (3) indicates that results can be sensitive to she does complete the therapy, what is the misspecification of the missing-data mecha- expected response? This results in two nism and if little knowledge is known about hypotheses. The first is the probability of the mechanism then sensitivity analysis dropping out of the clinical study and the should be performed. In longitudinal studies second is the expected response for those where non-gaussian responses are measured patients who completed the clinical study. A (eg, binary or count data) GEE is often used. logistic regression model could be used to GEE generally requires MCAR or covariate- model the probability of dropping out of the dependent dropout in order to yield consistent study, while an analysis of variance could be estimates (2). implemented for the expected response of The fifth and final category of analyses is completers. Shih and Quan (23) point out defined as other methods. As stated earlier, that the alternative hypotheses need to be in these are methods that do not fall into any of the same direction. For example, the placebo the four previously discussed classifications. has a greater percent of patients dropping out One of the methods, which was proposed by for unfavorable reasons compared to active Gould (21), converts all information on an treatment and the mean response for completers outcome variable into ranking of patients in is favorable for active treatment. The terms of desirability outcome. These desir- proposed method first tests the joint hypothesis. ability outcomes are ordered from the least If that is rejected the individual hypotheability desirable (eg, early withdrawal due to lack of ses are subsequently tested. A closed testing efficacy or intolerance) to the most desirable procedure is used in order to control the over- (eg, early withdrawal due to efficacy). Be- all type I error rate. This particular method tween the two ends of the desirability spectrum makes no MAR type assumption. The fol- are the ranks of the scores for those lowing are a few papers that discuss some patients who complete the study. A standard additional methods: Dawson and Lagakos two-sample test based on ranks could then (24) and Shih and Quan (25). be implemented. This particular method excludes those patients whose reason for dropout is unrelated to the study. The utility of ICH GUIDELINES this approach certainly requires a clear un- The International Conference for Harmonization derstanding of the reason for dropout. This (ICH) guidelines E9 Statistical Prinderstanding method does not directly address the issue ciples for Clinical Trials address the issue of estimation. Cornell (22) proposed a modi- of missing data. The guidelines indicate that
7 Handling Missing Data in Clinical Trials 531 methods for dealing with missing data should (20%). Side effects were classified into three be predefined in the protocol. The guidelines categories (mild, moderate, severe). The also point out that methods for dealing with high dropout group was those with severe missing data can be refined in the statistical side effects and a response 0.5 standard devi- analysis plan during the blind review of the ations below the mean response for the active data. This is a very important step to consider, treatment group or those patients with mod- given that it can be difficult to anticipate all erate side effects and a response 1.0 standard potential missing-data problems that could occur. deviation below the mean response for the Probably the most important suggestion active treatment group. The second scenario the ICH guidelines make, however, is to investigate is where low responders, irrespective of the sensitivity of the results of the analy- treatment, have a higher dropout rate (60%) sis to the method of handling missing data, than the rest of the patients (20%). For this that is, sensitivity analysis. case, the high dropout group was those patients who have a response that is 1.0 standard deviation below the mean response of EXAMPLES BASED ON A the placebo group. SIMULATED DATA SET Of the four possible treatment/dropout The following examples are not intended to scenarios, this paper focuses on two of the be an extensive simulation study, but rather more interesting cases. The first is where the some simple scenarios based on simulated active treatment is known not to be signifi- data sets. They are presented to demonstrate cantly superior to placebo and where there is the potential advantages or disadvantages of a higher dropout rate for the active treatment some of the methods under certain dropout group (Scenario 1). The second is where the scenarios. The examples will consider a active treatment is known to be superior to study where a single endpoint is of primary interest. The following two treatment scenarios are considered: placebo and there is a higher dropout rate among the placebo group (Scenario 2). The three methods that were applied were the complete-case analysis, multiple imputation method, and the composite approach. 1. The active treatment is known to be significantly superior to placebo [µ T = 15 units The first row of Table 1 provides the results and µ P = 10.5 units], and when analyzing the complete data set. 2. The active treatment is known not to be The other rows provide the results for the superior to placebo [µ T = 16 units and µ P three methods. For Scenario 1, the complete- = 15 units]. case analysis, as expected, provided a treatment mean that was biased upward and Three hundred patients were initially randomized thereby erroneously demonstrated efficacy to each treatment group with a of a nonefficacious treatment. On the other known standard deviation of 19 units. The hand, the multiple imputation method pro- first scenario provides approximately 80% vided mean values and results that more statistical power, while the second scenario closely resembled the truth. For the compos- provides approximately 10% statistical ite approach the differences for each hypothesis power. were in different directions (the acpower. After the complete data sets were created, tive treatment group has a higher dropout patient observations were deleted based on rate for unfavorable reasons, while the mean the following two dropout scenarios. First, response for the completers favors the active patients in the active treatment group who treatment group), therefore, it was not rea- were poor responders and who had signifi- sonable to combine the tests and perform the cant side effects have a higher dropout rate analysis. For Scenario 2, the complete-case (60%) than the rest of those in either the analysis, as expected, provided a placebo active treatment group or the placebo group mean that was biased upward and thereby
8 532 William R. Myers TABLE 1 Simulated Data Set Analysis Scenario 1 Scenario 2 Complete Data Set X T = 16.7 p = 0.23 X T = 15.3 p = X P = 14.9 X P = 10.5 Complete-Case X T = 18.7 p = 0.02 X T = 14.7 p = 0.15 X P = 14.6 X P = 12.4 Multiple Imputation X T = 16.2 p = 0.74 X T = 16.6 p = X P = 15.6 X P = 13.6 Composite Approach Joint hypothesis p = Scenario 1 active treatment is not significantly superior to placebo and the active treatment group has a higher dropout rate due to intolerability. Scenario 2 active treatment is significantly superior to placebo and the placebo group has a higher dropout rate due to lack of efficacy. that in order to detect whether this level of missing data has been reached one can per- form what was earlier called the worst case analysis. A major part of the paper was dedicated to the method of multiple imputation. It appears that regulatory agencies are still uncertain about the degree of usefulness of multiple imputation methods. Shih and Quan (23) provide several examples by which making inferences on the complete-data parameter may not always be of practical interest. For example, when a patient dies before the end of the study, the outcome measure for the end of the study simply does not exist. Another ex- ample is when the glomerular filtration rate in a renal disease study is not legitimate for patients who drop out due to renal failure and who are referred for kidney dialysis. They continued to point out that one should not estimate their nonexisting/missing values that would have been observed if the censoring event had not occurred. As Lavori et al. (1) point out, further work should be performed to better understand how well multi- ple imputation works under various types of dropout mechanisms, most specifically the nonignorable case. Obviously, it is very important to clearly understand the limitations of the various methods. This paper attempted to outline many of these limitations. This directly leads to the utility of performing some form of sensitivity analysis and how necessary and valuable it is. In conclusion, was unable to demonstrate that an efficacious treatment was effective. The multiple imputation method provided results that more closely mimicked the complete data set, even though it was not significant at the 0.05 significance level. The composite approach was unable to demonstrate a statistically significant result with the joint hypothesis, thus, the individual hypotheses were not tested. SUMMARY/DISCUSSION The objective of the paper was to present an overview of issues, concerns, and available methodology in the case of missing data as a result of patients dropping out of a clinical trial. It should be emphasized that sophisticated statistical analysis is no substitute for a good clinical plan in order to mitigate patients dropping out of a study. It is important to continue following patients even after they have dropped out of a clinical study. In addition, understanding both the disease and the therapy being studied can be helpful in selecting an appropriate statistical method. The choice of a particular method for handling missing data depends on whether one is considering a more pragmatic or a more explanatory perspective. There is often the question of whether there are too many missing data. Spriet and Dupin-Spriet (14) point out that the tolerable amount of missing data is that which would not conceal an effect in the opposite direction. They go on to add
9 Handling Missing Data in Clinical Trials 533 it is important not to necessarily consider the 13. Robins JM, Rotnitzky A, Zhao LP. Analysis of semi- various methods that handle dropouts (missin the presence of missing data. J Am Stat Assoc. parametric regression models for repeated outcomes ing data) as rivals but rather consider them 1995;90: as methods that can compliment one another. 14. Spriet A, Dupin-Spriet T. Imperfect data analysis. Drug Inf J. 1993;27: Brown B. A test for the difference between two REFERENCES treatments in a continuous measure of outcome when 1. Lavori PW, Dawson R, Shera D. A multiple imputation there are dropouts. Controll Clini Trials. 1992;13: strategy for clinical trials with truncation of patient data. Stat Med. 1995;14: Paik MC. The generalized estimating equation ap- 2. Little RJA. Modeling the drop-out mechanism in proach when data are not missing completely at random. repeated-measures studies. J Am Stat Assoc. 1995; J Am Stat Assoc. 1997;92: : Rubin DB. Multiple imputations in sample surveys A 3. Little RJA, Rubin DB. Statistical Analysis with Missing phenomenological Bayesian approach to Data. New York: John Wiley & Sons; nonresponse. Imputation and Editing of Faulty or 4. Little RJA. A test of missing completely at random Missing Survey Data. U.S. Department of Comfor multivariate data with missing values. J Am Stat merce. 1978:1 23. Assoc. 1988;83: Rubin DB, Schenker N. Multiple imputation in 5 Rubin DB. Inference and missing data. Biometrika. health-care databases: An overview and some applications. 1976;63: Stat Med. 1991;10: Laird NM. Missing data in longitudinal studies. Stat 19. Rubin DB. Multiple imputation after 18+ years. J Med. 1988;7: Am Stat Assoc. 1996;91; Murray GD, Findlay JG. Correcting for the bias 20. SOLAS for Missing Data Analysis 1.0. Statistical caused by drop-outs in hypertension trials. Stat Med. Solutions, Ltd ;7: Gould AL. A new approach to the analysis of clinical 8. Diggle P, Kenward MG. Informative dropout in longitudinal drug trials with withdrawals. Biometrics. 1980;36: data analysis (with discussion). Appl Stat ;43: Cornell RG. Handling dropouts and related issues 9. Diggle P. Testing for random dropouts in repeated Statistical methodology in the pharmaceutical sciences. measurement data. Biometrics. 1989;45: Marcel Dekker, Inc. 1990; Park T, Davis CS. A test of the missing data mecha- 23. Shih WJ, Quan H. Testing for treatment differences nism for repeated categorical data. Biometrics. 1993; with dropouts present in clinical trials A compos- 49: ite approach. Stat Med. 1997;16: Little RJA, Schenker N. Handbook of Statistical 24. Dawson JD, Lagakos SW. Size and power of two- Methodology Missing Data. New York: Plenum sample tests of repeated measures data. Biometrics. Press; ;49: Heyting A, Tolboom J, Essers J. Statistical handling 25. Shih WJ, Quan H. Stratified testing for treatment of drop-outs in longitudinal clinical trials. Stat Med. 1992;11: effects with missing data. Biometrics. 1998;54:
A Mixed Model Approach for Intent-to-Treat Analysis in Longitudinal Clinical Trials with Missing Values
Methods Report A Mixed Model Approach for Intent-to-Treat Analysis in Longitudinal Clinical Trials with Missing Values Hrishikesh Chakraborty and Hong Gu March 9 RTI Press About the Author Hrishikesh Chakraborty,
Missing Data. A Typology Of Missing Data. Missing At Random Or Not Missing At Random
[Leeuw, Edith D. de, and Joop Hox. (2008). Missing Data. Encyclopedia of Survey Research Methods. Retrieved from http://sage-ereference.com/survey/article_n298.html] Missing Data An important indicator
Problem of Missing Data
VASA Mission of VA Statisticians Association (VASA) Promote & disseminate statistical methodological research relevant to VA studies; Facilitate communication & collaboration among VA-affiliated statisticians;
Multiple Imputation for Missing Data: A Cautionary Tale
Multiple Imputation for Missing Data: A Cautionary Tale Paul D. Allison University of Pennsylvania Address correspondence to Paul D. Allison, Sociology Department, University of Pennsylvania, 3718 Locust
A Basic Introduction to Missing Data
John Fox Sociology 740 Winter 2014 Outline Why Missing Data Arise Why Missing Data Arise Global or unit non-response. In a survey, certain respondents may be unreachable or may refuse to participate. Item
Review of the Methods for Handling Missing Data in. Longitudinal Data Analysis
Int. Journal of Math. Analysis, Vol. 5, 2011, no. 1, 1-13 Review of the Methods for Handling Missing Data in Longitudinal Data Analysis Michikazu Nakai and Weiming Ke Department of Mathematics and Statistics
Missing Data Sensitivity Analysis of a Continuous Endpoint An Example from a Recent Submission
Missing Data Sensitivity Analysis of a Continuous Endpoint An Example from a Recent Submission Arno Fritsch Clinical Statistics Europe, Bayer November 21, 2014 ASA NJ Chapter / Bayer Workshop, Whippany
Missing Data in Longitudinal Studies: To Impute or not to Impute? Robert Platt, PhD McGill University
Missing Data in Longitudinal Studies: To Impute or not to Impute? Robert Platt, PhD McGill University 1 Outline Missing data definitions Longitudinal data specific issues Methods Simple methods Multiple
Handling missing data in Stata a whirlwind tour
Handling missing data in Stata a whirlwind tour 2012 Italian Stata Users Group Meeting Jonathan Bartlett www.missingdata.org.uk 20th September 2012 1/55 Outline The problem of missing data and a principled
Missing Data: Part 1 What to Do? Carol B. Thompson Johns Hopkins Biostatistics Center SON Brown Bag 3/20/13
Missing Data: Part 1 What to Do? Carol B. Thompson Johns Hopkins Biostatistics Center SON Brown Bag 3/20/13 Overview Missingness and impact on statistical analysis Missing data assumptions/mechanisms Conventional
Analysis Issues II. Mary Foulkes, PhD Johns Hopkins University
This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike License. Your use of this material constitutes acceptance of that license and the conditions of use of materials on this
Dealing with Missing Data
Res. Lett. Inf. Math. Sci. (2002) 3, 153-160 Available online at http://www.massey.ac.nz/~wwiims/research/letters/ Dealing with Missing Data Judi Scheffer I.I.M.S. Quad A, Massey University, P.O. Box 102904
The Consequences of Missing Data in the ATLAS ACS 2-TIMI 51 Trial
The Consequences of Missing Data in the ATLAS ACS 2-TIMI 51 Trial In this white paper, we will explore the consequences of missing data in the ATLAS ACS 2-TIMI 51 Trial and consider if an alternative approach
TUTORIAL on ICH E9 and Other Statistical Regulatory Guidance. Session 1: ICH E9 and E10. PSI Conference, May 2011
TUTORIAL on ICH E9 and Other Statistical Regulatory Guidance Session 1: PSI Conference, May 2011 Kerry Gordon, Quintiles 1 E9, and how to locate it 2 ICH E9 Statistical Principles for Clinical Trials (Issued
Challenges in Longitudinal Data Analysis: Baseline Adjustment, Missing Data, and Drop-out
Challenges in Longitudinal Data Analysis: Baseline Adjustment, Missing Data, and Drop-out Sandra Taylor, Ph.D. IDDRC BBRD Core 23 April 2014 Objectives Baseline Adjustment Introduce approaches Guidance
A REVIEW OF CURRENT SOFTWARE FOR HANDLING MISSING DATA
123 Kwantitatieve Methoden (1999), 62, 123-138. A REVIEW OF CURRENT SOFTWARE FOR HANDLING MISSING DATA Joop J. Hox 1 ABSTRACT. When we deal with a large data set with missing data, we have to undertake
Imputing Missing Data using SAS
ABSTRACT Paper 3295-2015 Imputing Missing Data using SAS Christopher Yim, California Polytechnic State University, San Luis Obispo Missing data is an unfortunate reality of statistics. However, there are
Introduction to mixed model and missing data issues in longitudinal studies
Introduction to mixed model and missing data issues in longitudinal studies Hélène Jacqmin-Gadda INSERM, U897, Bordeaux, France Inserm workshop, St Raphael Outline of the talk I Introduction Mixed models
Sensitivity Analysis in Multiple Imputation for Missing Data
Paper SAS270-2014 Sensitivity Analysis in Multiple Imputation for Missing Data Yang Yuan, SAS Institute Inc. ABSTRACT Multiple imputation, a popular strategy for dealing with missing values, usually assumes
PATTERN MIXTURE MODELS FOR MISSING DATA. Mike Kenward. London School of Hygiene and Tropical Medicine. Talk at the University of Turku,
PATTERN MIXTURE MODELS FOR MISSING DATA Mike Kenward London School of Hygiene and Tropical Medicine Talk at the University of Turku, April 10th 2012 1 / 90 CONTENTS 1 Examples 2 Modelling Incomplete Data
How to choose an analysis to handle missing data in longitudinal observational studies
How to choose an analysis to handle missing data in longitudinal observational studies ICH, 25 th February 2015 Ian White MRC Biostatistics Unit, Cambridge, UK Plan Why are missing data a problem? Methods:
Guideline on missing data in confirmatory clinical trials
2 July 2010 EMA/CPMP/EWP/1776/99 Rev. 1 Committee for Medicinal Products for Human Use (CHMP) Guideline on missing data in confirmatory clinical trials Discussion in the Efficacy Working Party June 1999/
Handling missing data in large data sets. Agostino Di Ciaccio Dept. of Statistics University of Rome La Sapienza
Handling missing data in large data sets Agostino Di Ciaccio Dept. of Statistics University of Rome La Sapienza The problem Often in official statistics we have large data sets with many variables and
Handling attrition and non-response in longitudinal data
Longitudinal and Life Course Studies 2009 Volume 1 Issue 1 Pp 63-72 Handling attrition and non-response in longitudinal data Harvey Goldstein University of Bristol Correspondence. Professor H. Goldstein
MISSING DATA: THE POINT OF VIEW OF ETHICAL COMMITTEES
I CONGRESSO NAZIONALE BIAS 2009 29/30 APRILE 2009 ELI LILLY SESTO FIORENTINO (FI) MISSING DATA: THE POINT OF VIEW OF ETHICAL COMMITTEES Anna Chiara Frigo Department of Environmental Medicine and Public
Overview. Longitudinal Data Variation and Correlation Different Approaches. Linear Mixed Models Generalized Linear Mixed Models
Overview 1 Introduction Longitudinal Data Variation and Correlation Different Approaches 2 Mixed Models Linear Mixed Models Generalized Linear Mixed Models 3 Marginal Models Linear Models Generalized Linear
MISSING DATA IMPUTATION IN CARDIAC DATA SET (SURVIVAL PROGNOSIS)
MISSING DATA IMPUTATION IN CARDIAC DATA SET (SURVIVAL PROGNOSIS) R.KAVITHA KUMAR Department of Computer Science and Engineering Pondicherry Engineering College, Pudhucherry, India DR. R.M.CHADRASEKAR Professor,
Dealing with Missing Data
Dealing with Missing Data Roch Giorgi email: [email protected] UMR 912 SESSTIM, Aix Marseille Université / INSERM / IRD, Marseille, France BioSTIC, APHM, Hôpital Timone, Marseille, France January
The Prevention and Treatment of Missing Data in Clinical Trials: An FDA Perspective on the Importance of Dealing With It
nature publishing group The Prevention and Treatment of Missing Data in Clinical Trials: An FDA Perspective on the Importance of Dealing With It RT O Neill 1 and R Temple 2 At the request of the Food and
Imputation and Analysis. Peter Fayers
Missing Data in Palliative Care Research Imputation and Analysis Peter Fayers Department of Public Health University of Aberdeen NTNU Det medisinske fakultet Missing data Missing data is a major problem
Dr James Roger. GlaxoSmithKline & London School of Hygiene and Tropical Medicine.
American Statistical Association Biopharm Section Monthly Webinar Series: Sensitivity analyses that address missing data issues in Longitudinal studies for regulatory submission. Dr James Roger. GlaxoSmithKline
1.0 Abstract. Title: Real Life Evaluation of Rheumatoid Arthritis in Canadians taking HUMIRA. Keywords. Rationale and Background:
1.0 Abstract Title: Real Life Evaluation of Rheumatoid Arthritis in Canadians taking HUMIRA Keywords Rationale and Background: This abbreviated clinical study report is based on a clinical surveillance
Missing data in randomized controlled trials (RCTs) can
EVALUATION TECHNICAL ASSISTANCE BRIEF for OAH & ACYF Teenage Pregnancy Prevention Grantees May 2013 Brief 3 Coping with Missing Data in Randomized Controlled Trials Missing data in randomized controlled
AVOIDING BIAS AND RANDOM ERROR IN DATA ANALYSIS
AVOIDING BIAS AND RANDOM ERROR IN DATA ANALYSIS Susan Ellenberg, Ph.D. Perelman School of Medicine University of Pennsylvania School of Medicine FDA Clinical Investigator Course White Oak, MD November
CHOOSING APPROPRIATE METHODS FOR MISSING DATA IN MEDICAL RESEARCH: A DECISION ALGORITHM ON METHODS FOR MISSING DATA
CHOOSING APPROPRIATE METHODS FOR MISSING DATA IN MEDICAL RESEARCH: A DECISION ALGORITHM ON METHODS FOR MISSING DATA Hatice UENAL Institute of Epidemiology and Medical Biometry, Ulm University, Germany
MISSING DATA TECHNIQUES WITH SAS. IDRE Statistical Consulting Group
MISSING DATA TECHNIQUES WITH SAS IDRE Statistical Consulting Group ROAD MAP FOR TODAY To discuss: 1. Commonly used techniques for handling missing data, focusing on multiple imputation 2. Issues that could
Missing data and net survival analysis Bernard Rachet
Workshop on Flexible Models for Longitudinal and Survival Data with Applications in Biostatistics Warwick, 27-29 July 2015 Missing data and net survival analysis Bernard Rachet General context Population-based,
SENSITIVITY ANALYSIS AND INFERENCE. Lecture 12
This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike License. Your use of this material constitutes acceptance of that license and the conditions of use of materials on this
Applied Missing Data Analysis in the Health Sciences. Statistics in Practice
Brochure More information from http://www.researchandmarkets.com/reports/2741464/ Applied Missing Data Analysis in the Health Sciences. Statistics in Practice Description: A modern and practical guide
2. Making example missing-value datasets: MCAR, MAR, and MNAR
Lecture 20 1. Types of missing values 2. Making example missing-value datasets: MCAR, MAR, and MNAR 3. Common methods for missing data 4. Compare results on example MCAR, MAR, MNAR data 1 Missing Data
Bayesian Statistical Analysis in Medical Research
Bayesian Statistical Analysis in Medical Research David Draper Department of Applied Mathematics and Statistics University of California, Santa Cruz [email protected] www.ams.ucsc.edu/ draper ROLE Steering
Study Design and Statistical Analysis
Study Design and Statistical Analysis Anny H Xiang, PhD Department of Preventive Medicine University of Southern California Outline Designing Clinical Research Studies Statistical Data Analysis Designing
Comparison/Control Groups. Mary Foulkes, PhD Johns Hopkins University
This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike License. Your use of this material constitutes acceptance of that license and the conditions of use of materials on this
Analyzing Structural Equation Models With Missing Data
Analyzing Structural Equation Models With Missing Data Craig Enders* Arizona State University [email protected] based on Enders, C. K. (006). Analyzing structural equation models with missing data. In G.
Workpackage 11 Imputation and Non-Response. Deliverable 11.2
Workpackage 11 Imputation and Non-Response Deliverable 11.2 2004 II List of contributors: Seppo Laaksonen, Statistics Finland; Ueli Oetliker, Swiss Federal Statistical Office; Susanne Rässler, University
Auxiliary Variables in Mixture Modeling: 3-Step Approaches Using Mplus
Auxiliary Variables in Mixture Modeling: 3-Step Approaches Using Mplus Tihomir Asparouhov and Bengt Muthén Mplus Web Notes: No. 15 Version 8, August 5, 2014 1 Abstract This paper discusses alternatives
Intent-to-treat Analysis of Randomized Clinical Trials
Intent-to-treat Analysis of Randomized Clinical Trials Michael P. LaValley Boston University http://people.bu.edu/mlava/ ACR/ARHP Annual Scientific Meeting Orlando 10/27/2003 Outline Origin of Randomization
A Study to Predict No Show Probability for a Scheduled Appointment at Free Health Clinic
A Study to Predict No Show Probability for a Scheduled Appointment at Free Health Clinic Report prepared for Brandon Slama Department of Health Management and Informatics University of Missouri, Columbia
Missing data are ubiquitous in clinical research.
Advanced Statistics: Missing Data in Clinical Research Part 1: An Introduction and Conceptual Framework Jason S. Haukoos, MD, MS, Craig D. Newgard, MD, MPH Abstract Missing data are commonly encountered
Analysis of Longitudinal Data with Missing Values.
Analysis of Longitudinal Data with Missing Values. Methods and Applications in Medical Statistics. Ingrid Garli Dragset Master of Science in Physics and Mathematics Submission date: June 2009 Supervisor:
II. DISTRIBUTIONS distribution normal distribution. standard scores
Appendix D Basic Measurement And Statistics The following information was developed by Steven Rothke, PhD, Department of Psychology, Rehabilitation Institute of Chicago (RIC) and expanded by Mary F. Schmidt,
Implementation of Pattern-Mixture Models Using Standard SAS/STAT Procedures
PharmaSUG2011 - Paper SP04 Implementation of Pattern-Mixture Models Using Standard SAS/STAT Procedures Bohdana Ratitch, Quintiles, Montreal, Quebec, Canada Michael O Kelly, Quintiles, Dublin, Ireland ABSTRACT
STATISTICAL ANALYSIS OF SAFETY DATA IN LONG-TERM CLINICAL TRIALS
STATISTICAL ANALYSIS OF SAFETY DATA IN LONG-TERM CLINICAL TRIALS Tailiang Xie, Ping Zhao and Joel Waksman, Wyeth Consumer Healthcare Five Giralda Farms, Madison, NJ 794 KEY WORDS: Safety Data, Adverse
Personalized Predictive Medicine and Genomic Clinical Trials
Personalized Predictive Medicine and Genomic Clinical Trials Richard Simon, D.Sc. Chief, Biometric Research Branch National Cancer Institute http://brb.nci.nih.gov brb.nci.nih.gov Powerpoint presentations
Bridging Statistical Analysis Plan and ADaM Datasets and Metadata for Submission
, October 24-26, 2012, San Francisco, USA Bridging Statistical Analysis Plan and ADaM Datasets and Metadata for Submission Abstract In this article, the relationship between the Statistical Analysis Plan
Mode and Patient-mix Adjustment of the CAHPS Hospital Survey (HCAHPS)
Mode and Patient-mix Adjustment of the CAHPS Hospital Survey (HCAHPS) April 30, 2008 Abstract A randomized Mode Experiment of 27,229 discharges from 45 hospitals was used to develop adjustments for the
Discussion. Seppo Laaksonen 1. 1. Introduction
Journal of Official Statistics, Vol. 23, No. 4, 2007, pp. 467 475 Discussion Seppo Laaksonen 1 1. Introduction Bjørnstad s article is a welcome contribution to the discussion on multiple imputation (MI)
This clinical study synopsis is provided in line with Boehringer Ingelheim s Policy on Transparency and Publication of Clinical Study Data.
abcd Clinical Study for Public Disclosure This clinical study synopsis is provided in line with s Policy on Transparency and Publication of Clinical Study Data. The synopsis which is part of the clinical
What are confidence intervals and p-values?
What is...? series Second edition Statistics Supported by sanofi-aventis What are confidence intervals and p-values? Huw TO Davies PhD Professor of Health Care Policy and Management, University of St Andrews
Fairfield Public Schools
Mathematics Fairfield Public Schools AP Statistics AP Statistics BOE Approved 04/08/2014 1 AP STATISTICS Critical Areas of Focus AP Statistics is a rigorous course that offers advanced students an opportunity
SPSS TRAINING SESSION 3 ADVANCED TOPICS (PASW STATISTICS 17.0) Sun Li Centre for Academic Computing [email protected]
SPSS TRAINING SESSION 3 ADVANCED TOPICS (PASW STATISTICS 17.0) Sun Li Centre for Academic Computing [email protected] IN SPSS SESSION 2, WE HAVE LEARNT: Elementary Data Analysis Group Comparison & One-way
Additional sources Compilation of sources: http://lrs.ed.uiuc.edu/tseportal/datacollectionmethodologies/jin-tselink/tselink.htm
Mgt 540 Research Methods Data Analysis 1 Additional sources Compilation of sources: http://lrs.ed.uiuc.edu/tseportal/datacollectionmethodologies/jin-tselink/tselink.htm http://web.utk.edu/~dap/random/order/start.htm
Introduction to Fixed Effects Methods
Introduction to Fixed Effects Methods 1 1.1 The Promise of Fixed Effects for Nonexperimental Research... 1 1.2 The Paired-Comparisons t-test as a Fixed Effects Method... 2 1.3 Costs and Benefits of Fixed
The Product Review Life Cycle A Brief Overview
Stat & Quant Mthds Pharm Reg (Spring 2, 2014) Lecture 2,Week 1 1 The review process developed over a 40 year period and has been influenced by 5 Prescription User Fee Act renewals Time frames for review
Missing data: the hidden problem
white paper Missing data: the hidden problem Draw more valid conclusions with SPSS Missing Data Analysis white paper Missing data: the hidden problem 2 Just about everyone doing analysis has some missing
Comparison of Imputation Methods in the Survey of Income and Program Participation
Comparison of Imputation Methods in the Survey of Income and Program Participation Sarah McMillan U.S. Census Bureau, 4600 Silver Hill Rd, Washington, DC 20233 Any views expressed are those of the author
A Pseudo Nearest-Neighbor Approach for Missing Data Recovery on Gaussian Random Data Sets
University of Nebraska at Omaha DigitalCommons@UNO Computer Science Faculty Publications Department of Computer Science -2002 A Pseudo Nearest-Neighbor Approach for Missing Data Recovery on Gaussian Random
A Review of Methods. for Dealing with Missing Data. Angela L. Cool. Texas A&M University 77843-4225
Missing Data 1 Running head: DEALING WITH MISSING DATA A Review of Methods for Dealing with Missing Data Angela L. Cool Texas A&M University 77843-4225 Paper presented at the annual meeting of the Southwest
Application in Predictive Analytics. FirstName LastName. Northwestern University
Application in Predictive Analytics FirstName LastName Northwestern University Prepared for: Dr. Nethra Sambamoorthi, Ph.D. Author Note: Final Assignment PRED 402 Sec 55 Page 1 of 18 Contents Introduction...
Design and Analysis of Phase III Clinical Trials
Cancer Biostatistics Center, Biostatistics Shared Resource, Vanderbilt University School of Medicine June 19, 2008 Outline 1 Phases of Clinical Trials 2 3 4 5 6 Phase I Trials: Safety, Dosage Range, and
Paper PO06. Randomization in Clinical Trial Studies
Paper PO06 Randomization in Clinical Trial Studies David Shen, WCI, Inc. Zaizai Lu, AstraZeneca Pharmaceuticals ABSTRACT Randomization is of central importance in clinical trials. It prevents selection
UNTIED WORST-RANK SCORE ANALYSIS
Paper PO16 UNTIED WORST-RANK SCORE ANALYSIS William F. McCarthy, Maryland Medical Research Institute, Baltime, MD Nan Guo, Maryland Medical Research Institute, Baltime, MD ABSTRACT When the non-fatal outcome
Data Cleaning and Missing Data Analysis
Data Cleaning and Missing Data Analysis Dan Merson [email protected] India McHale [email protected] April 13, 2010 Overview Introduction to SACS What do we mean by Data Cleaning and why do we do it? The SACS
Statistical modelling with missing data using multiple imputation. Session 4: Sensitivity Analysis after Multiple Imputation
Statistical modelling with missing data using multiple imputation Session 4: Sensitivity Analysis after Multiple Imputation James Carpenter London School of Hygiene & Tropical Medicine Email: [email protected]
Likelihood Approaches for Trial Designs in Early Phase Oncology
Likelihood Approaches for Trial Designs in Early Phase Oncology Clinical Trials Elizabeth Garrett-Mayer, PhD Cody Chiuzan, PhD Hollings Cancer Center Department of Public Health Sciences Medical University
Randomized trials versus observational studies
Randomized trials versus observational studies The case of postmenopausal hormone therapy and heart disease Miguel Hernán Harvard School of Public Health www.hsph.harvard.edu/causal Joint work with James
Proof-of-Concept Studies and the End of Phase IIa Meeting with the FDA
Medpace Discovery Series presents Proof-of-Concept Studies and the End of Phase IIa Meeting with the FDA DR. JIM WEI: Today my topic is going to be Proof-of-Concept Studies and FDA End of Phase 2a Meetings
If several different trials are mentioned in one publication, the data of each should be extracted in a separate data extraction form.
General Remarks This template of a data extraction form is intended to help you to start developing your own data extraction form, it certainly has to be adapted to your specific question. Delete unnecessary
NON-PROBABILITY SAMPLING TECHNIQUES
NON-PROBABILITY SAMPLING TECHNIQUES PRESENTED BY Name: WINNIE MUGERA Reg No: L50/62004/2013 RESEARCH METHODS LDP 603 UNIVERSITY OF NAIROBI Date: APRIL 2013 SAMPLING Sampling is the use of a subset of the
THE SELECTION OF RETURNS FOR AUDIT BY THE IRS. John P. Hiniker, Internal Revenue Service
THE SELECTION OF RETURNS FOR AUDIT BY THE IRS John P. Hiniker, Internal Revenue Service BACKGROUND The Internal Revenue Service, hereafter referred to as the IRS, is responsible for administering the Internal
Descriptive Methods Ch. 6 and 7
Descriptive Methods Ch. 6 and 7 Purpose of Descriptive Research Purely descriptive research describes the characteristics or behaviors of a given population in a systematic and accurate fashion. Correlational
UNIVERSITY OF NAIROBI
UNIVERSITY OF NAIROBI MASTERS IN PROJECT PLANNING AND MANAGEMENT NAME: SARU CAROLYNN ELIZABETH REGISTRATION NO: L50/61646/2013 COURSE CODE: LDP 603 COURSE TITLE: RESEARCH METHODS LECTURER: GAKUU CHRISTOPHER
Chapter 3. Sampling. Sampling Methods
Oxford University Press Chapter 3 40 Sampling Resources are always limited. It is usually not possible nor necessary for the researcher to study an entire target population of subjects. Most medical research
X X X a) perfect linear correlation b) no correlation c) positive correlation (r = 1) (r = 0) (0 < r < 1)
CORRELATION AND REGRESSION / 47 CHAPTER EIGHT CORRELATION AND REGRESSION Correlation and regression are statistical methods that are commonly used in the medical literature to compare two or more variables.
Sample Size and Power in Clinical Trials
Sample Size and Power in Clinical Trials Version 1.0 May 011 1. Power of a Test. Factors affecting Power 3. Required Sample Size RELATED ISSUES 1. Effect Size. Test Statistics 3. Variation 4. Significance
What is a P-value? Ronald A. Thisted, PhD Departments of Statistics and Health Studies The University of Chicago
What is a P-value? Ronald A. Thisted, PhD Departments of Statistics and Health Studies The University of Chicago 8 June 1998, Corrections 14 February 2010 Abstract Results favoring one treatment over another
Guidance for Industry
Guidance for Industry E9 Statistical Principles for Clinical Trials U.S. Department of Health and Human Services Food and Drug Administration Center for Drug Evaluation and Research (CDER) Center for Biologics
Mortality Assessment Technology: A New Tool for Life Insurance Underwriting
Mortality Assessment Technology: A New Tool for Life Insurance Underwriting Guizhou Hu, MD, PhD BioSignia, Inc, Durham, North Carolina Abstract The ability to more accurately predict chronic disease morbidity
Fixed-Effect Versus Random-Effects Models
CHAPTER 13 Fixed-Effect Versus Random-Effects Models Introduction Definition of a summary effect Estimating the summary effect Extreme effect size in a large study or a small study Confidence interval
Causality Assessment in Practice Pharmaceutical Industry Perspective. Lachlan MacGregor Senior Safety Scientist F. Hoffmann-La Roche Ltd.
Causality Assessment in Practice Pharmaceutical Industry Perspective Lachlan MacGregor Senior Safety Scientist F. Hoffmann-La Roche Ltd., Basel Disclaimer: The opinions expressed in this presentation are
U.S. Food and Drug Administration
U.S. Food and Drug Administration Notice: Archived Document The content in this document is provided on the FDA s website for reference purposes only. It was current when produced, but is no longer maintained
