Survival Analysis, Software

Size: px
Start display at page:

Download "Survival Analysis, Software"

Transcription

1 Survival Analysis, Software As used here, survival analysis refers to the analysis of data where the response variable is the time until the occurrence of some event (e.g. death), where some of the observations may not yet have experienced the event at the time of the analysis (these cases are censored). Analysis of this type of data occurs in many fields, and the general form of analysis has several different names, including reliability analysis (engineering), event history analysis (sociology), and duration analysis (economics). Although the biostatistical terminology is used here, many of the current statistical methods originated in other fields, and some excellent software (some of which is discussed below) is aimed at practitioners in other disciplines. We provide some guidance on jargon, but it cannot be regarded as definitive. This article has several purposes: 1. It provides a categorization scheme for guiding our discussion of computer packages for survival analysis. Examples of packages in each category are provided. 2. It briefly discusses some special features of some software packages. 3. It provides some guidance on assessing such software and finding information comparing the packages. A Categorization Scheme for Survival Analysis Software The range of software currently available makes any categorization scheme somewhat problematic. The categorization presented here is limited to two dimensions: the purpose aimed at and the method(s) used in achieving the purpose(s): 1. tests comparing time to survival for different groups (e.g. different treatments; (see Hypothesis Testing). Most of the packages mentioned in this article have at least one such test. However, they will not be discussed in detail here for the following reasons: (i) There are numerous such tests and, unfortunately, some lack of consistency in naming them [4, 5]. There is also some controversy regarding whether they should be used at all. There are two general objections: (a) Some statisticians are opposed to the use of tests and favor point and interval estimation instead. (b) Many of the tests are just special cases of information available from a more general model (e.g. the log rank test is a score test from a Cox proportional hazards regression) 2. modeling the data for purposes of description and/or estimation/testing; in general, models can be: (i) (iii) nonparametric (Kaplan Meier, Nelson Aalen); most packages that have any claim to analyzing survival data include the Kaplan Meier estimate in some form semi-parametric (Cox proportional hazards (CPH); see Cox Regression Model); Cox s 1972 article initiated this entire area, and many packages have at least the most simple version of this. Many packages allow time-dependent (defined) and/or time-varying (internal or ancillary) regressors, which allows one to have Coxtype models without proportional hazards parametric (Weibull, exponential, lognormal, etc., including accelerated failure time models for the Weibull and exponential distributions); this might also include more traditional censored linear regression models such as Buckley James (biostatistics), Schmee Hahn (engineering), or Tobit and its generalizations (economics). A comparison of Buckley James and Tobit, showing Buckley James to be generally superior, can be found in Moon [21]. Several other models have been suggested in the biostatistical literature; a comparison by Miller & Halpern [20] again showed the Buckley James procedure to be generally superior. There are also a number of modeling issues that can be used to distinguish among software packages, including: 1. discrete time vs. continuous time; most of the

2 2 Survival Analysis, Software above assumes that time is measured continuously. If time is clearly discrete (e.g. the issue is granting of tenure, which can only happen once per year), then logistic regression can handle the problem more easily and with easily interpreted results (e.g. [26]) 2. use of a counting process formulation. Although this was introduced in Aalen s 1975 dissertation [1], this has been slow to catch on. Although few packages use this formulation, it has several practical advantages over the traditional formulation (see, for example, [2]) 3. estimation via loglinear models, especially logistic and Poisson regression [19]. A few packages note these relationships 4. multiple occurrence models (e.g. number of heart attacks and time between them; see Repeated Events). In the literature, there are several ways to model this, but commercial software seems to be limited to the use of the so-called Huber robust sandwich [12, 17, 18, 24, 25] estimator of variance after assuming a working independence model 5. competing risks: (i) Independent risks can be modeled easily by just classifying failure from some other cause as a censored case. However, dependent competing risks cannot be modeled in this way. 6. cure models: not everyone will eventually fail via the event of interest. While, especially in medical contexts, it may be reasonable to treat this as a standard, right-censored observation, there are some contexts where it is important to distinguish those who are cured from those still at risk 7. data issues: (i) left-truncated data and delayed entry: (a) Left-truncated data refers to the absence, in the data set, of those who failed before a certain time. Usually there is no information, not even a count, on these observations, other than that they exist (e.g. studying the survival of those with heart disease who are referred to a central clinic for treatment; the left-truncated group are those who do not survive long enough to either get a referral or to arrive at the central clinic). The (b) current standard form of analysis is to condition on arrival at the clinic. However, this provides no help for some interesting questions (e.g. is it worthwhile to expend resources so that people are referred to the central site earlier?). Numerous other examples exist in epidemiology and various social sciences, and there is some literature, including [22]. There is also literature regarding righttruncated data (e.g. products with no problems during the warranty period) and in standard linear regression. The theory is the same in these two cases, changing the direction of time (see [13 16] and [7]). Delayed entry refers to observations that do not enter the data set until some time after what is defined as time 0. This yields the same statistical problem as left truncation. types of censoring: in many data sets, all one really knows about the timing of the event is that it happened between time a and time b; this is interval censoring and is somewhat more complicated than standard right censoring (some people have not had the event when the analysis is performed). If all data are either interval- or rightcensored, then this can easily be treated as discrete-time data. However, if the failure time is known for some cases but only known within an interval for other cases, then there are no clearly superior methods of analysis known, at least when covariates are present. Several variants, with a comparative assessment, are discussed in [3]. 8. whether the software is special purpose or is part of an integrated package. The mention, above, of discrete-time methods is one of the reasons we prefer an integrated package: a full analysis of the data may require many different types of analysis in addition to survival analysis. The performance of all the analyses, which may jump between survival and nonsurvival several times, is, at the least, more convenient with an integrated package. Note that special purpose as used here refers to the lack of standard descriptive statistics routines; a package that is primarily oriented to

3 Survival Analysis, Software 3 a specific discipline might still be considered general-purpose here. Capabilities of Existing Software Packages For each package mentioned below, we include contact information (US information is supplied where available; many have numerous contact sites outside the US also) as well as a table showing which of the above modeling capabilities it has (much of this is from Harrell & Goldstein [10]), although some is from Goldstein et al. [5]. The list is not complete; emphasis was placed on packages that include modeling procedures, especially Cox s proportional hazards model. The packages are in alphabetical order. 1. BMDP, SPSS Inc., 444 N. Michigan Ave., Chicago, IL 60611, USA; (312) Egret, Cytel Software Corp., 675 Massachusetts Avenue, Cambridge, MA 02139, USA; (617) Epicure, HiroSoft International Corp., 1463 E. Republican Ave., Suite 103, Seattle, WA 98112, USA; (206) Epilog Plus, Epicenter Software, PO Box 90073, Pasadena, CA 91109, USA; (818) Limdep, Econometric Software, Inc., 15 Gloria Place, Plainview, NY 11803, USA; (516) NCSS, 329 North 1000 East, Kaysville, UT 84037, USA; (801) SAS, SAS Institute, Inc., Box 8000, Cary, NC 27511, USA; (919) Spida, The Statistical Laboratory, Macquarie University, NSW 2109 Australia; S-PLUS, MathSoft, Inc., Data Analysis Products Division, 1700 Westlake Avenue North, Suite 500, Seattle, WA 98109, USA; (800) SPSS, SPSS Inc., 444 N. Michigan Ave., Chicago, IL 60611, USA; (312) Stata, Stata Corp., 702 University Drive East, College Station, TX 77849, USA; (800) Statistica, StatSoft, 2325 East 13th Street, Tulsa, OK 74104, USA; (918) Survival, Salford Systems, 8880 Rio San Diego Dr., Suite 1045, San Diego, CA 92108, USA; (619) True Epistat, Epistat Services, 2011 Cap Rock Circle, Richardson, TX , USA; (214) Table 1 shows some of the capabilities of each of the packages. Some of the issues previously discussed are not included in the Table since so few packages have the capability, including: 1. use of the robust sandwich variance estimator: only S-PLUS and Stata have this 2. cure models: only Epilog Plus and Limdep have this 3. censored linear regression: most packages have something to handle this, but the options are wide and several packages have more than one relevant technique 4. BMDP version 7 has an option for competing risk models using parametric techniques. The following is a guide to the abbreviations used in Table 1 1. Under Nonparametric : K-M is the Kaplan Meier product limit estimator; N-A is the Nelson Aalen estimator. 2. Under Semiparametric : CPH is Cox s proportional hazards model; t-d means that one can use time-dependent variables in CPH; t-v means that one can use time-varying (or, step function) variables in CPH; H-H means the Han Hausman estimator [8]. 3. Under Types of Residuals (discussion of the various residuals for CPH models can be found in [6], [23], and [3]); note that the martingale and Schoenfeld residuals appear to be the most important types as they are particularly useful for checking major regression assumptions; the score residuals are useful in checking for influential observations, as well as for computing sandwich covariance matrix estimates: C-S is Cox Snell; Std means standardized residuals; M is martingale residuals (these are useful for assessing the adequacy of an hypothesized transformation of a predictor variable); D is deviance residuals; Score is score residuals; Sch is Schoenfeld residuals, which are useful for testing the PH assumption.

4 4 Survival Analysis, Software Table 1 Attributes of various packages Package Nonparametric Semiparametric Parametric Type of residuals BMDP K-M CPH, t-d, t-v Several C-S, Std Egret K-M CPH, t-d, t-v Several None Epicure K-M, N-A CPH, t-d, t-v Several M, D, Score, Sch Epilog Plus K-M CPH, t-d, t-v None None Limdep K-M CPH, t-d, t-v, H-H Several None NCSS K-M CPH Some (but no None covariates) SAS K-M CPH, t-d, t-v Several M, D, Score, Sch Spida K-M CPH, t-d, t-v None None S-PLUS K-M, N-A CPH, t-v Several M, D, Score, Sch SPSS K-M CPH, t-d, t-v None C-S, M, Sch Stata K-M CPH, t-d, t-v Exp, Weib. M, Score Statistica K-M CPH, t-d Several None Survival K-M CPH, t-v Several None True Epistat K-M CPH, t-v None None Doc. for checking Counting Left-trunc./ Package assumptions process delayed entry Type of package BMDP Yes No DE Complete, general Egret PH No DE Special purpose Epicure PH Yes DE Special purpose Epilog Plus PH No Neither Complete, biostat Limdep PH No LT (parametric Complete, econometric models only) NCSS No No Neither Complete, general SAS PH Yes DE Complete, general Spida PH No DE Complete, general S-PLUS Yes Yes DE Complete, general SPSS PH No neither Complete, general Stata PH Yes DE Complete, general Statistica PH No Neither Complete, general Survival PH No Neither Special purpose True Epistat No No Neither Complete, biostat 4. Under Doc[umentation] for checking assumptions, the possible entries are: Yes, meaning that the testing of both the proportional hazards (PH) assumption and the loglinear assumption are discussed in the manual; PH, meaning that only the testing of the PH assumption is discussed; No, meaning there is no discussion of testing assumptions in the manual (the user may still be able to test assumptions, but the manual provides no help or guidance). 5. Under Left-trunc[ation]-delayed entry : DE means that the package can handle observations whose entry into the risk set is delayed (software packages that have this capability have it for the Cox model only, not for parametric models); LT means that the package can handle left-truncated data; neither means that the package can t deal with either delayed entry or left-truncation. 6. While there are no abbreviations under Type of package, it is worthwhile to note that packages that are extendable, and have active user communities making extensions, may well have

5 Survival Analysis, Software 5 additional techniques; two of the included packages have such user communities: (i) S-PLUS: not only was the original routine written by a user (Terry Therneau), but the second author of this article has written a coordinated set of functions for in-depth survival modeling which includes: (a) relaxing and testing linearity assumptions using cubic splines (b) automatic pooled Wald tests (see Likelihood) (c) bootstrap model validation (d) presentation graphics for describing models (see [1] and [9]) Stata: there were many additions to the prior version, including Bailey Makeham models, which will also work with the current version; the first author has added cubic splines to Stata s toolbox. Where to Go for More Information There are no good general sources of information on what software is available. Eventually, there will probably be a source on the Internet which can be added to frequently. A recent review [10] cites several other reviews of this software. It is clear that software for survival analysis is still in a state of flux, with packages having widely varying capabilities and using inconsistent language. Also, several important issues are either not dealt with by any package or are not dealt with to a sufficient degree. However, the software is changing fairly rapidly. References [1] Aalen, O.O. (1975). Statistical Inference for a Family of Counting Processes, PhD Thesis. University of California, Berkeley. [2] Andersen, P.K., Borgan, O., Gill, R.D. & Keiding, N. (1993). Statistical Models Based on Counting Processes. Springer-Verlag, New York. [3] Collett, D. (1994). Modeling Survival Data in Medical Research. Chapman & Hall, London. [4] Dain, B.J., Freeman, D.H. & Vredenburgh, J.J. (1989). Comparison of different packages survival test results, in Proceedings of the Statistical Computing Section. American Statistical Association, Alexandria, pp [5] Goldstein, R., Anderson, J., Ash, A., Craig, B., Harrington, D. & Pagano, M. (1989). Survival analysis software on MS/PC-DOS computers, Journal of Applied Econometrics 4, [6] Grambsch, P. & Therneau, T. (1994). Proportional hazards tests and diagnostics based on weighted residuals, Biometrika 81, ; amendments and corrections in Biometrika 82 (1995), p [7] Gross, S.T. & Lai, T.L. (1996). Nonparametric estimation and regression analysis with left-truncated and right-censored data, Journal of the American Statistical Association 91, [8] Han, A. & Hausman, J. (1988). Semiparametric Estimation of Duration and Competing Risk Models. Department of Economics, MIT, Cambridge, Mass. [9] Harrell, F.E. (1996). Design: S-Plus functions for biostatistical/epidemiological modeling, testing, estimation, validation, graphics, prediction, and typesetting by storing enhanced model design attributes in the fit. Available from Statlib, [10] Harrell, F.E., Jr. & Goldstein, R. (1997). A survey of microcomputer survival analysis software: the need for an integrated framework, American Statistician, to appear. [11] Harrell, F.E., Lee, K.L. & Mark, D.B. (1996). Multivariable prognostic models: Issues in developing models, evaluating assumptions and adequacy, and measuring and reducing errors, Statistics in Medicine 15, [12] Huber, P.J. (1967). The behavior of maximum likelihood estimates under nonstandard conditions, in Proceedings of the Fifth Berkeley Symposium on Mathematical Statistics and Probability. University of California Press, Berkeley, pp [13] Kalbfleisch, J.D. & Lawless, J.F. (1988). Estimation of reliability in field-performance studies (with discussion), Technometrics 30, [14] Kalbfleisch, J.D. & Lawless, J.F. (1991). Regression methods for right truncated data with applications to Aids incubation times and reporting lags, Statistica Sinica 1, [15] Kalbfleisch, J.D. & Lawless, J.F. (1992). Some useful statistical methods for truncated data, Journal of Quality Technology 24, [16] Kalbfleisch, J.K., Lawless, J.F. & Robinson, J.A. (1991). Methods for the analysis and prediction of warranty claims, Technometrics 33, [17] Lin, D.Y. (1994). Cox regression analysis of multivariate failure time data: the martingale approach, Statistics in Medicine 13, [18] Lin, D.Y. & Wei, L.J. (1989). The robust inference for the Cox proportional hazards model, Journal of the American Statistical Association 84, [19] Lindsey, J.K. (1995). Fitting parametric counting processes by using log-linear models, Applied Statistics 44, [20] Miller, R. & Halpern J. (1982). Regression with censored data, Biometrika 69,

6 6 Survival Analysis, Software [21] Moon, C.-G. (1989). A Monte Carlo comparison of semiparametric Tobit estimators, Journal of Applied Econometrics 4, [22] Nelson, W. (1990). Hazard plotting of left truncated life data, Journal of Quality Technology 22, [23] Therneau, T.M., Grambsch, P.M. & Fleming, T.R. (1990). Martingale-based residuals for survival models, Biometrika 77, [24] White, H. (1980). A heteroskedasticity-consistent variance matrix estimator and a direct test for heteroskedasticity, Econometrica 48, [25] White, H. (1982). Maximum likelihood estimation of misspecified models, Econometrica 50, [26] Wu, M. & Ware, J.H. (1979). On the use of repeated measurements in regression analysis with dichotomous responses, Biometrics 35, (See also Software, Biostatistical) RICHARD GOLDSTEIN &FRANK HARRELL

200609 - ATV - Lifetime Data Analysis

200609 - ATV - Lifetime Data Analysis Coordinating unit: Teaching unit: Academic year: Degree: ECTS credits: 2015 200 - FME - School of Mathematics and Statistics 715 - EIO - Department of Statistics and Operations Research 1004 - UB - (ENG)Universitat

More information

Regression Modeling Strategies

Regression Modeling Strategies Frank E. Harrell, Jr. Regression Modeling Strategies With Applications to Linear Models, Logistic Regression, and Survival Analysis With 141 Figures Springer Contents Preface Typographical Conventions

More information

Survival analysis methods in Insurance Applications in car insurance contracts

Survival analysis methods in Insurance Applications in car insurance contracts Survival analysis methods in Insurance Applications in car insurance contracts Abder OULIDI 1-2 Jean-Marie MARION 1 Hérvé GANACHAUD 3 1 Institut de Mathématiques Appliquées (IMA) Angers France 2 Institut

More information

Adequacy of Biomath. Models. Empirical Modeling Tools. Bayesian Modeling. Model Uncertainty / Selection

Adequacy of Biomath. Models. Empirical Modeling Tools. Bayesian Modeling. Model Uncertainty / Selection Directions in Statistical Methodology for Multivariable Predictive Modeling Frank E Harrell Jr University of Virginia Seattle WA 19May98 Overview of Modeling Process Model selection Regression shape Diagnostics

More information

Introduction to Event History Analysis DUSTIN BROWN POPULATION RESEARCH CENTER

Introduction to Event History Analysis DUSTIN BROWN POPULATION RESEARCH CENTER Introduction to Event History Analysis DUSTIN BROWN POPULATION RESEARCH CENTER Objectives Introduce event history analysis Describe some common survival (hazard) distributions Introduce some useful Stata

More information

Modeling the Claim Duration of Income Protection Insurance Policyholders Using Parametric Mixture Models

Modeling the Claim Duration of Income Protection Insurance Policyholders Using Parametric Mixture Models Modeling the Claim Duration of Income Protection Insurance Policyholders Using Parametric Mixture Models Abstract This paper considers the modeling of claim durations for existing claimants under income

More information

Service courses for graduate students in degree programs other than the MS or PhD programs in Biostatistics.

Service courses for graduate students in degree programs other than the MS or PhD programs in Biostatistics. Course Catalog In order to be assured that all prerequisites are met, students must acquire a permission number from the education coordinator prior to enrolling in any Biostatistics course. Courses are

More information

Statistics in Retail Finance. Chapter 6: Behavioural models

Statistics in Retail Finance. Chapter 6: Behavioural models Statistics in Retail Finance 1 Overview > So far we have focussed mainly on application scorecards. In this chapter we shall look at behavioural models. We shall cover the following topics:- Behavioural

More information

SAS Software to Fit the Generalized Linear Model

SAS Software to Fit the Generalized Linear Model SAS Software to Fit the Generalized Linear Model Gordon Johnston, SAS Institute Inc., Cary, NC Abstract In recent years, the class of generalized linear models has gained popularity as a statistical modeling

More information

Checking proportionality for Cox s regression model

Checking proportionality for Cox s regression model Checking proportionality for Cox s regression model by Hui Hong Zhang Thesis for the degree of Master of Science (Master i Modellering og dataanalyse) Department of Mathematics Faculty of Mathematics and

More information

Survival Analysis of Left Truncated Income Protection Insurance Data. [March 29, 2012]

Survival Analysis of Left Truncated Income Protection Insurance Data. [March 29, 2012] Survival Analysis of Left Truncated Income Protection Insurance Data [March 29, 2012] 1 Qing Liu 2 David Pitt 3 Yan Wang 4 Xueyuan Wu Abstract One of the main characteristics of Income Protection Insurance

More information

Statistics Graduate Courses

Statistics Graduate Courses Statistics Graduate Courses STAT 7002--Topics in Statistics-Biological/Physical/Mathematics (cr.arr.).organized study of selected topics. Subjects and earnable credit may vary from semester to semester.

More information

Tips for surviving the analysis of survival data. Philip Twumasi-Ankrah, PhD

Tips for surviving the analysis of survival data. Philip Twumasi-Ankrah, PhD Tips for surviving the analysis of survival data Philip Twumasi-Ankrah, PhD Big picture In medical research and many other areas of research, we often confront continuous, ordinal or dichotomous outcomes

More information

Duration Analysis. Econometric Analysis. Dr. Keshab Bhattarai. April 4, 2011. Hull Univ. Business School

Duration Analysis. Econometric Analysis. Dr. Keshab Bhattarai. April 4, 2011. Hull Univ. Business School Duration Analysis Econometric Analysis Dr. Keshab Bhattarai Hull Univ. Business School April 4, 2011 Dr. Bhattarai (Hull Univ. Business School) Duration April 4, 2011 1 / 27 What is Duration Analysis?

More information

Applied Statistics. J. Blanchet and J. Wadsworth. Institute of Mathematics, Analysis, and Applications EPF Lausanne

Applied Statistics. J. Blanchet and J. Wadsworth. Institute of Mathematics, Analysis, and Applications EPF Lausanne Applied Statistics J. Blanchet and J. Wadsworth Institute of Mathematics, Analysis, and Applications EPF Lausanne An MSc Course for Applied Mathematicians, Fall 2012 Outline 1 Model Comparison 2 Model

More information

SUMAN DUVVURU STAT 567 PROJECT REPORT

SUMAN DUVVURU STAT 567 PROJECT REPORT SUMAN DUVVURU STAT 567 PROJECT REPORT SURVIVAL ANALYSIS OF HEROIN ADDICTS Background and introduction: Current illicit drug use among teens is continuing to increase in many countries around the world.

More information

Interpretation of Somers D under four simple models

Interpretation of Somers D under four simple models Interpretation of Somers D under four simple models Roger B. Newson 03 September, 04 Introduction Somers D is an ordinal measure of association introduced by Somers (96)[9]. It can be defined in terms

More information

UNDERGRADUATE DEGREE DETAILS : BACHELOR OF SCIENCE WITH

UNDERGRADUATE DEGREE DETAILS : BACHELOR OF SCIENCE WITH QATAR UNIVERSITY COLLEGE OF ARTS & SCIENCES Department of Mathematics, Statistics, & Physics UNDERGRADUATE DEGREE DETAILS : Program Requirements and Descriptions BACHELOR OF SCIENCE WITH A MAJOR IN STATISTICS

More information

Introduction. Survival Analysis. Censoring. Plan of Talk

Introduction. Survival Analysis. Censoring. Plan of Talk Survival Analysis Mark Lunt Arthritis Research UK Centre for Excellence in Epidemiology University of Manchester 01/12/2015 Survival Analysis is concerned with the length of time before an event occurs.

More information

Department of Epidemiology and Public Health Miller School of Medicine University of Miami

Department of Epidemiology and Public Health Miller School of Medicine University of Miami Department of Epidemiology and Public Health Miller School of Medicine University of Miami BST 630 (3 Credit Hours) Longitudinal and Multilevel Data Wednesday-Friday 9:00 10:15PM Course Location: CRB 995

More information

BayesX - Software for Bayesian Inference in Structured Additive Regression

BayesX - Software for Bayesian Inference in Structured Additive Regression BayesX - Software for Bayesian Inference in Structured Additive Regression Thomas Kneib Faculty of Mathematics and Economics, University of Ulm Department of Statistics, Ludwig-Maximilians-University Munich

More information

Curriculum - Doctor of Philosophy

Curriculum - Doctor of Philosophy Curriculum - Doctor of Philosophy CORE COURSES Pharm 545-546.Pharmacoeconomics, Healthcare Systems Review. (3, 3) Exploration of the cultural foundations of pharmacy. Development of the present state of

More information

How To Understand The Theory Of Probability

How To Understand The Theory Of Probability Graduate Programs in Statistics Course Titles STAT 100 CALCULUS AND MATR IX ALGEBRA FOR STATISTICS. Differential and integral calculus; infinite series; matrix algebra STAT 195 INTRODUCTION TO MATHEMATICAL

More information

SPSS TRAINING SESSION 3 ADVANCED TOPICS (PASW STATISTICS 17.0) Sun Li Centre for Academic Computing lsun@smu.edu.sg

SPSS TRAINING SESSION 3 ADVANCED TOPICS (PASW STATISTICS 17.0) Sun Li Centre for Academic Computing lsun@smu.edu.sg SPSS TRAINING SESSION 3 ADVANCED TOPICS (PASW STATISTICS 17.0) Sun Li Centre for Academic Computing lsun@smu.edu.sg IN SPSS SESSION 2, WE HAVE LEARNT: Elementary Data Analysis Group Comparison & One-way

More information

Teaching Biostatistics to Postgraduate Students in Public Health

Teaching Biostatistics to Postgraduate Students in Public Health Teaching Biostatistics to Postgraduate Students in Public Health Peter A Lachenbruch - h s hgeles, California, USA 1. Introduction This paper describes how biostatistics is taught in US Schools of Public

More information

Advanced Quantitative Methods for Health Care Professionals PUBH 742 Spring 2015

Advanced Quantitative Methods for Health Care Professionals PUBH 742 Spring 2015 1 Advanced Quantitative Methods for Health Care Professionals PUBH 742 Spring 2015 Instructor: Joanne M. Garrett, PhD e-mail: joanne_garrett@med.unc.edu Class Notes: Copies of the class lecture slides

More information

Package depend.truncation

Package depend.truncation Type Package Package depend.truncation May 28, 2015 Title Statistical Inference for Parametric and Semiparametric Models Based on Dependently Truncated Data Version 2.4 Date 2015-05-28 Author Takeshi Emura

More information

From the help desk: Bootstrapped standard errors

From the help desk: Bootstrapped standard errors The Stata Journal (2003) 3, Number 1, pp. 71 80 From the help desk: Bootstrapped standard errors Weihua Guan Stata Corporation Abstract. Bootstrapping is a nonparametric approach for evaluating the distribution

More information

EDMS 769L: Statistical Analysis of Longitudinal Data 1809 PAC, Th 4:15-7:00pm 2009 Spring Semester

EDMS 769L: Statistical Analysis of Longitudinal Data 1809 PAC, Th 4:15-7:00pm 2009 Spring Semester Instructor Dr. Jeffrey Harring 1230E Benjamin Building Phone: (301) 405-3630 Email: harring@umd.edu Office Hours Tuesday 2:00-3:00pm, or by appointment Course Objectives, Description and Prerequisites

More information

STATISTICAL ANALYSIS OF SAFETY DATA IN LONG-TERM CLINICAL TRIALS

STATISTICAL ANALYSIS OF SAFETY DATA IN LONG-TERM CLINICAL TRIALS STATISTICAL ANALYSIS OF SAFETY DATA IN LONG-TERM CLINICAL TRIALS Tailiang Xie, Ping Zhao and Joel Waksman, Wyeth Consumer Healthcare Five Giralda Farms, Madison, NJ 794 KEY WORDS: Safety Data, Adverse

More information

Predicting Customer Churn in the Telecommunications Industry An Application of Survival Analysis Modeling Using SAS

Predicting Customer Churn in the Telecommunications Industry An Application of Survival Analysis Modeling Using SAS Paper 114-27 Predicting Customer in the Telecommunications Industry An Application of Survival Analysis Modeling Using SAS Junxiang Lu, Ph.D. Sprint Communications Company Overland Park, Kansas ABSTRACT

More information

The University of North Carolina at Chapel Hill School of Social Work. SOWO 919 Longitudinal and Multilevel Analysis Spring Semester, 2009

The University of North Carolina at Chapel Hill School of Social Work. SOWO 919 Longitudinal and Multilevel Analysis Spring Semester, 2009 The University of North Carolina at Chapel Hill School of Social Work Instructor SOWO 919 Longitudinal and Multilevel Analysis Spring Semester, 2009 Shenyang Guo, Ph.D., Room 524j, Tate Turner Kuralt Building

More information

Statistical Methods for research in International Relations and Comparative Politics

Statistical Methods for research in International Relations and Comparative Politics James Raymond Vreeland Dept. of Political Science Assistant Professor Yale University E-Mail: james.vreeland@yale.edu Room 300 Tel: 203-432-5252 124 Prospect Avenue Office hours: Wed. 10am to 12pm New

More information

STATISTICA Formula Guide: Logistic Regression. Table of Contents

STATISTICA Formula Guide: Logistic Regression. Table of Contents : Table of Contents... 1 Overview of Model... 1 Dispersion... 2 Parameterization... 3 Sigma-Restricted Model... 3 Overparameterized Model... 4 Reference Coding... 4 Model Summary (Summary Tab)... 5 Summary

More information

Survey, Statistics and Psychometrics Core Research Facility University of Nebraska-Lincoln. Log-Rank Test for More Than Two Groups

Survey, Statistics and Psychometrics Core Research Facility University of Nebraska-Lincoln. Log-Rank Test for More Than Two Groups Survey, Statistics and Psychometrics Core Research Facility University of Nebraska-Lincoln Log-Rank Test for More Than Two Groups Prepared by Harlan Sayles (SRAM) Revised by Julia Soulakova (Statistics)

More information

Lecture 2 ESTIMATING THE SURVIVAL FUNCTION. One-sample nonparametric methods

Lecture 2 ESTIMATING THE SURVIVAL FUNCTION. One-sample nonparametric methods Lecture 2 ESTIMATING THE SURVIVAL FUNCTION One-sample nonparametric methods There are commonly three methods for estimating a survivorship function S(t) = P (T > t) without resorting to parametric models:

More information

Health Policy and Administration PhD Track in Health Services and Policy Research

Health Policy and Administration PhD Track in Health Services and Policy Research Health Policy and Administration PhD Track in Health Services and Policy INTRODUCTION The Health Policy and Administration (HPA) Division of the UIC School of Public Health offers a PhD track in Health

More information

Study Design and Statistical Analysis

Study Design and Statistical Analysis Study Design and Statistical Analysis Anny H Xiang, PhD Department of Preventive Medicine University of Southern California Outline Designing Clinical Research Studies Statistical Data Analysis Designing

More information

Master programme in Statistics

Master programme in Statistics Master programme in Statistics Björn Holmquist 1 1 Department of Statistics Lund University Cramérsällskapets årskonferens, 2010-03-25 Master programme Vad är ett Master programme? Breddmaster vs Djupmaster

More information

Name of the module: Multivariate biostatistics and SPSS Number of module: 471-8-4081

Name of the module: Multivariate biostatistics and SPSS Number of module: 471-8-4081 Name of the module: Multivariate biostatistics and SPSS Number of module: 471-8-4081 BGU Credits: 1.5 ECTS credits: Academic year: 4 th Semester: 15 days during fall semester Hours of instruction: 8:00-17:00

More information

Comparison of resampling method applied to censored data

Comparison of resampling method applied to censored data International Journal of Advanced Statistics and Probability, 2 (2) (2014) 48-55 c Science Publishing Corporation www.sciencepubco.com/index.php/ijasp doi: 10.14419/ijasp.v2i2.2291 Research Paper Comparison

More information

Publication List. Chen Zehua Department of Statistics & Applied Probability National University of Singapore

Publication List. Chen Zehua Department of Statistics & Applied Probability National University of Singapore Publication List Chen Zehua Department of Statistics & Applied Probability National University of Singapore Publications Journal Papers 1. Y. He and Z. Chen (2014). A sequential procedure for feature selection

More information

Modeling Customer Lifetime Value Using Survival Analysis An Application in the Telecommunications Industry

Modeling Customer Lifetime Value Using Survival Analysis An Application in the Telecommunications Industry Paper 12028 Modeling Customer Lifetime Value Using Survival Analysis An Application in the Telecommunications Industry Junxiang Lu, Ph.D. Overland Park, Kansas ABSTRACT Increasingly, companies are viewing

More information

Department/Academic Unit: Public Health Sciences Degree Program: Biostatistics Collaborative Program

Department/Academic Unit: Public Health Sciences Degree Program: Biostatistics Collaborative Program Department/Academic Unit: Public Health Sciences Degree Program: Biostatistics Collaborative Program Department of Mathematics and Statistics Degree Level Expectations, Learning Outcomes, Indicators of

More information

The Probit Link Function in Generalized Linear Models for Data Mining Applications

The Probit Link Function in Generalized Linear Models for Data Mining Applications Journal of Modern Applied Statistical Methods Copyright 2013 JMASM, Inc. May 2013, Vol. 12, No. 1, 164-169 1538 9472/13/$95.00 The Probit Link Function in Generalized Linear Models for Data Mining Applications

More information

Handling attrition and non-response in longitudinal data

Handling attrition and non-response in longitudinal data Longitudinal and Life Course Studies 2009 Volume 1 Issue 1 Pp 63-72 Handling attrition and non-response in longitudinal data Harvey Goldstein University of Bristol Correspondence. Professor H. Goldstein

More information

7.1 The Hazard and Survival Functions

7.1 The Hazard and Survival Functions Chapter 7 Survival Models Our final chapter concerns models for the analysis of data which have three main characteristics: (1) the dependent variable or response is the waiting time until the occurrence

More information

Statistical Methods for the Analysis of Alcohol and Drug Uses for Young Adults

Statistical Methods for the Analysis of Alcohol and Drug Uses for Young Adults Journal of Data Science 7(2009), 469-485 Statistical Methods for the Analysis of Alcohol and Drug Uses for Young Adults Liang Zhu, Jianguo Sun and Phillip Wood University of Missouri Abstract: Alcohol

More information

Overview of Factor Analysis

Overview of Factor Analysis Overview of Factor Analysis Jamie DeCoster Department of Psychology University of Alabama 348 Gordon Palmer Hall Box 870348 Tuscaloosa, AL 35487-0348 Phone: (205) 348-4431 Fax: (205) 348-8648 August 1,

More information

Cross Validation techniques in R: A brief overview of some methods, packages, and functions for assessing prediction models.

Cross Validation techniques in R: A brief overview of some methods, packages, and functions for assessing prediction models. Cross Validation techniques in R: A brief overview of some methods, packages, and functions for assessing prediction models. Dr. Jon Starkweather, Research and Statistical Support consultant This month

More information

Distance to Event vs. Propensity of Event A Survival Analysis vs. Logistic Regression Approach

Distance to Event vs. Propensity of Event A Survival Analysis vs. Logistic Regression Approach Distance to Event vs. Propensity of Event A Survival Analysis vs. Logistic Regression Approach Abhijit Kanjilal Fractal Analytics Ltd. Abstract: In the analytics industry today, logistic regression is

More information

Missing Data in Longitudinal Studies: To Impute or not to Impute? Robert Platt, PhD McGill University

Missing Data in Longitudinal Studies: To Impute or not to Impute? Robert Platt, PhD McGill University Missing Data in Longitudinal Studies: To Impute or not to Impute? Robert Platt, PhD McGill University 1 Outline Missing data definitions Longitudinal data specific issues Methods Simple methods Multiple

More information

240ST014 - Data Analysis of Transport and Logistics

240ST014 - Data Analysis of Transport and Logistics Coordinating unit: Teaching unit: Academic year: Degree: ECTS credits: 2015 240 - ETSEIB - Barcelona School of Industrial Engineering 715 - EIO - Department of Statistics and Operations Research MASTER'S

More information

A random point process model for the score in sport matches

A random point process model for the score in sport matches IMA Journal of Management Mathematics (2009) 20, 121 131 doi:10.1093/imaman/dpn027 Advance Access publication on October 30, 2008 A random point process model for the score in sport matches PETR VOLF Institute

More information

LOGISTIC REGRESSION ANALYSIS

LOGISTIC REGRESSION ANALYSIS LOGISTIC REGRESSION ANALYSIS C. Mitchell Dayton Department of Measurement, Statistics & Evaluation Room 1230D Benjamin Building University of Maryland September 1992 1. Introduction and Model Logistic

More information

Survival Analysis of Dental Implants. Abstracts

Survival Analysis of Dental Implants. Abstracts Survival Analysis of Dental Implants Andrew Kai-Ming Kwan 1,4, Dr. Fu Lee Wang 2, and Dr. Tak-Kun Chow 3 1 Census and Statistics Department, Hong Kong, China 2 Caritas Institute of Higher Education, Hong

More information

Learning Objectives for Selected Programs Offering Degrees at Two Academic Levels

Learning Objectives for Selected Programs Offering Degrees at Two Academic Levels Learning Objectives for Selected Programs Offering Degrees at Two Academic Levels Discipline Degree Learning Objectives Accounting 1. Students graduating with a in Accounting should be able to understand

More information

Statistics for Biology and Health

Statistics for Biology and Health Statistics for Biology and Health Series Editors M. Gail, K. Krickeberg, J.M. Samet, A. Tsiatis, W. Wong For further volumes: http://www.springer.com/series/2848 David G. Kleinbaum Mitchel Klein Survival

More information

Statistical Rules of Thumb

Statistical Rules of Thumb Statistical Rules of Thumb Second Edition Gerald van Belle University of Washington Department of Biostatistics and Department of Environmental and Occupational Health Sciences Seattle, WA WILEY AJOHN

More information

Least Squares Estimation

Least Squares Estimation Least Squares Estimation SARA A VAN DE GEER Volume 2, pp 1041 1045 in Encyclopedia of Statistics in Behavioral Science ISBN-13: 978-0-470-86080-9 ISBN-10: 0-470-86080-4 Editors Brian S Everitt & David

More information

STATISTICS COURSES UNDERGRADUATE CERTIFICATE FACULTY. Explanation of Course Numbers. Bachelor's program. Master's programs.

STATISTICS COURSES UNDERGRADUATE CERTIFICATE FACULTY. Explanation of Course Numbers. Bachelor's program. Master's programs. STATISTICS Statistics is one of the natural, mathematical, and biomedical sciences programs in the Columbian College of Arts and Sciences. The curriculum emphasizes the important role of statistics as

More information

Pharmacoeconomic, Epidemiology, and Pharmaceutical Policy and Outcomes Research (PEPPOR) Graduate Program

Pharmacoeconomic, Epidemiology, and Pharmaceutical Policy and Outcomes Research (PEPPOR) Graduate Program Pharmacoeconomic, Epidemiology, and Pharmaceutical Policy and Outcomes Research (PEPPOR) Graduate Program Front from left: 2010 Graduates Rupali Nail, PhD & Pallavi Jaiswal, MS; Back from left: PEPPOR

More information

Poisson Models for Count Data

Poisson Models for Count Data Chapter 4 Poisson Models for Count Data In this chapter we study log-linear models for count data under the assumption of a Poisson error structure. These models have many applications, not only to the

More information

Implementing Panel-Corrected Standard Errors in R: The pcse Package

Implementing Panel-Corrected Standard Errors in R: The pcse Package Implementing Panel-Corrected Standard Errors in R: The pcse Package Delia Bailey YouGov Polimetrix Jonathan N. Katz California Institute of Technology Abstract This introduction to the R package pcse is

More information

Statistical Analysis of Life Insurance Policy Termination and Survivorship

Statistical Analysis of Life Insurance Policy Termination and Survivorship Statistical Analysis of Life Insurance Policy Termination and Survivorship Emiliano A. Valdez, PhD, FSA Michigan State University joint work with J. Vadiveloo and U. Dias Session ES82 (Statistics in Actuarial

More information

REPORT DOCUMENTATION PAGE

REPORT DOCUMENTATION PAGE REPORT DOCUMENTATION PAGE Form Approved OMB NO. 0704-0188 Public Reporting burden for this collection of information is estimated to average 1 hour per response, including the time for reviewing instructions,

More information

Lecture 15 Introduction to Survival Analysis

Lecture 15 Introduction to Survival Analysis Lecture 15 Introduction to Survival Analysis BIOST 515 February 26, 2004 BIOST 515, Lecture 15 Background In logistic regression, we were interested in studying how risk factors were associated with presence

More information

Use Data Mining Techniques to Assist Institutions in Achieving Enrollment Goals: A Case Study

Use Data Mining Techniques to Assist Institutions in Achieving Enrollment Goals: A Case Study Use Data Mining Techniques to Assist Institutions in Achieving Enrollment Goals: A Case Study Tongshan Chang The University of California Office of the President CAIR Conference in Pasadena 11/13/2008

More information

An Application of Weibull Analysis to Determine Failure Rates in Automotive Components

An Application of Weibull Analysis to Determine Failure Rates in Automotive Components An Application of Weibull Analysis to Determine Failure Rates in Automotive Components Jingshu Wu, PhD, PE, Stephen McHenry, Jeffrey Quandt National Highway Traffic Safety Administration (NHTSA) U.S. Department

More information

An Overview and Evaluation of Decision Tree Methodology

An Overview and Evaluation of Decision Tree Methodology An Overview and Evaluation of Decision Tree Methodology ASA Quality and Productivity Conference Terri Moore Motorola Austin, TX terri.moore@motorola.com Carole Jesse Cargill, Inc. Wayzata, MN carole_jesse@cargill.com

More information

Tests for Two Survival Curves Using Cox s Proportional Hazards Model

Tests for Two Survival Curves Using Cox s Proportional Hazards Model Chapter 730 Tests for Two Survival Curves Using Cox s Proportional Hazards Model Introduction A clinical trial is often employed to test the equality of survival distributions of two treatment groups.

More information

Comparison of Survival Curves

Comparison of Survival Curves Comparison of Survival Curves We spent the last class looking at some nonparametric approaches for estimating the survival function, Ŝ(t), over time for a single sample of individuals. Now we want to compare

More information

Predicting Customer Default Times using Survival Analysis Methods in SAS

Predicting Customer Default Times using Survival Analysis Methods in SAS Predicting Customer Default Times using Survival Analysis Methods in SAS Bart Baesens Bart.Baesens@econ.kuleuven.ac.be Overview The credit scoring survival analysis problem Statistical methods for Survival

More information

Statistics in Applications III. Distribution Theory and Inference

Statistics in Applications III. Distribution Theory and Inference 2.2 Master of Science Degrees The Department of Statistics at FSU offers three different options for an MS degree. 1. The applied statistics degree is for a student preparing for a career as an applied

More information

Additional sources Compilation of sources: http://lrs.ed.uiuc.edu/tseportal/datacollectionmethodologies/jin-tselink/tselink.htm

Additional sources Compilation of sources: http://lrs.ed.uiuc.edu/tseportal/datacollectionmethodologies/jin-tselink/tselink.htm Mgt 540 Research Methods Data Analysis 1 Additional sources Compilation of sources: http://lrs.ed.uiuc.edu/tseportal/datacollectionmethodologies/jin-tselink/tselink.htm http://web.utk.edu/~dap/random/order/start.htm

More information

BIOM611 Biological Data Analysis

BIOM611 Biological Data Analysis BIOM611 Biological Data Analysis Spring, 2015 Tentative Syllabus Introduction BIOMED611 is a ½ unit course required for all 1 st year BGS students (except GCB students). It will provide an introduction

More information

Karyn Ruiz-Cordell, MA, PhD Shunda Irons-Brown, PhD, MBA, CHCP Tamar Sapir, PhD

Karyn Ruiz-Cordell, MA, PhD Shunda Irons-Brown, PhD, MBA, CHCP Tamar Sapir, PhD Advanced Methodologies in Outcomes & Insights Research Study Design Measuring Knowledge vs. Impact vs. Performance vs. Quality of Care and Everything In Between Karyn Ruiz-Cordell, MA, PhD Shunda Irons-Brown,

More information

COURSE PLAN BDA: Biomedical Data Analysis Master in Bioinformatics for Health Sciences. 2015-2016 Academic Year Qualification.

COURSE PLAN BDA: Biomedical Data Analysis Master in Bioinformatics for Health Sciences. 2015-2016 Academic Year Qualification. COURSE PLAN BDA: Biomedical Data Analysis Master in Bioinformatics for Health Sciences 2015-2016 Academic Year Qualification. Master's Degree 1. Description of the subject Subject name: Biomedical Data

More information

Modeling and Analysis of Call Center Arrival Data: A Bayesian Approach

Modeling and Analysis of Call Center Arrival Data: A Bayesian Approach Modeling and Analysis of Call Center Arrival Data: A Bayesian Approach Refik Soyer * Department of Management Science The George Washington University M. Murat Tarimcilar Department of Management Science

More information

A REVIEW OF CURRENT SOFTWARE FOR HANDLING MISSING DATA

A REVIEW OF CURRENT SOFTWARE FOR HANDLING MISSING DATA 123 Kwantitatieve Methoden (1999), 62, 123-138. A REVIEW OF CURRENT SOFTWARE FOR HANDLING MISSING DATA Joop J. Hox 1 ABSTRACT. When we deal with a large data set with missing data, we have to undertake

More information

Assumptions. Assumptions of linear models. Boxplot. Data exploration. Apply to response variable. Apply to error terms from linear model

Assumptions. Assumptions of linear models. Boxplot. Data exploration. Apply to response variable. Apply to error terms from linear model Assumptions Assumptions of linear models Apply to response variable within each group if predictor categorical Apply to error terms from linear model check by analysing residuals Normality Homogeneity

More information

Survival Distributions, Hazard Functions, Cumulative Hazards

Survival Distributions, Hazard Functions, Cumulative Hazards Week 1 Survival Distributions, Hazard Functions, Cumulative Hazards 1.1 Definitions: The goals of this unit are to introduce notation, discuss ways of probabilistically describing the distribution of a

More information

What is Data Analysis. Kerala School of MathematicsCourse in Statistics for Scientis. Introduction to Data Analysis. Steps in a Statistical Study

What is Data Analysis. Kerala School of MathematicsCourse in Statistics for Scientis. Introduction to Data Analysis. Steps in a Statistical Study Kerala School of Mathematics Course in Statistics for Scientists Introduction to Data Analysis T.Krishnan Strand Life Sciences, Bangalore What is Data Analysis Statistics is a body of methods how to use

More information

Ph.D. Biostatistics 2014-2015 Note: All curriculum revisions will be updated immediately on the website http://www.publichealth.gwu.

Ph.D. Biostatistics 2014-2015 Note: All curriculum revisions will be updated immediately on the website http://www.publichealth.gwu. Columbian College of Arts and Sciences and Milken Institute School of Public Health Ph.D. Biostatistics 2014-2015 Note: All curriculum revisions will be updated immediately on the website http://www.publichealth.gwu.edu

More information

THE RISK DISTRIBUTION CURVE AND ITS DERIVATIVES. Ralph Stern Cardiovascular Medicine University of Michigan Ann Arbor, Michigan. stern@umich.

THE RISK DISTRIBUTION CURVE AND ITS DERIVATIVES. Ralph Stern Cardiovascular Medicine University of Michigan Ann Arbor, Michigan. stern@umich. THE RISK DISTRIBUTION CURVE AND ITS DERIVATIVES Ralph Stern Cardiovascular Medicine University of Michigan Ann Arbor, Michigan stern@umich.edu ABSTRACT Risk stratification is most directly and informatively

More information

11. Analysis of Case-control Studies Logistic Regression

11. Analysis of Case-control Studies Logistic Regression Research methods II 113 11. Analysis of Case-control Studies Logistic Regression This chapter builds upon and further develops the concepts and strategies described in Ch.6 of Mother and Child Health:

More information

Master of Science (MS) in Biostatistics 2014-2015. Program Director and Academic Advisor:

Master of Science (MS) in Biostatistics 2014-2015. Program Director and Academic Advisor: Master of Science (MS) in Biostatistics 014-015 Note: All curriculum revisions will be updated immediately on the website http://publichealh.gwu.edu Program Director and Academic Advisor: Dante A. Verme,

More information

It is important to bear in mind that one of the first three subscripts is redundant since k = i -j +3.

It is important to bear in mind that one of the first three subscripts is redundant since k = i -j +3. IDENTIFICATION AND ESTIMATION OF AGE, PERIOD AND COHORT EFFECTS IN THE ANALYSIS OF DISCRETE ARCHIVAL DATA Stephen E. Fienberg, University of Minnesota William M. Mason, University of Michigan 1. INTRODUCTION

More information

PROBABILITY AND STATISTICS. Ma 527. 1. To teach a knowledge of combinatorial reasoning.

PROBABILITY AND STATISTICS. Ma 527. 1. To teach a knowledge of combinatorial reasoning. PROBABILITY AND STATISTICS Ma 527 Course Description Prefaced by a study of the foundations of probability and statistics, this course is an extension of the elements of probability and statistics introduced

More information

Programme du parcours Clinical Epidemiology 2014-2015. UMR 1. Methods in therapeutic evaluation A Dechartres/A Flahault

Programme du parcours Clinical Epidemiology 2014-2015. UMR 1. Methods in therapeutic evaluation A Dechartres/A Flahault Programme du parcours Clinical Epidemiology 2014-2015 UR 1. ethods in therapeutic evaluation A /A Date cours Horaires 15/10/2014 14-17h General principal of therapeutic evaluation (1) 22/10/2014 14-17h

More information

Overview of Non-Parametric Statistics PRESENTER: ELAINE EISENBEISZ OWNER AND PRINCIPAL, OMEGA STATISTICS

Overview of Non-Parametric Statistics PRESENTER: ELAINE EISENBEISZ OWNER AND PRINCIPAL, OMEGA STATISTICS Overview of Non-Parametric Statistics PRESENTER: ELAINE EISENBEISZ OWNER AND PRINCIPAL, OMEGA STATISTICS About Omega Statistics Private practice consultancy based in Southern California, Medical and Clinical

More information

Applications of R Software in Bayesian Data Analysis

Applications of R Software in Bayesian Data Analysis Article International Journal of Information Science and System, 2012, 1(1): 7-23 International Journal of Information Science and System Journal homepage: www.modernscientificpress.com/journals/ijinfosci.aspx

More information

ANNUITY LAPSE RATE MODELING: TOBIT OR NOT TOBIT? 1. INTRODUCTION

ANNUITY LAPSE RATE MODELING: TOBIT OR NOT TOBIT? 1. INTRODUCTION ANNUITY LAPSE RATE MODELING: TOBIT OR NOT TOBIT? SAMUEL H. COX AND YIJIA LIN ABSTRACT. We devise an approach, using tobit models for modeling annuity lapse rates. The approach is based on data provided

More information

TEACHING STATISTICS THROUGH DATA ANALYSIS

TEACHING STATISTICS THROUGH DATA ANALYSIS TEACHING STATISTICS THROUGH DATA ANALYSIS Thomas Piazza Survey Research Center and Department of Sociology University of California Berkeley, California The question of how best to teach statistics is

More information

Introduction to Biostatistics PhD programs

Introduction to Biostatistics PhD programs Introduction to Biostatistics PhD programs Takumi Saegusa University of Washington Department of Biostatistics April 17 2013 Takumi Saegusa (UW Biostat) Biostat PhD April 17 2013 1 / 35 Outline What is

More information

ROCHESTER INSTITUTE OF TECHNOLOGY COURSE OUTLINE FORM COLLEGE OF SCIENCE. School of Mathematical Sciences

ROCHESTER INSTITUTE OF TECHNOLOGY COURSE OUTLINE FORM COLLEGE OF SCIENCE. School of Mathematical Sciences ! ROCHESTER INSTITUTE OF TECHNOLOGY COURSE OUTLINE FORM COLLEGE OF SCIENCE School of Mathematical Sciences New Revised COURSE: COS-MATH-252 Probability and Statistics II 1.0 Course designations and approvals:

More information

The Variability of P-Values. Summary

The Variability of P-Values. Summary The Variability of P-Values Dennis D. Boos Department of Statistics North Carolina State University Raleigh, NC 27695-8203 boos@stat.ncsu.edu August 15, 2009 NC State Statistics Departement Tech Report

More information

Curriculum for to the PhD Program in Pharmacy Administration

Curriculum for to the PhD Program in Pharmacy Administration Curriculum for to the PhD Program in Pharmacy Administration Course Hours Course Title BSE 5113 3 Principles of Epidemiology BSE 5163 3 Biostatistics Methods I* BSE 5173 3 Biostatistics Methods II* BSE

More information

Department of Economics

Department of Economics Department of Economics On Testing for Diagonality of Large Dimensional Covariance Matrices George Kapetanios Working Paper No. 526 October 2004 ISSN 1473-0278 On Testing for Diagonality of Large Dimensional

More information

Comparing return to work outcomes between vocational rehabilitation providers after adjusting for case mix using statistical models

Comparing return to work outcomes between vocational rehabilitation providers after adjusting for case mix using statistical models Comparing return to work outcomes between vocational rehabilitation providers after adjusting for case mix using statistical models Prepared by Jim Gaetjens Presented to the Institute of Actuaries of Australia

More information