A piecewise Markov process for analysing survival from breast cancer in dierent risk groups

Size: px
Start display at page:

Download "A piecewise Markov process for analysing survival from breast cancer in dierent risk groups"

Transcription

1 STATISTICS IN MEDICINE Statist. Med. 2001; 20: A piecewise Markov process for analysing survival from breast cancer in dierent risk groups Rafael Perez-Ocon ;, Juan Eloy Ruiz-Castro and M. Luz Gamiz-Perez Departamento de Estadstica e Investigacion Operativa; Universidad de Granada; Granada; Spain SUMMARY A study of the relapse and survival times for 300 breast cancer patients submitted to post-surgical treatments is presented. After surgery, these patients were given three treatments: chemotherapy; radiotherapy; hormonal therapy and a combination of them. From the data set, a non-homogeneous Markov model is selected as suitable for the evolution of the disease. The model is applied considering two time periods during the observation of the cohort where the disease is well dierentiated with respect to death and relapse. The eect of the treatments on the patients is introduced into the model via the transition intensity functions. A piecewise Markov process is applied, the likelihood function is built and the parameters are estimated, following a parametric methodological procedure. As a consequence, a survival table for dierent treatments is given, and survival functions for dierent treatments are plotted and compared with the corresponding empirical survival function. The t of the dierent curves is good, and predictions can be made on the survival probabilities to post-surgical treatments for dierent risk groups. Copyright? 2001 John Wiley & Sons, Ltd. 1. INTRODUCTION The introduction of dynamic models in survival studies presents certain advantages that are useful in this eld of application since the disease evolves over time and therefore the subjects must be followed up within a certain period. It is characteristic of survival studies that data are frequently censored; however, sometimes dierent factors that aect the lifetime evolution of the subjects must be introduced into the model when they are known. Stochastic processes are helpful when subjects in a population can spend times in dierent states, with sojourn times in the states being of interest. This is the case for cancer patients under treatment; they can be in dierent states, and the sojourn times in these states, as well as the changes among states that occur are of interest. When the present state of the disease summarizes all the previous information, the Markov model is appropriate. Moreover, this model has the advantage of exibility in the incorporation Correspondence to: Rafael Perez-Ocon, Departamento de Estadstica e Investigacion Operativa, Universidad de Granada, Granada, Spain rperezo@ugr.es Contract=grant sponsor: DGES; contract=grant number: PB Received March 1998 Copyright? 2001 John Wiley & Sons, Ltd. Accepted January 2000

2 110 R. P EREZ-OC ON, J. E. RUIZ-CASTRO AND M. L. G AMIZ-P EREZ of censored data and in the introduction of risk factors. In survival studies, homogeneous Markov models in continuous time have been used by various authors to analyse dierent diseases [1 4]. These works and the references therein are used as a basis to demonstrate that such models represent a good approach in this eld and provide useful insights. The hypothesis of homogeneity is unrealistic in some cases, since as time goes on the disease evolves. Several approaches to statistical inference in non-homogeneous Markov processes have been followed using non-parametric methods [5 7]. In this paper we propose a continuous-time non-homogeneous Markov process to study the evolution of breast cancer in a cohort of 300 patients. The non-homogeneous approach considered is a piecewise Markov process, with the transition intensity functions being step functions. We are interested in the survival evolution of dierent groups of patients that are determined by the treatment. These treatments are incorporated as covariates via the transition intensity functions of the Markov process. There are three treatments: chemotherapy; radiotherapy; hormonal therapy, and a combination of these three. Several researchers have studied Markov processes with covariates, and a procedure to obtain the parameters in a model with covariates has been reported [1; 2]. The methodology we review in this work follows a parametric procedure [1] dierent from other nonand semi-parametric methods [7]. In our model, we consider a three-dimensional covariate vector. Parametric methods are applied, the likelihood function is constructed and the transition intensity functions estimated. These functions are dependent on time due to their non-homogeneity, and the eect of the risk factors considered are included in these functions in a similar way to the Cox model [8]. The risk groups considered are the dierent classes of treatments that the patients received after surgery. We examine the inuence of post-surgical treatments in these groups, particularly on their lifetimes and relapse times. The paper is organized as follows. In Section 2 we provide a description of the data and the model. In Section 3 the likelihood function is built, parameters are estimated and survival functions for dierent treatments are given and compared with the empirical survival graphs. A test of the goodness-of-t between theoretical and empirical curves is applied, showing the goodness-of-t of the model. In Section 4 we draw several conclusions from the work. 2. SUBJECTS AND METHODS Data on a cohort of subjects submitted to the surgical treatment of breast cancer were obtained from the Department of Radiology in the Hospital Clnico of Granada. The follow-up dates from 1973 and all non-censored subjects were seen longitudinally until December A total of 518 subjects submitted to surgical treatment have been monitored. Entry into the study began at the time of surgery. Then, dierent treatments were given to the patients. Three states were considered in the evolution of the disease: no relapse (state 1); relapse (state 2), and death (state 3), which is an absorbent state. A test for Markov assumption, using a competing-risks Cox model, was performed [9; 10]. The p-value calculated from the data was , so that there is no empirical evidence for rejecting the null hypothesis of the Markov assumption. From this cohort, a subcohort of 300 patients with breast cancer was considered. All of these 300 patients were selected because they received the same chemotherapy treatment, when applied, and had at least one axillary node aected. The observation period for subjects in this cohort was at least ve years. This cohort was studied

3 PIECEWISE MARKOV PROCESS FOR ANALYSING SURVIVAL FROM BREAST CANCER 111 because it was deemed informative by the group of oncologists with whom we are working. Breast cancer is the most common cancer in the female population, not only in Spain but in the entire world. The usual treatments include the ones considered here, and relapse and mortality occur more frequently in patients with some axillary nodes aected. Transition rates to relapse and death for dierent groups of risk are estimated, and these are quantities with which epidemiologists are familiar [1; 2; 18]. With the multi-state model we use, it is possible to learn more about how the dierent treatments aect the evolution of the disease over time. The mean age for our cohort was 52.5 years, and the total censored patients was 122 (40.7 per cent of the total). In the rst four years, 195 survived (170 in state 1 and 25 in state 2), 105 died (67 in state 1 and 38 in state 2), with no censored data in this period. From four to ten years there were 70 deaths (45 in state 1 and 25 in state 2), 25 censored (24 in state 1 and 1 in state 2), 12 relapses (4 per cent of total data), and 100 patients still alive after ten years. After ten years, there were 3 deaths (2 in state 1 and 1 in state 2), 97 censored (86 in state 1 and 11 in state 2), and only one woman relapsed in this period. It is thus evident that the deaths and relapses occurred essentially in the rst ten years. Most censored data are in state 1 (25 in the rst ten years and 97 thereafter). After surgery, three specic treatments were applied to these patients: chemotherapy; (CT); radiotherapy (RT), and hormonal therapy (HT), and combinations of these three. No treatment was given to patients transferred to the hospital from other places. The treatment RT-HT-CT was applied to 39 patients, RT-CT to 110, RT to only 50, while 47 had no treatment, and the remaining possible treatments were applied to less than 22 patients The Markov model Let {X (t); t 0} be a Markov process with a state space E, where X (t) denotes the state of the disease at time t 0. Homogeneous Markov processes have frequently been used in the study of the evolution of diseases. For this model, the transition probabilities p ij (t) are dened p ij (t)=p{x (t + s)=j X (s)=i} As can be seen, this expression does not depend on s. This means that the transition i j on an interval of length t has the same probability at any time. This model has been studied previously by the authors [3]. Under this model, in our case, the relapse probability (transition 1 2) in one year was the same when a patient entered state 1 recently and when a patient had been in state 1 for some time (for example, two years). We also followed another approach to the evolution of breast cancer, introducing non-homogeneous Markov processes. A non-homogeneous Markov process with a state space E was considered. Transition probability functions in this model are denoted by p ij (s; t), where 06s6t: p ij (s; t) = Prob{X (t)=j X (s)=i}; ij E where p ij (s; s)= ij (Kronecker delta). The transition matrix function is P(s; t)=(p ij (s; t)). Transition intensity functions are dened by p ij (t; t + h) q ij (t) = lim ; i;j E; i j h 0 h 1 p ii (t; t + h) q i (t) = lim ; i E h 0 h

4 112 R. P EREZ-OC ON, J. E. RUIZ-CASTRO AND M. L. G AMIZ-P EREZ Figure 1. Transitions among states in the Markov model. Transition intensity functions q ij (t) can be interpreted as the change rate from state i to state j in time t, and q i (t)= q ii (t) as the exit rate from state i in time t. The q-matrix of the process is Q(t)=(q ij (t)), which we assume to be conservative, that is, the sum of its rows is zero, so q ii (t)= q i (t)= j i q ij(t) for all t. The staying time T i in state i for this Markov process, if the initial state is i, is [11] ( t ) P{T i t} = exp q i (u)du The transitions among states when the model is applied to our data set are given in Figure Model selection The eect of the treatments is incorporated into the model via the transition intensity functions [12]. A column vector z of observed covariates is considered (z is the transpose vector of z). Each component of this vector indicates the application or not of treatments. In this way, the data set is partitioned into dierent subcohorts, with particular attention paid to the evolution of these subcohorts. Therefore, in the model, we considered a covariate vector z, and the expression of these functions in the non-homogeneous case between the states i; j is 0 q ij (t; z)= ij (t) exp(z ij (t)); ij (t) 0 In this expression, ij (t) is the parameter column vector associated with the covariate vector z in the transition between states i and j in time t, and q ij (t; z) is the transitional rate at t for the subjects characterized by the vector z. The scalar product of vectors is denoted by z ij (t). These vectors are interpreted as in the Cox model [8]: ij (t) represents the regression coecients of the covariates on transitional rates, and ij (t) is the baseline transition intensity function between states i and j in time t, the transitional rate when z is the null vector. The transition probability function from state i in time s to state j in time t is denoted by p ij (s; t; z). This is the non-homogeneous model with covariates. Our aim is to calculate the transition probability functions and the survival functions for groups of patients submitted to the same treatment, each group characterized by a certain value of the covariate vector. This was done by estimating the parameters using the maximum-likelihood method. An approach to non-homogeneity in a Markov process is the stepwise method. The model we present assumes that transition rates are constant in dierent time intervals, so that covariate eects are included and parameters are allowed to change over xed intervals of the time domain of the

5 PIECEWISE MARKOV PROCESS FOR ANALYSING SURVIVAL FROM BREAST CANCER 113 Table I. Deaths and censored patients for each period. 06t648 months t 48 months Survivors in state Survivors in state Deaths from state Deaths from state Censored in state Censored in state Relapse study. If we consider a piecewise q-matrix in the disjointed intervals determined by the points that approach non-homogeneity, a 1 ;a 2 ;:::;a k with a 0 = 0 and a k =, the q-matrix of the model can be expressed in a general form by Q(t; z)= { Q1 (z); 0=a 0 6t6a 1 ; Q l (z); a l 1 t6a l ; l =2; 3;:::;k 3. RESULTS The general model was introduced in Section 2, where we applied the homogeneous Markov process to our data set [3]. Our aim is to construct a more accurate model for representing the survival probabilities for the dierent groups of patients. A test for data homogeneity in Markov processes is applied, considering all the transitions among states, thus extending a previous work [13]. The experimental value derived from the data for the statistics in the asymptotic distribution of the score vector from the likelihood function was 18:16, with a p-value 0:0001, showing that the hypothesis of non-homogeneity cannot be rejected. Sojourn times in states play an important role in Markov processes, and thus the behaviour of these quantities is relevant in these studies. In our case, these times are time before relapse or death, and time after relapse until death, and our interest was to establish the inuence of treatments on these times. We studied the cumulative hazard functions of sojourn times in states and observed the relapse times from state 1, and death times from states 1 and 2. From these empirical functions, two periods can be established in the evolution of breast cancer: up to 48 months and after 48 months. After 120 months there was no information because there were few transitions. Consequently, two periods were considered in the evolution of each patient: from the entry to observation (t =0)uptot = 48 months and after 48 months. Table I presents a summary of the data by periods Likelihood function and parameter estimation The likelihood function we calculate is an adaptation of previous work to the piecewise model [6; 14]. We assume that n items are observed, all beginning in state 1, and that item i has m i changes of states, the last time being death or censorship, and thus we have a sequence of times for each item i: 0=t i; 1 t i; 2 t i;mi. In these times, the item successively occupied the states 1=x1 i ;:::;xi m i. The value of the covariate vector for item i is known and denoted by z i. Each patient introduces to the likelihood function dierent factors in accordance with her sample function.

6 114 R. P EREZ-OC ON, J. E. RUIZ-CASTRO AND M. L. G AMIZ-P EREZ Therefore, if the observed transition interval for a patient is between two cutpoints, the contribution to the likelihood is the transition probability with the corresponding q-matrix. If the observed transition interval has one cutpoint, the contribution to the likelihood is the product of two factors: the transition probability in the interval between the instant of jump and the cutpoint, and from this point to the next jump or censoring, with the corresponding q-matrices in each period. If the observed transition interval is between k cutpoints, the contribution is the product of (k + 1) transition probabilities with the corresponding q-matrices. The last time observed could be a death time or a censoring time; in the rst case, the last product is a transition probability to the absorbent state, and, in the second case, a survival probability in the last state visited. The likelihood function for this data set is expressed as L = n m i p x i (t r 1 ;xi i; r 1;t r i; r ; z i ) i=1 r=2 This model can now be applied to our data set. We will assume that the q-matrix changes in time t = 48 (four years) and that it is constant in the intervals determined by this partition [0,48] and ]48; [. The evolution of the patients is progressive; subjects go from state 1 to states 2 or 3, and from state 2 to state 3, but do not go back to a previous state, so that the only observed transitions are 1 2; 1 3; 2 3. The covariate vector is a three-vector z =(z 1 ;z 2 ;z 3 ) = (RT; HT; CT), with z h a dichotomous variable for h =1; 2; 3; taking the value 1 if the corresponding treatment was not applied and 0 if it was. The eect of treatment h on the transition i j is measured by the coecient ij h ;h=1; 2; 3, so we have ij(t)=(ij; 1 ij; 2 ij 3 ) in the intervals [0,48] and ]48; [. The interpretation of the parameters and is as in the homogeneous case [3] in each period. The transition intensity matrix for this model is ( 12 (t)e z 12(t) + 13 (t)e z 13(t) ) 12 (t)e z 12(t) 13 (t)e z 13(t) Q(t; z)= 0 23 (t)e z 23(t) 23 (t)e z 23(t) The entries of this matrix can be estimated by applying the maximum-likelihood method and taking into account the fact that this matrix is constant in each of the two intervals of the partition determined by the time 48. The computational program we have used is the MATLAB program which involves the algorithm of Nelder Mead [15]. The dierent factors appearing in the maximum-likelihood function are given in the Appendix. The estimated parameters following this procedure, with their standard errors, are given in Tables II and III. The transitional rates among states for dierent subcohorts determined by the applied treatments are calculated from these tables and are given in Table IV. Some of the estimated parameters are close to zero, and it is possible to improve the model specication Survival probabilities The forward Kolmogorov equations in this case are P (s; t; z)=p(s; t; z)q(t; z). We can derive the transition functions from these probability equations if we know the q-matrix Q(t; z). In the Appendix the mathematical expression of the estimated survival probabilities is given for patients in state 1. We represent the survival curves for dierent treatments and compare these curves with the corresponding empirical survival curves and test the t.

7 PIECEWISE MARKOV PROCESS FOR ANALYSING SURVIVAL FROM BREAST CANCER 115 Table II. Maximum-likelihood estimates and standard errors of baseline transition rates among states. Parameters 06t648 months t 48 months 12(t) (SE) (0.0017) (0.0005) 13(t) (SE) (0.0022) (0.0014) 23(t) (SE) (0.0427) (0.0105) Table III. Maximum-likelihood estimates and standard errors of regression coecients. Parameters 06t648 months t 48 months Components Components ij1 ij2 ij3 ij1 ij2 ij3 12(t) (SE) (0.3020) (0.3183) (0.2747) (0.6150) (0.6670) (0.5760) 13(t) 0:1657 0: : :2380 (SE) (1.1846) (0.8241) (1.0707) (0.7739) (0.3591) (0.4017) 23(t) 0:9038 0:2090 0: :9415 (SE) (0.3289) (0.3940) (0.3205) (0.0002) (0.6050) (0.6111) Table IV. Maximum-likelihood estimates of transition rates among states. Treatments ˆq 12 (t) ˆq 13 (t) ˆq 23 (t) 06t648 t 48 06t648 t 48 06t648 t 48 months months months months months months CT RT-CT HT-CT RT-HT-CT No treatment RT HT RT-HT In Figures 2 to 5, we give the survival function for treatments RT-CT-HT, RT-CT, RT, CT and no treatment, and the corresponding empirical survival curves using the product-limit estimate [16]. The graphs cover a period of ten years. For each graph a test to study the t has been drawn using the likelihood rate test [17]. When data are censored, one way to perform this test is to apply the usual likelihood test in each interval determined by a lifetable associated with the data. To test the equality of the two graphs in the

8 116 R. P EREZ-OC ON, J. E. RUIZ-CASTRO AND M. L. G AMIZ-P EREZ Figure 2. Empirical and theoretical curves for RT-HT-CT. Figure 3. Empirical and theoretical curves for RT-CT. general case, a lifetable with k intervals is calculated and compared with the theoretical curve obtained from the model. The null hypothesis H 0 is that the survival probabilities in both curves are identical in each interval of the lifetable. The statistical test is =2 k d j log d j +2 k d j0 j=1 j=1 (n j d j ) log n j d j n j d j0

9 PIECEWISE MARKOV PROCESS FOR ANALYSING SURVIVAL FROM BREAST CANCER 117 Figure 4. Empirical and theoretical curves for RT. Figure 5. Empirical and theoretical curves for no treatment. n j being the number of patients at risk at the beginning of interval I j ; d j the number of deaths in I j ;d j0 = n j (1 S j0 ), and S j0 the probability to survive interval I j given survival to interval I j 1 in the model. It is known that the limit distribution of the estimate, under the null hypothesis H 0, is bounded by a distribution 2 with k d.f. and a distribution 2 with k s d.f., k being the number of intervals considered and s the number of parameters. In this study s = 36, so we take

10 118 R. P EREZ-OC ON, J. E. RUIZ-CASTRO AND M. L. G AMIZ-P EREZ Table V. Values of the statistics of the likelihood ratio test for the survival t to treatments to the empirical survival functions. Treatment k p-value with 2 k p-value with 2 k s RT-HT-CT RT-CT :0001 RT :0001 CT No treatment Table VI. Survival probabilities for treatments to 4, 8 and 10 years following the piecewise Markov model. Survival probability 4 years 8 years 10 years 12 years RT,HT,CT RT-CT HT-CT RT-HT RT CT HT No treatment the lifetable in intervals of three months for all curves and the number k is greater than 36 in all cases. Moreover, three months is a common period of observation in medical studies. The critical values for the treatments are given in Table V. Table V shows that the t of the curves is good, except perhaps the CT-RT treatment, and thus the model provides a good approach to the survival for treatments listed in Figures 2 5. Table VI gives the survival probabilities for all treatments in dierent times. 4. CONCLUDING REMARKS In previous studies of the data set that we present here, the Cox model was applied and the hazard rate functions were determined for dierent risk groups. A more detailed study is presented in this paper. The application of the Markov model to the data set takes into account several specic considerations about the disease process. First, the Markov hypothesis is tested, and we have the empirical conrmation that the transition rates in states are not aected by the previous sojourn time. Therefore, we assume that the Markov model is appropriate. The homogeneous Markov process has been applied by the authors [3]. We considered that the t of the survival curves to the empirical ones were not very good, and therefore we have considered the homogeneity of the model. A test of homogeneity for a Markov process has been applied, and we have no empirical evidence against the non-homogeneity of the data. Therefore, we consider the transition rates depending on time. Many models can be applied

11 PIECEWISE MARKOV PROCESS FOR ANALYSING SURVIVAL FROM BREAST CANCER 119 hereafter. We have considered several models, tting dierent distributions to these times. The data set is censored and the t was generally not adequate. Overall, the graphic t of the survival curves is better when the two-piece Markov model is considered. These previous studies and the simplicity of the calculations with the stepwise model have led us to consider a model with one cutpoint (t = 48). The non-homogeneous model we rst applied had no covariates. However, it seemed too restrictive to consider the dierent groups of patients to be homogeneous with respect to lifetime, it being more realistic to assume that the treatments introduce some non-homogeneity within the risk groups. Therefore, dierent groups of patients have been considered, and covariates have been introduced. The Markov process framework developed by Kay [9], continued by Gentlemann et al. [1] and applied by Andersen et al. [18] for studying the eect of disease indicators on survival has been used in this paper for constructing survival probabilities for dierent groups of risk determined by the treatments for breast cancer, introducing non-homogeneity in time. The present paper is also indebted to the work of Lu et al. [19], where a Markov model is applied for the evaluation of prognostic factors inuencing the course of HIV. We have performed specic computational implementations of the model. Some of the parameters estimated are close to zero. A hypothesis test has been applied testing the null hypothesis of being zero for eight parameters, which cannot be rejected. Consequently, the model can be simplied, but the parameter values and the plot of the graphic curves are not signicantly dierent from those in the unsimplied model. We have considered the most general case. As a result, we have obtained the eect of treatments in the evolution of survival probabilities and relapse times to breast cancer in a cohort of patients. The initial cohort of 518 subjects can be considered random, as it included all the patients that arrived at the hospital. None the less, the treatments are not applied randomly, depending instead on the patients. For example, HT is applied when the patient has hormonal receptors. The contribution of our study is to establish relapse and survival curves for patients submitted to treatments after surgery. The interest of the group of doctors with whom we are working is to know the relapse and survival curves of dierent risk groups. This study can be extended by taking into account not only treatments, but age, number of axillary nodes aected, and size of tumour. This would provide an indicator on the patients that arrive at the hospital relative to the relapse and survival times in terms of the subcohort to which they belong. The computational calculations have been performed using the MATHEMATICA and MATLAB programs, adapting the software to a general homogeneous model with m states to an extension to the non-homogeneous model. The parameters are estimated following the algorithm of Nelder Mead [14], and a survival table results for dierent periods of time and for dierent treatments. Survival functions for dierent treatments are established and compared with the corresponding empirical survival function [16], using a test of the goodness-of-t between the two curves to show that the model is well tted. APPENDIX Let {X (t);t 0} be a Markov process that describes the evolution of a disease. State space E is nite, with one state being absorbent (death) and the rest of the states transient. We assume the notation given in Section 3 for change or censored times and occupied states for the process. The

12 120 R. P EREZ-OC ON, J. E. RUIZ-CASTRO AND M. L. G AMIZ-P EREZ likelihood function, with the vector covariates incorporated to the model, is L = n m i p x i (t r 1 ;xi i; r 1;t r i; r ; z i ) i=1 r=1 An approach to the non-homogeneous case can be made assuming piecewise constant transition intensity functions. Our estimate is reduced to the tting of homogeneous continuous models to a series of k intervals, each bounded by points through the observation interval. The q-matrix is allowed to change at certain times but is constant between these times, so we consider Q(t; z)=q l (z); a l 1 6t a l ; l=1; 2;:::;k a 1 ;a 2 ;:::;a k k being cutpoints with a 0 = 0 and a k =. For calculations we dene the intervals I j =[a j 1 ;a j [; J q =]a q 1 ;a q ]; j;q=1; 2;:::;k. Q(t; z) Let pij (l; z) be the transition probability function calculated using the q-matrix Q l (z). Then, the factors in the likelihood function above have dierent expressions: (a) If t i; r 1 I j ; t i; r J j (b) If t i; r 1 I j ; t i; r J j+1 p x i r 1 ;xi r (t i; r 1;t i; r ; z i )=p Qj(zi) x i r 1 ;xi r(t i; r t i; r 1 ; z i ) p x i (t r 1 ;xi i; r 1;t r i; r ; z i )=p Qj(zi) (a x i j t i; r 1 ; z i )p Qj+1(zi) (t r 1 ;xi x i i; r a j ; z i ) r 1 r 1 ;xi r (c) If t i; r 1 I j ; t i; r J q ; q j 2 p x i (t r 1 ;xi i; r 1;t r i; r ; z i )=p Qj(zi) x i r 1 r 1(a j t i; r 1 ; z i ) ;xi q 2 u=j p Qu+1(zi) (a x i u+1 a u ; z i )p Qq(zi) r 1 ;xi x i r 1 r 1 r(t i; r a q 1 ; z i ) ;xi The state of the items in the cutpoints is known. For parameter estimation, we must apply the Chapman Kolmogorov equation in the cutpoints to obtain the transition probability functions. Q(t; z) If P is the transition matrix calculated with the estimated q-matrix Q(t; z), the matricial expression for these probabilities is: For s I j ; t J j P(s; t; z)=p Qj(z) (t s; z) For s I j ; t J j+1 P(s; t; z)=p Qj(z) (a j s; z)p Qj+1(z) (a j t; z) For s I j ; t J q ; q j 2 P(s; t; z)=p Qj(z) q 2 (a j s; z) P Qu+1(z) (a u+1 a u ; z)p Qq(z) (t a q 1 ; z) u=j

13 PIECEWISE MARKOV PROCESS FOR ANALYSING SURVIVAL FROM BREAST CANCER 121 The survival probability to time t; S(t; z) for breast cancer patients with vector covariate z, is S(t; z)=p 11 (0;t; z)+p 12 (0;t; z) As the model has a good t up to ten years, taking as cutpoints a 0 = 0 and a 1 = 48, the survival function in the rst ten years after surgical treatment is obtained: If t [0; 48] If t ]48; [ S(t; z)=p Q1(z) 11 (t; z)+p Q1(z) 12 (t; z) S(t; z)=p Q1(z) 11 (48; z)[p Q2(z) 11 (t 48; z)+p Q2(z) 12 (t 48; z)] + p Q1(z) 12 (48; z)p Q2(z) 22 (t 48; z) Some of these functions are represented in Figures 2 5 for the dierent values of the covariate vector z indicated in Section 3. ACKNOWLEDGEMENTS The authors wish to thank the two anonymous referees for many helpful comments and valuable suggestions which improved the presentation and content of the paper. We are also very grateful to Dr Pedraza, head of the Department of Radiology at the University of Granada, for kindly supplying the data. The rst author gratefully acknowledges the nancial support by DGES, Proyecto PB , Ministerio de Educacion y Cultura, España. REFERENCES 1. Gentlemann RC, Lawless JF, Lindsey JC, Yan P. Multi-state Markov models for analysing incomplete disease history data with illustrations for HIV disease. Statistics in Medicine 1985; 13: Marshall G, Jones RH. Multi-state models and diabetic retinopathy. Statistics in Medicine 1995; 14: Perez-Ocon R, Ruiz-Castro JE, Gamiz-Perez ML. A multivariate model to measure the eect of treatments in survival to breast cancer. Biometrical Journal 1998; 40(6): Aalen OO, Vernon TF, De Angelis D, Day NE, Gill ON. A Markov model for HIV disease progression including the eect of HIV diagnosis and treatment: Application to AIDS prediction in England and Wales. Statistics in Medicine 1997; 16: Aalen OO, Johansen S. An empirical transition matrix for non-homogeneous Markov chains based on censored observations. Scandinavian Journal of Statistics 1978; 5: Keiding N, Andersen PK. Nonparametric estimation of transition intensities and transition probabilities: a case study of a two-state Markov process. Applied Statistics 1989; 38(2): Andersen PK, Hansen LS, Keiding N. Non- and semiparametric estimation of transition probabilities from censored observations of a non-homogeneous Markov process. Scandinavian Journal of Statistics 1991; 18: Cox DR. Regression models and life tables (with discussion). Journal of the Royal Statistic Society; Series B 1972; 34: Kay R. A Markov model for analysing cancer markers and disease states in survival studies. Biometrics 1986; 42: Kalbeish JD, Prentice RL. The Statistical Analysis of Failure Time Data. Wiley: Gutierrez R, Perez-Ocon R. Remarks of the paper stochastic analysis of Q-matrix (Discussion on the paper Stochastic analysis of Q-matrix by K. L. Chung). In Selected Topics on Stochastic Modelling, Gutierrez R, Valderrama M (eds). World Scientic, Singapore, 1994; Perez-Ocon R, Gutierrez Jaimez R, Garcia Leal J, Ollero Hinojosa J. A Markovian model with exponential transition intensities: application to the two sample problem. In Applied Stochastic Models and Data Analysis, Volume 2, Janssen J, Skiadas CH (eds). World Scientic, Singapore, 1993; de Stavola BL. Testing departures from time homogeneity in multistate Markov processes. Applied Statistics 1988; 37:

14 122 R. P EREZ-OC ON, J. E. RUIZ-CASTRO AND M. L. G AMIZ-P EREZ 14. Andersen PK, Borgan, Gill RD, Keiding N. Statistical Models Based on Counting Processes. Springer-Verlag, NY, Nelder JA, Mead R. A simplex method for function minimization. Computer Journal 1985; 7: Kaplan EL, Meier P. Nonparametric estimation from incomplete observations. Journal of the American Statistical Association 1958; 53: Lawless JF. Statistical Models and Methods for Lifetime Data. Wiley, Andersen PK, Hansen LS, Keiding N. Assessing the inuence of reversible disease indicators on survival. Statistics in Medicine 1991; 10: Lu Y, Stitt FW. Using Markov processes to describe the prognosis of HIV-1 infection. Medical Decision Making 1994; 14:

Statistics in Retail Finance. Chapter 6: Behavioural models

Statistics in Retail Finance. Chapter 6: Behavioural models Statistics in Retail Finance 1 Overview > So far we have focussed mainly on application scorecards. In this chapter we shall look at behavioural models. We shall cover the following topics:- Behavioural

More information

MODEL CHECK AND GOODNESS-OF-FIT FOR NESTED CASE-CONTROL STUDIES

MODEL CHECK AND GOODNESS-OF-FIT FOR NESTED CASE-CONTROL STUDIES MODEL CHECK AND GOODNESS-OF-FIT FOR NESTED CASE-CONTROL STUDIES by YING ZHANG THESIS for the degree of MASTER OF SCIENCE (Master i Modellering og dataanalyse) Faculty of Mathematics and Natural Sciences

More information

Introduction. Survival Analysis. Censoring. Plan of Talk

Introduction. Survival Analysis. Censoring. Plan of Talk Survival Analysis Mark Lunt Arthritis Research UK Centre for Excellence in Epidemiology University of Manchester 01/12/2015 Survival Analysis is concerned with the length of time before an event occurs.

More information

Tips for surviving the analysis of survival data. Philip Twumasi-Ankrah, PhD

Tips for surviving the analysis of survival data. Philip Twumasi-Ankrah, PhD Tips for surviving the analysis of survival data Philip Twumasi-Ankrah, PhD Big picture In medical research and many other areas of research, we often confront continuous, ordinal or dichotomous outcomes

More information

Survey, Statistics and Psychometrics Core Research Facility University of Nebraska-Lincoln. Log-Rank Test for More Than Two Groups

Survey, Statistics and Psychometrics Core Research Facility University of Nebraska-Lincoln. Log-Rank Test for More Than Two Groups Survey, Statistics and Psychometrics Core Research Facility University of Nebraska-Lincoln Log-Rank Test for More Than Two Groups Prepared by Harlan Sayles (SRAM) Revised by Julia Soulakova (Statistics)

More information

Lecture 2 ESTIMATING THE SURVIVAL FUNCTION. One-sample nonparametric methods

Lecture 2 ESTIMATING THE SURVIVAL FUNCTION. One-sample nonparametric methods Lecture 2 ESTIMATING THE SURVIVAL FUNCTION One-sample nonparametric methods There are commonly three methods for estimating a survivorship function S(t) = P (T > t) without resorting to parametric models:

More information

5.1 CHI-SQUARE TEST OF INDEPENDENCE

5.1 CHI-SQUARE TEST OF INDEPENDENCE C H A P T E R 5 Inferential Statistics and Predictive Analytics Inferential statistics draws valid inferences about a population based on an analysis of a representative sample of that population. The

More information

Checking proportionality for Cox s regression model

Checking proportionality for Cox s regression model Checking proportionality for Cox s regression model by Hui Hong Zhang Thesis for the degree of Master of Science (Master i Modellering og dataanalyse) Department of Mathematics Faculty of Mathematics and

More information

More details on the inputs, functionality, and output can be found below.

More details on the inputs, functionality, and output can be found below. Overview: The SMEEACT (Software for More Efficient, Ethical, and Affordable Clinical Trials) web interface (http://research.mdacc.tmc.edu/smeeactweb) implements a single analysis of a two-armed trial comparing

More information

Applied Statistics. J. Blanchet and J. Wadsworth. Institute of Mathematics, Analysis, and Applications EPF Lausanne

Applied Statistics. J. Blanchet and J. Wadsworth. Institute of Mathematics, Analysis, and Applications EPF Lausanne Applied Statistics J. Blanchet and J. Wadsworth Institute of Mathematics, Analysis, and Applications EPF Lausanne An MSc Course for Applied Mathematicians, Fall 2012 Outline 1 Model Comparison 2 Model

More information

Survival Analysis of Left Truncated Income Protection Insurance Data. [March 29, 2012]

Survival Analysis of Left Truncated Income Protection Insurance Data. [March 29, 2012] Survival Analysis of Left Truncated Income Protection Insurance Data [March 29, 2012] 1 Qing Liu 2 David Pitt 3 Yan Wang 4 Xueyuan Wu Abstract One of the main characteristics of Income Protection Insurance

More information

Life Table Analysis using Weighted Survey Data

Life Table Analysis using Weighted Survey Data Life Table Analysis using Weighted Survey Data James G. Booth and Thomas A. Hirschl June 2005 Abstract Formulas for constructing valid pointwise confidence bands for survival distributions, estimated using

More information

Survival Analysis of Dental Implants. Abstracts

Survival Analysis of Dental Implants. Abstracts Survival Analysis of Dental Implants Andrew Kai-Ming Kwan 1,4, Dr. Fu Lee Wang 2, and Dr. Tak-Kun Chow 3 1 Census and Statistics Department, Hong Kong, China 2 Caritas Institute of Higher Education, Hong

More information

200609 - ATV - Lifetime Data Analysis

200609 - ATV - Lifetime Data Analysis Coordinating unit: Teaching unit: Academic year: Degree: ECTS credits: 2015 200 - FME - School of Mathematics and Statistics 715 - EIO - Department of Statistics and Operations Research 1004 - UB - (ENG)Universitat

More information

SPSS TRAINING SESSION 3 ADVANCED TOPICS (PASW STATISTICS 17.0) Sun Li Centre for Academic Computing lsun@smu.edu.sg

SPSS TRAINING SESSION 3 ADVANCED TOPICS (PASW STATISTICS 17.0) Sun Li Centre for Academic Computing lsun@smu.edu.sg SPSS TRAINING SESSION 3 ADVANCED TOPICS (PASW STATISTICS 17.0) Sun Li Centre for Academic Computing lsun@smu.edu.sg IN SPSS SESSION 2, WE HAVE LEARNT: Elementary Data Analysis Group Comparison & One-way

More information

Revenue Management with Correlated Demand Forecasting

Revenue Management with Correlated Demand Forecasting Revenue Management with Correlated Demand Forecasting Catalina Stefanescu Victor DeMiguel Kristin Fridgeirsdottir Stefanos Zenios 1 Introduction Many airlines are struggling to survive in today's economy.

More information

STATISTICA Formula Guide: Logistic Regression. Table of Contents

STATISTICA Formula Guide: Logistic Regression. Table of Contents : Table of Contents... 1 Overview of Model... 1 Dispersion... 2 Parameterization... 3 Sigma-Restricted Model... 3 Overparameterized Model... 4 Reference Coding... 4 Model Summary (Summary Tab)... 5 Summary

More information

SUMAN DUVVURU STAT 567 PROJECT REPORT

SUMAN DUVVURU STAT 567 PROJECT REPORT SUMAN DUVVURU STAT 567 PROJECT REPORT SURVIVAL ANALYSIS OF HEROIN ADDICTS Background and introduction: Current illicit drug use among teens is continuing to increase in many countries around the world.

More information

Missing data and net survival analysis Bernard Rachet

Missing data and net survival analysis Bernard Rachet Workshop on Flexible Models for Longitudinal and Survival Data with Applications in Biostatistics Warwick, 27-29 July 2015 Missing data and net survival analysis Bernard Rachet General context Population-based,

More information

Survival Distributions, Hazard Functions, Cumulative Hazards

Survival Distributions, Hazard Functions, Cumulative Hazards Week 1 Survival Distributions, Hazard Functions, Cumulative Hazards 1.1 Definitions: The goals of this unit are to introduce notation, discuss ways of probabilistically describing the distribution of a

More information

7.1 The Hazard and Survival Functions

7.1 The Hazard and Survival Functions Chapter 7 Survival Models Our final chapter concerns models for the analysis of data which have three main characteristics: (1) the dependent variable or response is the waiting time until the occurrence

More information

Nonparametric adaptive age replacement with a one-cycle criterion

Nonparametric adaptive age replacement with a one-cycle criterion Nonparametric adaptive age replacement with a one-cycle criterion P. Coolen-Schrijner, F.P.A. Coolen Department of Mathematical Sciences University of Durham, Durham, DH1 3LE, UK e-mail: Pauline.Schrijner@durham.ac.uk

More information

TABLE OF CONTENTS. GENERAL AND HISTORICAL PREFACE iii SIXTH EDITION PREFACE v PART ONE: REVIEW AND BACKGROUND MATERIAL

TABLE OF CONTENTS. GENERAL AND HISTORICAL PREFACE iii SIXTH EDITION PREFACE v PART ONE: REVIEW AND BACKGROUND MATERIAL TABLE OF CONTENTS GENERAL AND HISTORICAL PREFACE iii SIXTH EDITION PREFACE v PART ONE: REVIEW AND BACKGROUND MATERIAL CHAPTER ONE: REVIEW OF INTEREST THEORY 3 1.1 Interest Measures 3 1.2 Level Annuity

More information

A LONGITUDINAL AND SURVIVAL MODEL WITH HEALTH CARE USAGE FOR INSURED ELDERLY. Workshop

A LONGITUDINAL AND SURVIVAL MODEL WITH HEALTH CARE USAGE FOR INSURED ELDERLY. Workshop A LONGITUDINAL AND SURVIVAL MODEL WITH HEALTH CARE USAGE FOR INSURED ELDERLY Ramon Alemany Montserrat Guillén Xavier Piulachs Lozada Riskcenter - IREA Universitat de Barcelona http://www.ub.edu/riskcenter

More information

Survival analysis methods in Insurance Applications in car insurance contracts

Survival analysis methods in Insurance Applications in car insurance contracts Survival analysis methods in Insurance Applications in car insurance contracts Abder OULIDI 1-2 Jean-Marie MARION 1 Hérvé GANACHAUD 3 1 Institut de Mathématiques Appliquées (IMA) Angers France 2 Institut

More information

Regression Modeling Strategies

Regression Modeling Strategies Frank E. Harrell, Jr. Regression Modeling Strategies With Applications to Linear Models, Logistic Regression, and Survival Analysis With 141 Figures Springer Contents Preface Typographical Conventions

More information

Andrew Charles Titman

Andrew Charles Titman Model diagnostics in multi-state models of biological systems A dissertation submitted to the University of Cambridge Department of Pure Mathematics and Mathematical Statistics for the degree of Doctor

More information

Portfolio Using Queuing Theory

Portfolio Using Queuing Theory Modeling the Number of Insured Households in an Insurance Portfolio Using Queuing Theory Jean-Philippe Boucher and Guillaume Couture-Piché December 8, 2015 Quantact / Département de mathématiques, UQAM.

More information

A random point process model for the score in sport matches

A random point process model for the score in sport matches IMA Journal of Management Mathematics (2009) 20, 121 131 doi:10.1093/imaman/dpn027 Advance Access publication on October 30, 2008 A random point process model for the score in sport matches PETR VOLF Institute

More information

Least Squares Estimation

Least Squares Estimation Least Squares Estimation SARA A VAN DE GEER Volume 2, pp 1041 1045 in Encyclopedia of Statistics in Behavioral Science ISBN-13: 978-0-470-86080-9 ISBN-10: 0-470-86080-4 Editors Brian S Everitt & David

More information

Statistics Graduate Courses

Statistics Graduate Courses Statistics Graduate Courses STAT 7002--Topics in Statistics-Biological/Physical/Mathematics (cr.arr.).organized study of selected topics. Subjects and earnable credit may vary from semester to semester.

More information

Competing-risks regression

Competing-risks regression Competing-risks regression Roberto G. Gutierrez Director of Statistics StataCorp LP Stata Conference Boston 2010 R. Gutierrez (StataCorp) Competing-risks regression July 15-16, 2010 1 / 26 Outline 1. Overview

More information

SAS Software to Fit the Generalized Linear Model

SAS Software to Fit the Generalized Linear Model SAS Software to Fit the Generalized Linear Model Gordon Johnston, SAS Institute Inc., Cary, NC Abstract In recent years, the class of generalized linear models has gained popularity as a statistical modeling

More information

Personalized Predictive Medicine and Genomic Clinical Trials

Personalized Predictive Medicine and Genomic Clinical Trials Personalized Predictive Medicine and Genomic Clinical Trials Richard Simon, D.Sc. Chief, Biometric Research Branch National Cancer Institute http://brb.nci.nih.gov brb.nci.nih.gov Powerpoint presentations

More information

Lecture 15 Introduction to Survival Analysis

Lecture 15 Introduction to Survival Analysis Lecture 15 Introduction to Survival Analysis BIOST 515 February 26, 2004 BIOST 515, Lecture 15 Background In logistic regression, we were interested in studying how risk factors were associated with presence

More information

Introduction to General and Generalized Linear Models

Introduction to General and Generalized Linear Models Introduction to General and Generalized Linear Models General Linear Models - part I Henrik Madsen Poul Thyregod Informatics and Mathematical Modelling Technical University of Denmark DK-2800 Kgs. Lyngby

More information

Application of Markov chain analysis to trend prediction of stock indices Milan Svoboda 1, Ladislav Lukáš 2

Application of Markov chain analysis to trend prediction of stock indices Milan Svoboda 1, Ladislav Lukáš 2 Proceedings of 3th International Conference Mathematical Methods in Economics 1 Introduction Application of Markov chain analysis to trend prediction of stock indices Milan Svoboda 1, Ladislav Lukáš 2

More information

Modeling the Claim Duration of Income Protection Insurance Policyholders Using Parametric Mixture Models

Modeling the Claim Duration of Income Protection Insurance Policyholders Using Parametric Mixture Models Modeling the Claim Duration of Income Protection Insurance Policyholders Using Parametric Mixture Models Abstract This paper considers the modeling of claim durations for existing claimants under income

More information

A LOGNORMAL MODEL FOR INSURANCE CLAIMS DATA

A LOGNORMAL MODEL FOR INSURANCE CLAIMS DATA REVSTAT Statistical Journal Volume 4, Number 2, June 2006, 131 142 A LOGNORMAL MODEL FOR INSURANCE CLAIMS DATA Authors: Daiane Aparecida Zuanetti Departamento de Estatística, Universidade Federal de São

More information

Komorbide brystkræftpatienter kan de tåle behandling? Et registerstudie baseret på Danish Breast Cancer Cooperative Group

Komorbide brystkræftpatienter kan de tåle behandling? Et registerstudie baseret på Danish Breast Cancer Cooperative Group Komorbide brystkræftpatienter kan de tåle behandling? Et registerstudie baseret på Danish Breast Cancer Cooperative Group Lotte Holm Land MD, ph.d. Onkologisk Afd. R. OUH Kræft og komorbiditet - alle skal

More information

Tests for Two Survival Curves Using Cox s Proportional Hazards Model

Tests for Two Survival Curves Using Cox s Proportional Hazards Model Chapter 730 Tests for Two Survival Curves Using Cox s Proportional Hazards Model Introduction A clinical trial is often employed to test the equality of survival distributions of two treatment groups.

More information

CHILDHOOD CANCER SURVIVOR STUDY Analysis Concept Proposal

CHILDHOOD CANCER SURVIVOR STUDY Analysis Concept Proposal CHILDHOOD CANCER SURVIVOR STUDY Analysis Concept Proposal 1. STUDY TITLE: Longitudinal Assessment of Chronic Health Conditions: The Aging of Childhood Cancer Survivors 2. WORKING GROUP AND INVESTIGATORS:

More information

Safety margins for unsystematic biometric risk in life and health insurance

Safety margins for unsystematic biometric risk in life and health insurance Safety margins for unsystematic biometric risk in life and health insurance Marcus C. Christiansen June 1, 2012 7th Conference in Actuarial Science & Finance on Samos Seite 2 Safety margins for unsystematic

More information

Gamma Distribution Fitting

Gamma Distribution Fitting Chapter 552 Gamma Distribution Fitting Introduction This module fits the gamma probability distributions to a complete or censored set of individual or grouped data values. It outputs various statistics

More information

Methods for Meta-analysis in Medical Research

Methods for Meta-analysis in Medical Research Methods for Meta-analysis in Medical Research Alex J. Sutton University of Leicester, UK Keith R. Abrams University of Leicester, UK David R. Jones University of Leicester, UK Trevor A. Sheldon University

More information

Privacy-Preserving Models for Comparing Survival Curves Using the Logrank Test

Privacy-Preserving Models for Comparing Survival Curves Using the Logrank Test Privacy-Preserving Models for Comparing Survival Curves Using the Logrank Test Tingting Chen Sheng Zhong Computer Science and Engineering Department State University of New york at Buffalo Amherst, NY

More information

Conditional Tail Expectations for Multivariate Phase Type Distributions

Conditional Tail Expectations for Multivariate Phase Type Distributions Conditional Tail Expectations for Multivariate Phase Type Distributions Jun Cai Department of Statistics and Actuarial Science University of Waterloo Waterloo, ON N2L 3G1, Canada jcai@math.uwaterloo.ca

More information

Simple Linear Regression Inference

Simple Linear Regression Inference Simple Linear Regression Inference 1 Inference requirements The Normality assumption of the stochastic term e is needed for inference even if it is not a OLS requirement. Therefore we have: Interpretation

More information

Principle Component Analysis and Partial Least Squares: Two Dimension Reduction Techniques for Regression

Principle Component Analysis and Partial Least Squares: Two Dimension Reduction Techniques for Regression Principle Component Analysis and Partial Least Squares: Two Dimension Reduction Techniques for Regression Saikat Maitra and Jun Yan Abstract: Dimension reduction is one of the major tasks for multivariate

More information

R 2 -type Curves for Dynamic Predictions from Joint Longitudinal-Survival Models

R 2 -type Curves for Dynamic Predictions from Joint Longitudinal-Survival Models Faculty of Health Sciences R 2 -type Curves for Dynamic Predictions from Joint Longitudinal-Survival Models Inference & application to prediction of kidney graft failure Paul Blanche joint work with M-C.

More information

Monitoring the Behaviour of Credit Card Holders with Graphical Chain Models

Monitoring the Behaviour of Credit Card Holders with Graphical Chain Models Journal of Business Finance & Accounting, 30(9) & (10), Nov./Dec. 2003, 0306-686X Monitoring the Behaviour of Credit Card Holders with Graphical Chain Models ELENA STANGHELLINI* 1. INTRODUCTION Consumer

More information

3. ARTÍCULOS DE APLICACIÓN

3. ARTÍCULOS DE APLICACIÓN 3. ARTÍCULOS DE APLICACIÓN SOME PROBABILITY APPLICATIONS TO THE RISK ANALYSIS IN INSURANCE THEORY Jinzhi Li College of Science Central University for Nationalities, China Shixia Ma School of Sciences Hebei

More information

A Markovian Model for Investment Analysis in Advertising

A Markovian Model for Investment Analysis in Advertising Proceedings of the Seventh Young Statisticians Meeting Andrej Mrvar (Editor) Metodološki zvezki, 21, Ljubljana: FDV, 2003 A Markovian Model for Investment Analysis in Advertising Eugenio Novelli 1 Abstract

More information

Chapter 6: Multivariate Cointegration Analysis

Chapter 6: Multivariate Cointegration Analysis Chapter 6: Multivariate Cointegration Analysis 1 Contents: Lehrstuhl für Department Empirische of Wirtschaftsforschung Empirical Research and und Econometrics Ökonometrie VI. Multivariate Cointegration

More information

LECTURE 4. Last time: Lecture outline

LECTURE 4. Last time: Lecture outline LECTURE 4 Last time: Types of convergence Weak Law of Large Numbers Strong Law of Large Numbers Asymptotic Equipartition Property Lecture outline Stochastic processes Markov chains Entropy rate Random

More information

Modelling the short-term dependence between two remaining lifetimes

Modelling the short-term dependence between two remaining lifetimes Modelling the short-term dependence between two remaining lifetimes Jaap Spreeuw and Xu Wang March 27, 2008 Abstract Dependence between remaining lifetimes in a couple is mainly of a short-term type if

More information

Data analysis in supersaturated designs

Data analysis in supersaturated designs Statistics & Probability Letters 59 (2002) 35 44 Data analysis in supersaturated designs Runze Li a;b;, Dennis K.J. Lin a;b a Department of Statistics, The Pennsylvania State University, University Park,

More information

People have thought about, and defined, probability in different ways. important to note the consequences of the definition:

People have thought about, and defined, probability in different ways. important to note the consequences of the definition: PROBABILITY AND LIKELIHOOD, A BRIEF INTRODUCTION IN SUPPORT OF A COURSE ON MOLECULAR EVOLUTION (BIOL 3046) Probability The subject of PROBABILITY is a branch of mathematics dedicated to building models

More information

If several different trials are mentioned in one publication, the data of each should be extracted in a separate data extraction form.

If several different trials are mentioned in one publication, the data of each should be extracted in a separate data extraction form. General Remarks This template of a data extraction form is intended to help you to start developing your own data extraction form, it certainly has to be adapted to your specific question. Delete unnecessary

More information

Testing Simple Markov Structures for Credit Rating Transitions

Testing Simple Markov Structures for Credit Rating Transitions Testing Simple Markov Structures for Credit Rating Transitions Nicholas M. Kiefer and C. Erik Larson OCC Economics Working Paper 2004-3 Abstract Models abound that analyze changes in credit quality. These

More information

Life Tables. Marie Diener-West, PhD Sukon Kanchanaraksa, PhD

Life Tables. Marie Diener-West, PhD Sukon Kanchanaraksa, PhD This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike License. Your use of this material constitutes acceptance of that license and the conditions of use of materials on this

More information

The Landmark Approach: An Introduction and Application to Dynamic Prediction in Competing Risks

The Landmark Approach: An Introduction and Application to Dynamic Prediction in Competing Risks Landmarking and landmarking Landmarking and competing risks Discussion The Landmark Approach: An Introduction and Application to Dynamic Prediction in Competing Risks Department of Medical Statistics and

More information

LOGISTIC REGRESSION ANALYSIS

LOGISTIC REGRESSION ANALYSIS LOGISTIC REGRESSION ANALYSIS C. Mitchell Dayton Department of Measurement, Statistics & Evaluation Room 1230D Benjamin Building University of Maryland September 1992 1. Introduction and Model Logistic

More information

SAMPLE SIZE TABLES FOR LOGISTIC REGRESSION

SAMPLE SIZE TABLES FOR LOGISTIC REGRESSION STATISTICS IN MEDICINE, VOL. 8, 795-802 (1989) SAMPLE SIZE TABLES FOR LOGISTIC REGRESSION F. Y. HSIEH* Department of Epidemiology and Social Medicine, Albert Einstein College of Medicine, Bronx, N Y 10461,

More information

Product Pricing and Solvency Capital Requirements for Long-Term Care Insurance

Product Pricing and Solvency Capital Requirements for Long-Term Care Insurance Product Pricing and Solvency Capital Requirements for Long-Term Care Insurance Adam W. Shao, Michael Sherris, Joelle H. Fong ARC Centre of Excellence in Population Ageing Research (CEPAR) UNSW Australia

More information

Joint models for classification and comparison of mortality in different countries.

Joint models for classification and comparison of mortality in different countries. Joint models for classification and comparison of mortality in different countries. Viani D. Biatat 1 and Iain D. Currie 1 1 Department of Actuarial Mathematics and Statistics, and the Maxwell Institute

More information

Firm and Product Life Cycles and Firm Survival

Firm and Product Life Cycles and Firm Survival TECHNOLOGICAL CHANGE Firm and Product Life Cycles and Firm Survival By RAJSHREE AGARWAL AND MICHAEL GORT* On average, roughly 5 10 percent of the firms in a given market leave that market over the span

More information

Survival Analysis, Software

Survival Analysis, Software Survival Analysis, Software As used here, survival analysis refers to the analysis of data where the response variable is the time until the occurrence of some event (e.g. death), where some of the observations

More information

Effect of Risk and Prognosis Factors on Breast Cancer Survival: Study of a Large Dataset with a Long Term Follow-up

Effect of Risk and Prognosis Factors on Breast Cancer Survival: Study of a Large Dataset with a Long Term Follow-up Georgia State University ScholarWorks @ Georgia State University Mathematics Theses Department of Mathematics and Statistics 7-28-2012 Effect of Risk and Prognosis Factors on Breast Cancer Survival: Study

More information

Bias in the Estimation of Mean Reversion in Continuous-Time Lévy Processes

Bias in the Estimation of Mean Reversion in Continuous-Time Lévy Processes Bias in the Estimation of Mean Reversion in Continuous-Time Lévy Processes Yong Bao a, Aman Ullah b, Yun Wang c, and Jun Yu d a Purdue University, IN, USA b University of California, Riverside, CA, USA

More information

Service courses for graduate students in degree programs other than the MS or PhD programs in Biostatistics.

Service courses for graduate students in degree programs other than the MS or PhD programs in Biostatistics. Course Catalog In order to be assured that all prerequisites are met, students must acquire a permission number from the education coordinator prior to enrolling in any Biostatistics course. Courses are

More information

Model-Based Cluster Analysis for Web Users Sessions

Model-Based Cluster Analysis for Web Users Sessions Model-Based Cluster Analysis for Web Users Sessions George Pallis, Lefteris Angelis, and Athena Vakali Department of Informatics, Aristotle University of Thessaloniki, 54124, Thessaloniki, Greece gpallis@ccf.auth.gr

More information

Yield Curve Modeling

Yield Curve Modeling Eötvös Loránd University Faculty of Science Yield Curve Modeling Thesis Balázs Márton Süli Actuarial and Financial Mathematics MSc Quantitative Finances Major Supervisors: Dr. András Zempléni associate

More information

An Application of the G-formula to Asbestos and Lung Cancer. Stephen R. Cole. Epidemiology, UNC Chapel Hill. Slides: www.unc.

An Application of the G-formula to Asbestos and Lung Cancer. Stephen R. Cole. Epidemiology, UNC Chapel Hill. Slides: www.unc. An Application of the G-formula to Asbestos and Lung Cancer Stephen R. Cole Epidemiology, UNC Chapel Hill Slides: www.unc.edu/~colesr/ 1 Acknowledgements Collaboration with David B. Richardson, Haitao

More information

How big is large? a study of the limit for large insurance claims in case reserves. Karl-Sebastian Lindblad

How big is large? a study of the limit for large insurance claims in case reserves. Karl-Sebastian Lindblad How big is large? a study of the limit for large insurance claims in case reserves. Karl-Sebastian Lindblad February 15, 2011 Abstract A company issuing an insurance will provide, in return for a monetary

More information

Interpretation of Somers D under four simple models

Interpretation of Somers D under four simple models Interpretation of Somers D under four simple models Roger B. Newson 03 September, 04 Introduction Somers D is an ordinal measure of association introduced by Somers (96)[9]. It can be defined in terms

More information

Dealing with Missing Data

Dealing with Missing Data Dealing with Missing Data Roch Giorgi email: roch.giorgi@univ-amu.fr UMR 912 SESSTIM, Aix Marseille Université / INSERM / IRD, Marseille, France BioSTIC, APHM, Hôpital Timone, Marseille, France January

More information

The cost for the default of a loan Linking theory and practice

The cost for the default of a loan Linking theory and practice The cost for the default of a loan Linking theory and practice By Rafael Weissbach & Philipp Sibbertsen Department of Statistics University of Dortmund May 17, 2004 address for correspondence: Rafael Weissbach,

More information

2 Right Censoring and Kaplan-Meier Estimator

2 Right Censoring and Kaplan-Meier Estimator 2 Right Censoring and Kaplan-Meier Estimator In biomedical applications, especially in clinical trials, two important issues arise when studying time to event data (we will assume the event to be death.

More information

CHAPTER 13 SIMPLE LINEAR REGRESSION. Opening Example. Simple Regression. Linear Regression

CHAPTER 13 SIMPLE LINEAR REGRESSION. Opening Example. Simple Regression. Linear Regression Opening Example CHAPTER 13 SIMPLE LINEAR REGREION SIMPLE LINEAR REGREION! Simple Regression! Linear Regression Simple Regression Definition A regression model is a mathematical equation that descries the

More information

BayesX - Software for Bayesian Inference in Structured Additive Regression

BayesX - Software for Bayesian Inference in Structured Additive Regression BayesX - Software for Bayesian Inference in Structured Additive Regression Thomas Kneib Faculty of Mathematics and Economics, University of Ulm Department of Statistics, Ludwig-Maximilians-University Munich

More information

Hormones and cardiovascular disease, what the Danish Nurse Cohort learned us

Hormones and cardiovascular disease, what the Danish Nurse Cohort learned us Hormones and cardiovascular disease, what the Danish Nurse Cohort learned us Ellen Løkkegaard, Clinical Associate Professor, Ph.d. Dept. Obstetrics and Gynecology. Hillerød Hospital, University of Copenhagen

More information

TABLE OF CONTENTS. 4. Daniel Markov 1 173

TABLE OF CONTENTS. 4. Daniel Markov 1 173 TABLE OF CONTENTS 1. Survival A. Time of Death for a Person Aged x 1 B. Force of Mortality 7 C. Life Tables and the Deterministic Survivorship Group 19 D. Life Table Characteristics: Expectation of Life

More information

Systematic Reviews and Meta-analyses

Systematic Reviews and Meta-analyses Systematic Reviews and Meta-analyses Introduction A systematic review (also called an overview) attempts to summarize the scientific evidence related to treatment, causation, diagnosis, or prognosis of

More information

Poisson Models for Count Data

Poisson Models for Count Data Chapter 4 Poisson Models for Count Data In this chapter we study log-linear models for count data under the assumption of a Poisson error structure. These models have many applications, not only to the

More information

Randomized trials versus observational studies

Randomized trials versus observational studies Randomized trials versus observational studies The case of postmenopausal hormone therapy and heart disease Miguel Hernán Harvard School of Public Health www.hsph.harvard.edu/causal Joint work with James

More information

A Regime-Switching Model for Electricity Spot Prices. Gero Schindlmayr EnBW Trading GmbH g.schindlmayr@enbw.com

A Regime-Switching Model for Electricity Spot Prices. Gero Schindlmayr EnBW Trading GmbH g.schindlmayr@enbw.com A Regime-Switching Model for Electricity Spot Prices Gero Schindlmayr EnBW Trading GmbH g.schindlmayr@enbw.com May 31, 25 A Regime-Switching Model for Electricity Spot Prices Abstract Electricity markets

More information

Survival Analysis of the Patients Diagnosed with Non-Small Cell Lung Cancer Using SAS Enterprise Miner 13.1

Survival Analysis of the Patients Diagnosed with Non-Small Cell Lung Cancer Using SAS Enterprise Miner 13.1 Paper 11682-2016 Survival Analysis of the Patients Diagnosed with Non-Small Cell Lung Cancer Using SAS Enterprise Miner 13.1 Raja Rajeswari Veggalam, Akansha Gupta; SAS and OSU Data Mining Certificate

More information

Analysis of Financial Time Series

Analysis of Financial Time Series Analysis of Financial Time Series Analysis of Financial Time Series Financial Econometrics RUEY S. TSAY University of Chicago A Wiley-Interscience Publication JOHN WILEY & SONS, INC. This book is printed

More information

Introduction to mixed model and missing data issues in longitudinal studies

Introduction to mixed model and missing data issues in longitudinal studies Introduction to mixed model and missing data issues in longitudinal studies Hélène Jacqmin-Gadda INSERM, U897, Bordeaux, France Inserm workshop, St Raphael Outline of the talk I Introduction Mixed models

More information

A Bayesian hierarchical surrogate outcome model for multiple sclerosis

A Bayesian hierarchical surrogate outcome model for multiple sclerosis A Bayesian hierarchical surrogate outcome model for multiple sclerosis 3 rd Annual ASA New Jersey Chapter / Bayer Statistics Workshop David Ohlssen (Novartis), Luca Pozzi and Heinz Schmidli (Novartis)

More information

Predicting Customer Default Times using Survival Analysis Methods in SAS

Predicting Customer Default Times using Survival Analysis Methods in SAS Predicting Customer Default Times using Survival Analysis Methods in SAS Bart Baesens Bart.Baesens@econ.kuleuven.ac.be Overview The credit scoring survival analysis problem Statistical methods for Survival

More information

Is the Basis of the Stock Index Futures Markets Nonlinear?

Is the Basis of the Stock Index Futures Markets Nonlinear? University of Wollongong Research Online Applied Statistics Education and Research Collaboration (ASEARC) - Conference Papers Faculty of Engineering and Information Sciences 2011 Is the Basis of the Stock

More information

1170 M. M. SANCHEZ AND X. CHEN

1170 M. M. SANCHEZ AND X. CHEN STATISTICS IN MEDICINE Statist. Med. 2006; 25:1169 1181 Published online 5 January 2006 in Wiley InterScience (www.interscience.wiley.com). DOI: 10.1002/sim.2244 Choosing the analysis population in non-inferiority

More information

These axioms must hold for all vectors ū, v, and w in V and all scalars c and d.

These axioms must hold for all vectors ū, v, and w in V and all scalars c and d. DEFINITION: A vector space is a nonempty set V of objects, called vectors, on which are defined two operations, called addition and multiplication by scalars (real numbers), subject to the following axioms

More information

Vignette for survrm2 package: Comparing two survival curves using the restricted mean survival time

Vignette for survrm2 package: Comparing two survival curves using the restricted mean survival time Vignette for survrm2 package: Comparing two survival curves using the restricted mean survival time Hajime Uno Dana-Farber Cancer Institute March 16, 2015 1 Introduction In a comparative, longitudinal

More information

Statistical Rules of Thumb

Statistical Rules of Thumb Statistical Rules of Thumb Second Edition Gerald van Belle University of Washington Department of Biostatistics and Department of Environmental and Occupational Health Sciences Seattle, WA WILEY AJOHN

More information

Chenfeng Xiong (corresponding), University of Maryland, College Park (cxiong@umd.edu)

Chenfeng Xiong (corresponding), University of Maryland, College Park (cxiong@umd.edu) Paper Author (s) Chenfeng Xiong (corresponding), University of Maryland, College Park (cxiong@umd.edu) Lei Zhang, University of Maryland, College Park (lei@umd.edu) Paper Title & Number Dynamic Travel

More information

A new score predicting the survival of patients with spinal cord compression from myeloma

A new score predicting the survival of patients with spinal cord compression from myeloma A new score predicting the survival of patients with spinal cord compression from myeloma (1) Sarah Douglas, Department of Radiation Oncology, University of Lubeck, Germany; sarah_douglas@gmx.de (2) Steven

More information