JOINT MODELLING FOR ACTUARIAL GRADUATION AND DUPLICATE POLICIES. BY A. E. RENSHAW, B.Sc., Ph.D. (of The City University, London)

Size: px
Start display at page:

Download "JOINT MODELLING FOR ACTUARIAL GRADUATION AND DUPLICATE POLICIES. BY A. E. RENSHAW, B.Sc., Ph.D. (of The City University, London)"

Transcription

1 J.I.A. 119, I, JOINT MODELLING FOR ACTUARIAL GRADUATION AND DUPLICATE POLICIES BY A. E. RENSHAW, B.Sc., Ph.D. (of The City University, London) ABSTRACT In this paper it is demonstrated how recently developed statistical techniques designed to facilitate the joint modelling of the mean and dispersion are well suited to model the presence of duplicate policies in graduation. KEYWORDS Graduation; Duplicate Policies; Models; Life Tables 1. INTRODUCTION The purpose of this paper is to suggest how the possible effects of duplicate policies can be jointly modelled as part of the wider graduation modelling process. It is well known that graduations for the mortality of assured lives are based on the number of policies and not on the number of lives. Consequently, the death of a policyholder carrying more than one policy will appear as more than one death in the raw data, thereby clouding the resulting graduations unless due allowance is made for this eventuality. Various aspects of the effects of such duplicate policies on the distribution of deaths and the implications for graduations have been investigated by a number of researchers, including Seal (1945), Daw(1946), Beard & Perks (1949), Daw (1951), C.M.I. Committee (l957, 1986), and Forfar, McCutcheon & Wilkie (1988). With the exception of Forfar et al. (1988), perhaps one interesting historical side commentary on these developments has been the emphasis placed on adjustments to goodness-of-fit tests for graduations rather than on marginal adjustments to the graduations themselves. The underlying message of these early developments, in particular that of Beard & Perks (1949), is to prescribe an over-dispersed modelling distribution for the number of recorded deaths (more accurately identified as the number of recorded claims). To allow for such over-dispersion during the construction of graduations, Forfar et al. (1988) have advanced a method whereby the data are first transformed before modelling. The method relies on the availability of estimates at each age of the degree of over-dispersion present in the data, viz. a knowledge of the so-called variance ratios. The aims of this paper are to describe and explore the feasibility of modelling for graduations with allowance for over-dispersion attributed to duplicate policies using two-stage generalised linear (or non-linear) models. One clear advantage of such a method accrues from its universality, rendering a detailed 69

2 70 Joint Modelling for Actuarial Graduation and Duplicate Policies collateral knowledge of the specific variance ratios unnecessary. In any event, since these are data based and liable, as a consequence, to contain irregularities, it seems only natural to the author that these, as well as the irregular empirical death rates, should be smoothed by the graduation modelling process. The methodology owes much to recent advances in the joint modelling of mean and dispersion motivated by a certain class of industrial quality improvement experiments, as described in a discussion paper by Nelder & Lee (1991). It has been noted by Renshaw (1990, 1991) that the graduation models currently used by British actuaries are either generalised linear or generalised non-linear models, so that the first stage of the envisaged joint modelling process is already in place. All that is needed is for this to be augmented by a second-stage dispersion modelling process along the lines first proposed by Pregibon (1984) and developed by Nelder & Pregibon (1987); while Chapter 10 of the monograph by McCullagh & Nelder (1989) is also highly relevant here. Further, it is logical that the second stage of the joint modelling process should be formulated in such a way as to reflect the known properties of the empirical variance ratios. Indeed, without such knowledge, the formulation of the predictor-link combination for the second stage generalised linear model would be somewhat speculative. In order to set the issues in context at the outset, it is important to stress that, as Cox (1983) has pointed out, the modelling of excess variation, brought about in this context by the presence of duplicate policies, has only a marginal effect on the estimation of the regression coefficients of primary concern giving rise to graduations, but that statistical tests and confidence intervals may be seriously in error unless its effect is taken into account. Indeed intuitively one would not expect the presence of duplicate policies in the data to have a major effect on any resulting graduation. The nature of the over-dispersion present in graduations due to duplicate policies is discussed in Section 2, and the joint modelling for graduations and duplicate policies described in Section 3. Applications are presented in Section DUPLICATE POLICIES AND OVER-DISPERSION It is necessary to examine the nature of the over-dispersion and, in particular, the empirical evidence for this, if a plausible second stage is to be established for the joint mean and dispersion modelling process. Focus on the graduation target, qx the probability that a policyholder aged x, dies before age (x+ 1). Suppose there are some duplicate policies present, and let A, denote the total number of policies giving rise to claims, or simply, the number of claim, from among the policies held by the Nx policyholders aged x, present at the beginning of a duration period of 1 year. Clearly, if there are no duplicates present, Ax and Nx are synonymous with the actual number of deaths and the initial exposure to risk of death, respectively, but not otherwise. In common with modern actuarial graduation practice, A, is modelled as a

3 Joint Modelling for Actuarial Graduation and Duplicate Policies 71 random variable. Further, adapting the argument presented in Beard & Perks (1949), it is possible to write: where for policyholders k, are independent and identically distributed random variables defined by: (2.1) representing the number of policies giving rise to claim per policyholder, that is, the number of claims per policyholder. Here px = 1 - qx and is the probability that a policyholder, aged x, holds i policies; i = 1, 2, 3,... with: (2.2) Trivially, it follows from equation (2.2) that: so that these expressions, together with equation (2.1), imply that: where: are the initial exposures based on policies; and: (2.3) Note, that when there are no duplicate policies present, so that otherwise; Øx =: 1 and A, has the binomial distribution Ax ~ Bin(Rx, qx). Obviously Rx = Nx also for this case. When there are duplicates present, however, so that n(i) x > 0 for at least one value of i = 2, 3, 4,...; equation (2.3) approximates to: (2.4)

4 72 Joint Modelling for Actuarial Graduation and Duplicate Policies since qx is small. Consequently A, has an over-dispersed binomial distribution. The approximation is a good one because of the relative smallness of qx for all but the very oldest ages x; while the C.M.I. Committee (1957) memorandum contains impressive empirical evidence in support of this. Use of approximation (2.4) is important in-so-far as it renders the over-dispersion parameter X independent of the primary target qx. Defining fx(i) to be the proportion of policyholders, aged x, who have i policies in any empirical study group, the so-called data based variance ratios : estimate the φ xs. Two empirical studies of the number of policies held by policyholders aged x, in which variance ratios have been computed, are available to shed light on a likely structure for the φ xs. Roth studies are based on data culled from the death certificates of policyholders whose deaths occurred during the specific year of investigation. The first of these, relating to duplicate policies held by assured lives in 1954, is reported in the C.M.I. Committee (1957) memorandum; while different aspects of the second study, relating to duplicate Figure 2.1 Variance Ratios Against Age (quinary groups) Various assured lives experiences

5 Variance Ratios Against Age Assured lives experience Figure 2.2 (b) Variance Ratios Against Exposed to Risk Assured lives experience Exposed to Risk

6 74 Joint Modelling for Actuarial Graduation and Duplicate Policies policies held by assured lives in 1981 and 1982, are reported by the C.M.I. Committee (1986) and in Section 17 of Forfar et al. (1988). The variance ratios from each study, based on quinquennial age groups, are plotted against age in Figure 2.1. For the former study, two sets of variance ratios are displayed; one based on data from all offices participating in the study and the other based on non-industrial offices reporting not less than 300 policy claims; the latter being interpreted as a realisation of an upper estimate for the φ xs. For the more recent study and in the interests of greater clarity, only variance ratios based on 1981 and 1982 combined are displayed. In any event, values for the 2 years taken separately lie fairly closely on either side of these plotted values. Also available from the more recent study are variance ratios for individual ages. These are plotted both against age and against exposure to risk in Figures 2.2(a) and (b). It is remarked that Figure 2.2(b) is substantially the same as Figure 2.2(a) folded in half, since the exposed to risk rises to a peak at age 36 and then falls away again. The patterns observed in all three figures form the basis for the choice of structure imparted to the dispersion modelling process introduced in the next section. 3. JOINT MODELLING FOR GRADUATIONS AND DUPLICATE POLICIES The first stage graduation modelling process is a generalised linear or nonlinear model in which the responses, Ax, are modelled either as over-dispersed binomial variables with initial exposures Rx in the case of qx-graduations, so that: E(Ax) = Rxqx, Var(Ax) = φ xrxqx (1-qx) or as over-dispersed Poisson variables with central exposures Rx in the case of µx-graduations, so that: Current graduation practice, as comprehensively described by Forfar et al. (1988), is to use the Gompertz-Makeham predictor: subject to the convention that r = 0 implies the exponentiated polynomial term only, and s = 0 implies the polynomial term only; in conjunction with either the odds-link: with inverse in the case of qx-graduations, or the identity link nx=µx for µx-graduations. Renshaw (1990, 1991) has described how this practice falls within the generalised linear and non-linear modelling framework and, in particular, how the scope of current actuarial graduation practice can be appreciably enhanced by the use of

7 Joint Modelling for Actuarial Graduation and Duplicate Policies 75 alternative and somewhat more conventional specific linkages of the general type x = g(qx) or x = g(µx), as the case may be. Here g is a monotonic differentiable function. Graduated values are computed using the inverse linkage and predictor combination once the parameters in the predictor (equation (3.1)) have been estimated. If duplicate policies are either non-existent or simply ignored, so that φ x = 1 for all ages X, the AxS are modelled as conventional independent binomial or Poisson variables, and the parameter estimates follow by maximum likelihood using the likelihood for the binomial or Poisson response variables as the case may be. The question now to be addressed is what to do when the effects of duplicate policies are to be incorporated into the graduation modelling process, so that the over-dispersion paramters, φ x 1, are not all equal to unity and are unknown. It is proposed to model the unknown dispersion paramters φ x a secondary interrelating generalised linear model in much the same way as the unknown parameters qx or µx in the mean, mx = E(Ax) = Rxqx or Rxµx+½, are modelled as the primary target. There are two possible candidates to act as dispersion statistics dx = dx(ax; x) for the φ xs based on: either the Pearson squared residual, where: dx = dx (Ax; mx) = or the deviance squared residual, where: (Ax mx)2 V(mx) (3.2) (3.3) Here, in each expression, V(e) is the variance function of the first stage graduation modelling process, namely: and: V(mx) = mx(l mx/rx) for qx-graduations, V (mx) = mx for µx-graduations. Expressions (3.2) and (3.3) are identical if the variance function is a constant, corresponding to a normal modelling distribution, but not otherwise. Since Var(Ax) = fxv(mx), it follows trivially from expression (3.2), that when evaluated at the true value of for the Pearson dispersion statistic; while for the deviance dispersion statistic. It is further necessary to specify the variance function VD( ) for the second stage generalised linear model so that with scale factor t. Note that when the Axs are normally distributed, dx has the distribution so that Var(dx)=2& and the gamma model would be chosen for the dxs with Vn(+,) = 4:. The choice of the predictorlink structure for the dispersion modelling generalised linear model is guided by

8 76 Joint Modelling for Actuarial Graduation and Duplicate Policies the known properties of the variance ratios presented graphically in Figures 2.1 and 2.2. As a consequence, it is proposed to use either a quadratic or possibly higher order polynomial age predictor or a straight line exposure-to-risk predictor in combination with a monotonic increasing differentiable link function h mapping the interval [1, 1+ k], K > 0 to the whole of the real line. Further, Figures 2.1 and 2.2 would seem to support that K = 1, which is assumed throughout with one exception. More precise detail is presented in Section 4. It is appropriate to summarise the foregoing model structure in outline as follows, before proceeeding to discuss how to fit both stages of the model: (1) Model the Axs as independent response variables with: and predictor-link E(Ax) = mx, Var(Ax) = φ xv(mx) (2) Model the unknown dispersion parameters φ x using an appropriate dispersion statistic dx with: and predictor-link Here it is assumed that the first stage predictor x as well as the second stage predictor lx is linear, although it is possible to relax this condition, as discussed previously. The u,,5 and v,,5 are known covariate components, while it is necessary to estimate the β js and γ js in order to fit the model. To do this an optimisation or estimating function is needed to play the role of the likelihood or log-likelihood function in the estimation of parameters for simpler, probably more familiar, model structures. There are two cases to consider depending on the specific form of d,. When d, is based on Poisson squared residuals, the estimation of both sets of regression parameters is based on the optimisation of the so-called pseudolikelihood (strictly the pseudo-log-likelihood) P defined by: (3.4) where d, is defined by equation (3.2). The reader will immediately associate this with the log-likelihood based on assumed independent normally distributed Poisson residuals. The properties and applications of the pseudo-likelihood Pare discussed in papers by Carroll & Ruppert (1982), Davidian & Carroll (1987, 1988) and Breslow (1990). When d, is based on deviance squared residuals, the estimation of both sets of

9 Joint Modeling for Actuarial Graduation and Duplicate Policies 77 regression parameters is based on the optimisation of the Nelder & Pregibon (1987) extended quasi-likelihood (strictly the extended quasi-log-likelihood) Q defined by: (3.5) where this time dx is defined by equation (3.3). The properties of the extended quasi-likelihood function are also discussed in McCullagh & Nelder (1989). The β is enter into expressions (3.4) and (3.5) via the mean mx, in the corresponding expressions for dx, viz. equations (3.2) and (3.3) respectively. Using the chain rule for first order partial derivatives, it follows that the optimum values for the β is satisfy the following system of equations: irrespective of which version of dx is selected. These are the standard Wedderburn quasi-likelihood estimation equations with weights 1/ φ x see Wedderburn (1974) or McCullagh & Nelder (1989), which may be solved using the GLIM software package when the weights 1/ φ x are known. It is instructive to tie up loose ends at this stage by noting that if the weights 1/ φ x are replaced by their estimates 1/rx based on the variance ratios of Section 2; then it is easy to see that the resulting parameter estimating equations (for a linear predictor) based on the optimisation of the likelihood (and also chi-square) expressions (6.2.23) and (6.3.17) of Forfar et al. (1988) are special cases of the quasi-likelihood estimation equations (3.6) and would, therefore, lead to the same graduations. Forfar et al. (1988) do not, however, appear to have adopted this method of allowing for duplicate policies but have opted instead to transform the data before using the conventional Poisson likelihood to obtain predictor estimates for µx-graduations in Section 17 of their paper. The γ is enter into expressions (3.4) and (3.5) via the over-dispersion parameters φ x. Again using the chain-rule for first-order partial derivatives, it follows this time that the optimum values for the γ is satisfy the following system of equations: where dx is defined by the appropriate expression (3.2) or (3.3). This time these are the Wedderburn quasi-likelihood estimation equations based on independent responses dx, with variance function VD( φ x) = φ &corresponding 2x to a second stage generalised linear model based on gamma response variates. The parameter estimating equations (3.6) and (3.7) point the way forward. The two stages of the modelling process are clearly inter-dependent in-so-far as fitted values are needed from the second-stage generalised linear model to provide estimates for the weights in order to fit the first-stage generalised linear (3.6) (3.7)

10 78 Joint Modelling for Actuarial Graduation and Duplicate Policies model; while fitted values are needed from the first-stage generalised linear model in order to compute realisations of the responses dx, for the second-stage generalised linear model. Thus, the full model structures are fitted by alternating between the two separate stages until convergence is achieved. An effective starting model to assume for this iterative process is that based on no duplicate policies, so that initially for all x. 4. APPLICATIONS To investigate the potential of the method, the joint modelling process outlined in Section 3 has been applied to a number of data sets including the Assured Lives Experience with Duration 5 and over, reported by the C.M.I. Committee in J.I.A. (1974). The model used by the C.M.I. Committee to produce the A qx-graduation reported as Table 4 on pages 160-l63 of C.M.I. Committee J.I.A. (1974) using the data for ages 10.5 years to 89.5 years comprises a generalised non-linear model with binomial responses, odds link and non-linear predictor GMx(2,2) defined by equation (3.1). Both the formal statistical tests applied by the C.M.I. Committee together with an examination of various residual plots, which are not reproduced here, indicate a highly satisfactory fit. Since, however, the raw data contain duplicate policies, it is desirable to make fine adjustments to this graduation through the introduction of an over-dispersed binomial modelling distribution and the estimation of its parameters through the introduction of an associated dispersion modelling stage. To proceed, it is necessary to declare the precise form for the dispersion link function h where Since this is designed to map the interval [1,2] to the whole of the real line, three choices of links immediately come to mind based on the appropriate translation of either the complementary log-log link, the logit link or the probit link; all of which map the interval [0,1] to the whole of the real line, while it is not too difficult to suggest other possibilities. The detail needed to implement these links using the GLIM computer software package is as follows: (1) The translated complementary log-log link with inverse: and derivative (2) The translated logit link with inverse: and derivative (3) The translated probit link with inverse: and derivative where Q) is the distribution function of the standard normal deviate.

11 Joint Modelling for Actuarial Graduation and Duplicate Policies 79 X qx x X 9x 4, X e( 4x I x- age, qx- graduated, ax- estimated over-dispersion Table 4.1. Assured lives : Duration 5 and over

12 80 Joint Modelling for Actuarial Graduation and Duplicate Policies Implementation of the full joint modelling of the mean and dispersion is possible using the GLIM computer software package and the interactive facility to program user defined models as GLIM macros. Notably, empirical studies based on this, and other data sets, indicate that both choices of dispersion statistics defined by equations (3.2) and (3.3) in combination with all three of the above link functions produce near identical results when applied to the-same data set. The resulting qxgraduation and dispersion parameter estimates φ x, are reproduced in Table 4.1. These involve the use of a two-stage modelling process comprising a primary graduation stage based on over-dispersed binomial responses, with odds link and non-linear predictor GMx(2,2); and a secondary dispersion stage based on the Pearson statistic defined by equation (3.2), the translated probit link and quadratic predictor It is instructive to compare these results with those contained in Table 4 of the CMI Committee (1974) presentation. There the variance ratios for quinquennial age groups are not based on the actual data, but quoted rather from the earlier CMI Committee (1957) study. One possible small reservation concerning the smooth nature of the over-dispersion parameter estimates at ages 60 and 65 and over the years between these ages is that, in reality, it is quite likely that multiple policies drop rather sharply as multiple endowment policies mature and policyholders are left with, perhaps, a single whole of life policy. Figure 2.2(a) gives evidence for this discontinuity. It is suggested that it may be possible to cater for this effect by experimenting with different second-stage predictor types, perhaps involving the use of splines. Convergence of the alternating meandispersion modelling stages was achieved to appreciable accuracy after 10 iterations; while the qx-graduation of Table 4 of the C.M.I. Committee (1974) has been verified using the GLIM macros with φ x set equal to 1 for all ages x, and the dispersion modelling stage suppressed. One further, perhaps not surprising feature revealed by these empirical studies, is the apparent failure of the joint modelling process to converge when the structure of the primary graduation modelling stage is known to be inadequate. One such case in point concerns the µx-graduation of the Assured Lives l Experience with Duration 5 and over, based on the generalised non-linear model with Poisson responses, identity link and non-linear predictor GMx(2,2); which is discussed at length in Section 17 of Forfar et al. (1988). The lack of fit, identified by Forfar et al. (1988) through the application of their standard battery of statistical tests, is graphically illustrated in Figure 4.1(a) by the developing cyclical pattern in the residuals with increasing age. To combat this, Forfar et al. (1988) resort to the higher order nonlinear predictor GMx(3,6) which would appear to be GLIM unfriendly. As a viable alternative, it is possible to resort to the natural cubic spline predictor: with n fixed knots {xj}, such that xj<xj+1 cases, in combination with the

13 Joint Modelling for Actuarial Graduation and Duplicate Policies Residuals I Figure 4.1 (a) Age (years) Residuals Against Age. GM(2,2) predictor Assured lives experience canonical log-link and with offsets log (Rx). To set this in the context of existing actuarial graduation practice, it is equivalent to the use of the identity link in combination with the exponentiated predictor GMx(0,s) defined by equation (3.1) in which the polynomial expression β jxj has been replaced by the cubic spline expression defined immediately above. To accord with current actuarial usage, see for example, Chapter 16 of Benjamin & Pollard (1980), it is possible to use the following alternative form for the predictor: where: Further, implementation of this predictor in the current actuarial graduation context of generalised liner models does not require the determination of specific weights, constructed either by reference to a similar standard mortality table as described in Benjamin & Pollard (1980) or by resorting to the McCutcheon (1981) iterative process.

14 82 Joint Modelling for Actuarial Graduation and Duplicate Policies x µx ø x x µx Øx x µx øx x- age, µx- graduated, x,- estimated over-dispersion Table 4.2. Assured lives : Duration 5 and over

15 4 Residuals Joint Modelling for Actuarial Graduation and Duplicate Policies Figure 4.1 (b) Residuals Against Age. Cubic Spline Predictor Assured lives experience 90 Age (years) The Assured Lives Experience with Duration 5 and over has been regraduated using a two-stage generalised linear model comprising a primary graduation stage based on over-dispersed Poisson responses A,, with canonical long-link, natural cubic spline predictor with offsets log(&); and a secondary dispersion stage based on the Pearson dispersion statistic dx, defined by equation (3.2), the scaled and translated probit link with inverse and straight-line predictor The resulting µx-graduation and dispersion parameter estimates, x, for eight approximately equally spaced knots l8, 28, 38, 47, 56, 65, 74, 83 with K = 0.87 corresponding to the maximum observed variance ratios for these data--are reproduced in Table 4.2. Convergence is achieved after six iterations. The goodness-of-fit of the primary graduation stage improves significantly as the number of knots is increased from seven to eight. The standard battery of statistical tests, including the run test and serial correlation tests at lags 1 to 3 inclusive all indicate a highly satisfactory fit, as may be judged from Figure 4.1 (b) in which residuals are once again plotted against age. Unlike the Assured Lives Experience, for the Assured Lives , the dispersion parameter estimates attain the upper bound set by the dispersion link; a feature which is non-attributable to the difference in predictors. It is conjectured that one possible explanation as to why this should occur may well lie in the quality of the fit achieved in the primary graduation stage.

16 84 Joint Modelling for Actuarial Graduation and Duplicate Policies 5. SUMMARY The probability based argument developed in Section 2, in which the effects of duplicate policies on the graduation process are modelled as over-dispersion parameters Øx, differs conceptually from the modelling of duplicate policies used hitherto by actuaries through their use of variance ratios rx, in the sense that variance ratios are here perceived as data-based estimates of the over-dispersion model parameters. One way to allow for the over-dispersion attributable to duplicate policies, that suggested by Forfar et al. (1988), is to transform the data by dividing both the actual numbers of deaths and the exposures to risk of death by the variance ratios before modelling; using either familiar binomial or Poisson models as the case may be. The approach described here differs inasmuch as the untransformed data are modelled directly, while the effects of overdispersion are built into the model structure. The joint modelling process then generates smoothed estimates for the over-dispersion parameters, different from the alternative irregular variance ratio estimates which are not always available, but nevertheless constructed in such a way as to preserve the essential patterns observed in variance ratios. These are then used as weights in the primary graduation modelling stage. The method has clear advantages when variance ratio estimates are unavailable for the data in question, while, as a viable alternative to the data transformation method suggested by Forfar et al. (1988) when variance ratio estimates are available, the variance ratios can be used as alternative weights and the untransformed data modelled directly. REFERENCES BEARD, R. E. & PERKS, W. (1949). The Relation between the Distribution of Sickness and the Effect of Duplicates on the Distribution of Deaths. J.I.A., 75, 75. BENJAMIN, B. & POLLARD, J. H. (1980). The Analysis of Mortality and other Actuarial Statistics. Heinemann, London. BRESLOW, N. (1990). Tests of Hypotheses in Overdispersed Poisson Regression and Other Quasi- Likelihood Models. J. Amer. Statist. Ass., 85, 565. CARROLL, R. J. & RUPPERT, D. (1982). Robust Estimation of Heteroscedastic Linear Models. Ann. Stat., 10, CMI COMMITTEE (1957). Continuous Investigation into the Mortality of Assured Lives, Memorandum on a Special Inquiry into the Distribution of Duplicate Policies. J.I.A., 83,34 and T.F.A., 24,94. CMI COMMITTEE: (1974). Considerations Affecting the Preparation of Standard Tables of Mortality. J.I.A., 101, 133. CMI COMMITTEE (1986). An Investigation into the Distribution of Policies per Life Assured in the Cause of Death Investigation Data. CMIR, 8,49. COX, D. R. (1983). Some Remarks on Over-Dispersion. Biometrika. 70, 269. DAVIDIAN, M. & CARROLL., R. J. (1987). Variance Function Estimation. J. Amer. Statist. Ass., 82, DAVIDIAN, M.& CARROLL, R. J. (1988). A Note on Extended Quasi-Likelihood. J.R. Statist. Soc. B, 50, 74. DAW, R. H. (l946). On the Validity of Statistical Tests of the Graduation of Mortality a Table. J.I.A., 72, 174.

17 Joint Modelling for Actuarial Graduation and Duplicate Policies 85 DAW, R. H. (1951). Duplicate Policies in Mortality Data. J.I.A., 77, 261. FORFAR, D. O., McCUTCHEON, J. J. & WILKIE, A. D. (1988). On Graduation by Mathematical Formula. J.I.A., 115, 1 and T.F.A., 41, 97. McCULLAGH, P. & NELDER, J. A. (1989). Generalized Linear Models. Chapman & Hall, London. McCUTCHEON, J. J. (1981). Some Remarks on Splines. T.F.A., 37, 421. NELDER, J. A. & LEE, Y. (1991). Generalized Linear Models for the Analysis of Taguchi-type Experiments. Applied Stoch. Mod. & Data Anal. 7, 107. NELDER, J. A. & PREGIBON, D. (1987). An Extended Quasi-Likelihood Function. Biometrika, 74,221. PREGIBON, D. (1984). Review of Generalized Linear Models. Ann. Statist. 12, RENSHAW, A. E. (1990). Graduation by Generalized Linear Modelling Techniques. XXII ASTIN Colloquium, Montreux, Switzerland. RENSHAW, A. E. (1991). Actuarial Graduation Practice and Generalised Linear and Non-Linear Models. J.I.A., 118, 295. SEAL, H. (1945). Tests of a Mortality Table Graduation. J.I.A., 71, 3. WEDDERBURN, R. W. M. (1974). Quasi-Likelihood Functions, Generalized Linear Models and the Gauss - Newton Method. Biometrika, 61, 439.

SAS Software to Fit the Generalized Linear Model

SAS Software to Fit the Generalized Linear Model SAS Software to Fit the Generalized Linear Model Gordon Johnston, SAS Institute Inc., Cary, NC Abstract In recent years, the class of generalized linear models has gained popularity as a statistical modeling

More information

STATISTICA Formula Guide: Logistic Regression. Table of Contents

STATISTICA Formula Guide: Logistic Regression. Table of Contents : Table of Contents... 1 Overview of Model... 1 Dispersion... 2 Parameterization... 3 Sigma-Restricted Model... 3 Overparameterized Model... 4 Reference Coding... 4 Model Summary (Summary Tab)... 5 Summary

More information

Poisson Models for Count Data

Poisson Models for Count Data Chapter 4 Poisson Models for Count Data In this chapter we study log-linear models for count data under the assumption of a Poisson error structure. These models have many applications, not only to the

More information

Comparing return to work outcomes between vocational rehabilitation providers after adjusting for case mix using statistical models

Comparing return to work outcomes between vocational rehabilitation providers after adjusting for case mix using statistical models Comparing return to work outcomes between vocational rehabilitation providers after adjusting for case mix using statistical models Prepared by Jim Gaetjens Presented to the Institute of Actuaries of Australia

More information

11. Analysis of Case-control Studies Logistic Regression

11. Analysis of Case-control Studies Logistic Regression Research methods II 113 11. Analysis of Case-control Studies Logistic Regression This chapter builds upon and further develops the concepts and strategies described in Ch.6 of Mother and Child Health:

More information

Simple Linear Regression Inference

Simple Linear Regression Inference Simple Linear Regression Inference 1 Inference requirements The Normality assumption of the stochastic term e is needed for inference even if it is not a OLS requirement. Therefore we have: Interpretation

More information

Institute of Actuaries of India Subject CT3 Probability and Mathematical Statistics

Institute of Actuaries of India Subject CT3 Probability and Mathematical Statistics Institute of Actuaries of India Subject CT3 Probability and Mathematical Statistics For 2015 Examinations Aim The aim of the Probability and Mathematical Statistics subject is to provide a grounding in

More information

Joint models for classification and comparison of mortality in different countries.

Joint models for classification and comparison of mortality in different countries. Joint models for classification and comparison of mortality in different countries. Viani D. Biatat 1 and Iain D. Currie 1 1 Department of Actuarial Mathematics and Statistics, and the Maxwell Institute

More information

Continuous Mortality Investigation. Income Protection Committee WORKING PAPER 48

Continuous Mortality Investigation. Income Protection Committee WORKING PAPER 48 C ONTINUOUS M ORTALITY I NVESTIGATION INSTITUTE OF ACTUARIES FACULTY OF ACTUARIES ISSN 2044-3145 Continuous Mortality Investigation Income Protection Committee WORKING PAPER 48 An overview of the Graduations

More information

Automated Biosurveillance Data from England and Wales, 1991 2011

Automated Biosurveillance Data from England and Wales, 1991 2011 Article DOI: http://dx.doi.org/10.3201/eid1901.120493 Automated Biosurveillance Data from England and Wales, 1991 2011 Technical Appendix This online appendix provides technical details of statistical

More information

MATHEMATICAL METHODS OF STATISTICS

MATHEMATICAL METHODS OF STATISTICS MATHEMATICAL METHODS OF STATISTICS By HARALD CRAMER TROFESSOK IN THE UNIVERSITY OF STOCKHOLM Princeton PRINCETON UNIVERSITY PRESS 1946 TABLE OF CONTENTS. First Part. MATHEMATICAL INTRODUCTION. CHAPTERS

More information

A Handbook of Statistical Analyses Using R. Brian S. Everitt and Torsten Hothorn

A Handbook of Statistical Analyses Using R. Brian S. Everitt and Torsten Hothorn A Handbook of Statistical Analyses Using R Brian S. Everitt and Torsten Hothorn CHAPTER 6 Logistic Regression and Generalised Linear Models: Blood Screening, Women s Role in Society, and Colonic Polyps

More information

Overview Classes. 12-3 Logistic regression (5) 19-3 Building and applying logistic regression (6) 26-3 Generalizations of logistic regression (7)

Overview Classes. 12-3 Logistic regression (5) 19-3 Building and applying logistic regression (6) 26-3 Generalizations of logistic regression (7) Overview Classes 12-3 Logistic regression (5) 19-3 Building and applying logistic regression (6) 26-3 Generalizations of logistic regression (7) 2-4 Loglinear models (8) 5-4 15-17 hrs; 5B02 Building and

More information

Survival Analysis of Left Truncated Income Protection Insurance Data. [March 29, 2012]

Survival Analysis of Left Truncated Income Protection Insurance Data. [March 29, 2012] Survival Analysis of Left Truncated Income Protection Insurance Data [March 29, 2012] 1 Qing Liu 2 David Pitt 3 Yan Wang 4 Xueyuan Wu Abstract One of the main characteristics of Income Protection Insurance

More information

Univariate Regression

Univariate Regression Univariate Regression Correlation and Regression The regression line summarizes the linear relationship between 2 variables Correlation coefficient, r, measures strength of relationship: the closer r is

More information

Corporate Defaults and Large Macroeconomic Shocks

Corporate Defaults and Large Macroeconomic Shocks Corporate Defaults and Large Macroeconomic Shocks Mathias Drehmann Bank of England Andrew Patton London School of Economics and Bank of England Steffen Sorensen Bank of England The presentation expresses

More information

http://www.jstor.org This content downloaded on Tue, 19 Feb 2013 17:28:43 PM All use subject to JSTOR Terms and Conditions

http://www.jstor.org This content downloaded on Tue, 19 Feb 2013 17:28:43 PM All use subject to JSTOR Terms and Conditions A Significance Test for Time Series Analysis Author(s): W. Allen Wallis and Geoffrey H. Moore Reviewed work(s): Source: Journal of the American Statistical Association, Vol. 36, No. 215 (Sep., 1941), pp.

More information

Overview... 2. Accounting for Business (MCD1010)... 3. Introductory Mathematics for Business (MCD1550)... 4. Introductory Economics (MCD1690)...

Overview... 2. Accounting for Business (MCD1010)... 3. Introductory Mathematics for Business (MCD1550)... 4. Introductory Economics (MCD1690)... Unit Guide Diploma of Business Contents Overview... 2 Accounting for Business (MCD1010)... 3 Introductory Mathematics for Business (MCD1550)... 4 Introductory Economics (MCD1690)... 5 Introduction to Management

More information

2. Simple Linear Regression

2. Simple Linear Regression Research methods - II 3 2. Simple Linear Regression Simple linear regression is a technique in parametric statistics that is commonly used for analyzing mean response of a variable Y which changes according

More information

Auxiliary Variables in Mixture Modeling: 3-Step Approaches Using Mplus

Auxiliary Variables in Mixture Modeling: 3-Step Approaches Using Mplus Auxiliary Variables in Mixture Modeling: 3-Step Approaches Using Mplus Tihomir Asparouhov and Bengt Muthén Mplus Web Notes: No. 15 Version 8, August 5, 2014 1 Abstract This paper discusses alternatives

More information

GENERALIZED LINEAR MODELS IN VEHICLE INSURANCE

GENERALIZED LINEAR MODELS IN VEHICLE INSURANCE ACTA UNIVERSITATIS AGRICULTURAE ET SILVICULTURAE MENDELIANAE BRUNENSIS Volume 62 41 Number 2, 2014 http://dx.doi.org/10.11118/actaun201462020383 GENERALIZED LINEAR MODELS IN VEHICLE INSURANCE Silvie Kafková

More information

Regression Modeling Strategies

Regression Modeling Strategies Frank E. Harrell, Jr. Regression Modeling Strategies With Applications to Linear Models, Logistic Regression, and Survival Analysis With 141 Figures Springer Contents Preface Typographical Conventions

More information

PREDICTIVE DISTRIBUTIONS OF OUTSTANDING LIABILITIES IN GENERAL INSURANCE

PREDICTIVE DISTRIBUTIONS OF OUTSTANDING LIABILITIES IN GENERAL INSURANCE PREDICTIVE DISTRIBUTIONS OF OUTSTANDING LIABILITIES IN GENERAL INSURANCE BY P.D. ENGLAND AND R.J. VERRALL ABSTRACT This paper extends the methods introduced in England & Verrall (00), and shows how predictive

More information

The Probit Link Function in Generalized Linear Models for Data Mining Applications

The Probit Link Function in Generalized Linear Models for Data Mining Applications Journal of Modern Applied Statistical Methods Copyright 2013 JMASM, Inc. May 2013, Vol. 12, No. 1, 164-169 1538 9472/13/$95.00 The Probit Link Function in Generalized Linear Models for Data Mining Applications

More information

The zero-adjusted Inverse Gaussian distribution as a model for insurance claims

The zero-adjusted Inverse Gaussian distribution as a model for insurance claims The zero-adjusted Inverse Gaussian distribution as a model for insurance claims Gillian Heller 1, Mikis Stasinopoulos 2 and Bob Rigby 2 1 Dept of Statistics, Macquarie University, Sydney, Australia. email:

More information

Underwriting risk control in non-life insurance via generalized linear models and stochastic programming

Underwriting risk control in non-life insurance via generalized linear models and stochastic programming Underwriting risk control in non-life insurance via generalized linear models and stochastic programming 1 Introduction Martin Branda 1 Abstract. We focus on rating of non-life insurance contracts. We

More information

Ordinal Regression. Chapter

Ordinal Regression. Chapter Ordinal Regression Chapter 4 Many variables of interest are ordinal. That is, you can rank the values, but the real distance between categories is unknown. Diseases are graded on scales from least severe

More information

GLM I An Introduction to Generalized Linear Models

GLM I An Introduction to Generalized Linear Models GLM I An Introduction to Generalized Linear Models CAS Ratemaking and Product Management Seminar March 2009 Presented by: Tanya D. Havlicek, Actuarial Assistant 0 ANTITRUST Notice The Casualty Actuarial

More information

Statistics in Retail Finance. Chapter 6: Behavioural models

Statistics in Retail Finance. Chapter 6: Behavioural models Statistics in Retail Finance 1 Overview > So far we have focussed mainly on application scorecards. In this chapter we shall look at behavioural models. We shall cover the following topics:- Behavioural

More information

Example G Cost of construction of nuclear power plants

Example G Cost of construction of nuclear power plants 1 Example G Cost of construction of nuclear power plants Description of data Table G.1 gives data, reproduced by permission of the Rand Corporation, from a report (Mooz, 1978) on 32 light water reactor

More information

Elements of statistics (MATH0487-1)

Elements of statistics (MATH0487-1) Elements of statistics (MATH0487-1) Prof. Dr. Dr. K. Van Steen University of Liège, Belgium December 10, 2012 Introduction to Statistics Basic Probability Revisited Sampling Exploratory Data Analysis -

More information

Probit Analysis By: Kim Vincent

Probit Analysis By: Kim Vincent Probit Analysis By: Kim Vincent Quick Overview Probit analysis is a type of regression used to analyze binomial response variables. It transforms the sigmoid dose-response curve to a straight line that

More information

How To Model The Fate Of An Animal

How To Model The Fate Of An Animal Models Where the Fate of Every Individual is Known This class of models is important because they provide a theory for estimation of survival probability and other parameters from radio-tagged animals.

More information

Generalized Linear Models

Generalized Linear Models Generalized Linear Models We have previously worked with regression models where the response variable is quantitative and normally distributed. Now we turn our attention to two types of models where the

More information

ANNEX 2: Assessment of the 7 points agreed by WATCH as meriting attention (cover paper, paragraph 9, bullet points) by Andy Darnton, HSE

ANNEX 2: Assessment of the 7 points agreed by WATCH as meriting attention (cover paper, paragraph 9, bullet points) by Andy Darnton, HSE ANNEX 2: Assessment of the 7 points agreed by WATCH as meriting attention (cover paper, paragraph 9, bullet points) by Andy Darnton, HSE The 7 issues to be addressed outlined in paragraph 9 of the cover

More information

Handling attrition and non-response in longitudinal data

Handling attrition and non-response in longitudinal data Longitudinal and Life Course Studies 2009 Volume 1 Issue 1 Pp 63-72 Handling attrition and non-response in longitudinal data Harvey Goldstein University of Bristol Correspondence. Professor H. Goldstein

More information

Least Squares Estimation

Least Squares Estimation Least Squares Estimation SARA A VAN DE GEER Volume 2, pp 1041 1045 in Encyclopedia of Statistics in Behavioral Science ISBN-13: 978-0-470-86080-9 ISBN-10: 0-470-86080-4 Editors Brian S Everitt & David

More information

Statistical Rules of Thumb

Statistical Rules of Thumb Statistical Rules of Thumb Second Edition Gerald van Belle University of Washington Department of Biostatistics and Department of Environmental and Occupational Health Sciences Seattle, WA WILEY AJOHN

More information

Simple linear regression

Simple linear regression Simple linear regression Introduction Simple linear regression is a statistical method for obtaining a formula to predict values of one variable from another where there is a causal relationship between

More information

A Primer on Forecasting Business Performance

A Primer on Forecasting Business Performance A Primer on Forecasting Business Performance There are two common approaches to forecasting: qualitative and quantitative. Qualitative forecasting methods are important when historical data is not available.

More information

Model Selection and Claim Frequency for Workers Compensation Insurance

Model Selection and Claim Frequency for Workers Compensation Insurance Model Selection and Claim Frequency for Workers Compensation Insurance Jisheng Cui, David Pitt and Guoqi Qian Abstract We consider a set of workers compensation insurance claim data where the aggregate

More information

CHAPTER 3 EXAMPLES: REGRESSION AND PATH ANALYSIS

CHAPTER 3 EXAMPLES: REGRESSION AND PATH ANALYSIS Examples: Regression And Path Analysis CHAPTER 3 EXAMPLES: REGRESSION AND PATH ANALYSIS Regression analysis with univariate or multivariate dependent variables is a standard procedure for modeling relationships

More information

Predictive Modeling for Life Insurers

Predictive Modeling for Life Insurers Predictive Modeling for Life Insurers Application of Predictive Modeling Techniques in Measuring Policyholder Behavior in Variable Annuity Contracts April 30, 2010 Guillaume Briere-Giroux, FSA, MAAA, CFA

More information

13. Poisson Regression Analysis

13. Poisson Regression Analysis 136 Poisson Regression Analysis 13. Poisson Regression Analysis We have so far considered situations where the outcome variable is numeric and Normally distributed, or binary. In clinical work one often

More information

The Fair Valuation of Life Insurance Participating Policies: The Mortality Risk Role

The Fair Valuation of Life Insurance Participating Policies: The Mortality Risk Role The Fair Valuation of Life Insurance Participating Policies: The Mortality Risk Role Massimiliano Politano Department of Mathematics and Statistics University of Naples Federico II Via Cinthia, Monte S.Angelo

More information

Regression III: Advanced Methods

Regression III: Advanced Methods Lecture 16: Generalized Additive Models Regression III: Advanced Methods Bill Jacoby Michigan State University http://polisci.msu.edu/jacoby/icpsr/regress3 Goals of the Lecture Introduce Additive Models

More information

Time series Forecasting using Holt-Winters Exponential Smoothing

Time series Forecasting using Holt-Winters Exponential Smoothing Time series Forecasting using Holt-Winters Exponential Smoothing Prajakta S. Kalekar(04329008) Kanwal Rekhi School of Information Technology Under the guidance of Prof. Bernard December 6, 2004 Abstract

More information

Logit Models for Binary Data

Logit Models for Binary Data Chapter 3 Logit Models for Binary Data We now turn our attention to regression models for dichotomous data, including logistic regression and probit analysis. These models are appropriate when the response

More information

Section A. Index. Section A. Planning, Budgeting and Forecasting Section A.2 Forecasting techniques... 1. Page 1 of 11. EduPristine CMA - Part I

Section A. Index. Section A. Planning, Budgeting and Forecasting Section A.2 Forecasting techniques... 1. Page 1 of 11. EduPristine CMA - Part I Index Section A. Planning, Budgeting and Forecasting Section A.2 Forecasting techniques... 1 EduPristine CMA - Part I Page 1 of 11 Section A. Planning, Budgeting and Forecasting Section A.2 Forecasting

More information

Curriculum Map Statistics and Probability Honors (348) Saugus High School Saugus Public Schools 2009-2010

Curriculum Map Statistics and Probability Honors (348) Saugus High School Saugus Public Schools 2009-2010 Curriculum Map Statistics and Probability Honors (348) Saugus High School Saugus Public Schools 2009-2010 Week 1 Week 2 14.0 Students organize and describe distributions of data by using a number of different

More information

MATH4427 Notebook 2 Spring 2016. 2 MATH4427 Notebook 2 3. 2.1 Definitions and Examples... 3. 2.2 Performance Measures for Estimators...

MATH4427 Notebook 2 Spring 2016. 2 MATH4427 Notebook 2 3. 2.1 Definitions and Examples... 3. 2.2 Performance Measures for Estimators... MATH4427 Notebook 2 Spring 2016 prepared by Professor Jenny Baglivo c Copyright 2009-2016 by Jenny A. Baglivo. All Rights Reserved. Contents 2 MATH4427 Notebook 2 3 2.1 Definitions and Examples...................................

More information

INTERPOLATION. Interpolation is a process of finding a formula (often a polynomial) whose graph will pass through a given set of points (x, y).

INTERPOLATION. Interpolation is a process of finding a formula (often a polynomial) whose graph will pass through a given set of points (x, y). INTERPOLATION Interpolation is a process of finding a formula (often a polynomial) whose graph will pass through a given set of points (x, y). As an example, consider defining and x 0 =0, x 1 = π 4, x

More information

Time Series Analysis. 1) smoothing/trend assessment

Time Series Analysis. 1) smoothing/trend assessment Time Series Analysis This (not surprisingly) concerns the analysis of data collected over time... weekly values, monthly values, quarterly values, yearly values, etc. Usually the intent is to discern whether

More information

Chapter 29 The GENMOD Procedure. Chapter Table of Contents

Chapter 29 The GENMOD Procedure. Chapter Table of Contents Chapter 29 The GENMOD Procedure Chapter Table of Contents OVERVIEW...1365 WhatisaGeneralizedLinearModel?...1366 ExamplesofGeneralizedLinearModels...1367 TheGENMODProcedure...1368 GETTING STARTED...1370

More information

APPLICATION OF LINEAR REGRESSION MODEL FOR POISSON DISTRIBUTION IN FORECASTING

APPLICATION OF LINEAR REGRESSION MODEL FOR POISSON DISTRIBUTION IN FORECASTING APPLICATION OF LINEAR REGRESSION MODEL FOR POISSON DISTRIBUTION IN FORECASTING Sulaimon Mutiu O. Department of Statistics & Mathematics Moshood Abiola Polytechnic, Abeokuta, Ogun State, Nigeria. Abstract

More information

Statistics Graduate Courses

Statistics Graduate Courses Statistics Graduate Courses STAT 7002--Topics in Statistics-Biological/Physical/Mathematics (cr.arr.).organized study of selected topics. Subjects and earnable credit may vary from semester to semester.

More information

Gender Effects in the Alaska Juvenile Justice System

Gender Effects in the Alaska Juvenile Justice System Gender Effects in the Alaska Juvenile Justice System Report to the Justice and Statistics Research Association by André Rosay Justice Center University of Alaska Anchorage JC 0306.05 October 2003 Gender

More information

Diploma of Business Unit Guide 2015

Diploma of Business Unit Guide 2015 Unit Guide 2015 OVERVIEW... 2 ACCOUNTING FOR BUSINESS (MCD1010)... 3 INTRODUCTORY MATHEMATICS FOR BUSINESS (MCD1550)... 4 INTRODUCTORY ECONOMICS (MCD1690)... 5 INTRODUCTION TO MANAGEMENT (MCD1020)... 6

More information

Introduction to Regression and Data Analysis

Introduction to Regression and Data Analysis Statlab Workshop Introduction to Regression and Data Analysis with Dan Campbell and Sherlock Campbell October 28, 2008 I. The basics A. Types of variables Your variables may take several forms, and it

More information

Advanced Statistical Analysis of Mortality. Rhodes, Thomas E. and Freitas, Stephen A. MIB, Inc. 160 University Avenue. Westwood, MA 02090

Advanced Statistical Analysis of Mortality. Rhodes, Thomas E. and Freitas, Stephen A. MIB, Inc. 160 University Avenue. Westwood, MA 02090 Advanced Statistical Analysis of Mortality Rhodes, Thomas E. and Freitas, Stephen A. MIB, Inc 160 University Avenue Westwood, MA 02090 001-(781)-751-6356 fax 001-(781)-329-3379 trhodes@mib.com Abstract

More information

Probability and Statistics Prof. Dr. Somesh Kumar Department of Mathematics Indian Institute of Technology, Kharagpur

Probability and Statistics Prof. Dr. Somesh Kumar Department of Mathematics Indian Institute of Technology, Kharagpur Probability and Statistics Prof. Dr. Somesh Kumar Department of Mathematics Indian Institute of Technology, Kharagpur Module No. #01 Lecture No. #15 Special Distributions-VI Today, I am going to introduce

More information

ANNUITY LAPSE RATE MODELING: TOBIT OR NOT TOBIT? 1. INTRODUCTION

ANNUITY LAPSE RATE MODELING: TOBIT OR NOT TOBIT? 1. INTRODUCTION ANNUITY LAPSE RATE MODELING: TOBIT OR NOT TOBIT? SAMUEL H. COX AND YIJIA LIN ABSTRACT. We devise an approach, using tobit models for modeling annuity lapse rates. The approach is based on data provided

More information

Nonparametric adaptive age replacement with a one-cycle criterion

Nonparametric adaptive age replacement with a one-cycle criterion Nonparametric adaptive age replacement with a one-cycle criterion P. Coolen-Schrijner, F.P.A. Coolen Department of Mathematical Sciences University of Durham, Durham, DH1 3LE, UK e-mail: Pauline.Schrijner@durham.ac.uk

More information

Introduction to time series analysis

Introduction to time series analysis Introduction to time series analysis Margherita Gerolimetto November 3, 2010 1 What is a time series? A time series is a collection of observations ordered following a parameter that for us is time. Examples

More information

STOCHASTIC CLAIMS RESERVING IN GENERAL INSURANCE. By P. D. England and R. J. Verrall. abstract. keywords. contact address. ".

STOCHASTIC CLAIMS RESERVING IN GENERAL INSURANCE. By P. D. England and R. J. Verrall. abstract. keywords. contact address. . B.A.J. 8, III, 443-544 (2002) STOCHASTIC CLAIMS RESERVING IN GENERAL INSURANCE By P. D. England and R. J. Verrall [Presented to the Institute of Actuaries, 28 January 2002] abstract This paper considers

More information

Basic Statistics and Data Analysis for Health Researchers from Foreign Countries

Basic Statistics and Data Analysis for Health Researchers from Foreign Countries Basic Statistics and Data Analysis for Health Researchers from Foreign Countries Volkert Siersma siersma@sund.ku.dk The Research Unit for General Practice in Copenhagen Dias 1 Content Quantifying association

More information

Teaching Multivariate Analysis to Business-Major Students

Teaching Multivariate Analysis to Business-Major Students Teaching Multivariate Analysis to Business-Major Students Wing-Keung Wong and Teck-Wong Soon - Kent Ridge, Singapore 1. Introduction During the last two or three decades, multivariate statistical analysis

More information

Simple Linear Regression

Simple Linear Regression STAT 101 Dr. Kari Lock Morgan Simple Linear Regression SECTIONS 9.3 Confidence and prediction intervals (9.3) Conditions for inference (9.1) Want More Stats??? If you have enjoyed learning how to analyze

More information

15.062 Data Mining: Algorithms and Applications Matrix Math Review

15.062 Data Mining: Algorithms and Applications Matrix Math Review .6 Data Mining: Algorithms and Applications Matrix Math Review The purpose of this document is to give a brief review of selected linear algebra concepts that will be useful for the course and to develop

More information

Introduction to General and Generalized Linear Models

Introduction to General and Generalized Linear Models Introduction to General and Generalized Linear Models General Linear Models - part I Henrik Madsen Poul Thyregod Informatics and Mathematical Modelling Technical University of Denmark DK-2800 Kgs. Lyngby

More information

Logistic Regression (a type of Generalized Linear Model)

Logistic Regression (a type of Generalized Linear Model) Logistic Regression (a type of Generalized Linear Model) 1/36 Today Review of GLMs Logistic Regression 2/36 How do we find patterns in data? We begin with a model of how the world works We use our knowledge

More information

Some Observations on Variance and Risk

Some Observations on Variance and Risk Some Observations on Variance and Risk 1 Introduction By K.K.Dharni Pradip Kumar 1.1 In most actuarial contexts some or all of the cash flows in a contract are uncertain and depend on the death or survival

More information

15.1 The Structure of Generalized Linear Models

15.1 The Structure of Generalized Linear Models 15 Generalized Linear Models Due originally to Nelder and Wedderburn (1972), generalized linear models are a remarkable synthesis and extension of familiar regression models such as the linear models described

More information

Prentice Hall Algebra 2 2011 Correlated to: Colorado P-12 Academic Standards for High School Mathematics, Adopted 12/2009

Prentice Hall Algebra 2 2011 Correlated to: Colorado P-12 Academic Standards for High School Mathematics, Adopted 12/2009 Content Area: Mathematics Grade Level Expectations: High School Standard: Number Sense, Properties, and Operations Understand the structure and properties of our number system. At their most basic level

More information

Simple Regression Theory II 2010 Samuel L. Baker

Simple Regression Theory II 2010 Samuel L. Baker SIMPLE REGRESSION THEORY II 1 Simple Regression Theory II 2010 Samuel L. Baker Assessing how good the regression equation is likely to be Assignment 1A gets into drawing inferences about how close the

More information

3.3. Solving Polynomial Equations. Introduction. Prerequisites. Learning Outcomes

3.3. Solving Polynomial Equations. Introduction. Prerequisites. Learning Outcomes Solving Polynomial Equations 3.3 Introduction Linear and quadratic equations, dealt within Sections 3.1 and 3.2, are members of a class of equations, called polynomial equations. These have the general

More information

A Short review of steel demand forecasting methods

A Short review of steel demand forecasting methods A Short review of steel demand forecasting methods Fujio John M. Tanaka This paper undertakes the present and past review of steel demand forecasting to study what methods should be used in any future

More information

LOGISTIC REGRESSION. Nitin R Patel. where the dependent variable, y, is binary (for convenience we often code these values as

LOGISTIC REGRESSION. Nitin R Patel. where the dependent variable, y, is binary (for convenience we often code these values as LOGISTIC REGRESSION Nitin R Patel Logistic regression extends the ideas of multiple linear regression to the situation where the dependent variable, y, is binary (for convenience we often code these values

More information

Time Series Analysis

Time Series Analysis Time Series Analysis Identifying possible ARIMA models Andrés M. Alonso Carolina García-Martos Universidad Carlos III de Madrid Universidad Politécnica de Madrid June July, 2012 Alonso and García-Martos

More information

You Are What You Bet: Eliciting Risk Attitudes from Horse Races

You Are What You Bet: Eliciting Risk Attitudes from Horse Races You Are What You Bet: Eliciting Risk Attitudes from Horse Races Pierre-André Chiappori, Amit Gandhi, Bernard Salanié and Francois Salanié March 14, 2008 What Do We Know About Risk Preferences? Not that

More information

Forecasting in supply chains

Forecasting in supply chains 1 Forecasting in supply chains Role of demand forecasting Effective transportation system or supply chain design is predicated on the availability of accurate inputs to the modeling process. One of the

More information

Fairfield Public Schools

Fairfield Public Schools Mathematics Fairfield Public Schools AP Statistics AP Statistics BOE Approved 04/08/2014 1 AP STATISTICS Critical Areas of Focus AP Statistics is a rigorous course that offers advanced students an opportunity

More information

From the help desk: Bootstrapped standard errors

From the help desk: Bootstrapped standard errors The Stata Journal (2003) 3, Number 1, pp. 71 80 From the help desk: Bootstrapped standard errors Weihua Guan Stata Corporation Abstract. Bootstrapping is a nonparametric approach for evaluating the distribution

More information

Errata and updates for ASM Exam C/Exam 4 Manual (Sixteenth Edition) sorted by page

Errata and updates for ASM Exam C/Exam 4 Manual (Sixteenth Edition) sorted by page Errata for ASM Exam C/4 Study Manual (Sixteenth Edition) Sorted by Page 1 Errata and updates for ASM Exam C/Exam 4 Manual (Sixteenth Edition) sorted by page Practice exam 1:9, 1:22, 1:29, 9:5, and 10:8

More information

2013 MBA Jump Start Program. Statistics Module Part 3

2013 MBA Jump Start Program. Statistics Module Part 3 2013 MBA Jump Start Program Module 1: Statistics Thomas Gilbert Part 3 Statistics Module Part 3 Hypothesis Testing (Inference) Regressions 2 1 Making an Investment Decision A researcher in your firm just

More information

Computer exercise 4 Poisson Regression

Computer exercise 4 Poisson Regression Chalmers-University of Gothenburg Department of Mathematical Sciences Probability, Statistics and Risk MVE300 Computer exercise 4 Poisson Regression When dealing with two or more variables, the functional

More information

MATH. ALGEBRA I HONORS 9 th Grade 12003200 ALGEBRA I HONORS

MATH. ALGEBRA I HONORS 9 th Grade 12003200 ALGEBRA I HONORS * Students who scored a Level 3 or above on the Florida Assessment Test Math Florida Standards (FSA-MAFS) are strongly encouraged to make Advanced Placement and/or dual enrollment courses their first choices

More information

Example: Credit card default, we may be more interested in predicting the probabilty of a default than classifying individuals as default or not.

Example: Credit card default, we may be more interested in predicting the probabilty of a default than classifying individuals as default or not. Statistical Learning: Chapter 4 Classification 4.1 Introduction Supervised learning with a categorical (Qualitative) response Notation: - Feature vector X, - qualitative response Y, taking values in C

More information

Analysis of Bayesian Dynamic Linear Models

Analysis of Bayesian Dynamic Linear Models Analysis of Bayesian Dynamic Linear Models Emily M. Casleton December 17, 2010 1 Introduction The main purpose of this project is to explore the Bayesian analysis of Dynamic Linear Models (DLMs). The main

More information

CREATING A CORPORATE BOND SPOT YIELD CURVE FOR PENSION DISCOUNTING DEPARTMENT OF THE TREASURY OFFICE OF ECONOMIC POLICY WHITE PAPER FEBRUARY 7, 2005

CREATING A CORPORATE BOND SPOT YIELD CURVE FOR PENSION DISCOUNTING DEPARTMENT OF THE TREASURY OFFICE OF ECONOMIC POLICY WHITE PAPER FEBRUARY 7, 2005 CREATING A CORPORATE BOND SPOT YIELD CURVE FOR PENSION DISCOUNTING I. Introduction DEPARTMENT OF THE TREASURY OFFICE OF ECONOMIC POLICY WHITE PAPER FEBRUARY 7, 2005 Plan sponsors, plan participants and

More information

Location matters. 3 techniques to incorporate geo-spatial effects in one's predictive model

Location matters. 3 techniques to incorporate geo-spatial effects in one's predictive model Location matters. 3 techniques to incorporate geo-spatial effects in one's predictive model Xavier Conort xavier.conort@gear-analytics.com Motivation Location matters! Observed value at one location is

More information

LOGISTIC REGRESSION ANALYSIS

LOGISTIC REGRESSION ANALYSIS LOGISTIC REGRESSION ANALYSIS C. Mitchell Dayton Department of Measurement, Statistics & Evaluation Room 1230D Benjamin Building University of Maryland September 1992 1. Introduction and Model Logistic

More information

Equations, Inequalities & Partial Fractions

Equations, Inequalities & Partial Fractions Contents Equations, Inequalities & Partial Fractions.1 Solving Linear Equations 2.2 Solving Quadratic Equations 1. Solving Polynomial Equations 1.4 Solving Simultaneous Linear Equations 42.5 Solving Inequalities

More information

College Readiness LINKING STUDY

College Readiness LINKING STUDY College Readiness LINKING STUDY A Study of the Alignment of the RIT Scales of NWEA s MAP Assessments with the College Readiness Benchmarks of EXPLORE, PLAN, and ACT December 2011 (updated January 17, 2012)

More information

Business Statistics. Successful completion of Introductory and/or Intermediate Algebra courses is recommended before taking Business Statistics.

Business Statistics. Successful completion of Introductory and/or Intermediate Algebra courses is recommended before taking Business Statistics. Business Course Text Bowerman, Bruce L., Richard T. O'Connell, J. B. Orris, and Dawn C. Porter. Essentials of Business, 2nd edition, McGraw-Hill/Irwin, 2008, ISBN: 978-0-07-331988-9. Required Computing

More information

REINSURANCE PROFIT SHARE

REINSURANCE PROFIT SHARE REINSURANCE PROFIT SHARE Prepared by Damian Thornley Presented to the Institute of Actuaries of Australia Biennial Convention 23-26 September 2007 Christchurch, New Zealand This paper has been prepared

More information

THE FUNDAMENTAL THEOREM OF ALGEBRA VIA PROPER MAPS

THE FUNDAMENTAL THEOREM OF ALGEBRA VIA PROPER MAPS THE FUNDAMENTAL THEOREM OF ALGEBRA VIA PROPER MAPS KEITH CONRAD 1. Introduction The Fundamental Theorem of Algebra says every nonconstant polynomial with complex coefficients can be factored into linear

More information

Multivariate Logistic Regression

Multivariate Logistic Regression 1 Multivariate Logistic Regression As in univariate logistic regression, let π(x) represent the probability of an event that depends on p covariates or independent variables. Then, using an inv.logit formulation

More information

Motor and Household Insurance: Pricing to Maximise Profit in a Competitive Market

Motor and Household Insurance: Pricing to Maximise Profit in a Competitive Market Motor and Household Insurance: Pricing to Maximise Profit in a Competitive Market by Tom Wright, Partner, English Wright & Brockman 1. Introduction This paper describes one way in which statistical modelling

More information

TRINITY COLLEGE. Faculty of Engineering, Mathematics and Science. School of Computer Science & Statistics

TRINITY COLLEGE. Faculty of Engineering, Mathematics and Science. School of Computer Science & Statistics UNIVERSITY OF DUBLIN TRINITY COLLEGE Faculty of Engineering, Mathematics and Science School of Computer Science & Statistics BA (Mod) Enter Course Title Trinity Term 2013 Junior/Senior Sophister ST7002

More information