A Space Time Extension of the Lee Carter Model in a Hierarchical Bayesian Framework: Modelling and Forecasting Provincial Mortality in Italy

Size: px
Start display at page:

Download "A Space Time Extension of the Lee Carter Model in a Hierarchical Bayesian Framework: Modelling and Forecasting Provincial Mortality in Italy"

Transcription

1 WP October 2013 UNITED NATIONS STATISTICAL COMMISSION and ECONOMIC COMMISSION FOR EUROPE STATISTICAL OFFICE OF THE EUROPEAN UNION (EUROSTAT) Joint Eurostat/UNECE Work Session on Demographic Projections organised in cooperation with Istat (29-31 October 2013, Rome, Italy) Item 14 Multiregional Projections A Space Time Extension of the Lee Carter Model in a Hierarchical Bayesian Framework: Modelling and Forecasting Provincial Mortality in Italy Fedele Greco, University of Bologna Francesco Scalone, University of Bologna

2 Joint Eurostat UNECE ISTAT Work Session on Demographic Projections Rome, October 2013 A Space Time Extension of the Lee Carter Model in a Hierarchical Bayesian Framework: Modelling and Forecasting Provincial Mortality in Italy Fedele Greco Francesco Scalone Department of Statistical Sciences University of Bologna, Italy 1. Introduction The main purpose of this paper is to define a statistical method to model and forecast mortality rates by age and sex for provincial areas in Italy. In this endeavor, we will combine mortality modeling techniques using a Bayesian approach. Standard mortality poorly forecasting performs at a small area level. In fact, when working on single small areas, forecasters have to deal with large variances of direct estimates of the mortality rates, i.e. estimates obtained by exploiting data which refer to each small area separately. We propose a statistical model that allows area level estimates to borrow strength from each other by exploiting spatial association of provincial mortality rates and taking into account temporal correlation. The need for a spatiotemporal model is motivated by an explorative analysis. This approach has become very popular in the disease mapping literature (Kim and Lim 2010, Knorr Held 2000) but, to our knowledge, it has not been employed for modeling and forecasting age sex mortality rates in a demographic framework. In these terms, we redefine an extension of the Lee Carter method as a statistical model accounting for sub populations at provincial geographical level. Adopting a Bayesian approach, Markov Chain Monte Carlo methods (Girosi and King 2008) will be used to fit the model and to sample from the posterior predictive distribution. The outline of this paper is as follows. We first review the most used mortality modeling techniques such as the classical Lee Carter method, its log linear Poisson formulation, the extensions for multiple sub populations and the more recent Bayesian hierarchical models. Then we present the used data providing a brief description of the twentieth century provincial mortality trends in Italy. Afterwards, adopting a Bayesian approach, we define a spatial temporal model, giving details on implementation and prediction. In the final part, some concluding remarks are provided, also presenting possible applications for mortality forecasting.

3 2. Literature review In this section we briefly look at the most recent techniques of modeling forecasting mortality. We don t consider the ones based on pure deterministic and extrapolative methods (see Pitacco et al. 2009, ), focusing on methods that include stochastic components. For the purposes of this paper, we also need to look at methods that forecast mortality in multiple sub group populations. From this point of view, Bayesian hierarchical formulations can be ideal to control for all possible sources of variability (concerning both model parameters and predictions) and to forecast mortality rates on different geographical sub national units, as in the case of the province. The Lee Carter Method Lee and Carter (1992) proposed a method (henceforth the LC method) for modeling long term change in mortality as a function of a unique temporal index. Based on standard time series procedures, the LC method takes into account historical trends and projects distributions of age specific death rates. In order to represent the age specific mortality, Lee and Carter (1992) model the central death rate by using the following formulation. Let denote the central death rate for age x at time t, so we assume the following log linear form:, As Lee and Carter proposed, the s describe the age pattern of mortality averaged over time, whereas the s describe the deviations from the averaged pattern when varies. The change in the level of mortality over time is described by the (univariate) mortality index. The quantity,, denotes the error term, with mean 0 and variance, reflecting particular age specific historical influence that are not captured by the model. However, the LC methodology works as a mere extrapolation based on the fact that the future will be someway like the past, but without taking into account the effects of sudden improvements in survivorship, for example related to new medical discoveries and treatments. In order to predict future age specific mortality rates, Lee and Carter assume that the temporal component can be modeled as a random walk with drift, following an ARIMA(0,1,0) process. As a matter of fact, this stochastic process remains the only source of variability taken into account. Log Linear Poisson Models However, the estimation error of the parameters cannot be taken into account and the proposed model only incorporates uncertainty related to the index forecast (Lee and Carter, 1992). So the prediction intervals may result too narrow. In order to take into account all variability sources, a bootstrap procedure for a Poisson logbilinear formulation of the LC model was implemented (Brouhns et al. 2005). As suggested

4 by Alho (2000), switching from a classical linear model to a generalized linear model is possible. So the original LC method was embedded in a Poisson regression model, which is perfectly suited for age sex specific mortality rates the Poisson distribution arises naturally when data set takes the form of counts (Brouhns et al. 2002). Extensions for multiple sub populations Using the LC method to forecast mortality rates for the two sexes or other sub groups of a population can, however, be difficult. As a matter of fact, it seems improper to forecast mortality for provincial or other sub national populations in isolation from one another. Indeed, closely related populations have similar mortality patterns that do not diverge in the long run. In order to improve the mortality forecasts for individual provinces or other subnational populations, the LC model can be modified by taking into account their membership in a group. While forecasting provincial mortality for Canada, Lee and Nault (1993) propose making use of the same x and k t for each province, only in the case the historical x s do not vary significantly by province. In another application, the LC model is applied to a group of populations, allowing each its own age pattern and level of mortality but imposing shared rates of change by age (Lii and Lee 2005). Bayesian Hierarchical Spatio Temporal Frameworks Turning to the Bayesian approach Czado et al. (2005) proposed a Bayesian formulation of the log linear Poisson to consider all possible sources of variability that affect prediction values and intervals. A further Bayesian formulation of the LC method is given in Pedroza (2006), by using simulation methods. So Markov Chain Monte Carlo (henceforth MCMC) methods are proposed to draw samples from the joint posterior distribution of the parameters and to shape the posterior predictive distribution of the log mortality rates. In particular, the model is fitted and the rates are forecasted making use of the Gibbs sampler (Pedroza 2006). In a Bayesian framework, predicted intervals appear to be more correctly wider than those calculated applying the LC method, since they appropriately incorporate all known sources of variability. As a consequence, forecast variability turns out to be better suited to the observed data. During the last two decades, further hierarchical Bayesian estimates of mortality rates were proposed, based on Binomial or Poisson sampling and taking into account spatial correlation procedure. These hierarchical models are generally based on a Poisson model for the first stage and incorporate covariates by various modeling of the Poisson parameters at the second stage (Kim and Lim 2010). Spatial effects are typically included as random with some distributions where their parameters must be estimated. Maximum likelihood methods are generally used to estimate hyper parameters whereas Bayesian or empirical Bayesian methods are applied to obtain Poisson parameters and rates. In particular, Knorr Held (2000) propose spatio temporal interaction models where the spatial effects are nested within time so that it is possible to take into account how spatial heterogeneity and patterns evolved over time.

5 3. Descriptive analyses First of all, it is worth defining the geo unit of our analysis. In Italy, a province is an administrative division of intermediate level between a municipality and a region. In 2009, on average they count 561,168 persons, whereas the median is equal to 385,729, the first and third quartiles are 232,540 and 586,020, respectively. However, some provinces can be quite different in terms of population dimension. As a matter of fact, ten provinces have more than 1 million inhabitants, whereas 18 have less than 200,000. The two most populated provinces of Rome and Milan count respectively 4,1 and 3,4 million, whereas the least populated ones of Isernia and Ogliastra count 88 and 57 thousands. From this point of view, we want to take into account this variation in population size when we model and predict mortality levels by province. In order to model age specific mortality rates at provincial level, we take into account number of deaths and population amounts by 5 year age groups and provinces in each year from 1993 to We used data collected by the Italian Institute of National Statistics (Istat) referring to 103 Italian provinces in the period between 1993 and The deaths and population series were preliminary reconstructed in order to have time constant provincial borders. Measuring Age Specific Mortality Levels In order to highlight spatio temporal trends of mortality in Italy, we take into account age group specific mortality rates. Let Djtp denote the frequencies of deaths for the t th time period within the j th age group of the p th province. Direct estimates of the age sex specific central provincial death rates are obtained as: The jth age group is a 5 years age class comprising all single one year ages from x to x+4. jtp is the average population in the j th age group at time t in province p: 2 where is the population in the j th age group in December 31 in year t for a given p th province. Thus exposure to risk is therefore given by the population in the j th age group at time t 1 plus the population at time t divided by 2.

6 Mortality Tendencies in Italy at Provincial Level For the sake of brevity, all the figures discussed in this section are referred to the female population. In figure 1, provinces are classified by direct mortality rates level in age group between 0 and 4 for six given years (1992, 1995, 1998, 2001, 2004 and 2007). In each map, data is presented according to a seven intervals classification based on the septiles (7 quantiles). In order to have the same classification in each year, septiles are calculated by taking into account all the observed death rates in each province between 1992 and In these terms, figure 1 represents the general declining trend which occurred in the 0 4 age group during the period in question. Indeed, we can easily see that the first two 1992 and 1995 maps in figure 1 are dominated by the darkest highest mortality categories, since almost all the provinces fell in the highest quantiles. On the other hand, in 2005 and 2007, we can see the clearest lowest mortality categories prevalence. In figure 2, mortality rates in the 0 4 age group are still considered. Nevertheless, category intervals are not constant, since septiles are calculated on death rates ranking of a given single year which the map refers to. In these terms, we can observe the permanence of mortality clustering during the whole period in question, right up to As a matter of fact, higher death rates are concentrated in some visible cluster of provinces, providing evidence of spatial autocorrelation of mortality. In these terms, we demonstrate the importance of spatial structure components when modeling mortality rates for provincial sub populations. It is also important to say that the two tendencies (general mortality decline and clustering persistence) we have just observed for the age group between 0 and 4 are common to the other age classes. To give a further example, in figure 3 and 4, we report direct mortality rates in the age group, creating the interval categories in the same way we did for figure 1 and 2, respectively.

7 Figure 1. Mortality rates in age class 0 4 by province. Colored intervals = Seven quantiles based on the distribution of all mortality rates observed from 1992 to Figure 2. Mortality rates in age class 0 4 by province. Colored intervals = Seven quantiles based on distributions of mortality rates observed in each single given year

8 Figure 3. Mortality rates in age class by province. Colored intervals = Seven quantiles based on the distribution of all mortality rates observed from 1992 to Figure 4. Mortality rates in age class by province. Colored intervals = Seven quantiles based on distributions of mortality rates observed in each single given year

9 Figure 5. APLE Index based on provincial mortality rates by years and 5 age groups for males and females, Males Females Year Year j th Age age class j th Age age class To better demonstrate spatial mortality clustering, figure 5 displays male and female Approximate Profile Likelihood Estimator (APLE) Indexes on direct provincial death rates for different years and age groups. We prefer the APLE index to the more commonly used Moran s I because the former index has shown better behaviors when describing spatial correlation (Li et al., 2007). Stronger spatial autocorrelation is observed for both male and female populations in the first 0 4 age groups on each year from 1992 to The persistence of mortality clustering for infant ages can be due to differences in health care systems and environmental conditions. As a matter of fact, neonatology and pediatric care is organized at district and regional levels and so changing from area to area. Spatial clustering in mortality evidently reduces in younger age classes for both sexes (almost disappearing for males). Direct mortality rates in those ages are very low and thus are more affected by random variations. As a consequence, it is not possible to register spatial autocorrelation. In addition, stronger mortality clustering is evident for males in adult ages (with higher APLE Index levels before 2000), whereas almost no spatial autocorrelation is observed for the oldest ages between 80 and 99. On the contrary, when looking at females, we observe weaker mortality clustering for central adult ages and stronger spatial correlation for oldest age groups. Given the differences in survivorship between the two sexes, higher mortality levels in adult ages significantly and selectively reduce the number of male survivors in oldest age groups. On the one hand, oldest male survivors should be more robust, making biological components prevail on environmental factors and health systems and thus cancelling mortality clustering. In these terms, women in oldest age groups should be less selected and numerous than men and so more affected by geo spatial differences. On the

10 other hand, direct mortality rates for men in oldest age classes are based on very small numbers both at the numerator and denominator, making them extremely sensitive to random fluctuations and thus annulling spatial autocorrelation. 4. Hierarchical Bayesian Models for Space Time Variation In this section, we illustrate a Bayesian hierarchical model in order to take into account the data features described in the previous section. More precisely, the model is intended to capture both temporal and spatial features of the mortality trends, along with spatiotemporal interactions. The model which follows was first proposed by Knorr Held (2000) and has been adopted, with some modification, in several disease mapping applications (see for example: Kim and Lim 2010). As a starting point, we propose separate models for each age class; thus, in order to simplify notation, we drop the subscript j indexing age groups as in previous formulae. Let P tp and D tp denote respectively the exposure to risk and the death counts at year t and province p, t 1,...,T ; p 1,...,P. At the first level of the hierarchy, conditionally on model parameters involved in higher levels, we assume that mortality counts parameter P, i.e.: tp tp Dtp follow independent Poisson distributions with D ~Poisson P tp tp tp where tp denotes the mortality rate which is modeled as a logarithm function of three sets of random effect as follows: ln. tp t p tp Random terms t, p, tp are temporal, spatial and spatiotemporal random effects respectively. Model hierarchy is completed by assuming prior distributions for each random vector α 1,..., t,..., T, φ 1,..., p,..., P, δ 11,..., tp,..., TP. For all these set of parameters, an Intrinsic Conditional AutoRegressive (ICAR) model is assumed, that can be expressed as multivariate Normal distributions parameterized in terms of mean vectors and precision matrices as follows: where matrices W and T 2 2 α ~MVN 0, M W φ ~MVN 0, M W P δ ~MVN 0, M W M W TP 2 W are adjacency matrices whose generic ij entry elements i and j are considered as neighbors, while wij 0 otherwise. Matrices M and wij 1 if M

11 are diagonal matrices with diagonal entries equal to the number of neighbors. As regards the temporal structure of the model, we consider only adjacent times as neighbors, i.e. w 1 iff tt' 1: this delivers a first order Random Walk for capturing temporal dynamics. As regards spatial random effects, provinces are considered as neighbors if they share a common boundary. The precision matrix of the spatiotemporal interaction term delta is specified as the Kronecker product of M W as suggested by Clayton (1996). As regards precision parameters 2, M W and 2 and 2, we assume independent small parameter Gamma as prior distribution both to preserve non informativeness and to take advantage of the conjugacy between the Normal and the Gamma distribution. In order to allow model identifiability, constraints are imposed on spatial ( φ ) and spatiotemporal ( δ ) random effects, while the unconstrained temporal random effects ( α ) capture the average log mortality rate at each year t Let θ α, φ, δ,,, Bayesian model can be written as follows: denote the set of model parameters. Eventually, the full θ D D θ α φ δ p p p p p p p p Where pθ D denotes the joint posterior distribution of model parameters. Since such distribution is not available in closed form, fully Bayesian inference proceeds by means of a MCMC algorithm which allows us to obtain samples from the posterior distribution. The MCMC algorithm consists in block Metropolis steps for sampling random effects 2 2 α, φ, δ. Gibbs steps can be adopted for sampling precision parameters,, 2, since their full conditional distributions are available in closed form. The algorithm we built performs an automatic tuning of the proposal distribution in order to achieve an acceptance rate of around 40%. With regard to forecasting, coherently with a Bayesian approach, predictions proceed on the basis of the posterior predictive distribution: a merit of the Bayesian approach is to properly propagate the uncertainty related to all model parameters. tt' 5. Applications Results shown in this section are based on posterior samples from the posterior distribution. High autocorrelation can be observed in the MCMC samples and convergence is quite slow. For this reason, after a 30,000 burn in period, we perform heavy thinning of the chains, building a final sample of 10,000 post convergence samples.

12 For the sake of brevity, only results referred to the female population are shown. In Figure 6, posterior means of the temporal random effects α are reported for each year and age class. The expected shape of the log mortality rate along ages is recovered from the model even if the proposed model does not explicitly take into account dependence between mortality rates by age. The downward shifting of the year specific curves testifies the decreasing trend in the mortality rates. Figure 6. Posterior means of the temporal random effects α vs age class. Each line corresponds to a year. j j-th age class Figure 7. Posterior means of the temporal random effects α (green lines) and provincespecific log rates in years 1992 and Year 1992 Year 2009 j j j-th age class j-th age class

13 As can be seen from Figure 7, while average levels of the log morality rates show a smooth trend over age classes, Province specific curves show quite a noisy behavior that can be ascribed to the lack of dependence between age classes in the estimated models. Figure 8. Posterior means of the spatial random effects φ for age classes 0 4 and Age class 0-4 Age class Figure 8 shows the average spatial structure at provincial level which was incorporated by the φ components. Figure 9. Model based estimates (black dots) and Bayesian credibility intervals (black lines). Direct estimates (red dots) and confidence intervals (red dashed lines). Year 1992 age class 0 4. Estimated Rate Province In Figure 9, model based estimates are compared with direct estimates: it can be seen that model based estimates are less variable than direct estimates. In fact, the confidence intervals which refer to direct estimates are on average 20% wider than the model based

14 credibility intervals. Such reduction in the uncertainty of the estimates can be attributed to the borrowing strength process. 6. Concluding Remarks In this preliminary paper, we discuss a spatio temporal Bayesian hierarchical model to estimate small area mortality rates at provincial level. Since provincial sub populations in Italy can widely vary, direct estimates of mortality rates can show huge uncertainty when considering specific age groups in some smaller provinces. In a preliminary descriptive analysis, we clearly demonstrated the existence of specific spatiotemporal patterns which can be easily incorporated in a comprhensive Bayesian hierarchical model. By exploiting the observed spatial association and temporal correlation trends, we proposed a model where provincial level mortality rates estimates can borrow strength from each other in order to reduce uncertainty related to the estimates. Model estimation is performed, under a Bayesian framework, by adopting an MCMC algorithm to obtain samples from the posterior distribution. As a result, it appears that model based estimates are less variable than direct estimates. Even if the dependence between mortality rates by age is not included in the model, the estimated log mortality rates assume a typical J shape curve which is totally coherent with the expected results. The ability of the proposed model in forecasting mortality rates needs to be evaluated; technically, predictions proceed on the basis of the posterior predictive distribution: prediction will be the next step of the present work, along with extensions that will allow to take account of age dependence. References Alho, J.M., Discussion of Lee (2000). North American Actuarial Journal 4, Brouhns N., Denuit M., Vermunt J. K., A Poisson log bilinear regression approach to the construction of projected lifetables. Insurance: Mathematics and Economics 31 (2002) Brouhns N., Denuit M., Van Keilegom I., Bootstrapping the Poisson log bilinear model for mortality forecasting. Scandinavian Actuarial Journal Clayton D., Generalized linear mixed models. In Markov Chain Monte Carlo in Practice, ed. by W.R. Gilks, S. Richardson, and D.J. Spiegelhalter. Chapman and Hall,

15 Czado C., Delwarde A., Denuit M., Bayesian Poisson log bilinear mortality projections. Insurance: Mathematics & Economics 36, Girosi F. and King G., Demographic Forecasting, Princeton University Press. Kim H. and Lim H., Comparison of Bayesian Spatio Temporal Models for Chronic Diseases. Journal of Data Science 8(2010), Knorr Held L Bayesian modelling of inseparable space time variation in disease risk. Statistics in Medicine. 19: Lee R. D. and Carter L., Modeling and Forecasting the Time Series of U.S. Mortality. Journal of the American Statistical Association. 87: Lee, R. D. and F. Nault Modeling and Forecasting Provincial Mortality in Canada. Paper presented at World Congress of the IUSSP, Montreal, Canada. Li H., Calder C. A. and Cressie N. A. C Beyond Moran s I: testing for spatial dependence based on the spatial autoregressive model. Geographical Analysis 39, Li N. and Lee R. D, Coherent mortality forecasts for a group of populations: An extension of the Lee Carter method. Demography. 42(3): Pedroza C., A Bayesian forecasting model: predicting U.S. male mortality. Biostatistics. 7(4): Pitacco E., Denuit M., Haberman S., Olivieri A., Modelling Longevity Dynamics for Pensions and Annuity Business, Oxford University Press.

Statistics Graduate Courses

Statistics Graduate Courses Statistics Graduate Courses STAT 7002--Topics in Statistics-Biological/Physical/Mathematics (cr.arr.).organized study of selected topics. Subjects and earnable credit may vary from semester to semester.

More information

LONGEVITY IMPACT ON THE LIFE ANNUITIES ON ROMANIA BY COMPARATIVE ANALYSIS WITH BULGARIA AND HUNGARY

LONGEVITY IMPACT ON THE LIFE ANNUITIES ON ROMANIA BY COMPARATIVE ANALYSIS WITH BULGARIA AND HUNGARY LONGEVITY IMPACT ON THE LIFE ANNUITIES ON ROMANIA BY COMPARATIVE ANALYSIS WITH BULGARIA AND HUNGARY Lucian Claudiu ANGHEL, PhD * Cristian Ioan SOLOMON ** Abstract People are living longer worldwide than

More information

11. Time series and dynamic linear models

11. Time series and dynamic linear models 11. Time series and dynamic linear models Objective To introduce the Bayesian approach to the modeling and forecasting of time series. Recommended reading West, M. and Harrison, J. (1997). models, (2 nd

More information

Prospective Life Tables

Prospective Life Tables An introduction to time dependent mortality models by Julien Antunes Mendes and Christophe Pochet TRENDS OF MORTALITY Life expectancy at birth among early humans was likely to be about 20 to 30 years.

More information

Dynamic and Stochastic Survival Models

Dynamic and Stochastic Survival Models Dynamic and Stochastic Survival Models A Comparison of Europe by Omer Saddiqi THESIS for the degree of MASTER OF SCIENCE (Modelling and Data Analysis) Faculty of Mathematics and Natural Sciences University

More information

STA 4273H: Statistical Machine Learning

STA 4273H: Statistical Machine Learning STA 4273H: Statistical Machine Learning Russ Salakhutdinov Department of Statistics! rsalakhu@utstat.toronto.edu! http://www.cs.toronto.edu/~rsalakhu/ Lecture 6 Three Approaches to Classification Construct

More information

A Practical Approach of Natural Hedging for Insurance Companies. Hong-Chih Huang 1 Chou-Wen Wang 2. Proposal draft 2015/04/30 ABSTRACT

A Practical Approach of Natural Hedging for Insurance Companies. Hong-Chih Huang 1 Chou-Wen Wang 2. Proposal draft 2015/04/30 ABSTRACT A Practical Approach of Natural Hedging for Insurance Companies Hong-Chih Huang 1 Chou-Wen Wang 2 Proposal draft 2015/04/30 ABSTRACT This research investigates a natural hedging strategy and attempts to

More information

The Fair Valuation of Life Insurance Participating Policies: The Mortality Risk Role

The Fair Valuation of Life Insurance Participating Policies: The Mortality Risk Role The Fair Valuation of Life Insurance Participating Policies: The Mortality Risk Role Massimiliano Politano Department of Mathematics and Statistics University of Naples Federico II Via Cinthia, Monte S.Angelo

More information

Tutorial on Markov Chain Monte Carlo

Tutorial on Markov Chain Monte Carlo Tutorial on Markov Chain Monte Carlo Kenneth M. Hanson Los Alamos National Laboratory Presented at the 29 th International Workshop on Bayesian Inference and Maximum Entropy Methods in Science and Technology,

More information

MULTI-POPULATION MORTALITY MODELLING: A Danish Case Study. Andrew Cairns Heriot-Watt University, and The Maxwell Institute, Edinburgh

MULTI-POPULATION MORTALITY MODELLING: A Danish Case Study. Andrew Cairns Heriot-Watt University, and The Maxwell Institute, Edinburgh MULTI-POPULATION MORTALITY MODELLING: A Danish Case Study Andrew Cairns Heriot-Watt University, and The Maxwell Institute, Edinburgh Joint work with D. Blake, K. Dowd, M. Kallestrup-Lamb, C. Rosenskjold

More information

BayesX - Software for Bayesian Inference in Structured Additive Regression

BayesX - Software for Bayesian Inference in Structured Additive Regression BayesX - Software for Bayesian Inference in Structured Additive Regression Thomas Kneib Faculty of Mathematics and Economics, University of Ulm Department of Statistics, Ludwig-Maximilians-University Munich

More information

CHAPTER 3 EXAMPLES: REGRESSION AND PATH ANALYSIS

CHAPTER 3 EXAMPLES: REGRESSION AND PATH ANALYSIS Examples: Regression And Path Analysis CHAPTER 3 EXAMPLES: REGRESSION AND PATH ANALYSIS Regression analysis with univariate or multivariate dependent variables is a standard procedure for modeling relationships

More information

Joint models for classification and comparison of mortality in different countries.

Joint models for classification and comparison of mortality in different countries. Joint models for classification and comparison of mortality in different countries. Viani D. Biatat 1 and Iain D. Currie 1 1 Department of Actuarial Mathematics and Statistics, and the Maxwell Institute

More information

How To Understand The Theory Of Probability

How To Understand The Theory Of Probability Graduate Programs in Statistics Course Titles STAT 100 CALCULUS AND MATR IX ALGEBRA FOR STATISTICS. Differential and integral calculus; infinite series; matrix algebra STAT 195 INTRODUCTION TO MATHEMATICAL

More information

An Application of the G-formula to Asbestos and Lung Cancer. Stephen R. Cole. Epidemiology, UNC Chapel Hill. Slides: www.unc.

An Application of the G-formula to Asbestos and Lung Cancer. Stephen R. Cole. Epidemiology, UNC Chapel Hill. Slides: www.unc. An Application of the G-formula to Asbestos and Lung Cancer Stephen R. Cole Epidemiology, UNC Chapel Hill Slides: www.unc.edu/~colesr/ 1 Acknowledgements Collaboration with David B. Richardson, Haitao

More information

Bayesian Statistics: Indian Buffet Process

Bayesian Statistics: Indian Buffet Process Bayesian Statistics: Indian Buffet Process Ilker Yildirim Department of Brain and Cognitive Sciences University of Rochester Rochester, NY 14627 August 2012 Reference: Most of the material in this note

More information

Spatial Statistics Chapter 3 Basics of areal data and areal data modeling

Spatial Statistics Chapter 3 Basics of areal data and areal data modeling Spatial Statistics Chapter 3 Basics of areal data and areal data modeling Recall areal data also known as lattice data are data Y (s), s D where D is a discrete index set. This usually corresponds to data

More information

Parallelization Strategies for Multicore Data Analysis

Parallelization Strategies for Multicore Data Analysis Parallelization Strategies for Multicore Data Analysis Wei-Chen Chen 1 Russell Zaretzki 2 1 University of Tennessee, Dept of EEB 2 University of Tennessee, Dept. Statistics, Operations, and Management

More information

A Basic Introduction to Missing Data

A Basic Introduction to Missing Data John Fox Sociology 740 Winter 2014 Outline Why Missing Data Arise Why Missing Data Arise Global or unit non-response. In a survey, certain respondents may be unreachable or may refuse to participate. Item

More information

Statistical Rules of Thumb

Statistical Rules of Thumb Statistical Rules of Thumb Second Edition Gerald van Belle University of Washington Department of Biostatistics and Department of Environmental and Occupational Health Sciences Seattle, WA WILEY AJOHN

More information

Handling attrition and non-response in longitudinal data

Handling attrition and non-response in longitudinal data Longitudinal and Life Course Studies 2009 Volume 1 Issue 1 Pp 63-72 Handling attrition and non-response in longitudinal data Harvey Goldstein University of Bristol Correspondence. Professor H. Goldstein

More information

Modeling and Analysis of Call Center Arrival Data: A Bayesian Approach

Modeling and Analysis of Call Center Arrival Data: A Bayesian Approach Modeling and Analysis of Call Center Arrival Data: A Bayesian Approach Refik Soyer * Department of Management Science The George Washington University M. Murat Tarimcilar Department of Management Science

More information

Imputing Missing Data using SAS

Imputing Missing Data using SAS ABSTRACT Paper 3295-2015 Imputing Missing Data using SAS Christopher Yim, California Polytechnic State University, San Luis Obispo Missing data is an unfortunate reality of statistics. However, there are

More information

Validation of Software for Bayesian Models using Posterior Quantiles. Samantha R. Cook Andrew Gelman Donald B. Rubin DRAFT

Validation of Software for Bayesian Models using Posterior Quantiles. Samantha R. Cook Andrew Gelman Donald B. Rubin DRAFT Validation of Software for Bayesian Models using Posterior Quantiles Samantha R. Cook Andrew Gelman Donald B. Rubin DRAFT Abstract We present a simulation-based method designed to establish that software

More information

Basics of Statistical Machine Learning

Basics of Statistical Machine Learning CS761 Spring 2013 Advanced Machine Learning Basics of Statistical Machine Learning Lecturer: Xiaojin Zhu jerryzhu@cs.wisc.edu Modern machine learning is rooted in statistics. You will find many familiar

More information

EXPLORING SPATIAL PATTERNS IN YOUR DATA

EXPLORING SPATIAL PATTERNS IN YOUR DATA EXPLORING SPATIAL PATTERNS IN YOUR DATA OBJECTIVES Learn how to examine your data using the Geostatistical Analysis tools in ArcMap. Learn how to use descriptive statistics in ArcMap and Geoda to analyze

More information

PREDICTIVE DISTRIBUTIONS OF OUTSTANDING LIABILITIES IN GENERAL INSURANCE

PREDICTIVE DISTRIBUTIONS OF OUTSTANDING LIABILITIES IN GENERAL INSURANCE PREDICTIVE DISTRIBUTIONS OF OUTSTANDING LIABILITIES IN GENERAL INSURANCE BY P.D. ENGLAND AND R.J. VERRALL ABSTRACT This paper extends the methods introduced in England & Verrall (00), and shows how predictive

More information

A distribution-based stochastic model of cohort life expectancy, with applications

A distribution-based stochastic model of cohort life expectancy, with applications A distribution-based stochastic model of cohort life expectancy, with applications David McCarthy Demography and Longevity Workshop CEPAR, Sydney, Australia 26 th July 2011 1 Literature review Traditional

More information

Handling missing data in large data sets. Agostino Di Ciaccio Dept. of Statistics University of Rome La Sapienza

Handling missing data in large data sets. Agostino Di Ciaccio Dept. of Statistics University of Rome La Sapienza Handling missing data in large data sets Agostino Di Ciaccio Dept. of Statistics University of Rome La Sapienza The problem Often in official statistics we have large data sets with many variables and

More information

Example: Credit card default, we may be more interested in predicting the probabilty of a default than classifying individuals as default or not.

Example: Credit card default, we may be more interested in predicting the probabilty of a default than classifying individuals as default or not. Statistical Learning: Chapter 4 Classification 4.1 Introduction Supervised learning with a categorical (Qualitative) response Notation: - Feature vector X, - qualitative response Y, taking values in C

More information

Service courses for graduate students in degree programs other than the MS or PhD programs in Biostatistics.

Service courses for graduate students in degree programs other than the MS or PhD programs in Biostatistics. Course Catalog In order to be assured that all prerequisites are met, students must acquire a permission number from the education coordinator prior to enrolling in any Biostatistics course. Courses are

More information

Analyzing Clinical Trial Data via the Bayesian Multiple Logistic Random Effects Model

Analyzing Clinical Trial Data via the Bayesian Multiple Logistic Random Effects Model Analyzing Clinical Trial Data via the Bayesian Multiple Logistic Random Effects Model Bartolucci, A.A 1, Singh, K.P 2 and Bae, S.J 2 1 Dept. of Biostatistics, University of Alabama at Birmingham, Birmingham,

More information

Nonnested model comparison of GLM and GAM count regression models for life insurance data

Nonnested model comparison of GLM and GAM count regression models for life insurance data Nonnested model comparison of GLM and GAM count regression models for life insurance data Claudia Czado, Julia Pfettner, Susanne Gschlößl, Frank Schiller December 8, 2009 Abstract Pricing and product development

More information

Forecasting German mortality using panel data procedures

Forecasting German mortality using panel data procedures B. Babel, E. Bomsdorf and R. Schmidt (2006) Forecasting German mortality using panel data procedures, to appear in the Journal of Population Economics. Forecasting German mortality using panel data procedures

More information

Java Modules for Time Series Analysis

Java Modules for Time Series Analysis Java Modules for Time Series Analysis Agenda Clustering Non-normal distributions Multifactor modeling Implied ratings Time series prediction 1. Clustering + Cluster 1 Synthetic Clustering + Time series

More information

Similarity Search and Mining in Uncertain Spatial and Spatio Temporal Databases. Andreas Züfle

Similarity Search and Mining in Uncertain Spatial and Spatio Temporal Databases. Andreas Züfle Similarity Search and Mining in Uncertain Spatial and Spatio Temporal Databases Andreas Züfle Geo Spatial Data Huge flood of geo spatial data Modern technology New user mentality Great research potential

More information

Least Squares Estimation

Least Squares Estimation Least Squares Estimation SARA A VAN DE GEER Volume 2, pp 1041 1045 in Encyclopedia of Statistics in Behavioral Science ISBN-13: 978-0-470-86080-9 ISBN-10: 0-470-86080-4 Editors Brian S Everitt & David

More information

Bayesian Statistics in One Hour. Patrick Lam

Bayesian Statistics in One Hour. Patrick Lam Bayesian Statistics in One Hour Patrick Lam Outline Introduction Bayesian Models Applications Missing Data Hierarchical Models Outline Introduction Bayesian Models Applications Missing Data Hierarchical

More information

Analysis of Financial Time Series

Analysis of Financial Time Series Analysis of Financial Time Series Analysis of Financial Time Series Financial Econometrics RUEY S. TSAY University of Chicago A Wiley-Interscience Publication JOHN WILEY & SONS, INC. This book is printed

More information

MODELLING AND ANALYSIS OF

MODELLING AND ANALYSIS OF MODELLING AND ANALYSIS OF FOREST FIRE IN PORTUGAL - PART I Giovani L. Silva CEAUL & DMIST - Universidade Técnica de Lisboa gsilva@math.ist.utl.pt Maria Inês Dias & Manuela Oliveira CIMA & DM - Universidade

More information

Association Between Variables

Association Between Variables Contents 11 Association Between Variables 767 11.1 Introduction............................ 767 11.1.1 Measure of Association................. 768 11.1.2 Chapter Summary.................... 769 11.2 Chi

More information

Department of Economics

Department of Economics Department of Economics On Testing for Diagonality of Large Dimensional Covariance Matrices George Kapetanios Working Paper No. 526 October 2004 ISSN 1473-0278 On Testing for Diagonality of Large Dimensional

More information

Income inequalities in Italy: trend over time

Income inequalities in Italy: trend over time Income inequalities in Italy: trend over time Loris Vergolini Descriptive evidence In this note we aim to deepen the OECD (2011) results that highlight a critical situation for Italy, providing the image

More information

Model Selection and Claim Frequency for Workers Compensation Insurance

Model Selection and Claim Frequency for Workers Compensation Insurance Model Selection and Claim Frequency for Workers Compensation Insurance Jisheng Cui, David Pitt and Guoqi Qian Abstract We consider a set of workers compensation insurance claim data where the aggregate

More information

Predict the Popularity of YouTube Videos Using Early View Data

Predict the Popularity of YouTube Videos Using Early View Data 000 001 002 003 004 005 006 007 008 009 010 011 012 013 014 015 016 017 018 019 020 021 022 023 024 025 026 027 028 029 030 031 032 033 034 035 036 037 038 039 040 041 042 043 044 045 046 047 048 049 050

More information

A Bayesian hierarchical surrogate outcome model for multiple sclerosis

A Bayesian hierarchical surrogate outcome model for multiple sclerosis A Bayesian hierarchical surrogate outcome model for multiple sclerosis 3 rd Annual ASA New Jersey Chapter / Bayer Statistics Workshop David Ohlssen (Novartis), Luca Pozzi and Heinz Schmidli (Novartis)

More information

A successful market segmentation initiative answers the following critical business questions: * How can we a. Customer Status.

A successful market segmentation initiative answers the following critical business questions: * How can we a. Customer Status. MARKET SEGMENTATION The simplest and most effective way to operate an organization is to deliver one product or service that meets the needs of one type of customer. However, to the delight of many organizations

More information

INTEREST RATES AND FX MODELS

INTEREST RATES AND FX MODELS INTEREST RATES AND FX MODELS 8. Portfolio greeks Andrew Lesniewski Courant Institute of Mathematical Sciences New York University New York March 27, 2013 2 Interest Rates & FX Models Contents 1 Introduction

More information

Life Settlement Pricing

Life Settlement Pricing Life Settlement Pricing Yinglu Deng Patrick Brockett Richard MacMinn Tsinghua University University of Texas Illinois State University Life Settlement Description A life settlement is a financial arrangement

More information

Challenges of the World Population in the 21st Century.

Challenges of the World Population in the 21st Century. Challenges of the World Population in the 21st Century. The changing age structure of population and its consequences for development Panel discussion Thursday, 12 October 2006 Dag Hammarskjold Library

More information

171:290 Model Selection Lecture II: The Akaike Information Criterion

171:290 Model Selection Lecture II: The Akaike Information Criterion 171:290 Model Selection Lecture II: The Akaike Information Criterion Department of Biostatistics Department of Statistics and Actuarial Science August 28, 2012 Introduction AIC, the Akaike Information

More information

Predictive Modeling. Age. Sex. Y.o.B. Expected mortality. Model. Married Amount. etc. SOA/CAS Spring Meeting

Predictive Modeling. Age. Sex. Y.o.B. Expected mortality. Model. Married Amount. etc. SOA/CAS Spring Meeting watsonwyatt.com SOA/CAS Spring Meeting Application of Predictive Modeling in Life Insurance Jean-Felix Huet, ASA June 18, 28 Predictive Modeling Statistical model that relates an event (death) with a number

More information

Contents. List of Figures. List of Tables. List of Examples. Preface to Volume IV

Contents. List of Figures. List of Tables. List of Examples. Preface to Volume IV Contents List of Figures List of Tables List of Examples Foreword Preface to Volume IV xiii xvi xxi xxv xxix IV.1 Value at Risk and Other Risk Metrics 1 IV.1.1 Introduction 1 IV.1.2 An Overview of Market

More information

VOLUME 25, ARTICLE 5, PAGES 173-214 PUBLISHED 15 JULY 2011 http://www.demographic-research.org/volumes/vol25/5/ DOI: 10.4054/DemRes.2011.25.

VOLUME 25, ARTICLE 5, PAGES 173-214 PUBLISHED 15 JULY 2011 http://www.demographic-research.org/volumes/vol25/5/ DOI: 10.4054/DemRes.2011.25. Demographic Research a free, expedited, online journal of peer-reviewed research and commentary in the population sciences published by the Max Planck Institute for Demographic Research Konrad-Zuse Str.

More information

THE USE OF STATISTICAL DISTRIBUTIONS TO MODEL CLAIMS IN MOTOR INSURANCE

THE USE OF STATISTICAL DISTRIBUTIONS TO MODEL CLAIMS IN MOTOR INSURANCE THE USE OF STATISTICAL DISTRIBUTIONS TO MODEL CLAIMS IN MOTOR INSURANCE Batsirai Winmore Mazviona 1 Tafadzwa Chiduza 2 ABSTRACT In general insurance, companies need to use data on claims gathered from

More information

Overview of Violations of the Basic Assumptions in the Classical Normal Linear Regression Model

Overview of Violations of the Basic Assumptions in the Classical Normal Linear Regression Model Overview of Violations of the Basic Assumptions in the Classical Normal Linear Regression Model 1 September 004 A. Introduction and assumptions The classical normal linear regression model can be written

More information

Lecture 2 ESTIMATING THE SURVIVAL FUNCTION. One-sample nonparametric methods

Lecture 2 ESTIMATING THE SURVIVAL FUNCTION. One-sample nonparametric methods Lecture 2 ESTIMATING THE SURVIVAL FUNCTION One-sample nonparametric methods There are commonly three methods for estimating a survivorship function S(t) = P (T > t) without resorting to parametric models:

More information

A spreadsheet Approach to Business Quantitative Methods

A spreadsheet Approach to Business Quantitative Methods A spreadsheet Approach to Business Quantitative Methods by John Flaherty Ric Lombardo Paul Morgan Basil desilva David Wilson with contributions by: William McCluskey Richard Borst Lloyd Williams Hugh Williams

More information

Chenfeng Xiong (corresponding), University of Maryland, College Park (cxiong@umd.edu)

Chenfeng Xiong (corresponding), University of Maryland, College Park (cxiong@umd.edu) Paper Author (s) Chenfeng Xiong (corresponding), University of Maryland, College Park (cxiong@umd.edu) Lei Zhang, University of Maryland, College Park (lei@umd.edu) Paper Title & Number Dynamic Travel

More information

APPLIED MISSING DATA ANALYSIS

APPLIED MISSING DATA ANALYSIS APPLIED MISSING DATA ANALYSIS Craig K. Enders Series Editor's Note by Todd D. little THE GUILFORD PRESS New York London Contents 1 An Introduction to Missing Data 1 1.1 Introduction 1 1.2 Chapter Overview

More information

MATHEMATICAL METHODS OF STATISTICS

MATHEMATICAL METHODS OF STATISTICS MATHEMATICAL METHODS OF STATISTICS By HARALD CRAMER TROFESSOK IN THE UNIVERSITY OF STOCKHOLM Princeton PRINCETON UNIVERSITY PRESS 1946 TABLE OF CONTENTS. First Part. MATHEMATICAL INTRODUCTION. CHAPTERS

More information

7 Time series analysis

7 Time series analysis 7 Time series analysis In Chapters 16, 17, 33 36 in Zuur, Ieno and Smith (2007), various time series techniques are discussed. Applying these methods in Brodgar is straightforward, and most choices are

More information

ACTUARIAL MATHEMATICS FOR LIFE CONTINGENT RISKS

ACTUARIAL MATHEMATICS FOR LIFE CONTINGENT RISKS ACTUARIAL MATHEMATICS FOR LIFE CONTINGENT RISKS DAVID C. M. DICKSON University of Melbourne MARY R. HARDY University of Waterloo, Ontario V HOWARD R. WATERS Heriot-Watt University, Edinburgh CAMBRIDGE

More information

Statistical Bulletin. National Life Tables, United Kingdom, 2011-2013. Key Points. Summary. Introduction

Statistical Bulletin. National Life Tables, United Kingdom, 2011-2013. Key Points. Summary. Introduction Statistical Bulletin National Life Tables, United Kingdom, 2011-2013 Coverage: UK Date: 25 September 2014 Geographical Area: Country Theme: Population Key Points A newborn baby boy could expect to live

More information

Bayesian prediction of disability insurance frequencies using economic indicators

Bayesian prediction of disability insurance frequencies using economic indicators Bayesian prediction of disability insurance frequencies using economic indicators Catherine Donnelly Heriot-Watt University, Edinburgh, UK Mario V. Wüthrich ETH Zurich, RisLab, Department of Mathematics,

More information

QUALITY ENGINEERING PROGRAM

QUALITY ENGINEERING PROGRAM QUALITY ENGINEERING PROGRAM Production engineering deals with the practical engineering problems that occur in manufacturing planning, manufacturing processes and in the integration of the facilities and

More information

SPSS TRAINING SESSION 3 ADVANCED TOPICS (PASW STATISTICS 17.0) Sun Li Centre for Academic Computing lsun@smu.edu.sg

SPSS TRAINING SESSION 3 ADVANCED TOPICS (PASW STATISTICS 17.0) Sun Li Centre for Academic Computing lsun@smu.edu.sg SPSS TRAINING SESSION 3 ADVANCED TOPICS (PASW STATISTICS 17.0) Sun Li Centre for Academic Computing lsun@smu.edu.sg IN SPSS SESSION 2, WE HAVE LEARNT: Elementary Data Analysis Group Comparison & One-way

More information

Fairfield Public Schools

Fairfield Public Schools Mathematics Fairfield Public Schools AP Statistics AP Statistics BOE Approved 04/08/2014 1 AP STATISTICS Critical Areas of Focus AP Statistics is a rigorous course that offers advanced students an opportunity

More information

Paid and Unpaid Labor in Developing Countries: an inequalities in time use approach

Paid and Unpaid Labor in Developing Countries: an inequalities in time use approach Paid and Unpaid Work inequalities 1 Paid and Unpaid Labor in Developing Countries: an inequalities in time use approach Paid and Unpaid Labor in Developing Countries: an inequalities in time use approach

More information

Multiple Imputation for Missing Data: A Cautionary Tale

Multiple Imputation for Missing Data: A Cautionary Tale Multiple Imputation for Missing Data: A Cautionary Tale Paul D. Allison University of Pennsylvania Address correspondence to Paul D. Allison, Sociology Department, University of Pennsylvania, 3718 Locust

More information

Hierarchical Insurance Claims Modeling

Hierarchical Insurance Claims Modeling Hierarchical Insurance Claims Modeling Edward W. (Jed) Frees, University of Wisconsin - Madison Emiliano A. Valdez, University of Connecticut 2009 Joint Statistical Meetings Session 587 - Thu 8/6/09-10:30

More information

Optimizing Prediction with Hierarchical Models: Bayesian Clustering

Optimizing Prediction with Hierarchical Models: Bayesian Clustering 1 Technical Report 06/93, (August 30, 1993). Presidencia de la Generalidad. Caballeros 9, 46001 - Valencia, Spain. Tel. (34)(6) 386.6138, Fax (34)(6) 386.3626, e-mail: bernardo@mac.uv.es Optimizing Prediction

More information

Bootstrapping Big Data

Bootstrapping Big Data Bootstrapping Big Data Ariel Kleiner Ameet Talwalkar Purnamrita Sarkar Michael I. Jordan Computer Science Division University of California, Berkeley {akleiner, ameet, psarkar, jordan}@eecs.berkeley.edu

More information

Financial Mathematics and Simulation MATH 6740 1 Spring 2011 Homework 2

Financial Mathematics and Simulation MATH 6740 1 Spring 2011 Homework 2 Financial Mathematics and Simulation MATH 6740 1 Spring 2011 Homework 2 Due Date: Friday, March 11 at 5:00 PM This homework has 170 points plus 20 bonus points available but, as always, homeworks are graded

More information

Probabilistic user behavior models in online stores for recommender systems

Probabilistic user behavior models in online stores for recommender systems Probabilistic user behavior models in online stores for recommender systems Tomoharu Iwata Abstract Recommender systems are widely used in online stores because they are expected to improve both user

More information

Centre for Central Banking Studies

Centre for Central Banking Studies Centre for Central Banking Studies Technical Handbook No. 4 Applied Bayesian econometrics for central bankers Andrew Blake and Haroon Mumtaz CCBS Technical Handbook No. 4 Applied Bayesian econometrics

More information

Validation of Software for Bayesian Models Using Posterior Quantiles

Validation of Software for Bayesian Models Using Posterior Quantiles Validation of Software for Bayesian Models Using Posterior Quantiles Samantha R. COOK, Andrew GELMAN, and Donald B. RUBIN This article presents a simulation-based method designed to establish the computational

More information

MISSING DATA TECHNIQUES WITH SAS. IDRE Statistical Consulting Group

MISSING DATA TECHNIQUES WITH SAS. IDRE Statistical Consulting Group MISSING DATA TECHNIQUES WITH SAS IDRE Statistical Consulting Group ROAD MAP FOR TODAY To discuss: 1. Commonly used techniques for handling missing data, focusing on multiple imputation 2. Issues that could

More information

The primary goal of this thesis was to understand how the spatial dependence of

The primary goal of this thesis was to understand how the spatial dependence of 5 General discussion 5.1 Introduction The primary goal of this thesis was to understand how the spatial dependence of consumer attitudes can be modeled, what additional benefits the recovering of spatial

More information

THE MATHEMATICS OF LIFE INSURANCE THE NET SINGLE PREMIUM. - Investigate the component parts of the life insurance premium.

THE MATHEMATICS OF LIFE INSURANCE THE NET SINGLE PREMIUM. - Investigate the component parts of the life insurance premium. THE MATHEMATICS OF LIFE INSURANCE THE NET SINGLE PREMIUM CHAPTER OBJECTIVES - Discuss the importance of life insurance mathematics. - Investigate the component parts of the life insurance premium. - Demonstrate

More information

Nonlinear Regression:

Nonlinear Regression: Zurich University of Applied Sciences School of Engineering IDP Institute of Data Analysis and Process Design Nonlinear Regression: A Powerful Tool With Considerable Complexity Half-Day : Improved Inference

More information

business statistics using Excel OXFORD UNIVERSITY PRESS Glyn Davis & Branko Pecar

business statistics using Excel OXFORD UNIVERSITY PRESS Glyn Davis & Branko Pecar business statistics using Excel Glyn Davis & Branko Pecar OXFORD UNIVERSITY PRESS Detailed contents Introduction to Microsoft Excel 2003 Overview Learning Objectives 1.1 Introduction to Microsoft Excel

More information

Modeling the Distribution of Environmental Radon Levels in Iowa: Combining Multiple Sources of Spatially Misaligned Data

Modeling the Distribution of Environmental Radon Levels in Iowa: Combining Multiple Sources of Spatially Misaligned Data Modeling the Distribution of Environmental Radon Levels in Iowa: Combining Multiple Sources of Spatially Misaligned Data Brian J. Smith, Ph.D. The University of Iowa Joint Statistical Meetings August 10,

More information

II. DISTRIBUTIONS distribution normal distribution. standard scores

II. DISTRIBUTIONS distribution normal distribution. standard scores Appendix D Basic Measurement And Statistics The following information was developed by Steven Rothke, PhD, Department of Psychology, Rehabilitation Institute of Chicago (RIC) and expanded by Mary F. Schmidt,

More information

Bayesian Machine Learning (ML): Modeling And Inference in Big Data. Zhuhua Cai Google, Rice University caizhua@gmail.com

Bayesian Machine Learning (ML): Modeling And Inference in Big Data. Zhuhua Cai Google, Rice University caizhua@gmail.com Bayesian Machine Learning (ML): Modeling And Inference in Big Data Zhuhua Cai Google Rice University caizhua@gmail.com 1 Syllabus Bayesian ML Concepts (Today) Bayesian ML on MapReduce (Next morning) Bayesian

More information

LONG TERM CARE INSURANCE IN ITALY

LONG TERM CARE INSURANCE IN ITALY XXVII ICA - Cancun, 2002 Health Seminar Critical Issues in Managing Long Term Care Insurance LONG TERM CARE INSURANCE IN ITALY Ermanno Pitacco University of Trieste Outline The scenario The need for LTC

More information

CALCULATIONS & STATISTICS

CALCULATIONS & STATISTICS CALCULATIONS & STATISTICS CALCULATION OF SCORES Conversion of 1-5 scale to 0-100 scores When you look at your report, you will notice that the scores are reported on a 0-100 scale, even though respondents

More information

CHAPTER 2 Estimating Probabilities

CHAPTER 2 Estimating Probabilities CHAPTER 2 Estimating Probabilities Machine Learning Copyright c 2016. Tom M. Mitchell. All rights reserved. *DRAFT OF January 24, 2016* *PLEASE DO NOT DISTRIBUTE WITHOUT AUTHOR S PERMISSION* This is a

More information

Supplement to Call Centers with Delay Information: Models and Insights

Supplement to Call Centers with Delay Information: Models and Insights Supplement to Call Centers with Delay Information: Models and Insights Oualid Jouini 1 Zeynep Akşin 2 Yves Dallery 1 1 Laboratoire Genie Industriel, Ecole Centrale Paris, Grande Voie des Vignes, 92290

More information

Monte Carlo Methods and Models in Finance and Insurance

Monte Carlo Methods and Models in Finance and Insurance Chapman & Hall/CRC FINANCIAL MATHEMATICS SERIES Monte Carlo Methods and Models in Finance and Insurance Ralf Korn Elke Korn Gerald Kroisandt f r oc) CRC Press \ V^ J Taylor & Francis Croup ^^"^ Boca Raton

More information

E3: PROBABILITY AND STATISTICS lecture notes

E3: PROBABILITY AND STATISTICS lecture notes E3: PROBABILITY AND STATISTICS lecture notes 2 Contents 1 PROBABILITY THEORY 7 1.1 Experiments and random events............................ 7 1.2 Certain event. Impossible event............................

More information

Fundamentals of Actuarial Mathematics. 3rd Edition

Fundamentals of Actuarial Mathematics. 3rd Edition Brochure More information from http://www.researchandmarkets.com/reports/2866022/ Fundamentals of Actuarial Mathematics. 3rd Edition Description: - Provides a comprehensive coverage of both the deterministic

More information

Marketing Mix Modelling and Big Data P. M Cain

Marketing Mix Modelling and Big Data P. M Cain 1) Introduction Marketing Mix Modelling and Big Data P. M Cain Big data is generally defined in terms of the volume and variety of structured and unstructured information. Whereas structured data is stored

More information

Comparison of Non-linear Dimensionality Reduction Techniques for Classification with Gene Expression Microarray Data

Comparison of Non-linear Dimensionality Reduction Techniques for Classification with Gene Expression Microarray Data CMPE 59H Comparison of Non-linear Dimensionality Reduction Techniques for Classification with Gene Expression Microarray Data Term Project Report Fatma Güney, Kübra Kalkan 1/15/2013 Keywords: Non-linear

More information

Insurance: Mathematics and Economics

Insurance: Mathematics and Economics Insurance: Mathematics and Economics 45 (2009) 255 270 Contents lists available at ScienceDirect Insurance: Mathematics and Economics journal homep: www.elsevier.com/locate/ime On -period-cohort parametric

More information

IMPORTANCE OF QUANTITATIVE TECHNIQUES IN MANAGERIAL DECISIONS

IMPORTANCE OF QUANTITATIVE TECHNIQUES IN MANAGERIAL DECISIONS IMPORTANCE OF QUANTITATIVE TECHNIQUES IN MANAGERIAL DECISIONS Abstract The term Quantitative techniques refers to the methods used to quantify the variables in any discipline. It means the application

More information

PS 271B: Quantitative Methods II. Lecture Notes

PS 271B: Quantitative Methods II. Lecture Notes PS 271B: Quantitative Methods II Lecture Notes Langche Zeng zeng@ucsd.edu The Empirical Research Process; Fundamental Methodological Issues 2 Theory; Data; Models/model selection; Estimation; Inference.

More information

MATH BOOK OF PROBLEMS SERIES. New from Pearson Custom Publishing!

MATH BOOK OF PROBLEMS SERIES. New from Pearson Custom Publishing! MATH BOOK OF PROBLEMS SERIES New from Pearson Custom Publishing! The Math Book of Problems Series is a database of math problems for the following courses: Pre-algebra Algebra Pre-calculus Calculus Statistics

More information