Probabilistic concepts of risk classification in insurance
|
|
|
- Clemence Summers
- 10 years ago
- Views:
Transcription
1 Probabilistic concepts of risk classification in insurance Emiliano A. Valdez Michigan State University East Lansing, Michigan, USA joint work with Katrien Antonio* * K.U. Leuven 7th International Workshop on Applied Probability (IWAP2014) June 2014, Antalya, TURKEY E.A. Valdez (Mich State Univ) IWAP2014, Antalya June / 35
2 Introduction Ratemaking and risk classification Insurance ratemaking and risk classification Ratemaking (or pricing): a major task of an actuary calculate a predetermined price in exchange for the uncertainty probability of occurrence, timing, financial impact Risk classification the art and science of grouping insureds into homogeneous (similar), independent risks the same premium cannot be applied for all insured risks in the portfolio good risks may feel paying too much and leave the company; bad risks may favor uniform price and prefer to stay spiral effect of having a disproportionate number of bad risks to stay in business, you keep increasing premium E.A. Valdez (Mich State Univ) IWAP2014, Antalya June / 35
3 Introduction Ratemaking and risk classification Risk classification Risk classification system must: lead to fairness among insured individuals ensure the financial soundness of the insurance company What risk classification is not: about predicting the experience for an individual risk: impossible and unnecessary should not reward or penalize certain classes of individuals at the expense of others See American Academy of Actuaries (AAA) Risk Classification Statement of Principles E.A. Valdez (Mich State Univ) IWAP2014, Antalya June / 35
4 Introduction Ratemaking and risk classification Statistical or actuarial considerations Constructing a risk classification system involves the selection of classifying or rating variables which must meet certain actuarial criteria: the rating variable must be accurate in the sense that it has a direct impact on costs the rating variable must meet homogeneity requirement in the sense that the resulting expected costs within a class are reasonably similar the rating variable must be statistically credible and reliable E.A. Valdez (Mich State Univ) IWAP2014, Antalya June / 35
5 Introduction a priori vs a posteriori a priori vs a posteriori With a priori risk classification, the actuary lacks (individual) measurable information about the policyholder to make a more informed decision: unable to identify all possible important factors especially the unobservable or the unmeasurable makes it more difficult to achieve a more homogeneous classification With a posteriori risk classification, the actuary makes use of an experience rating mechanism: premiums are re-evaluated by taking into account the history of claims of the insured the history of claims provide additional information about the driver s unobservable factors E.A. Valdez (Mich State Univ) IWAP2014, Antalya June / 35
6 Introduction statistical techniques Statistical techniques of risk classification a priori techniques: (ordinary) linear regression, e.g. Lemaire (1985) on automobile insurance Generalized Linear Models (GLMs) Generalized Additive Models (GAMs) Generalized count distribution models and heavy-tailed regression a posteriori techniques: experience rating schemes: No Claim Discounts, Bonus-Malus models for clustered data (panel data, multilevel data models) estimation methods: likelihood-based, Bayesian use of Markov chain models E.A. Valdez (Mich State Univ) IWAP2014, Antalya June / 35
7 A priori methods Observable data for a priori rating For existing portfolios, insurers typically keep track of frequency and severity data: Policyholder file: underwriting information about the insured and its coverage (e.g. age, gender, policy information such as coverage, deductibles and limitations) Claims file: information about claims filed to the insurer together with amounts and payments made For each insured i, we can write the observable data as {N i, E i, y i, x i } where N i is the number of claims and the total period of exposure E i during which these claims were observed, y i = (y i1,..., y ini ) is the vector of individual losses, and x i is the set of potential explanatory variables. E.A. Valdez (Mich State Univ) IWAP2014, Antalya June / 35
8 A priori methods pure premium Pure premium: claim frequency and claim severity Define the aggregate loss as L i = y i1 + + y ini so that frequency and severity data can be combined into a pure premium as P i = L i = N i L i = F i S i, E i E i N i where F i refers to the claim frequency per unit of exposure and S i is the claim severity for a given loss. To determine the price, some premium principle can be applied (e.g. expected value): π[p i ] = E[P i ] = E[F i ] E[S i ]. For each frequency and severity component, the explanatory variables will be injected. E.A. Valdez (Mich State Univ) IWAP2014, Antalya June / 35
9 A priori methods GLM Current practice: generalized linear models Canonical density from the exponential family: [ ] yθ ψ(θ) f(y) = exp + c(y, φ), φ where ψ( ) and c( ) are known functions, θ and φ are the natural and scale parameters, respectively. Members include, but not limited to, the Normal, Poisson, Binomial and the Gamma distributions. May be used to model either the frequency (count) or the severity (amount). The following are well-known: µ = E[Y ] = ψ (θ) and Var[Y ] = φψ (θ) = φv (µ), where the derivatives are with respect to θ and V ( ) is the variance function. E.A. Valdez (Mich State Univ) IWAP2014, Antalya June / 35
10 A priori methods GLM Claim frequency models The Poisson distribution model: Pr(N i = n i ) = exp ( λ i)λ n i i, n i! Risk classification variables can be introduced through the mean parameter The Negative Binomial model: Pr(N i = n i ) = Γ(α + n i) Γ(α)n i! λ i = E i exp (x iβ). ( ) α α ( ) ni λi, λ i + α λ i + α where α = τ/µ. Risk classification variables can be built through µ i = E i exp (x iβ), or through the use of a Poisson mixture with N i Poi(λ i θ) with λ i = E i exp (x iβ) and θ Γ(τ/µ, τ/µ). E.A. Valdez (Mich State Univ) IWAP2014, Antalya June / 35
11 A priori methods GLM Illustration for claim counts Claim counts are modeled for an automobile insurance data set with 159,947 policies. No classification variables considered here. No. of Claims Observed Frequency Poisson Frequency NB Frequency 0 145, , , ,910 13,902 12, , , > log Lik. 101, ,314 AIC 101, ,318 E.A. Valdez (Mich State Univ) IWAP2014, Antalya June / 35
12 A priori methods generalized count Generalized count distributions Mixtures The NB distribution is indeed a mixture of Poisson. Other continuous mixtures of the Poisson include the Poisson-Inverse Gaussian ( PIG ) distribution and the Poisson-LogNormal ( PLN ) distribution. Panjer and Willmot (1992). Zero-inflated models Here, N = 0 with probability p and N has distribution Pr(N = n θ) with probability 1 p. This gives the following ZI distributional specification: { p + (1 p)pr(n = 0 θ), n = 0, Pr ZI (N = n p, θ) = (1 p)pr(n = n θ), n > 0. Hurdle models For hurdle models, Pr Hur (N = 0 p, θ) = p, Pr Hur (N = n p, θ) = 1 p Pr(N = n θ), n > 0 1 Pr(0 θ) E.A. Valdez (Mich State Univ) IWAP2014, Antalya June / 35
13 A priori methods generalized count Illustration with ZI and hurdle Poisson models Using the same set of data earlier introduced. Still no classification variables considered here. No. of Claims Observed NB ZI Poisson Hurdle Poisson 0 145, , , , ,910 12,899 12,858 13, ,234 1,225 1,295 1, > log Lik. 101, , ,910 AIC 101, , ,914 E.A. Valdez (Mich State Univ) IWAP2014, Antalya June / 35
14 A priori methods risk classification Introducing risk classification in ZI and hurdle models The common procedure is to introduce regressor variables through the mean parameter using for example µ i = E i exp (x iβ) and for the zero-part, use a logistic regression of the form p i = exp (z i γ) 1 + exp (z i γ) where x i and z i are sets of regressor variables. E.A. Valdez (Mich State Univ) IWAP2014, Antalya June / 35
15 A priori methods risk classification Risk classification variables For the automobile insurance data, description of covariates used: Covariate Vehicle Age Cubic Capacity Tonnage Private CompCov SexIns AgeIns Experience NCD TLength Description The age of the vehicle in years. Vehicle capacity for cars and motors. Vehicle capacity for trucks. 1 if vehicle is used for private purpose, 0 otherwise. 1 if cover is comprehensive, 0 otherwise. 1 if driver is female, 0 if male. Age of the insured. Driving experience of the insured. 1 if there is no No Claims Discount, 0 if discount is present. This is based on previous accident record of the policyholder. The higher the discount, the better the prior accident record. (Exposure) Number of calendar years during which claim counts are registered. E.A. Valdez (Mich State Univ) IWAP2014, Antalya June / 35
16 A priori methods risk classification Parameter estimates for various count models Poisson NB ZIP Parameter Estimate (s.e.) Estimate (s.e.) Estimate (s.e.) Regression Coefficients: Positive Part Intercept (0.0621) (0.0635) (0.1311) Sex Insured female (0.022) (0.0226) not used male ref. group Age Vehicle 2 years (0.0195) (0.02) (0.02) > 2 and 8 years ref. group > 8 years (0.0238) (0.024) (0.0244) Age Insured 28 years (0.0265) (0.027) 0.34 (0.0273) > 28 years and 35 years (0.0203) (0.0209) (0.0208) > 35 and 68 years ref. group > 68 years (0.0882) (0.0897) (0.0895) Private Car Yes (0.0542) (0.0554) (0.0554) Capacity of Car 1500 ref. group > (0.0168) (0.0173) (0.0172) Capacity of Truck 1 ref. group > (0.0635) (0.065) (0.065) Comprehensive Cover Yes (0.0321) (0.0327) (0.1201) No Claims Discount No (0.0175) (0.0181) (0.018) Driving Experience of Insured 5 years (0.0251) (0.0259) (0.0258) > 5 and 10 years (0.0202) (0.0207) (0.0207) > 10 years ref. group Extra Par. ˆα = Regression Coefficients: Zero Part Intercept (0.301)) Comprehensive Cover Yes (0.3057) Sex Insured female (0.068) male ref. group Summary -2 Log Likelihood 98,326 98,161 98,167 AIC 98,356 98,191 98,199 E.A. Valdez (Mich State Univ) IWAP2014, Antalya June / 35
17 A priori methods risk classification Additive regression models Generalized additive models (GAMs) allow for more flexible relations between the response and a set of covariates. For example: log µ i = η i = Exposure + β 0 + β 1 I(Sex = F) + β 2 I(NCD = 0) + β 3 I(Cover = C) + β 4 I(Private = 1) + f 1 (VAge) + f 2 (VehCapCubic) + f 3 (Experience) + f 4 (AgeInsured). E.A. Valdez (Mich State Univ) IWAP2014, Antalya June / 35
18 A priori methods risk classification Additive effects in a Poisson GAM - illustration E.A. Valdez (Mich State Univ) IWAP2014, Antalya June / 35
19 A priori methods severity models Some claim severity models Distribution Density f(y) Conditional Mean E[Y ] Gamma Inverse Gaussian Lognormal 1 Γ(α) βα y α 1 e βy ( ) λ 1/2 [ ] λ(y µ) 2 2πy 3 exp 2µ 2 y [ 1 exp 1 ( ) ] log y µ 2 2πσy 2 σ α β = exp (x γ) µ = exp (x γ) exp (µ + 12 ) σ2 with µ = exp (x γ) E.A. Valdez (Mich State Univ) IWAP2014, Antalya June / 35
20 A priori methods severity models Parameter estimates for various severity models Gamma Inverse Gaussian Lognormal Parameter Estimate (s.e.) Estimate (s.e.) Estimate (s.e.) Intercept (0.0339) (0.0682) (0.0391) Sex Insured female not sign. not. sign. not sign. male Age Vehicle 2 years ref. group > 2 and 8 years ref. group > 8 years (0.02) (0.0428) (0.0229) Age Insured 28 years not sign. not sign. not sign. > 28 years and 35 years > 35 and 68 years > 68 years Private Car Yes (0.0348) (0.0697) (0.04) Capacity of Car 1500 ref. group ref. group ref. group > 1500 and (0.0183) (0.04) (0.021) > (0.043) (0.1016) (0.0498) Capacity of Truck 1 not sign. not sign. not sign. > 1 Comprehensive Cover Yes not sign. not sign. not sign. No Claims Discount No (0.0178) (0.039) (0.0205) Driving Experience of Insured 5 years not sign. not sign. not sign. > 5 and 10 years > 10 years ref. group Extra Par. ˆα = ˆλ = ˆσ = Summary -2 Log Likelihood 267, , ,633 AIC 267, , ,647 E.A. Valdez (Mich State Univ) IWAP2014, Antalya June / 35
21 A priori methods severity models Other flexible parametric models for claim severity The cumulative distribution functions for the Burr Type XII and the GB2 distribution are given, respectively by ( ) β λ F Burr,Y (y) = 1 β + y τ, y > 0, β, λ, τ > 0, and ( ) (y/b) a F GB2,Y (y) = B 1 + (y/b) a ; p, q, y > 0, a 0, b, p, q > 0, where B(, ) is the incomplete Beta function. If the available covariate information is denoted by x, it is straightforward to allow one or more of the parameters to vary with x. The result can be called a Burr or a GB2 regression model. E.A. Valdez (Mich State Univ) IWAP2014, Antalya June / 35
22 A priori methods severity models Fire insurance portfolio Burr (τ) Burr (β) GB2 (b) GB2 (a) Parameter Estimate (s.e.) Estimate (s.e.) Estimate (s.e.) Estimate (s.e.) Intercept 0.46 (0.073) (0.316) (0.349) (0.002) Type (0.058) (0.326) -2.5 (0.327) (0.002) (0.06) (0.325) (0.317) (0.002) (0.17) (0.627) (0.682) (0.003) (0.055) (0.303) (0.3) (0.002) (0.067) (0.376) (0.37) (0.003) Type 1*SI (0.025) (0.152) (0.154) (0.001) 2*SI (0.028) (0.174) (0.18) (0.001) 3*SI (0.067) (0.345) (0.326) (0.001) 4*SI (0.464) (1.429) (1.626) (0.006) 5*SI (0.027) (0.17) (0.169) (0.001) 6*SI (0.037) (0.223) (0.235) (0.001) β ( ) λ (0.04) (0.037) τ (0.071) a (0.045) b (0.114) p (0.12) (0.099) q (0.12) 357 (0.132) E.A. Valdez (Mich State Univ) IWAP2014, Antalya June / 35
23 A priori methods severity models Fire insurance portfolio: residual QQ plots Burr Regression in tau Burr Regression in beta Res Res Quant Quant GB2 Regression in b GB2 Regression in a Res Res Quant Quant E.A. Valdez (Mich State Univ) IWAP2014, Antalya June / 35
24 A posteriori methods A posteriori risk classification When constructing an a priori tariff structure, not all important risk factors may be observable. usually the situation for either a new policyholder or an existing one with insufficient information the result is lack of many important risk factors to meet the homogeneity requirement For a posteriori risk classification, the premiums are adjusted to account for the available history of claims experience. use of an experience rating mechanism - a long tradition in actuarial science the premise is that the claims history reveals more of the factors or characteristics that were previously unobservable the challenge is to optimally mix the individual claims experience and that of the group to which the individual belongs credibility theory - a well developed area of study in actuarial science E.A. Valdez (Mich State Univ) IWAP2014, Antalya June / 35
25 A posteriori methods GLMM Generalized linear mixed models GLMMs are extensions to GLMs allowing for random, or subject-specific, effects in the linear predictor. Consider M subjects with each subject i (1 i M), T i observations are available. Given the vector b i, the random effects for subject (or cluster) i, the repeated measurements Y i1,..., Y iti are assumed independent with density from the exponential family ( ) yit θ it ψ(θ it ) f(y it b i, β, φ) = exp + c(y it, φ), t = 1,..., T i, φ and the following (conditional) relations hold µ it = E[Y it b i ] = ψ (θ it ) and Var[Y it b i ] = φψ (θ it ) = φv (µ it ) where g(µ it ) = x it β + z it b i. E.A. Valdez (Mich State Univ) IWAP2014, Antalya June / 35
26 A posteriori methods GLMM The random effects Specification of the GLMM is completed by assuming that b i (i = 1,..., M) are mutually independent and identically distributed with density f(b i α). α denotes the unknown parameters in the density. common to assume the random effects have a (multivariate) normal distribution with zero mean and covariance matrix determined by α dependence between observations on the same subject arises because they share the same random effects b i. The likelihood function for the unknown parameters is M L(β, α, φ; y) = f(y i α, β, φ) = i=1 M i=1 T i t=1 f(y it b i, β, φ)f(b i α)db i. E.A. Valdez (Mich State Univ) IWAP2014, Antalya June / 35
27 A posteriori methods GLMM Poisson GLMM Let N it be the claim frequency in year t for policyholder i. Assume that, conditional on b i, N it follows a Poisson with mean E[N it b i ] = exp (x it β + b i) and that b i N(0, σ 2 b ). Straightforward calculations lead to and Var(N it ) = Var(E(N it b i )) + E(Var(N it b i )) = E(N it )(exp (x itβ)[exp (3σb 2 /2) exp (σ2 b /2)] + 1), Cov(N it1, N it2 ) = Cov(E(N it1 b i ), E(N it2 b i )) + E(Cov(N it1, N it2 b i )) = exp (x it 1 β) exp (x it 2 β)(exp (2σ 2 b ) exp (σ2 b )). We used the expressions for the mean and variance of a Lognormal distribution. For the covariance we used the fact that, given the random effect b i, N it1 and N it2 are independent. E.A. Valdez (Mich State Univ) IWAP2014, Antalya June / 35
28 A posteriori methods GLMM Poisson GLMM - continued Now, if we assume that, conditional on b i, N it follows a Poisson distribution with mean E[N it b i ] = exp (x it β + b i) and that b i N( σ2 b 2, σ 2 b ). This re-parameterization is commonly used in ratemaking. Indeed, we now get ( ) E[N it ] = E[E[N it b i ]] = exp x itβ σ2 b 2 + σ2 b = exp (x 2 itβ), and E[N it b i ] = exp (x inβ + b i ). This specification shows that the a priori premium, given by exp (x itβ), is correct on the average. The a posteriori correction to this premium is determined by exp (b i ). E.A. Valdez (Mich State Univ) IWAP2014, Antalya June / 35
29 A posteriori methods GLMM Numerical illustration Data consist of 12,893 policyholders observed during (fractions of) the period Let N it be the number of claims registered for policyholder i in period t. The model specification: N it b i Poi(µ it b i ) and µ it b i = e it exp (x itβ + b i ) b i N( σ 2 /2, σ 2 ), The a priori premium is given by (a priori) E[N it ] = e it exp (x itβ). The a posteriori premium is given by: (a posteriori) E[N it b i ] = e it exp (x itβ + b i ). The ratio of the two is called the theoretical Bonus-Malus Factor (BMF). It reflects the extent to which the policyholder is rewarded or penalized for past claims. E.A. Valdez (Mich State Univ) IWAP2014, Antalya June / 35
30 A posteriori methods GLMM Figure 5 Conditional distribution b_i, given Y_i1,...,Y_in_i A priori (red) and a posteriori (grey) premiums Effect b_i Policy holder i=1,..., Policy holder i=1,...,20 Left panel: Boxplot of the conditional distribution of b i, given the history N i1,..., N ini, for a random selection of 20 policyholders. Right panel: For the same selection of policyholders: boxplots with simulations from the a priori (red) and a posteriori (grey) premium. E.A. Valdez (Mich State Univ) IWAP2014, Antalya June / 35
31 Remarks Remarks This paper makes several distinctions in the modeling aspects involved in ratemaking: a priori vs a posteriori risk classification in ratemaking claim frequency and claim severity make up for the calculation of a pure premium the form of the data that may be recorded, become available to the insurance company and are used for calibrating models: a priori: the data usually are cross-sectional a posteriori: the recorded data may come in various layers: multilevel (e.g. panel, longitudinal) or other types of clustering, transitions for bonus-malus schemes E.A. Valdez (Mich State Univ) IWAP2014, Antalya June / 35
32 Life insurance Risk classification in life insurance Gschlossl, S., Schoenmaekers, P., Denuit, M., 2011, Risk classification in life insurance: methodology and case study, European Actuarial Journal, 1: Start with n invidiuals all aged x, observed a period of time and during this period, each individual is either dead or alive: { 1, if individual i dies, δ i = 0, otherwise Let τ i be the time spent by the individual i during the period. In summary, we observe n independent and identically distributed observations (δ i, τ i ) for i = 1, 2,..., n. E.A. Valdez (Mich State Univ) IWAP2014, Antalya June / 35
33 Life insurance Poisson model Poisson model If the individual is alive, his contribution to the likelihood is exp( τ i µ x ). If dead, his contribution is µ x exp( τ i µ x ). Thus the aggregate likelihood contribution of all individuals observed can be expressed as L(µ x ) = n (µ x ) δ i exp( τ i µ x ) = (µ x ) dx exp( E x µ x ), i=1 where d x = n i=1 δ i is the total number of deaths and E x = n i=1 τ i is the total exposure. This likelihood is proportional to the likelihood of a Poisson number of deaths: D x Poisson(E x µ x ). E.A. Valdez (Mich State Univ) IWAP2014, Antalya June / 35
34 Life insurance Poisson regression Poisson regression model There is usually heterogeneity among the individual lives (age, gender, lifestyle, income, etc.) and this can be accounted for using a Poisson regression model. In this context, we would assume we have a set of covariates say x i = (1, x i1, x i2,..., x ik ), which here we include an intercept. We link these covariates to the death rates through a log-linear function as follows: log(µ i ) = x iβ The β coefficients in this case have the interpretation of a percentage change, in the case of a continuous covariate, or a percentage difference in the case of a binary covariate. E.A. Valdez (Mich State Univ) IWAP2014, Antalya June / 35
35 References Selected reference Antonio, K. and E.A. Valdez, 2010, Statistical concepts of a priori and a posteriori risk classification in insurance, Advances in Statistical Analysis, forthcoming special issue on actuarial statistics. Denuit, M., Marechal, X., Pitrebois, S. and J.-F. Walhin, 2007, Actuarial Modelling of Claim Counts: Risk Classification, Credibility and Bonus-Malus Systems, John Wiley, England. Finger, R.J., 2001, Risk Classification in Foundations of Casualty Actuarial Science, chapter 6, pages , Casualty Actuarial Society. Gschlossl, S., Schoenmaekers, P. and M. Denuit, 2011, Risk classification in life insurance: methodology and case study, European Actuarial Journal, 1: E.A. Valdez (Mich State Univ) IWAP2014, Antalya June / 35
A revisit of the hierarchical insurance claims modeling
A revisit of the hierarchical insurance claims modeling Emiliano A. Valdez Michigan State University joint work with E.W. Frees* * University of Wisconsin Madison Statistical Society of Canada (SSC) 2014
A Multilevel Analysis of Intercompany Claim Counts
A Multilevel Analysis of Intercompany Claim Counts Katrien Antonio Edward W. Frees Emiliano A. Valdez May 15, 2008 Abstract For the construction of a fair tariff structure in automobile insurance, insurers
Underwriting risk control in non-life insurance via generalized linear models and stochastic programming
Underwriting risk control in non-life insurance via generalized linear models and stochastic programming 1 Introduction Martin Branda 1 Abstract. We focus on rating of non-life insurance contracts. We
GENERALIZED LINEAR MODELS IN VEHICLE INSURANCE
ACTA UNIVERSITATIS AGRICULTURAE ET SILVICULTURAE MENDELIANAE BRUNENSIS Volume 62 41 Number 2, 2014 http://dx.doi.org/10.11118/actaun201462020383 GENERALIZED LINEAR MODELS IN VEHICLE INSURANCE Silvie Kafková
Hierarchical Insurance Claims Modeling
Hierarchical Insurance Claims Modeling Edward W. Frees University of Wisconsin Emiliano A. Valdez University of New South Wales 02-February-2007 y Abstract This work describes statistical modeling of detailed,
Statistical Analysis of Life Insurance Policy Termination and Survivorship
Statistical Analysis of Life Insurance Policy Termination and Survivorship Emiliano A. Valdez, PhD, FSA Michigan State University joint work with J. Vadiveloo and U. Dias Session ES82 (Statistics in Actuarial
Multivariate Negative Binomial Models for Insurance Claim Counts
Multivariate Negative Binomial Models for Insurance Claim Counts Peng Shi Division of Statistics Northern Illinois University DeKalb, IL 0 Email: [email protected] Emiliano A. Valdez Department of Mathematics
ESTIMATION OF CLAIM NUMBERS IN AUTOMOBILE INSURANCE
Annales Univ. Sci. Budapest., Sect. Comp. 42 (2014) 19 35 ESTIMATION OF CLAIM NUMBERS IN AUTOMOBILE INSURANCE Miklós Arató (Budapest, Hungary) László Martinek (Budapest, Hungary) Dedicated to András Benczúr
Longitudinal Modeling of Singapore Motor Insurance
Longitudinal Modeling of Singapore Motor Insurance Emiliano A. Valdez University of New South Wales Edward W. (Jed) Frees University of Wisconsin 28-December-2005 Abstract This work describes longitudinal
Lecture 8: Gamma regression
Lecture 8: Gamma regression Claudia Czado TU München c (Claudia Czado, TU Munich) ZFS/IMS Göttingen 2004 0 Overview Models with constant coefficient of variation Gamma regression: estimation and testing
Introduction to Predictive Modeling Using GLMs
Introduction to Predictive Modeling Using GLMs Dan Tevet, FCAS, MAAA, Liberty Mutual Insurance Group Anand Khare, FCAS, MAAA, CPCU, Milliman 1 Antitrust Notice The Casualty Actuarial Society is committed
Own Damage, Third Party Property Damage Claims and Malaysian Motor Insurance: An Empirical Examination
Australian Journal of Basic and Applied Sciences, 5(7): 1190-1198, 2011 ISSN 1991-8178 Own Damage, Third Party Property Damage Claims and Malaysian Motor Insurance: An Empirical Examination 1 Mohamed Amraja
How To Model A Car Accident For A Car Insurance Rating Model
Linear credibility models based on time series for claim counts Oana Purcaru 1 and Montserrat Guillén 2 and Michel Denuit 3 Abstract. In this paper, several models for longitudinal analysis of claim frequencies
Predictive Modeling Techniques in Insurance
Predictive Modeling Techniques in Insurance Tuesday May 5, 2015 JF. Breton Application Engineer 2014 The MathWorks, Inc. 1 Opening Presenter: JF. Breton: 13 years of experience in predictive analytics
Chapter 9 Experience rating
0 INTRODUCTION 1 Chapter 9 Experience rating 0 Introduction The rating process is the process of deciding on an appropriate level of premium for a particular class of insurance business. The contents of
Nonnested model comparison of GLM and GAM count regression models for life insurance data
Nonnested model comparison of GLM and GAM count regression models for life insurance data Claudia Czado, Julia Pfettner, Susanne Gschlößl, Frank Schiller December 8, 2009 Abstract Pricing and product development
Travelers Analytics: U of M Stats 8053 Insurance Modeling Problem
Travelers Analytics: U of M Stats 8053 Insurance Modeling Problem October 30 th, 2013 Nathan Hubbell, FCAS Shengde Liang, Ph.D. Agenda Travelers: Who Are We & How Do We Use Data? Insurance 101 Basic business
A Deeper Look Inside Generalized Linear Models
A Deeper Look Inside Generalized Linear Models University of Minnesota February 3 rd, 2012 Nathan Hubbell, FCAS Agenda Property & Casualty (P&C Insurance) in one slide The Actuarial Profession Travelers
The zero-adjusted Inverse Gaussian distribution as a model for insurance claims
The zero-adjusted Inverse Gaussian distribution as a model for insurance claims Gillian Heller 1, Mikis Stasinopoulos 2 and Bob Rigby 2 1 Dept of Statistics, Macquarie University, Sydney, Australia. email:
Computer exercise 4 Poisson Regression
Chalmers-University of Gothenburg Department of Mathematical Sciences Probability, Statistics and Risk MVE300 Computer exercise 4 Poisson Regression When dealing with two or more variables, the functional
Lecture 6: Poisson regression
Lecture 6: Poisson regression Claudia Czado TU München c (Claudia Czado, TU Munich) ZFS/IMS Göttingen 2004 0 Overview Introduction EDA for Poisson regression Estimation and testing in Poisson regression
SAS Software to Fit the Generalized Linear Model
SAS Software to Fit the Generalized Linear Model Gordon Johnston, SAS Institute Inc., Cary, NC Abstract In recent years, the class of generalized linear models has gained popularity as a statistical modeling
GLM I An Introduction to Generalized Linear Models
GLM I An Introduction to Generalized Linear Models CAS Ratemaking and Product Management Seminar March 2009 Presented by: Tanya D. Havlicek, Actuarial Assistant 0 ANTITRUST Notice The Casualty Actuarial
BayesX - Software for Bayesian Inference in Structured Additive Regression
BayesX - Software for Bayesian Inference in Structured Additive Regression Thomas Kneib Faculty of Mathematics and Economics, University of Ulm Department of Statistics, Ludwig-Maximilians-University Munich
Institute of Actuaries of India Subject CT3 Probability and Mathematical Statistics
Institute of Actuaries of India Subject CT3 Probability and Mathematical Statistics For 2015 Examinations Aim The aim of the Probability and Mathematical Statistics subject is to provide a grounding in
Statistics in Retail Finance. Chapter 6: Behavioural models
Statistics in Retail Finance 1 Overview > So far we have focussed mainly on application scorecards. In this chapter we shall look at behavioural models. We shall cover the following topics:- Behavioural
Stochastic programming approaches to pricing in non-life insurance
Stochastic programming approaches to pricing in non-life insurance Martin Branda Charles University in Prague Department of Probability and Mathematical Statistics 11th International Conference on COMPUTATIONAL
Combining Linear and Non-Linear Modeling Techniques: EMB America. Getting the Best of Two Worlds
Combining Linear and Non-Linear Modeling Techniques: Getting the Best of Two Worlds Outline Who is EMB? Insurance industry predictive modeling applications EMBLEM our GLM tool How we have used CART with
GLMs: Gompertz s Law. GLMs in R. Gompertz s famous graduation formula is. or log µ x is linear in age, x,
Computing: an indispensable tool or an insurmountable hurdle? Iain Currie Heriot Watt University, Scotland ATRC, University College Dublin July 2006 Plan of talk General remarks The professional syllabus
Package insurancedata
Type Package Package insurancedata February 20, 2015 Title A Collection of Insurance Datasets Useful in Risk Classification in Non-life Insurance. Version 1.0 Date 2014-09-04 Author Alicja Wolny--Dominiak
**BEGINNING OF EXAMINATION** The annual number of claims for an insured has probability function: , 0 < q < 1.
**BEGINNING OF EXAMINATION** 1. You are given: (i) The annual number of claims for an insured has probability function: 3 p x q q x x ( ) = ( 1 ) 3 x, x = 0,1,, 3 (ii) The prior density is π ( q) = q,
NUMBER OF ACCIDENTS OR NUMBER OF CLAIMS? AN APPROACH WITH ZERO-INFLATED POISSON MODELS FOR PANEL DATA
NUMBER OF ACCIDENTS OR NUMBER OF CLAIMS? AN APPROACH WITH ZERO-INFLATED POISSON MODELS FOR PANEL DATA Jean-Philippe Boucher [email protected] Département de Mathématiques Université du Québec
Offset Techniques for Predictive Modeling for Insurance
Offset Techniques for Predictive Modeling for Insurance Matthew Flynn, Ph.D, ISO Innovative Analytics, W. Hartford CT Jun Yan, Ph.D, Deloitte & Touche LLP, Hartford CT ABSTRACT This paper presents the
Actuarial Applications of a Hierarchical Insurance Claims Model
Introduction Applications of a Hierarchical Insurance Claims Edward W. (Jed) University of Wisconsin Madison and Insurance Services Office joint work with Peng Shi - Northern Illinois University Emiliano
More Flexible GLMs Zero-Inflated Models and Hybrid Models
More Flexible GLMs Zero-Inflated Models and Hybrid Models Mathew Flynn, Ph.D. Louise A. Francis FCAS, MAAA Motivation: GLMs are widely used in insurance modeling applications. Claim or frequency models
Linda K. Muthén Bengt Muthén. Copyright 2008 Muthén & Muthén www.statmodel.com. Table Of Contents
Mplus Short Courses Topic 2 Regression Analysis, Eploratory Factor Analysis, Confirmatory Factor Analysis, And Structural Equation Modeling For Categorical, Censored, And Count Outcomes Linda K. Muthén
Mixing internal and external data for managing operational risk
Mixing internal and external data for managing operational risk Antoine Frachot and Thierry Roncalli Groupe de Recherche Opérationnelle, Crédit Lyonnais, France This version: January 29, 2002 Introduction
STATISTICAL METHODS IN GENERAL INSURANCE. Philip J. Boland National University of Ireland, Dublin [email protected]
STATISTICAL METHODS IN GENERAL INSURANCE Philip J. Boland National University of Ireland, Dublin [email protected] The Actuarial profession appeals to many with excellent quantitative skills who aspire
A LOGNORMAL MODEL FOR INSURANCE CLAIMS DATA
REVSTAT Statistical Journal Volume 4, Number 2, June 2006, 131 142 A LOGNORMAL MODEL FOR INSURANCE CLAIMS DATA Authors: Daiane Aparecida Zuanetti Departamento de Estatística, Universidade Federal de São
A GENERALIZATION OF AUTOMOBILE INSURANCE RATING MODELS: THE NEGATIVE BINOMIAL DISTRIBUTION WITH A REGRESSION COMPONENT
WORKSHOP A GENERALIZATION OF AUTOMOBILE INSURANCE RATING MODELS: THE NEGATIVE BINOMIAL DISTRIBUTION WITH A REGRESSION COMPONENT BY GEORGES DIONNE and CHARLES VANASSE Universit~ de MontrEal, Canada * ABSTRACT
Survival Analysis of Left Truncated Income Protection Insurance Data. [March 29, 2012]
Survival Analysis of Left Truncated Income Protection Insurance Data [March 29, 2012] 1 Qing Liu 2 David Pitt 3 Yan Wang 4 Xueyuan Wu Abstract One of the main characteristics of Income Protection Insurance
Approximation of Aggregate Losses Using Simulation
Journal of Mathematics and Statistics 6 (3): 233-239, 2010 ISSN 1549-3644 2010 Science Publications Approimation of Aggregate Losses Using Simulation Mohamed Amraja Mohamed, Ahmad Mahir Razali and Noriszura
Fitting the Belgian Bonus-Malus System
Fitting the Belgian Bonus-Malus System S. Pitrebois 1, M. Denuit 2 and J.F. Walhin 3 Abstract. We show in this paper how to obtain the relativities of the Belgian Bonus-Malus System, including the special
Automated Biosurveillance Data from England and Wales, 1991 2011
Article DOI: http://dx.doi.org/10.3201/eid1901.120493 Automated Biosurveillance Data from England and Wales, 1991 2011 Technical Appendix This online appendix provides technical details of statistical
Bayesian credibility methods for pricing non-life insurance on individual claims history
Bayesian credibility methods for pricing non-life insurance on individual claims history Daniel Eliasson Masteruppsats i matematisk statistik Master Thesis in Mathematical Statistics Masteruppsats 215:1
Logistic Regression (a type of Generalized Linear Model)
Logistic Regression (a type of Generalized Linear Model) 1/36 Today Review of GLMs Logistic Regression 2/36 How do we find patterns in data? We begin with a model of how the world works We use our knowledge
I L L I N O I S UNIVERSITY OF ILLINOIS AT URBANA-CHAMPAIGN
Beckman HLM Reading Group: Questions, Answers and Examples Carolyn J. Anderson Department of Educational Psychology I L L I N O I S UNIVERSITY OF ILLINOIS AT URBANA-CHAMPAIGN Linear Algebra Slide 1 of
Quest for Optimal Bonus-Malus in Automobile Insurance in Developing Economies: An Actuarial Perspective
Quest for Optimal Bonus-Malus in Automobile Insurance in Developing Economies: An Actuarial Perspective Ade Ibiwoye, I. A. Adeleke & S. A. Aduloju Department of Actuarial Science & Insurance, University
i=1 In practice, the natural logarithm of the likelihood function, called the log-likelihood function and denoted by
Statistics 580 Maximum Likelihood Estimation Introduction Let y (y 1, y 2,..., y n be a vector of iid, random variables from one of a family of distributions on R n and indexed by a p-dimensional parameter
STATISTICA Formula Guide: Logistic Regression. Table of Contents
: Table of Contents... 1 Overview of Model... 1 Dispersion... 2 Parameterization... 3 Sigma-Restricted Model... 3 Overparameterized Model... 4 Reference Coding... 4 Model Summary (Summary Tab)... 5 Summary
FAIR TRADE IN INSURANCE INDUSTRY: PREMIUM DETERMINATION OF TAIWAN AUTOMOBILE INSURANCE. Emilio Venezian Venezian Associate, Taiwan
FAIR TRADE IN INSURANCE INDUSTRY: PREMIUM DETERMINATION OF TAIWAN AUTOMOBILE INSURANCE Emilio Venezian Venezian Associate, Taiwan Chu-Shiu Li Department of Economics, Feng Chia University, Taiwan 100 Wen
Logistic Regression (1/24/13)
STA63/CBB540: Statistical methods in computational biology Logistic Regression (/24/3) Lecturer: Barbara Engelhardt Scribe: Dinesh Manandhar Introduction Logistic regression is model for regression used
Non-life insurance mathematics. Nils F. Haavardsson, University of Oslo and DNB Skadeforsikring
Non-life insurance mathematics Nils F. Haavardsson, University of Oslo and DNB Skadeforsikring Overview Important issues Models treated Curriculum Duration (in lectures) What is driving the result of a
Predictive Modeling in Workers Compensation 2008 CAS Ratemaking Seminar
Predictive Modeling in Workers Compensation 2008 CAS Ratemaking Seminar Prepared by Louise Francis, FCAS, MAAA Francis Analytics and Actuarial Data Mining, Inc. www.data-mines.com [email protected]
Innovations and Value Creation in Predictive Modeling. David Cummings Vice President - Research
Innovations and Value Creation in Predictive Modeling David Cummings Vice President - Research ISO Innovative Analytics 1 Innovations and Value Creation in Predictive Modeling A look back at the past decade
UNIVERSITY OF OSLO. The Poisson model is a common model for claim frequency.
UNIVERSITY OF OSLO Faculty of mathematics and natural sciences Candidate no Exam in: STK 4540 Non-Life Insurance Mathematics Day of examination: December, 9th, 2015 Examination hours: 09:00 13:00 This
Classification Problems
Classification Read Chapter 4 in the text by Bishop, except omit Sections 4.1.6, 4.1.7, 4.2.4, 4.3.3, 4.3.5, 4.3.6, 4.4, and 4.5. Also, review sections 1.5.1, 1.5.2, 1.5.3, and 1.5.4. Classification Problems
BOOSTED REGRESSION TREES: A MODERN WAY TO ENHANCE ACTUARIAL MODELLING
BOOSTED REGRESSION TREES: A MODERN WAY TO ENHANCE ACTUARIAL MODELLING Xavier Conort [email protected] Session Number: TBR14 Insurance has always been a data business The industry has successfully
Model Selection and Claim Frequency for Workers Compensation Insurance
Model Selection and Claim Frequency for Workers Compensation Insurance Jisheng Cui, David Pitt and Guoqi Qian Abstract We consider a set of workers compensation insurance claim data where the aggregate
Probability Calculator
Chapter 95 Introduction Most statisticians have a set of probability tables that they refer to in doing their statistical wor. This procedure provides you with a set of electronic statistical tables that
Generalized Linear Models
Generalized Linear Models We have previously worked with regression models where the response variable is quantitative and normally distributed. Now we turn our attention to two types of models where the
EMPIRICAL RISK MINIMIZATION FOR CAR INSURANCE DATA
EMPIRICAL RISK MINIMIZATION FOR CAR INSURANCE DATA Andreas Christmann Department of Mathematics homepages.vub.ac.be/ achristm Talk: ULB, Sciences Actuarielles, 17/NOV/2006 Contents 1. Project: Motor vehicle
Poisson Models for Count Data
Chapter 4 Poisson Models for Count Data In this chapter we study log-linear models for count data under the assumption of a Poisson error structure. These models have many applications, not only to the
Modeling the Claim Duration of Income Protection Insurance Policyholders Using Parametric Mixture Models
Modeling the Claim Duration of Income Protection Insurance Policyholders Using Parametric Mixture Models Abstract This paper considers the modeling of claim durations for existing claimants under income
Assumptions. Assumptions of linear models. Boxplot. Data exploration. Apply to response variable. Apply to error terms from linear model
Assumptions Assumptions of linear models Apply to response variable within each group if predictor categorical Apply to error terms from linear model check by analysing residuals Normality Homogeneity
Introduction to Longitudinal Data Analysis
Introduction to Longitudinal Data Analysis Longitudinal Data Analysis Workshop Section 1 University of Georgia: Institute for Interdisciplinary Research in Education and Human Development Section 1: Introduction
How To Build A Predictive Model In Insurance
The Do s & Don ts of Building A Predictive Model in Insurance University of Minnesota November 9 th, 2012 Nathan Hubbell, FCAS Katy Micek, Ph.D. Agenda Travelers Broad Overview Actuarial & Analytics Career
Development Period 1 2 3 4 5 6 7 8 9 Observed Payments
Pricing and reserving in the general insurance industry Solutions developed in The SAS System John Hansen & Christian Larsen, Larsen & Partners Ltd 1. Introduction The two business solutions presented
Introduction to mixed model and missing data issues in longitudinal studies
Introduction to mixed model and missing data issues in longitudinal studies Hélène Jacqmin-Gadda INSERM, U897, Bordeaux, France Inserm workshop, St Raphael Outline of the talk I Introduction Mixed models
PRICING MODELS APPLIED IN NON-LIFE INSURANCE
ALEXANDRU IOAN CUZA UNIVERSITY OF IASI FACULTY OF ECONOMICS AND BUSINESS ADMINISTRATION DOCTORAL SCHOOL OF ECONOMICS AND BUSINESS ADMINISTRATION DOCTORAL THESIS PRICING MODELS APPLIED IN NON-LIFE INSURANCE
STA 4273H: Statistical Machine Learning
STA 4273H: Statistical Machine Learning Russ Salakhutdinov Department of Statistics! [email protected]! http://www.cs.toronto.edu/~rsalakhu/ Lecture 6 Three Approaches to Classification Construct
Exam C, Fall 2006 PRELIMINARY ANSWER KEY
Exam C, Fall 2006 PRELIMINARY ANSWER KEY Question # Answer Question # Answer 1 E 19 B 2 D 20 D 3 B 21 A 4 C 22 A 5 A 23 E 6 D 24 E 7 B 25 D 8 C 26 A 9 E 27 C 10 D 28 C 11 E 29 C 12 B 30 B 13 C 31 C 14
TABLE OF CONTENTS. GENERAL AND HISTORICAL PREFACE iii SIXTH EDITION PREFACE v PART ONE: REVIEW AND BACKGROUND MATERIAL
TABLE OF CONTENTS GENERAL AND HISTORICAL PREFACE iii SIXTH EDITION PREFACE v PART ONE: REVIEW AND BACKGROUND MATERIAL CHAPTER ONE: REVIEW OF INTEREST THEORY 3 1.1 Interest Measures 3 1.2 Level Annuity
ANNUITY LAPSE RATE MODELING: TOBIT OR NOT TOBIT? 1. INTRODUCTION
ANNUITY LAPSE RATE MODELING: TOBIT OR NOT TOBIT? SAMUEL H. COX AND YIJIA LIN ABSTRACT. We devise an approach, using tobit models for modeling annuity lapse rates. The approach is based on data provided
Math 370, Actuarial Problemsolving Spring 2008 A.J. Hildebrand. Practice Test, 1/28/2008 (with solutions)
Math 370, Actuarial Problemsolving Spring 008 A.J. Hildebrand Practice Test, 1/8/008 (with solutions) About this test. This is a practice test made up of a random collection of 0 problems from past Course
SUGI 29 Statistics and Data Analysis
Paper 194-29 Head of the CLASS: Impress your colleagues with a superior understanding of the CLASS statement in PROC LOGISTIC Michelle L. Pritchard and David J. Pasta Ovation Research Group, San Francisco,
An Extension Model of Financially-balanced Bonus-Malus System
An Extension Model of Financially-balanced Bonus-Malus System Other : Ratemaking, Experience Rating XIAO, Yugu Center for Applied Statistics, Renmin University of China, Beijing, 00872, P.R. China Phone:
MODELLING CRITICAL ILLNESS INSURANCE DATA
MODELLING CRITICAL ILLNESS INSURANCE DATA Howard Waters Joint work with: Erengul Dodd (Ozkok), George Streftaris, David Wilkie University of Piraeus, October 2014 1 Plan: 1. Critical Illness Insurance
CHAPTER 3 EXAMPLES: REGRESSION AND PATH ANALYSIS
Examples: Regression And Path Analysis CHAPTER 3 EXAMPLES: REGRESSION AND PATH ANALYSIS Regression analysis with univariate or multivariate dependent variables is a standard procedure for modeling relationships
EDUCATION AND EXAMINATION COMMITTEE SOCIETY OF ACTUARIES RISK AND INSURANCE. Copyright 2005 by the Society of Actuaries
EDUCATION AND EXAMINATION COMMITTEE OF THE SOCIET OF ACTUARIES RISK AND INSURANCE by Judy Feldman Anderson, FSA and Robert L. Brown, FSA Copyright 25 by the Society of Actuaries The Education and Examination
1. A survey of a group s viewing habits over the last year revealed the following
1. A survey of a group s viewing habits over the last year revealed the following information: (i) 8% watched gymnastics (ii) 9% watched baseball (iii) 19% watched soccer (iv) 14% watched gymnastics and
A credibility method for profitable cross-selling of insurance products
Submitted to Annals of Actuarial Science manuscript 2 A credibility method for profitable cross-selling of insurance products Fredrik Thuring Faculty of Actuarial Science and Insurance, Cass Business School,
One-year reserve risk including a tail factor : closed formula and bootstrap approaches
One-year reserve risk including a tail factor : closed formula and bootstrap approaches Alexandre Boumezoued R&D Consultant Milliman Paris [email protected] Yoboua Angoua Non-Life Consultant
Combining GLM and datamining techniques for modelling accident compensation data. Peter Mulquiney
Combining GLM and datamining techniques for modelling accident compensation data Peter Mulquiney Introduction Accident compensation data exhibit features which complicate loss reserving and premium rate
Does smoking impact your mortality?
An estimated 25% of the medically underwritten, assured population can be classified as smokers Does smoking impact your mortality? Introduction Your smoking habits influence the premium that you pay for
CS 688 Pattern Recognition Lecture 4. Linear Models for Classification
CS 688 Pattern Recognition Lecture 4 Linear Models for Classification Probabilistic generative models Probabilistic discriminative models 1 Generative Approach ( x ) p C k p( C k ) Ck p ( ) ( x Ck ) p(
Pricing Alternative forms of Commercial Insurance cover
Pricing Alternative forms of Commercial Insurance cover Prepared by Andrew Harford Presented to the Institute of Actuaries of Australia Biennial Convention 23-26 September 2007 Christchurch, New Zealand
Location matters. 3 techniques to incorporate geo-spatial effects in one's predictive model
Location matters. 3 techniques to incorporate geo-spatial effects in one's predictive model Xavier Conort [email protected] Motivation Location matters! Observed value at one location is
Linear Classification. Volker Tresp Summer 2015
Linear Classification Volker Tresp Summer 2015 1 Classification Classification is the central task of pattern recognition Sensors supply information about an object: to which class do the object belong
SOCIETY OF ACTUARIES/CASUALTY ACTUARIAL SOCIETY EXAM C CONSTRUCTION AND EVALUATION OF ACTUARIAL MODELS EXAM C SAMPLE QUESTIONS
SOCIETY OF ACTUARIES/CASUALTY ACTUARIAL SOCIETY EXAM C CONSTRUCTION AND EVALUATION OF ACTUARIAL MODELS EXAM C SAMPLE QUESTIONS Copyright 005 by the Society of Actuaries and the Casualty Actuarial Society
Lecture 3: Linear methods for classification
Lecture 3: Linear methods for classification Rafael A. Irizarry and Hector Corrada Bravo February, 2010 Today we describe four specific algorithms useful for classification problems: linear regression,
A Model of Optimum Tariff in Vehicle Fleet Insurance
A Model of Optimum Tariff in Vehicle Fleet Insurance. Bouhetala and F.Belhia and R.Salmi Statistics and Probability Department Bp, 3, El-Alia, USTHB, Bab-Ezzouar, Alger Algeria. Summary: An approach about
