EVALUATION OF PROBABILITY MODELS ON INSURANCE CLAIMS IN GHANA

Size: px
Start display at page:

Download "EVALUATION OF PROBABILITY MODELS ON INSURANCE CLAIMS IN GHANA"

Transcription

1 EVALUATION OF PROBABILITY MODELS ON INSURANCE CLAIMS IN GHANA E. J. Dadey SSNIT, Research Department, Accra, Ghana S. Ankrah, PhD Student PGIA, University of Peradeniya, Sri Lanka Abstract This study investigates the probability distributions that best fits the number of insurance claims. In particular, it compares the poisson distribution and the negative binomial distribution models to determine which distribution best fit insurance claim data obtained from two Insurance Companies in Ghana. Data on the number of claims of a funeral policy spanning from year 2006 to 200 were used for the study. Probability distribution models and the parametric bootstrap methods were employed in analyzing the data collected. The Negative Binomial distribution was found to be superior to the Poisson distribution in fitting the claims data. Also, the result revealed that the estimates obtained by the probability models and that of the parametric bootstrap estimates have no significant difference. Keywords: Poisson, Negative Binomial, Insurance Claims, Parametric Bootstrap Introduction Statistical methods have been paramount in the field of insurance due to risk involved in allocating insurance funds. Insurance funds may be invested in assets like bonds, equities and others. This helps to increase investment real returns in order to meet claim payment demands and other financial obligations. An appropriate statistical estimation is needed to acquire concrete information about the uncertain liabilities. This helps us to ascertain good decisions pertaining to assets allocation, expected monthly claims and payment targets as well as future insurance pricing. Policy holders expect a cushion in the event of economic loss as stipulated in an Insurance Contract. In view of this, the challenge of meeting the payment terms becomes an issue of much concern to the Insurer. A statistical estimation of expected claim liabilities of the Insurance Policies enables decisions on asset allocation and claim payment to be taken without much error. The main objective of the study is to explore probability models that will model adequately the number of claims occurring under funeral insurance policies in Ghana to estimate the expected number of claims. Specific objectives include but not limited to: To identify and explain seasonal variation within the number of claims. To compare the Probability distribution estimates of the number of claims. To derive the Probability distribution model that best fits the number of Claims for the funeral policies. To construct bootstrap confidence intervals for the Expected Number of Claims and compare with that of the estimates obtained from the models. 52

2 METHODOLOGY Secondary Data The data were collected from Star Life and Metropolitan life Insurance Company. The data consist of monthly recorded number of claims under a Family Funeral Insurance Policy for a period of Five years ( ). This policy was chosen because it is one of the most patronised insurance policies in Ghana. These two insurance companies represent a major controlling force in Ghana s insurance sector. Shapiro-Wilk test for Normality The Shapiro-Wilk test, proposed by Shapiro S. S. and Wilk M. B. (965), calculates a W statistic that tests whether a random sample, x, x 2,..., x n comes from (specifically) a normal distribution. Small values of W are evidence of departure from normality W = n ai x i= n i= ( i) ( x) xi where the x (i) are the ordered sample values (x () is the smallest) and the a i are constants generated from the means, variances and covariances of the order statistics of a sample of size n from a normal distribution. The Null hypothesis is rejected if the test statistic W is too small or the p-value is less than the significance level α. The Kolmogorov-Smirnov test for Normality The two sided Kolmogorov-Smirov test tests the null hypothesis that two samples x; x2; x3;. and x; x2; x3; have a similar distribution. The test statistic is = ( x) ( x) n, n sup...2, n 2, n D X F F Where: F;n and F2; n are the empirical distribution functions of the first and second sample respectively. The null hypothesis is rejected at level α if p-value is less than the α. The Poisson Process and Distribution Function First of all, a Poisson process N is a stochastic process - that is, a collection of random variables N(t) for each t in some specifieded set. More specifically, Poisson processes are counting processes: for each t > 0 we count the number of "events" that happen between time 0 and time t. The kind of events in the case depends on the application. For example the number of insurance claims led by a particular driver, or the number of callers phoning in to a help line, or the number of people retiring from a particular employer, and so on. Whatever you might mean by an "event", N(t) denotes the number of events that occur after time 0 up through and including time t > 0. Finally, The Poisson distribution arises from independently and identically exponentially distributed inter-arrival times between events and is defined as fellows: Let X be a random variable with discrete distribution that is defined over N = {0; ; 2; 3; }. X has a Poisson distribution with parameter λ written as, X ~ Poisson(λ) if and only it the probability function is given by P k e λ λ = λ R, k = 0,, 2, k! ( X k) =, The Poisson distribution has expected value, E(X) = λ and variance, Var(X) = λ. 53

3 The equality of the mean and variance is characteristic to the Poisson distribution and serves as the reference point of modeling count data. Modeling count data with the Poisson distribution requires randomness and homogeneity of the data which is referred to as equidispersion. If X and Y are Poisson distributed as X ~ Poisson (λ ) and Y _ Poisson (µ ), it follows that the random variable Z = X + Y is Poisson distributed as Z ~Poisson (λ +µ ). Negative Binomial Distribution A discrete variable is negative binomially distributed if they were generated from an occurrence or duration dependence process or if the rate at which events occur is heterogeneous. A random variable X has a negative binomial distribution with parameter α > 0 and θ > 0 written as X Negbin(α; θ), if the probability function is given by α k Γ ( α + k ) θ P( x= k) =, k = 0,, Γ α Γ k + + θ + θ ( ) ( ) s Γ ( ) denotes the gamma function such that ( s) ( ) = + θ ( ) pgf s s α probability generating function Γ = z z e dz for s > 0. The 0 The mean and variance are given by E( X) = αθ and var ( X) = αθ ( + θ ) = E( X)( + θ ) The value of θ is called the dispersion parameter and measures the dispersion in the count data. Since θ > 0, the Variance of the negative binomial distribution exceeds it mean hence it is overdispersed. If X and Y are independently negatively binomial distributed with X Negbin ( λθ), and Y Negbin ( µθ),, it fellows that the random variable Z = X+Y is negative binomial distributed. i.e. Z Negbin ( λ + µθ, ). The Negative Binomial is preferable to the Poisson distribution in claim modeling because it is overdispersed and actual experience shows that this is certainly observed in Insurance. Goodness of Fit Test Two statistics that are employed in assessing the goodness of fit of a given distribution are the scaled deviance and the Pearson's chi square statistic. For a fixed value of the dispersion parameter θ, the scaled deviance is defined to be twice the difference between the maximum achievable log-likelihood and the log-likelihood at maximum likelihood estimate of the parameters. If l(y; µ ) is the log-likelihood function expressed as a function of the predicted mean values µ and the vector y of responses then the scaled deviance is defined by D ( y, µ ) = 2 l( y, y) l( y, µ ) which can be expressed for specific distributions as 54

4 (, µ ) D y D ( y, µ ) = θ the scaled deviance for the Poisson and negative binomial distributions are given as follows: y i 2 w i yilog ( yi µ i ) i µ i and y y + k 2 wi y log y + log i µ k µ k The scaled deviance is chi-square distributed with n- degrees of freedom, where n is the number of observations. Parametric Bootstrap Estimation The procedures for the Bootstrap estimation is outlined as follows: Given a random sample, X = (x,, x n ), estimate the appropriate probability distribution and calculate the desirable parameter ˆ θ. Sample with replacement from the estimated Probability distribution to obtain b b b X = X,..., X n ( ) Calculate the same statistic using the bootstrap sample in step 2 to get ˆ θ b Repeat steps 2 through 3, B times (i.e. the number of resamples desired). Use this estimate of the distribution of ˆ θ (i.e.,the bootstrap replicates) to obtain the desired characteristics as follows: B ˆ ˆ b θ = θ B b= 2 ˆ ( ˆ B ) ( ˆ b SE ˆ B θ = θ θ ) B b= biasˆ ˆ ˆ B = θ θ and the bias corrected estimator is given by ˆc θ = ˆ θ biasˆ 2 ˆ ˆ B = θ θ Results And Conclusions Seasonal Analysis A study of the seasonal changes in the occurrence of the number of claims revealed an increasing pattern for both Portfolios along the period. The highest number of claims was observed in July and March, 200 for StarLife and MetLife respectively. Several months between 2006 and 2007 recorded no claims (zero claims), that was linked to the reason that the policy was introduced in 2005 and as at the end of 2006, not much of the policies had been sold. Figure 4. is a box plot of the monthly recorded number of claims on yearly basis for the underlying years ( ). The yearly distributions of the number of claims were all tailed and skewed to either the left or right in the various years. The annual distributions of number of claims for MetLife were skewed to the right while those of StarLife were skewed to the left in 200 but in the other years under review they were all skewed to the right ( ). This revealed that the annual distributions were not conventionally bell shaped (normally distributed) in any of the years but were all tailed and skewed. 55

5 Figure 4.: Boxplot of Yearly Number of Claims on Funeral Policy from StarLife and MetLife. Considering, the annual distributions during the active years ( ) where much of the policies had been sold, the year 2008 distribution of the number of claims from MetLife was heavily tailed, right skewed and fat arched (platykurtic) while that of StarLife was slightly tailed, right skewed and slender arched (leptokurtic). Furthermore, year 2009 distribution depicted a tailed, right skewed and slender arched (leptokurtic) for MetLife and tailed, right skewed and normally peaked (mesokurtic) for StarLife. Finally in year 200, the distributions for both portfolios were similar, tailed, asymmetric and slender arched (leptokurtic). Overall, the number of claims depicted increasing patterns across the years with rising annual averages (see Table 4. for details). Year StarLife MetLife Table 4.: Trend of Annual Averages of Number of Claims from Starlife and MetLife Figure 4.2: Quatile Quatile plot of Number of Claims on Funeral Policy from StarLife and MetLife. 56

6 Comparison of the Distributions The preliminary analysis on the number of claims suggests that the distributions of both portfolios were not normal. To confirm, the null hypothesis of normality was rejected at a p-value of.28 x 0-8 for StarLife and.89 x 0-4 for MetLife according to Shapiro-Wilk test of normality. To answer the question as to whether the distribution of number of claims for both insurance companies came from the same distribution, a QQ plot displayed in Figure 4.2 was constructed and the fairly linear trend preliminarily suggest that the distribution of the number of claims for both insurance companies belong to a same distribution. Moreover, the p-value of 2.3 x 0 - obtained by the two sided Kolmogorov-Smirnov test is statistically significant to ascertain that the probability distribution of the number of claim are the same. Fitting Probability Distribution to Claims The density estimates as displayed in figure 4.3 for both portfolios are not significantly different; they both have a similar distribution but the StarLife distribution curve is stepper and slender than that of MetLife Insurance Company. Figure 4.3: Density Estimate of Number of Claims on Funeral Policy from StarLife and MetLife. Figure 4.4: Negative Binomial and Poisson Distribution Fit to the Number of Claims on Funeral Policy from StarLife and MetLife. 57

7 Furthermore, it is visible that the sample distribution though uni-modal, has several turning points which is not typical of the conventional probability distribution at a glance. However, ascribing this abnormality to the presence of outliers in the data set may warrant smoothening (fitting) to depict a true probability distribution. Empirical Distributions that can be fitted to the observed data include the Mixed Poisson Distributions starting with the Poisson distribution as discussed in the literature. Figure 4.4 and 4.5 were produced as a result of fitting the number of Claims for StarLife and MetLife respectively with the negative binomial and Poisson distribution. A critical look at the charts (figures 4.4 and 4.5) reveals that the Poisson distribution does not fit well to the number of claims from both StarLife and MetLife Insurance Companies. However the Negative Binomial Distribution fit the Number of Claims reasonably well. The maximum likelihood estimates of the Poisson mean for StarLife and MetLife were Ʌs = 3.05 and Ʌm = 6.65 respectively. The 95% confidence interval for Ʌs and Ʌm were (2.672; ) and (5.6489; 7.752) and finally, the log-likelihood were and However, for the negative binomial distribution, the means were estimated to be Ʌs = and Ʌm = and the dispersion parameters to be Ʌs = and Ʌm =.4323 for the StarLife and MetLife number of Claims. The variance of the random effect for both StarLife and MetLife were estimated to be Vs(Θ) = and Vm(Θ) = respectively. The 95% confidence interval for the dispersion parameters were (.2633; ), (0.892;.9734) and (0.360; 0.796), (0:5067;.22) for their variance V(Θ). The Log-likelihood of the Negative Binomial Distribution was and for StarLife and MetLife respectively which were far better than that of the Poisson distribution for both StarLife and MetLife respectively. Finally, the confidence interval estimates of the monthly expected number of claims estimated by the Negative Binomial Model were approximately (9; 9) and (2; 23) claims for StarLife and MetLife.(see table 4.3 for details) Figure 4.6 provides summary statistics of the Goodness of fit of the negative binomial distribution for both StarLife and MetLife Insurance Companies. The Scaled deviance of and for StarLife and MetLife compared to the asymptotic chi-square with 59 degrees of freedom yielded a p-value of about 0.5, which implies we cannot reject the null hypothesis that the specified negative binomial model is the correct model. Figure 4.5: PP Plot of the Distribution of Number of Claims, Negative Binomial and Poisson Distribution. Again, the dispersion parameter estimates were and.4323 for StarLife and MetLife respectively. 58

8 Company Poisson Estimate N. Binomial Estimate StarLife 2.672; ; MetLife ; ; Table 4.2: Interval Estimates of the Number of Claims from StarLife and MetLife The deductions are that the number of claims from StarLife and MetLife is not purely independent as required by the classical Poisson process. The number of claims is duration and occurrence dependent because the composition of the number of portfolios are constantly varying (either increasing or decreasing) as a result of continuous sales of the policy. In effect the assumption of independence of the random process is being violated hence the inadequacy of the Poisson model. However, the Negative Binomial Distribution produced considerably good fits in the sense that it is the limiting form of the resulting distributions that arise in the situation of occurrence and duration dependence and is known as Pölya - Eggenberger distribution. Figure 4.6: Criteria for assessing goodness of fit of the Negative Binomial model of the Number of claims. Again, the number of claim process is not homogeneous as required by the poisson distribution. The occurrence of death which drives the number of claims varies with every policy holder as a result of unobservable social, moral, economic and health factors. Again, the poisson distribution assumption of homogeneity is being violated and the negative binomial distribution arises as the limiting distribution and for that matter is adequate. Bootstrap Estimates Company ˆ θ ˆ θ * Bias ˆc θ StarLife MetLife Table 4.3: Summary statistics of Bootstrap replicates of Number of Claims from StarLife and MetLife. SE B ( ˆ θ ) Table 4.3 shows the summary statistics of the bootstrap estimate of 00 resamples from the estimated probability distribution of the number of claims from StarLife and MetLife. A comparison between the expected monthly number of Claims ˆ θ obtained by the 59

9 probability models and that obtained from the bootstrap method ˆc θ showed no significant variation as they were set at 3 and 7 claims for StarLife and MetLife insurance company. Conclusion The Negative Binomial distribution appears to be superior to the Poisson distribution for fitting insurance claims and therefore, provides somewhat reliable estimates for planning, decision making as well as estimation in insurance administration. The bootstrap estimates did not vary from the estimates obtained by the probability models. This research only focuses on choosing between the Poisson and the Negative Binomial distribution for fitting insurance claims and estimating the monthly expected number of claims. The bootstrap estimates should be obtained and compared with the estimates from the probability models to authenticate the estimates. Further work should be conducted using other models including mixed poison probability models. References: Aitkin, M 999, "A general maximum likelihood analysis of variance components in generalized linear models," Biometrics 55, Annette J. Dobson (990), "An Introduction to Generalized Linear models," Chapman and Hall, London. Atella, V. and F.C. Rosati 2000,"Uncertainty about children's survival and fertility: A test using Indian microdata," Journal of Population Economics 3(2): Barmby, T. And J. Doornik 989, "Modeling trip frequency as a Poisson variable," Journal of Transport Economics and Policy 23(3): Baron, D.N. 992, "The analysis of count data: Overdispersion and autocorrelation," in P. Marsden (ed.) Sociological Methodology 992, Blackwell: Cambridge, MA, Blundell, R., R. Gri_th and J. van Reenen 995, "Dynamic count data models of technological innovation," Economic Journal 05: Bockenholt, U. 999, "Mixed INAR () Poisson regression models: analyzing heterogeneity and serial dependence in longitudinal count data," Journal of Econometrics 89: Bowman, K.O., and Shenton L.R. 988, "Properties of Estimators for the Gamma Distribution", New York: Marcel Dekker. Breslow, N. 990, "Tests of hypotheses in overdispersed Poisson regression and other quasilikelihood models," Journal of the American Statistical Association 85: Buck, A.J. 984, "Modeling a Poisson process: strike frequency in Great Britain," Atlantic Economic Journal 2(): C. Davison and D. V. Hilkley (997),"Bootstrap Methods and Their Applications" Cambridge University Press. Cameron, A.C. and P. Johansson 997, "Count data regression using series expansions: with applications," Journal of Applied Econometrics 2(3):

SAS Software to Fit the Generalized Linear Model

SAS Software to Fit the Generalized Linear Model SAS Software to Fit the Generalized Linear Model Gordon Johnston, SAS Institute Inc., Cary, NC Abstract In recent years, the class of generalized linear models has gained popularity as a statistical modeling

More information

SOCIETY OF ACTUARIES/CASUALTY ACTUARIAL SOCIETY EXAM C CONSTRUCTION AND EVALUATION OF ACTUARIAL MODELS EXAM C SAMPLE QUESTIONS

SOCIETY OF ACTUARIES/CASUALTY ACTUARIAL SOCIETY EXAM C CONSTRUCTION AND EVALUATION OF ACTUARIAL MODELS EXAM C SAMPLE QUESTIONS SOCIETY OF ACTUARIES/CASUALTY ACTUARIAL SOCIETY EXAM C CONSTRUCTION AND EVALUATION OF ACTUARIAL MODELS EXAM C SAMPLE QUESTIONS Copyright 005 by the Society of Actuaries and the Casualty Actuarial Society

More information

Simple Linear Regression Inference

Simple Linear Regression Inference Simple Linear Regression Inference 1 Inference requirements The Normality assumption of the stochastic term e is needed for inference even if it is not a OLS requirement. Therefore we have: Interpretation

More information

Institute of Actuaries of India Subject CT3 Probability and Mathematical Statistics

Institute of Actuaries of India Subject CT3 Probability and Mathematical Statistics Institute of Actuaries of India Subject CT3 Probability and Mathematical Statistics For 2015 Examinations Aim The aim of the Probability and Mathematical Statistics subject is to provide a grounding in

More information

Poisson Models for Count Data

Poisson Models for Count Data Chapter 4 Poisson Models for Count Data In this chapter we study log-linear models for count data under the assumption of a Poisson error structure. These models have many applications, not only to the

More information

Automated Biosurveillance Data from England and Wales, 1991 2011

Automated Biosurveillance Data from England and Wales, 1991 2011 Article DOI: http://dx.doi.org/10.3201/eid1901.120493 Automated Biosurveillance Data from England and Wales, 1991 2011 Technical Appendix This online appendix provides technical details of statistical

More information

STATISTICA Formula Guide: Logistic Regression. Table of Contents

STATISTICA Formula Guide: Logistic Regression. Table of Contents : Table of Contents... 1 Overview of Model... 1 Dispersion... 2 Parameterization... 3 Sigma-Restricted Model... 3 Overparameterized Model... 4 Reference Coding... 4 Model Summary (Summary Tab)... 5 Summary

More information

Least Squares Estimation

Least Squares Estimation Least Squares Estimation SARA A VAN DE GEER Volume 2, pp 1041 1045 in Encyclopedia of Statistics in Behavioral Science ISBN-13: 978-0-470-86080-9 ISBN-10: 0-470-86080-4 Editors Brian S Everitt & David

More information

CA200 Quantitative Analysis for Business Decisions. File name: CA200_Section_04A_StatisticsIntroduction

CA200 Quantitative Analysis for Business Decisions. File name: CA200_Section_04A_StatisticsIntroduction CA200 Quantitative Analysis for Business Decisions File name: CA200_Section_04A_StatisticsIntroduction Table of Contents 4. Introduction to Statistics... 1 4.1 Overview... 3 4.2 Discrete or continuous

More information

Introduction to General and Generalized Linear Models

Introduction to General and Generalized Linear Models Introduction to General and Generalized Linear Models General Linear Models - part I Henrik Madsen Poul Thyregod Informatics and Mathematical Modelling Technical University of Denmark DK-2800 Kgs. Lyngby

More information

VISUALIZATION OF DENSITY FUNCTIONS WITH GEOGEBRA

VISUALIZATION OF DENSITY FUNCTIONS WITH GEOGEBRA VISUALIZATION OF DENSITY FUNCTIONS WITH GEOGEBRA Csilla Csendes University of Miskolc, Hungary Department of Applied Mathematics ICAM 2010 Probability density functions A random variable X has density

More information

GENERALIZED LINEAR MODELS IN VEHICLE INSURANCE

GENERALIZED LINEAR MODELS IN VEHICLE INSURANCE ACTA UNIVERSITATIS AGRICULTURAE ET SILVICULTURAE MENDELIANAE BRUNENSIS Volume 62 41 Number 2, 2014 http://dx.doi.org/10.11118/actaun201462020383 GENERALIZED LINEAR MODELS IN VEHICLE INSURANCE Silvie Kafková

More information

Estimating Industry Multiples

Estimating Industry Multiples Estimating Industry Multiples Malcolm Baker * Harvard University Richard S. Ruback Harvard University First Draft: May 1999 Rev. June 11, 1999 Abstract We analyze industry multiples for the S&P 500 in

More information

A LOGNORMAL MODEL FOR INSURANCE CLAIMS DATA

A LOGNORMAL MODEL FOR INSURANCE CLAIMS DATA REVSTAT Statistical Journal Volume 4, Number 2, June 2006, 131 142 A LOGNORMAL MODEL FOR INSURANCE CLAIMS DATA Authors: Daiane Aparecida Zuanetti Departamento de Estatística, Universidade Federal de São

More information

Final Exam Practice Problem Answers

Final Exam Practice Problem Answers Final Exam Practice Problem Answers The following data set consists of data gathered from 77 popular breakfast cereals. The variables in the data set are as follows: Brand: The brand name of the cereal

More information

Regression Analysis: A Complete Example

Regression Analysis: A Complete Example Regression Analysis: A Complete Example This section works out an example that includes all the topics we have discussed so far in this chapter. A complete example of regression analysis. PhotoDisc, Inc./Getty

More information

Statistics I for QBIC. Contents and Objectives. Chapters 1 7. Revised: August 2013

Statistics I for QBIC. Contents and Objectives. Chapters 1 7. Revised: August 2013 Statistics I for QBIC Text Book: Biostatistics, 10 th edition, by Daniel & Cross Contents and Objectives Chapters 1 7 Revised: August 2013 Chapter 1: Nature of Statistics (sections 1.1-1.6) Objectives

More information

Chapter 3 RANDOM VARIATE GENERATION

Chapter 3 RANDOM VARIATE GENERATION Chapter 3 RANDOM VARIATE GENERATION In order to do a Monte Carlo simulation either by hand or by computer, techniques must be developed for generating values of random variables having known distributions.

More information

Summary of Formulas and Concepts. Descriptive Statistics (Ch. 1-4)

Summary of Formulas and Concepts. Descriptive Statistics (Ch. 1-4) Summary of Formulas and Concepts Descriptive Statistics (Ch. 1-4) Definitions Population: The complete set of numerical information on a particular quantity in which an investigator is interested. We assume

More information

Generalized Linear Models

Generalized Linear Models Generalized Linear Models We have previously worked with regression models where the response variable is quantitative and normally distributed. Now we turn our attention to two types of models where the

More information

Dongfeng Li. Autumn 2010

Dongfeng Li. Autumn 2010 Autumn 2010 Chapter Contents Some statistics background; ; Comparing means and proportions; variance. Students should master the basic concepts, descriptive statistics measures and graphs, basic hypothesis

More information

Chi Square Tests. Chapter 10. 10.1 Introduction

Chi Square Tests. Chapter 10. 10.1 Introduction Contents 10 Chi Square Tests 703 10.1 Introduction............................ 703 10.2 The Chi Square Distribution.................. 704 10.3 Goodness of Fit Test....................... 709 10.4 Chi Square

More information

Tutorial 5: Hypothesis Testing

Tutorial 5: Hypothesis Testing Tutorial 5: Hypothesis Testing Rob Nicholls nicholls@mrc-lmb.cam.ac.uk MRC LMB Statistics Course 2014 Contents 1 Introduction................................ 1 2 Testing distributional assumptions....................

More information

UNDERSTANDING THE INDEPENDENT-SAMPLES t TEST

UNDERSTANDING THE INDEPENDENT-SAMPLES t TEST UNDERSTANDING The independent-samples t test evaluates the difference between the means of two independent or unrelated groups. That is, we evaluate whether the means for two independent groups are significantly

More information

Mortgage Loan Approvals and Government Intervention Policy

Mortgage Loan Approvals and Government Intervention Policy Mortgage Loan Approvals and Government Intervention Policy Dr. William Chow 18 March, 214 Executive Summary This paper introduces an empirical framework to explore the impact of the government s various

More information

Calculating P-Values. Parkland College. Isela Guerra Parkland College. Recommended Citation

Calculating P-Values. Parkland College. Isela Guerra Parkland College. Recommended Citation Parkland College A with Honors Projects Honors Program 2014 Calculating P-Values Isela Guerra Parkland College Recommended Citation Guerra, Isela, "Calculating P-Values" (2014). A with Honors Projects.

More information

Lecture 8. Confidence intervals and the central limit theorem

Lecture 8. Confidence intervals and the central limit theorem Lecture 8. Confidence intervals and the central limit theorem Mathematical Statistics and Discrete Mathematics November 25th, 2015 1 / 15 Central limit theorem Let X 1, X 2,... X n be a random sample of

More information

Overview of Violations of the Basic Assumptions in the Classical Normal Linear Regression Model

Overview of Violations of the Basic Assumptions in the Classical Normal Linear Regression Model Overview of Violations of the Basic Assumptions in the Classical Normal Linear Regression Model 1 September 004 A. Introduction and assumptions The classical normal linear regression model can be written

More information

An Introduction to Modeling Stock Price Returns With a View Towards Option Pricing

An Introduction to Modeling Stock Price Returns With a View Towards Option Pricing An Introduction to Modeling Stock Price Returns With a View Towards Option Pricing Kyle Chauvin August 21, 2006 This work is the product of a summer research project at the University of Kansas, conducted

More information

Portfolio Using Queuing Theory

Portfolio Using Queuing Theory Modeling the Number of Insured Households in an Insurance Portfolio Using Queuing Theory Jean-Philippe Boucher and Guillaume Couture-Piché December 8, 2015 Quantact / Département de mathématiques, UQAM.

More information

Curriculum Map Statistics and Probability Honors (348) Saugus High School Saugus Public Schools 2009-2010

Curriculum Map Statistics and Probability Honors (348) Saugus High School Saugus Public Schools 2009-2010 Curriculum Map Statistics and Probability Honors (348) Saugus High School Saugus Public Schools 2009-2010 Week 1 Week 2 14.0 Students organize and describe distributions of data by using a number of different

More information

BA 275 Review Problems - Week 6 (10/30/06-11/3/06) CD Lessons: 53, 54, 55, 56 Textbook: pp. 394-398, 404-408, 410-420

BA 275 Review Problems - Week 6 (10/30/06-11/3/06) CD Lessons: 53, 54, 55, 56 Textbook: pp. 394-398, 404-408, 410-420 BA 275 Review Problems - Week 6 (10/30/06-11/3/06) CD Lessons: 53, 54, 55, 56 Textbook: pp. 394-398, 404-408, 410-420 1. Which of the following will increase the value of the power in a statistical test

More information

The Variability of P-Values. Summary

The Variability of P-Values. Summary The Variability of P-Values Dennis D. Boos Department of Statistics North Carolina State University Raleigh, NC 27695-8203 boos@stat.ncsu.edu August 15, 2009 NC State Statistics Departement Tech Report

More information

12.5: CHI-SQUARE GOODNESS OF FIT TESTS

12.5: CHI-SQUARE GOODNESS OF FIT TESTS 125: Chi-Square Goodness of Fit Tests CD12-1 125: CHI-SQUARE GOODNESS OF FIT TESTS In this section, the χ 2 distribution is used for testing the goodness of fit of a set of data to a specific probability

More information

Statistical Analysis of Life Insurance Policy Termination and Survivorship

Statistical Analysis of Life Insurance Policy Termination and Survivorship Statistical Analysis of Life Insurance Policy Termination and Survivorship Emiliano A. Valdez, PhD, FSA Michigan State University joint work with J. Vadiveloo and U. Dias Session ES82 (Statistics in Actuarial

More information

Simple linear regression

Simple linear regression Simple linear regression Introduction Simple linear regression is a statistical method for obtaining a formula to predict values of one variable from another where there is a causal relationship between

More information

Normality Testing in Excel

Normality Testing in Excel Normality Testing in Excel By Mark Harmon Copyright 2011 Mark Harmon No part of this publication may be reproduced or distributed without the express permission of the author. mark@excelmasterseries.com

More information

Supplement to Call Centers with Delay Information: Models and Insights

Supplement to Call Centers with Delay Information: Models and Insights Supplement to Call Centers with Delay Information: Models and Insights Oualid Jouini 1 Zeynep Akşin 2 Yves Dallery 1 1 Laboratoire Genie Industriel, Ecole Centrale Paris, Grande Voie des Vignes, 92290

More information

Maximum Likelihood Estimation

Maximum Likelihood Estimation Math 541: Statistical Theory II Lecturer: Songfeng Zheng Maximum Likelihood Estimation 1 Maximum Likelihood Estimation Maximum likelihood is a relatively simple method of constructing an estimator for

More information

Chapter 8 Hypothesis Testing Chapter 8 Hypothesis Testing 8-1 Overview 8-2 Basics of Hypothesis Testing

Chapter 8 Hypothesis Testing Chapter 8 Hypothesis Testing 8-1 Overview 8-2 Basics of Hypothesis Testing Chapter 8 Hypothesis Testing 1 Chapter 8 Hypothesis Testing 8-1 Overview 8-2 Basics of Hypothesis Testing 8-3 Testing a Claim About a Proportion 8-5 Testing a Claim About a Mean: s Not Known 8-6 Testing

More information

Exploratory Data Analysis

Exploratory Data Analysis Exploratory Data Analysis Johannes Schauer johannes.schauer@tugraz.at Institute of Statistics Graz University of Technology Steyrergasse 17/IV, 8010 Graz www.statistics.tugraz.at February 12, 2008 Introduction

More information

Business Statistics. Successful completion of Introductory and/or Intermediate Algebra courses is recommended before taking Business Statistics.

Business Statistics. Successful completion of Introductory and/or Intermediate Algebra courses is recommended before taking Business Statistics. Business Course Text Bowerman, Bruce L., Richard T. O'Connell, J. B. Orris, and Dawn C. Porter. Essentials of Business, 2nd edition, McGraw-Hill/Irwin, 2008, ISBN: 978-0-07-331988-9. Required Computing

More information

THE KRUSKAL WALLLIS TEST

THE KRUSKAL WALLLIS TEST THE KRUSKAL WALLLIS TEST TEODORA H. MEHOTCHEVA Wednesday, 23 rd April 08 THE KRUSKAL-WALLIS TEST: The non-parametric alternative to ANOVA: testing for difference between several independent groups 2 NON

More information

Sales forecasting # 2

Sales forecasting # 2 Sales forecasting # 2 Arthur Charpentier arthur.charpentier@univ-rennes1.fr 1 Agenda Qualitative and quantitative methods, a very general introduction Series decomposition Short versus long term forecasting

More information

Survival Analysis of Left Truncated Income Protection Insurance Data. [March 29, 2012]

Survival Analysis of Left Truncated Income Protection Insurance Data. [March 29, 2012] Survival Analysis of Left Truncated Income Protection Insurance Data [March 29, 2012] 1 Qing Liu 2 David Pitt 3 Yan Wang 4 Xueyuan Wu Abstract One of the main characteristics of Income Protection Insurance

More information

MATH4427 Notebook 2 Spring 2016. 2 MATH4427 Notebook 2 3. 2.1 Definitions and Examples... 3. 2.2 Performance Measures for Estimators...

MATH4427 Notebook 2 Spring 2016. 2 MATH4427 Notebook 2 3. 2.1 Definitions and Examples... 3. 2.2 Performance Measures for Estimators... MATH4427 Notebook 2 Spring 2016 prepared by Professor Jenny Baglivo c Copyright 2009-2016 by Jenny A. Baglivo. All Rights Reserved. Contents 2 MATH4427 Notebook 2 3 2.1 Definitions and Examples...................................

More information

Statistics in Retail Finance. Chapter 6: Behavioural models

Statistics in Retail Finance. Chapter 6: Behavioural models Statistics in Retail Finance 1 Overview > So far we have focussed mainly on application scorecards. In this chapter we shall look at behavioural models. We shall cover the following topics:- Behavioural

More information

How To Test For Significance On A Data Set

How To Test For Significance On A Data Set Non-Parametric Univariate Tests: 1 Sample Sign Test 1 1 SAMPLE SIGN TEST A non-parametric equivalent of the 1 SAMPLE T-TEST. ASSUMPTIONS: Data is non-normally distributed, even after log transforming.

More information

Logistic Regression (a type of Generalized Linear Model)

Logistic Regression (a type of Generalized Linear Model) Logistic Regression (a type of Generalized Linear Model) 1/36 Today Review of GLMs Logistic Regression 2/36 How do we find patterns in data? We begin with a model of how the world works We use our knowledge

More information

Recall this chart that showed how most of our course would be organized:

Recall this chart that showed how most of our course would be organized: Chapter 4 One-Way ANOVA Recall this chart that showed how most of our course would be organized: Explanatory Variable(s) Response Variable Methods Categorical Categorical Contingency Tables Categorical

More information

Quantitative Methods for Finance

Quantitative Methods for Finance Quantitative Methods for Finance Module 1: The Time Value of Money 1 Learning how to interpret interest rates as required rates of return, discount rates, or opportunity costs. 2 Learning how to explain

More information

Elements of statistics (MATH0487-1)

Elements of statistics (MATH0487-1) Elements of statistics (MATH0487-1) Prof. Dr. Dr. K. Van Steen University of Liège, Belgium December 10, 2012 Introduction to Statistics Basic Probability Revisited Sampling Exploratory Data Analysis -

More information

Survey, Statistics and Psychometrics Core Research Facility University of Nebraska-Lincoln. Log-Rank Test for More Than Two Groups

Survey, Statistics and Psychometrics Core Research Facility University of Nebraska-Lincoln. Log-Rank Test for More Than Two Groups Survey, Statistics and Psychometrics Core Research Facility University of Nebraska-Lincoln Log-Rank Test for More Than Two Groups Prepared by Harlan Sayles (SRAM) Revised by Julia Soulakova (Statistics)

More information

**BEGINNING OF EXAMINATION** The annual number of claims for an insured has probability function: , 0 < q < 1.

**BEGINNING OF EXAMINATION** The annual number of claims for an insured has probability function: , 0 < q < 1. **BEGINNING OF EXAMINATION** 1. You are given: (i) The annual number of claims for an insured has probability function: 3 p x q q x x ( ) = ( 1 ) 3 x, x = 0,1,, 3 (ii) The prior density is π ( q) = q,

More information

Department of Mathematics, Indian Institute of Technology, Kharagpur Assignment 2-3, Probability and Statistics, March 2015. Due:-March 25, 2015.

Department of Mathematics, Indian Institute of Technology, Kharagpur Assignment 2-3, Probability and Statistics, March 2015. Due:-March 25, 2015. Department of Mathematics, Indian Institute of Technology, Kharagpur Assignment -3, Probability and Statistics, March 05. Due:-March 5, 05.. Show that the function 0 for x < x+ F (x) = 4 for x < for x

More information

Lecture 8: Gamma regression

Lecture 8: Gamma regression Lecture 8: Gamma regression Claudia Czado TU München c (Claudia Czado, TU Munich) ZFS/IMS Göttingen 2004 0 Overview Models with constant coefficient of variation Gamma regression: estimation and testing

More information

Chapter 13 Introduction to Nonlinear Regression( 非 線 性 迴 歸 )

Chapter 13 Introduction to Nonlinear Regression( 非 線 性 迴 歸 ) Chapter 13 Introduction to Nonlinear Regression( 非 線 性 迴 歸 ) and Neural Networks( 類 神 經 網 路 ) 許 湘 伶 Applied Linear Regression Models (Kutner, Nachtsheim, Neter, Li) hsuhl (NUK) LR Chap 10 1 / 35 13 Examples

More information

Is the Basis of the Stock Index Futures Markets Nonlinear?

Is the Basis of the Stock Index Futures Markets Nonlinear? University of Wollongong Research Online Applied Statistics Education and Research Collaboration (ASEARC) - Conference Papers Faculty of Engineering and Information Sciences 2011 Is the Basis of the Stock

More information

Introduction to Predictive Modeling Using GLMs

Introduction to Predictive Modeling Using GLMs Introduction to Predictive Modeling Using GLMs Dan Tevet, FCAS, MAAA, Liberty Mutual Insurance Group Anand Khare, FCAS, MAAA, CPCU, Milliman 1 Antitrust Notice The Casualty Actuarial Society is committed

More information

UNIVERSITY OF OSLO. The Poisson model is a common model for claim frequency.

UNIVERSITY OF OSLO. The Poisson model is a common model for claim frequency. UNIVERSITY OF OSLO Faculty of mathematics and natural sciences Candidate no Exam in: STK 4540 Non-Life Insurance Mathematics Day of examination: December, 9th, 2015 Examination hours: 09:00 13:00 This

More information

Hierarchical Insurance Claims Modeling

Hierarchical Insurance Claims Modeling Hierarchical Insurance Claims Modeling Edward W. (Jed) Frees, University of Wisconsin - Madison Emiliano A. Valdez, University of Connecticut 2009 Joint Statistical Meetings Session 587 - Thu 8/6/09-10:30

More information

Fairfield Public Schools

Fairfield Public Schools Mathematics Fairfield Public Schools AP Statistics AP Statistics BOE Approved 04/08/2014 1 AP STATISTICS Critical Areas of Focus AP Statistics is a rigorous course that offers advanced students an opportunity

More information

Nonparametric Statistics

Nonparametric Statistics Nonparametric Statistics References Some good references for the topics in this course are 1. Higgins, James (2004), Introduction to Nonparametric Statistics 2. Hollander and Wolfe, (1999), Nonparametric

More information

HYPOTHESIS TESTING: POWER OF THE TEST

HYPOTHESIS TESTING: POWER OF THE TEST HYPOTHESIS TESTING: POWER OF THE TEST The first 6 steps of the 9-step test of hypothesis are called "the test". These steps are not dependent on the observed data values. When planning a research project,

More information

Factors affecting online sales

Factors affecting online sales Factors affecting online sales Table of contents Summary... 1 Research questions... 1 The dataset... 2 Descriptive statistics: The exploratory stage... 3 Confidence intervals... 4 Hypothesis tests... 4

More information

START Selected Topics in Assurance

START Selected Topics in Assurance START Selected Topics in Assurance Related Technologies Table of Contents Introduction Some Statistical Background Fitting a Normal Using the Anderson Darling GoF Test Fitting a Weibull Using the Anderson

More information

Probability and Statistics Prof. Dr. Somesh Kumar Department of Mathematics Indian Institute of Technology, Kharagpur

Probability and Statistics Prof. Dr. Somesh Kumar Department of Mathematics Indian Institute of Technology, Kharagpur Probability and Statistics Prof. Dr. Somesh Kumar Department of Mathematics Indian Institute of Technology, Kharagpur Module No. #01 Lecture No. #15 Special Distributions-VI Today, I am going to introduce

More information

Assumptions. Assumptions of linear models. Boxplot. Data exploration. Apply to response variable. Apply to error terms from linear model

Assumptions. Assumptions of linear models. Boxplot. Data exploration. Apply to response variable. Apply to error terms from linear model Assumptions Assumptions of linear models Apply to response variable within each group if predictor categorical Apply to error terms from linear model check by analysing residuals Normality Homogeneity

More information

The Best of Both Worlds:

The Best of Both Worlds: The Best of Both Worlds: A Hybrid Approach to Calculating Value at Risk Jacob Boudoukh 1, Matthew Richardson and Robert F. Whitelaw Stern School of Business, NYU The hybrid approach combines the two most

More information

Course Text. Required Computing Software. Course Description. Course Objectives. StraighterLine. Business Statistics

Course Text. Required Computing Software. Course Description. Course Objectives. StraighterLine. Business Statistics Course Text Business Statistics Lind, Douglas A., Marchal, William A. and Samuel A. Wathen. Basic Statistics for Business and Economics, 7th edition, McGraw-Hill/Irwin, 2010, ISBN: 9780077384470 [This

More information

1 Another method of estimation: least squares

1 Another method of estimation: least squares 1 Another method of estimation: least squares erm: -estim.tex, Dec8, 009: 6 p.m. (draft - typos/writos likely exist) Corrections, comments, suggestions welcome. 1.1 Least squares in general Assume Y i

More information

Additional sources Compilation of sources: http://lrs.ed.uiuc.edu/tseportal/datacollectionmethodologies/jin-tselink/tselink.htm

Additional sources Compilation of sources: http://lrs.ed.uiuc.edu/tseportal/datacollectionmethodologies/jin-tselink/tselink.htm Mgt 540 Research Methods Data Analysis 1 Additional sources Compilation of sources: http://lrs.ed.uiuc.edu/tseportal/datacollectionmethodologies/jin-tselink/tselink.htm http://web.utk.edu/~dap/random/order/start.htm

More information

Managing uncertainty in call centers using Poisson mixtures

Managing uncertainty in call centers using Poisson mixtures Managing uncertainty in call centers using Poisson mixtures Geurt Jongbloed and Ger Koole Vrije Universiteit, Division of Mathematics and Computer Science De Boelelaan 1081a, 1081 HV Amsterdam, The Netherlands

More information

Overview of Monte Carlo Simulation, Probability Review and Introduction to Matlab

Overview of Monte Carlo Simulation, Probability Review and Introduction to Matlab Monte Carlo Simulation: IEOR E4703 Fall 2004 c 2004 by Martin Haugh Overview of Monte Carlo Simulation, Probability Review and Introduction to Matlab 1 Overview of Monte Carlo Simulation 1.1 Why use simulation?

More information

Description. Textbook. Grading. Objective

Description. Textbook. Grading. Objective EC151.02 Statistics for Business and Economics (MWF 8:00-8:50) Instructor: Chiu Yu Ko Office: 462D, 21 Campenalla Way Phone: 2-6093 Email: kocb@bc.edu Office Hours: by appointment Description This course

More information

How To Check For Differences In The One Way Anova

How To Check For Differences In The One Way Anova MINITAB ASSISTANT WHITE PAPER This paper explains the research conducted by Minitab statisticians to develop the methods and data checks used in the Assistant in Minitab 17 Statistical Software. One-Way

More information

CHAPTER 13 SIMPLE LINEAR REGRESSION. Opening Example. Simple Regression. Linear Regression

CHAPTER 13 SIMPLE LINEAR REGRESSION. Opening Example. Simple Regression. Linear Regression Opening Example CHAPTER 13 SIMPLE LINEAR REGREION SIMPLE LINEAR REGREION! Simple Regression! Linear Regression Simple Regression Definition A regression model is a mathematical equation that descries the

More information

Module 2 Probability and Statistics

Module 2 Probability and Statistics Module 2 Probability and Statistics BASIC CONCEPTS Multiple Choice Identify the choice that best completes the statement or answers the question. 1. The standard deviation of a standard normal distribution

More information

Modeling Individual Claims for Motor Third Party Liability of Insurance Companies in Albania

Modeling Individual Claims for Motor Third Party Liability of Insurance Companies in Albania Modeling Individual Claims for Motor Third Party Liability of Insurance Companies in Albania Oriana Zacaj Department of Mathematics, Polytechnic University, Faculty of Mathematics and Physics Engineering

More information

MATHEMATICAL METHODS OF STATISTICS

MATHEMATICAL METHODS OF STATISTICS MATHEMATICAL METHODS OF STATISTICS By HARALD CRAMER TROFESSOK IN THE UNIVERSITY OF STOCKHOLM Princeton PRINCETON UNIVERSITY PRESS 1946 TABLE OF CONTENTS. First Part. MATHEMATICAL INTRODUCTION. CHAPTERS

More information

Study Guide for the Final Exam

Study Guide for the Final Exam Study Guide for the Final Exam When studying, remember that the computational portion of the exam will only involve new material (covered after the second midterm), that material from Exam 1 will make

More information

The Study of Chinese P&C Insurance Risk for the Purpose of. Solvency Capital Requirement

The Study of Chinese P&C Insurance Risk for the Purpose of. Solvency Capital Requirement The Study of Chinese P&C Insurance Risk for the Purpose of Solvency Capital Requirement Xie Zhigang, Wang Shangwen, Zhou Jinhan School of Finance, Shanghai University of Finance & Economics 777 Guoding

More information

P(every one of the seven intervals covers the true mean yield at its location) = 3.

P(every one of the seven intervals covers the true mean yield at its location) = 3. 1 Let = number of locations at which the computed confidence interval for that location hits the true value of the mean yield at its location has a binomial(7,095) (a) P(every one of the seven intervals

More information

GLM I An Introduction to Generalized Linear Models

GLM I An Introduction to Generalized Linear Models GLM I An Introduction to Generalized Linear Models CAS Ratemaking and Product Management Seminar March 2009 Presented by: Tanya D. Havlicek, Actuarial Assistant 0 ANTITRUST Notice The Casualty Actuarial

More information

Testing Market Efficiency in a Fixed Odds Betting Market

Testing Market Efficiency in a Fixed Odds Betting Market WORKING PAPER SERIES WORKING PAPER NO 2, 2007 ESI Testing Market Efficiency in a Fixed Odds Betting Market Robin Jakobsson Department of Statistics Örebro University robin.akobsson@esi.oru.se By Niklas

More information

Estimation and attribution of changes in extreme weather and climate events

Estimation and attribution of changes in extreme weather and climate events IPCC workshop on extreme weather and climate events, 11-13 June 2002, Beijing. Estimation and attribution of changes in extreme weather and climate events Dr. David B. Stephenson Department of Meteorology

More information

Example: Boats and Manatees

Example: Boats and Manatees Figure 9-6 Example: Boats and Manatees Slide 1 Given the sample data in Table 9-1, find the value of the linear correlation coefficient r, then refer to Table A-6 to determine whether there is a significant

More information

Exam P - Total 23/23 - 1 -

Exam P - Total 23/23 - 1 - Exam P Learning Objectives Schools will meet 80% of the learning objectives on this examination if they can show they meet 18.4 of 23 learning objectives outlined in this table. Schools may NOT count a

More information

1. What is the critical value for this 95% confidence interval? CV = z.025 = invnorm(0.025) = 1.96

1. What is the critical value for this 95% confidence interval? CV = z.025 = invnorm(0.025) = 1.96 1 Final Review 2 Review 2.1 CI 1-propZint Scenario 1 A TV manufacturer claims in its warranty brochure that in the past not more than 10 percent of its TV sets needed any repair during the first two years

More information

Use of deviance statistics for comparing models

Use of deviance statistics for comparing models A likelihood-ratio test can be used under full ML. The use of such a test is a quite general principle for statistical testing. In hierarchical linear models, the deviance test is mostly used for multiparameter

More information

Projects Involving Statistics (& SPSS)

Projects Involving Statistics (& SPSS) Projects Involving Statistics (& SPSS) Academic Skills Advice Starting a project which involves using statistics can feel confusing as there seems to be many different things you can do (charts, graphs,

More information

Non-life insurance mathematics. Nils F. Haavardsson, University of Oslo and DNB Skadeforsikring

Non-life insurance mathematics. Nils F. Haavardsson, University of Oslo and DNB Skadeforsikring Non-life insurance mathematics Nils F. Haavardsson, University of Oslo and DNB Skadeforsikring Overview Important issues Models treated Curriculum Duration (in lectures) What is driving the result of a

More information

INTERPRETING THE ONE-WAY ANALYSIS OF VARIANCE (ANOVA)

INTERPRETING THE ONE-WAY ANALYSIS OF VARIANCE (ANOVA) INTERPRETING THE ONE-WAY ANALYSIS OF VARIANCE (ANOVA) As with other parametric statistics, we begin the one-way ANOVA with a test of the underlying assumptions. Our first assumption is the assumption of

More information

Practice problems for Homework 11 - Point Estimation

Practice problems for Homework 11 - Point Estimation Practice problems for Homework 11 - Point Estimation 1. (10 marks) Suppose we want to select a random sample of size 5 from the current CS 3341 students. Which of the following strategies is the best:

More information

Advanced Statistical Analysis of Mortality. Rhodes, Thomas E. and Freitas, Stephen A. MIB, Inc. 160 University Avenue. Westwood, MA 02090

Advanced Statistical Analysis of Mortality. Rhodes, Thomas E. and Freitas, Stephen A. MIB, Inc. 160 University Avenue. Westwood, MA 02090 Advanced Statistical Analysis of Mortality Rhodes, Thomas E. and Freitas, Stephen A. MIB, Inc 160 University Avenue Westwood, MA 02090 001-(781)-751-6356 fax 001-(781)-329-3379 trhodes@mib.com Abstract

More information

http://www.jstor.org This content downloaded on Tue, 19 Feb 2013 17:28:43 PM All use subject to JSTOR Terms and Conditions

http://www.jstor.org This content downloaded on Tue, 19 Feb 2013 17:28:43 PM All use subject to JSTOR Terms and Conditions A Significance Test for Time Series Analysis Author(s): W. Allen Wallis and Geoffrey H. Moore Reviewed work(s): Source: Journal of the American Statistical Association, Vol. 36, No. 215 (Sep., 1941), pp.

More information

171:290 Model Selection Lecture II: The Akaike Information Criterion

171:290 Model Selection Lecture II: The Akaike Information Criterion 171:290 Model Selection Lecture II: The Akaike Information Criterion Department of Biostatistics Department of Statistics and Actuarial Science August 28, 2012 Introduction AIC, the Akaike Information

More information

CONTENTS OF DAY 2. II. Why Random Sampling is Important 9 A myth, an urban legend, and the real reason NOTES FOR SUMMER STATISTICS INSTITUTE COURSE

CONTENTS OF DAY 2. II. Why Random Sampling is Important 9 A myth, an urban legend, and the real reason NOTES FOR SUMMER STATISTICS INSTITUTE COURSE 1 2 CONTENTS OF DAY 2 I. More Precise Definition of Simple Random Sample 3 Connection with independent random variables 3 Problems with small populations 8 II. Why Random Sampling is Important 9 A myth,

More information

The Loss in Efficiency from Using Grouped Data to Estimate Coefficients of Group Level Variables. Kathleen M. Lang* Boston College.

The Loss in Efficiency from Using Grouped Data to Estimate Coefficients of Group Level Variables. Kathleen M. Lang* Boston College. The Loss in Efficiency from Using Grouped Data to Estimate Coefficients of Group Level Variables Kathleen M. Lang* Boston College and Peter Gottschalk Boston College Abstract We derive the efficiency loss

More information