Tilted Normal Distribution and Its Survival Properties

Size: px
Start display at page:

Download "Tilted Normal Distribution and Its Survival Properties"

Transcription

1 Journal of Data Science 10(2012), Tilted Normal Distribution and Its Survival Properties Sudhansu S. Maiti 1 and Mithu Dey 2 1 Visva-Bharati University and 2 Asansol Engineering College Abstract: To analyze skewed data, skew normal distribution is proposed by Azzalini (1985). For practical problems of estimating the skewness parameter of this distribution, Gupta and Gupta (2008) suggested power normal distribution as an alternative. We search for another alternative, named tilted normal distribution following the approach of Marshall and Olkin (1997) to add a positive parameter to a general survival function and taking survival function is of normal form. We have found out different properties of this distribution. Maximum likelihood estimate of parameters of this distribution have been found out. Comparison of tilted normal distribution with skew normal and power normal distribution have been made. Key words: Akaike information criterion, compound distribution, failure rate, Kolmogorov discrepancy measure, maximum likelihood. 1. Introduction The skew normal distribution, proposed by Azzalani (1985), can be a suitable model for the analysis of data exhibiting a unimodal density having some skewness present, a structure often occurring in data analysis. The proposed distribution is a generalization of the standard normal distribution and the probability density function (pdf) is given by φ(z; λ) = 2φ(z)Φ(λz), < z <, (1.1) where φ(x) and Φ(x) denote the N(0, 1) density and distribution function respectively. The parameter λ regulates the skewness and λ = 0 corresponds to the standard normal case. The density given by (1.1) enjoys a number of formal properties which resemble those of the normal distribution, for example, if X has the pdf, given by (1.1), then X 2 has a chi-square distribution with one degree of freedom. That is, all even moments of X are exactly the same as the Corresponding author.

2 226 Sudhansu S. Maiti and Mithu Dey corresponding even moments of the standard normal distribution. For more information, see Azzalini and Capitanio (1999) and Genton (2004). A motivation of the skew normal distribution has been elegantly exhibited by Arnold et al. (1993). This model can naturally arise in applications as hidden function and/or selective reporting model, see Arnold and Beaver (2002). Gupta and Gupta (2008) observed that the estimation of the skewness parameter of model (1.1) is problematic when the sample size is not large enough. Monti (2003) noted that the estimate is λ = ±, even when the data are generated by a model with finite λ. In the case of a skew normal model the moment estimator of λ is the solution of 2 λ π = X. (1.2) 1 + λ 2 Therefore, the solution exists if and only if X < 2/π. The exact distribution of X is not known and it is not easy to establish. Gupta and Gupta (2008) estimated the value of P( X > 2/π) based on simulation data for different values of n and λ. They have shown that as the sample size increases, the probability of the feasibility of the moment estimator increases. On the other hand, as the skewness parameter increases, the chances of obtaining the moment estimator decreases. To find out the maximum likelihood estimator(mle) of λ based on a random sample X 1, X 2,, X n, the likelihood equation is n i=0 X i φ(λx i ) Φ(λX i ) = 0. (1.3) It is clear that if all X i have the same sign then there is no solution of (1.3), see also Liseo (1990) and Azzalini and Capitanio (1999). The probability that X > 0, for λ > 0, is an increasing function of λ. When the absolute value of the skewness parameter is large, the probability that all the observations have the same sign is large. Therefore, for small to moderate sample sizes, the MLE may not be accurate enough for a practical use. The Centered Parametrization (CP) can solve these difficulties to some extent. Azzalini (1985) reparametrized the problem by writing X = µ + Z 0, where Z 0 = (Z µ z )/ z, z = (1 µ 2 z) 1 2, µ z = 2/π (λ/ 1 + λ 2 ) and considering the centred parameters CP= (µ,, γ 1 ) instead of the direct parameters DP= (µ,, λ). Here γ 1 is the usual univariate index of skewness, taken with same sign as that of λ [see for details, Azzalini and Capitanio (1999)]. The set of S-PLUS routines developed for the computation are freely available at the following World Wide Web address:

3 Tilted Normal Distribution and Its Survival Properties 227 For the above reason, Gupta and Gupta (2008) considered another skewed model for which normal distribution is a special case. They proposed the power normal model whose distribution function and density function are given by and F 2 (x; α) = [Φ(x)] α, < x <, α > 0, (1.4) f 2 (x; α) = α[φ(x)] α 1 φ(x), < x <, α > 0. (1.5) This is basically an exponentiated family of distributions with baseline distribution as normal. This is nothing but a system of adding one or more parameters to the existing normal distribution. Such an addition of parameters makes the resulting distribution richer and more flexible for modeling data. The power normal distribution has several nice properties. For α = 1 it reduces to the standard normal distribution. This distribution is a unimodal density which is skewed to the right if α > 1 and to the left if 0 < α < 1. Gupta and Gupta (2008) considered α as skewness parameter of the distribution. But for 0 < α < 1, skewness to the left is not so clear as is to be seen from the Figure 1 and modeling of left skewed data set will be misfit. This motivates us to think for another skewed distribution for which normal distribution is a specified case and at the same time it will be fit for modeling both kind of skewed data. We take Marshall and Olkin (1997) extended model to add a positive parameter to a general survival function. f 2 (x;α) α=1 α=0.5 α=0.1 α= x Figure 1: Plot of power normal distribution for different values of α

4 228 Sudhansu S. Maiti and Mithu Dey Let X be arandom variable with survival function and the probability density function as F 3 (x; β) = βf 0 (x), < x <, β > 0, (1.6) 1 (1 β)f 0 (x) and f 3 (x; β) = βf 0 (x), < x <, β > 0, (1.7) [1 (1 β)f 0 (x)] 2 respectively. Here F 0 (x) is the baseline distribution function and f 0 (x) is the corresponding probability density function. Taking F 0 (x) = Φ(x) and f 0 (x) = φ(x), we get the tilted normal (standard) distribution as f 3 (x; β) = βφ(x), < x <, β > 0. (1.8) [1 (1 β){1 Φ(x)}] 2 Here β = 1 indicates a standard normal density function. The tilted normal density is a unimodal density which is skewed to the left if β > 1 and to the right if 0 < β < 1. Figure 2 displays a few pdf graphs for various values of β. f 3 (x;β) β=1 β=0.5 β=0.01 β= x Figure 2: Plot of tilted normal distribution for different values of β

5 f 3 (x; β, µ, ) = Tilted Normal Distribution and Its Survival Properties 229 If the normal density has mean µ and variance 2, then the form of the tilted normal distribution will be β φ ( x µ ) [1 (1 β){1 Φ ( x µ ), < x, µ <, β, > 0. (1.9) }] 2 We will denote this tilted normal distribution with parameters µ, and β as TN(µ,, β). The paper is organized as follows. In Section 2, we have shown that the TN distribution can be obtained as a compound distribution with mixing exponential distribution. The basic structural properties of the proposed model including the reliability properties are presented in Section 3. In Section 4, we discuss the estimation of parameters of the TN distribution. Section 5 is devoted for studying closeness of skew normal, power normal and tilted normal distributions. A data set has been analyzed using the TN model in Section 6. Section 7 concludes. 2. Compounding Let Ḡ(x θ), < x, θ <, be a survival function (SF) of a random variable X given θ. Let θ be a random variable with pdf m(θ). Then a distribution with survival function Ḡ(x) = Ḡ(x θ)m(θ)dθ, < x <, is called a compound distribution with mixing density m(θ). Compound distributions provide a tool to obtain new parametric families of distributions from existing ones. They represent heterogeneous models where populations individuals have different risks. The following theorem show that the TN distribution can be obtained as a compound distribution. Theorem 2.1 Suppose that the conditional SF of a continuous random variable X given Θ = θ is given by [ Φ ( x µ ) ] Ḡ(x θ) = exp θ 1 Φ ( x µ ), < x, µ <, θ, > 0. Let θ have an exponential distribution with pdf m(θ) = βe βθ, θ, β > 0. Then the compound distribution of X is the TN(µ,, β) distribution.

6 230 Sudhansu S. Maiti and Mithu Dey Proof: For < x <, < µ <, β > 0, the SF of X is given by Ḡ(x) = = β = Ḡ(x θ)m(θ)dθ 0 Φ( x µ e θ ) 1 Φ( x µ 0 β [ 1 Φ ( x µ ) e βθ dθ )] 1 (1 β) [ 1 Φ ( x µ which is the SF of the TN(µ,, β) distribution. For fixed θ > 0, the distribution with survival function Ḡ(x θ) gives a class of non-standard distributions. As a result of Theorem 2.1 above, compounding any of the distributions belonging this class with an exponential mixing distribution for θ will lead to a certain form of TN distribution. This theorem also provides another way to interpret the TN distribution in addition to the geometric-extreme stability property given by Marshall and Olkin (1997). 3. Basic Structural and Survival Properties The expressions for the mean, variance and skewness of the TN(µ,, β) model seem to be not available in compact form. For the proposed model, we have, β φ ( x µ ) E (X r ) = x r )], [1 (1 β){1 Φ ( x µ ) }] 2 dx. Using this expression, with the help of computer, we can find measures of central tendency, dispersion and skewness index of the TN(µ,, β). We have presented measures of central tendency (mean, median, mode), and variance and skewness index, β 1 = µ 3 /µ (where µ 2 and µ 3 are second and third order central moments) for the TN(0, 1, β) in Figure 3 and Figure 4 respectively. The median M of the TN(0, 1, β) distribution is given by ( ) β M = Φ 1, 1 + β which is the β/(1 + β)th quantile of standard normal distribution. The mode is a solution of the equation 2(1 β)φ(x) x = 1 (1 β)[1 Φ(x)]. From Figure 3 and Figure 4, it is found that, as expected, median is less than mean for β < 1. Variance is increasing for β < 1 and is decreasing for β > 1, and skewness index is lying between -1 and 1 for the TN(0, 1, β) variate.

7 Tilted Normal Distribution and Its Survival Properties 231 value mean median mode β Figure 3: Plot of mean, median and mode of tilted normal distribution for different values of β value variance skewness β Figure 4: Plot of variance and skewness index of tilted normal distribution for different values of β

8 232 Sudhansu S. Maiti and Mithu Dey The closed form expressions for basic structural properties are not available. Therefore, we now concentrate to get survival properties of the TN distribution. The failure rate function of X is given by h T N (x) = with the integrated failure rate function = H T N (x) = ln 1 1 (1 β)[1 Φ(x)] φ(x) 1 Φ(x) 1 1 (1 β)[1 Φ(x)] h N(x), β [1 Φ(x)] 1 (1 β) [1 Φ(x)]. The reversed failure rate function of X is given by r T N (x) = = β 1 (1 β)[1 Φ(x)] φ(x) Φ(x) β 1 (1 β)[1 Φ(x)] r N(x). Here h N (x) and r N (x) are failure rate and reversed failure rate function of the N(0, 1) distribution respectively. Figure 5 and Figure 6 show failure rates and reversed failure rates of the tilted normal distribution for different values of β respectively. h TN (x) β=1 β=2 β=3 β= x Figure 5: Plot of failure rate of tilted normal distribution for different values of β

9 Tilted Normal Distribution and Its Survival Properties 233 r TN (x) β=1 β=2 β=3 β= x Figure 6: Plot of reversed failure rate of tilted normal distribution for different values of β Theorem 3.1 If X T N(0, 1, β 1 ), Y T N(0, 1, β 2 ), and β 1 > β 2, then Y < LR X Y < HAZ X Y < ST X and Y < HAZ X Y < MRL X. Here Y < W X means Y is smaller than X in W ordering. Proof: The logarithm of the likelihood ratio v(x) = ln f 3(x, β 1 ) f 3 (x, β 2 ) = ln β 1 β [ln{1 (1 β 2 )(1 Φ(x))} ln{1 (1 β 1 )(1 Φ(x))}] is an increasing function of x if β 1 > β 2, since v (x) = 2φ(x)(β 1 β 2 ) [1 (1 β 1 )(1 Φ(x))][1 (1 β 2 )(1 Φ(x))] > 0, for all x. Therefore, the tilted normal distribution has the likelihood ratio ordering, which implies it has the failure rate ordering as well as the stochastic ordering and the mean residual life ordering, see Gupta and Kirmani (1987). 4. Maximum Likelihood Estimation Let X i, i = 1, 2,, n, be the ith observation and let Y i be the corresponding censoring point. The X i s and Y i s are assumed independent. The X i s are assumed to have the TN(µ,, β) distribution and the Y i s are assumed to have a

10 234 Sudhansu S. Maiti and Mithu Dey non-informative distribution i.e. a distribution that does not involve the parameters µ, and β. One observes only the pair (T i, δ i ), where T i = min(x i, Y i ) and δ i = I(X i Y i ) is the censoring indicator. Given the data {(t i, δ i ), i = 1, 2,, n}, the likelihood function is given by L n (µ,, β) = = n {f 3 (t i ; µ,, β)} δ i {1 F 3 (t i ; µ,, β)} 1 δ i n [ 1 φ ( t i µ ) 1 Φ ( t i µ ) [ β { 1 Φ ( t i µ 1 (1 β) { 1 Φ ( t i µ 1 1 (1 β) { 1 Φ ( t i µ )} )} ] Maximizing the likelihood function is equivalent to maximizing its logarithm. Hence the log-likelihood l(µ,, β) = ln L n (µ,, β) is given by n n ( ) ti µ l(µ,, β) = ln δ i + n ln β + δ i ln φ n { ( )} ti µ + (1 δ i ) ln 1 Φ n [ { ( )}] ti µ (1 + δ i ) ln 1 (1 β) 1 Φ. Maximum likelihood estimates (MLEs) of the parameters β, µ, are obtained by solving the non-linear equations l/ β = 0, l/ µ = 0, l/ = 0. Hence, the iterative solutions are n β = [ { )} ], µ = and [ 1 n δ i n δ i t i + n )} ( ti µ (1+δ i ) 1 Φ { ( )} ti µ 1 (1 β) 1 Φ n (1 δ i )φ ( t i µ ) 1 Φ ( t i µ ) +. ] δi n (1 + δ i )(1 β)φ ( t i µ ) ] 1 (1 β) { 1 Φ ( t i µ )}, [ n 2 1 n = n δ δ i (t i µ) 2 (1 δ i )(t i µ)φ ( t i µ ) + i 1 Φ ( t i µ ) n (1 + δ i )(t i µ)(1 β)φ ( t i µ )] + 1 (1 β) { 1 Φ ( t i µ )},

11 Tilted Normal Distribution and Its Survival Properties 235 respectively. For the complete sample, δ i = 1, for all i, and for the type-i censored sample, Y i = t 0, for all i. The large-sample variance-covariance matrix of the MLEs is obtained by inverting the matrix of minus second order derivatives of the log-likelihood evaluated at the MLEs. The likelihood ratio test will be used to test the null hypothesis H 0 : β = 1 (Normal distribution). When H 0 is true, the deviance test statistic d n = 2[l(ˆµ, ˆ, 1) l(ˆµ, ˆ, ˆβ)], where ˆµ, ˆ are the MLEs of µ, under H 0 : β = 1, has approximately a chi-square distribution with 1 degree of freedom. In addition, for model selection, we use the Akaike Information Criterion (AIC) defined as: AIC = -2 log-likelihood + 2k, where k is the number of parameters in the model and n is the sample size. For more details about the AIC, see Akaike (1969). The model with minimum AIC is the one that better fits the data. 5. Closeness of Skew Normal, Power Normal and Tilted Normal Distribution It should be noted that the cumulative distribution function of the skew normal, power normal and tilted normal distributions are and F 1 (x, λ) = 2 x F 2 (x, α) = Φ α (x), α > 0, F 3 (x, β) = Φ(λt)φ(t)dt, < λ <, Φ(x) 1 (1 β){1 Φ(x)}, β > 0, respectively. F 1 (x, λ), F 2 (x, α) and F 3 (x, β) are exactly equal to normal distribution when λ = 0, α = 1 and β = 1. F 1 (x, λ) and F 2 (x, α) are also exactly equal when λ = 1 and α = 2. Since all the models can be used to model positively or negatively skewed data, it might be possible to approximate F 1 (x, λ) by F 2 (x, α) or F 3 (x, β). To investigate this, we apply the following procedure: 1. Set a value of λ. { 2. Find a and b such that a = max x : 2 } x Φ(λt)φ(t)dt and { b = min x : 2 x }. Φ(λt)φ(t)dt 0.999

12 236 Sudhansu S. Maiti and Mithu Dey 3. Divide the range [a, b] by 100 parts, say, a = x 0 x 1 x 100 = b, and set y i = F 1 (x i, λ); i = 0, 1, 2,, Find α by minimizing S 2 1 = 100 [y i F 2 (x i, α)] 2, to fit the model y F 2 (x, α). 5. Find the Kolmogorov discrepancy measure D 12 = max xi F 2 (x i, ˆα) F 1 (x i, λ). 6. Find β by minimizing S 2 2 = 100 [y i F 3 (x i, β)] 2, to fit the model y F 3 (x, β). 7. Find the Kolmogorov discrepancy measure D 13 = max xi F 3 (x i, ˆβ) F 1 (x i, λ). In Table 1, we represent the values of ˆα and D 12 corresponding to various values of λ, and the values of β and D 13 corresponding to various values of α (assuming ˆα as α in this case). Table 1: Kolmogorov discrepancy measure for various values of λ and corresponding values of α and β λ ˆα D 12 ˆβ D From the table it is clear that the discrepancy among the skew normal, power normal and tilted normal is minimal when λ = 0, α = 1 and β = 1 and the three models coincide with the standard normal distribution and there is, obviously, no discrepancy among the three. The skew normal and power normal distributions are also identical when λ = 1 and α = 2. The power normal distribution provides a better approximation to the skew normal when λ is positive as compared to the negative values of λ and the Kolmogorov discrepancy measure does not exhibit symmetric behaviour for values of λ moving away from zero in both directions. On the other hand, the tilted normal distribution approximates with equal efficiency for values of λ moving away from zero in both directions as the

13 Tilted Normal Distribution and Its Survival Properties 237 Kolmogorov discrepancy measure exhibits symmetric behaviour. It provides a better approximation than the power normal to the skew normal when λ is negative. The relative Kolmogorov discrepancy of tilted normal and power normal is less for the positive values of λ as compared to that for the negative values of λ. 6. Data Analysis In this section, we consider a data set to fit with the tilted normal distribution. The data related to failure time of bus motor is taken from Davis (1952). We consider initial bus motor failures data of Davis (1952, p. 145). Data contain distance intervals (in thousands of miles) and corresponding observed number of failures and have been shown in Table 2. Histogram shows that the data set is negatively (left-) skewed with coefficient of skewness, b 1 = m 3 / m 3 2 = (where m 2 and m 3 are sample second and third order central moments). Davis fitted this data to the normal distribution with observed χ 2 = 9.93 having p-value We have fitted this data with the normal, the skew normal, the power normal and the tilted normal distributions. The results are summarized in Table 3. Histogram and fitted tilted normal, skew normal, power normal and normal curves to data have been shown in Figure 7. Table 2: Initial bus motor failures Distance Interval, Thousands of Miles Observed Number of Failures Less than up 2 Total 191 Table 3: Summarized results of fitting different distributions to data set of Davis (1952) Distribution ˆµ ˆ Estimate of tilt parameter log-likelihood AIC Normal Power normal ˆα = Skew normal ˆλ = Tilted normal ˆβ =

14 238 Sudhansu S. Maiti and Mithu Dey Tilted normal Skew normal Power normal Normal Figure 7: Plot of histogram and fitted tilted normal, skew normal, power normal and normal curves to data of Davis (1952) Here for tilted normal fitting, the deviance test statistic d n = with p- value These information justify better fitting of the tilted normal distribution to the data. Since the data are related to failure times, we have presented the estimated failure rate functions for the normal, the power normal, the skew normal and the tilted normal model in Figure 8. The estimated failure rate shows an increasing pattern. x r(x) Tilted Normal Skew Normal Power Normal Normal x Figure 8: Plot of estimated failure rate functions of fitted tilted normal, skew normal, power normal and normal curves to data of Davis (1952)

15 7. Concluding Remarks Tilted Normal Distribution and Its Survival Properties 239 In this article, the tilted normal distribution has been studied which could be an alternative model to skew normal distribution for left skewed data. The practical problems in estimating the skewness parameter for small to moderate sample sizes of skew normal distribution has been discussed and to alleviate these problems, an alternative model called the Power normal model has been proposed by Gupta and Gupta (2008). It has been observed that the Power normal model is not a good fit for data skewed to the left. As an alternative this tilted normal distribution has been proposed. The normal distribution is widely and comfortably used by the practitioners as well as the theoreticians for analyzing symmetric data. Since skewed data could be analyzed by the tilted normal distribution that is obtained by adding a parameter, it is to be easy and comprehensive to the users. The structural and survival properties of this distribution have been studied and inference on parameters have also been mentioned. The appropriateness of fitting the tilted normal distribution has been established by analyzing a data set. Acknowledgements The authors would like to thank the referee for a very careful reading of the manuscript and making a number of nice suggestions which improved the earlier version of the manuscript. Authors are also thankful to Dr. M. M. Panja, Department of Mathematics, Visva-Bharati University for computational help. References Akaike, H. (1969). Fitting autoregressive models for prediction. Annals of the Institute of Statistical Mathematics 21, Arnold, B. C. and Beaver, R. J. (2002). Skewed multivariate models related to hidden truncation and/or selective reporting. Test 11, Arnold, B. C., Beaver, R. J., Groeneveld, R. A. and Meeker, W. Q. (1993). The nontruncated marginal of a truncated bivariate normal distribution. Psychometrika 58, Azzalini, A. (1985). A class of distributions which includes the normal ones. Scandinavian Journal of Statistics 12, Azzalini, A. and Capitanio, A. (1999). Statistical applications of the multivariate skew-normal distribution. Journal of the Royal Statistical Society, Series B 61,

16 240 Sudhansu S. Maiti and Mithu Dey Davis, D. J. (1952). An analysis of some failure data. Journal of the American Statistical Association 47, Genton, M. G. (2004). Skew-Elliptical Distributions and Their Applications: A Journey Beyond Normality. Chapman and Hall, New York. Gupta, R. C. (2006). Variance residual life function in reliability studies. Metron - International Journal of Statistics LXIV, Gupta, R. C. and Kirmani, S. N. U. A. (1987). On the order relations between reliability measures of two distributions. Stochastic Models 3, Gupta, R. D. and Gupta, R. C. (2008). Analyzing skewed data by power normal model. Test 17, Liseo, B. (1990). La classe delle densità normali sghembe: aspetti inferenziali da un punto di vista bayesiano (in Italian). Statistica L, Marshall, A. W. and Olkin, I. (1997). A new method for adding a parameter to a family of distributions with application to the exponential and Weibull families. Biometrika 84, Monti, A. C. (2003). A note on the estimation of the skew normal and the skew exponential power distributions. Metron - International Journal of Statistics LXI, Received July 18, 2011; accepted October 23, Sudhansu S. Maiti Department of Statistics Visva-Bharati University Santiniketan , India dssm1@rediffmail.com Mithu Dey Department of Basic Sciences (Mathematics) Asansol Engineering College Vivekananda Sarani, Kanyapur , India mithu.dey@gmail.com

Institute of Actuaries of India Subject CT3 Probability and Mathematical Statistics

Institute of Actuaries of India Subject CT3 Probability and Mathematical Statistics Institute of Actuaries of India Subject CT3 Probability and Mathematical Statistics For 2015 Examinations Aim The aim of the Probability and Mathematical Statistics subject is to provide a grounding in

More information

Introduction to General and Generalized Linear Models

Introduction to General and Generalized Linear Models Introduction to General and Generalized Linear Models General Linear Models - part I Henrik Madsen Poul Thyregod Informatics and Mathematical Modelling Technical University of Denmark DK-2800 Kgs. Lyngby

More information

Construction of Skew-Normal Random Variables: Are They Linear. Combinations of Normal and Half-Normal? Mohsen Pourahmadi. pourahm@math.niu.

Construction of Skew-Normal Random Variables: Are They Linear. Combinations of Normal and Half-Normal? Mohsen Pourahmadi. pourahm@math.niu. Construction of Skew-Normal Random Variables: Are They Linear Combinations of Normal and Half-Normal? Mohsen Pourahmadi pourahm@math.niu.edu Division of Statistics Northern Illinois University Abstract

More information

STATISTICA Formula Guide: Logistic Regression. Table of Contents

STATISTICA Formula Guide: Logistic Regression. Table of Contents : Table of Contents... 1 Overview of Model... 1 Dispersion... 2 Parameterization... 3 Sigma-Restricted Model... 3 Overparameterized Model... 4 Reference Coding... 4 Model Summary (Summary Tab)... 5 Summary

More information

Chapter 3 RANDOM VARIATE GENERATION

Chapter 3 RANDOM VARIATE GENERATION Chapter 3 RANDOM VARIATE GENERATION In order to do a Monte Carlo simulation either by hand or by computer, techniques must be developed for generating values of random variables having known distributions.

More information

Inference on the parameters of the Weibull distribution using records

Inference on the parameters of the Weibull distribution using records Statistics & Operations Research Transactions SORT 39 (1) January-June 2015, 3-18 ISSN: 1696-2281 eissn: 2013-8830 www.idescat.cat/sort/ Statistics & Operations Research c Institut d Estadstica de Catalunya

More information

Permutation Tests for Comparing Two Populations

Permutation Tests for Comparing Two Populations Permutation Tests for Comparing Two Populations Ferry Butar Butar, Ph.D. Jae-Wan Park Abstract Permutation tests for comparing two populations could be widely used in practice because of flexibility of

More information

Lecture 8: Gamma regression

Lecture 8: Gamma regression Lecture 8: Gamma regression Claudia Czado TU München c (Claudia Czado, TU Munich) ZFS/IMS Göttingen 2004 0 Overview Models with constant coefficient of variation Gamma regression: estimation and testing

More information

Survival Analysis of Left Truncated Income Protection Insurance Data. [March 29, 2012]

Survival Analysis of Left Truncated Income Protection Insurance Data. [March 29, 2012] Survival Analysis of Left Truncated Income Protection Insurance Data [March 29, 2012] 1 Qing Liu 2 David Pitt 3 Yan Wang 4 Xueyuan Wu Abstract One of the main characteristics of Income Protection Insurance

More information

Multivariate Normal Distribution

Multivariate Normal Distribution Multivariate Normal Distribution Lecture 4 July 21, 2011 Advanced Multivariate Statistical Methods ICPSR Summer Session #2 Lecture #4-7/21/2011 Slide 1 of 41 Last Time Matrices and vectors Eigenvalues

More information

BNG 202 Biomechanics Lab. Descriptive statistics and probability distributions I

BNG 202 Biomechanics Lab. Descriptive statistics and probability distributions I BNG 202 Biomechanics Lab Descriptive statistics and probability distributions I Overview The overall goal of this short course in statistics is to provide an introduction to descriptive and inferential

More information

II. DISTRIBUTIONS distribution normal distribution. standard scores

II. DISTRIBUTIONS distribution normal distribution. standard scores Appendix D Basic Measurement And Statistics The following information was developed by Steven Rothke, PhD, Department of Psychology, Rehabilitation Institute of Chicago (RIC) and expanded by Mary F. Schmidt,

More information

Interpretation of Somers D under four simple models

Interpretation of Somers D under four simple models Interpretation of Somers D under four simple models Roger B. Newson 03 September, 04 Introduction Somers D is an ordinal measure of association introduced by Somers (96)[9]. It can be defined in terms

More information

SAS Software to Fit the Generalized Linear Model

SAS Software to Fit the Generalized Linear Model SAS Software to Fit the Generalized Linear Model Gordon Johnston, SAS Institute Inc., Cary, NC Abstract In recent years, the class of generalized linear models has gained popularity as a statistical modeling

More information

Overview of Violations of the Basic Assumptions in the Classical Normal Linear Regression Model

Overview of Violations of the Basic Assumptions in the Classical Normal Linear Regression Model Overview of Violations of the Basic Assumptions in the Classical Normal Linear Regression Model 1 September 004 A. Introduction and assumptions The classical normal linear regression model can be written

More information

Curriculum Map Statistics and Probability Honors (348) Saugus High School Saugus Public Schools 2009-2010

Curriculum Map Statistics and Probability Honors (348) Saugus High School Saugus Public Schools 2009-2010 Curriculum Map Statistics and Probability Honors (348) Saugus High School Saugus Public Schools 2009-2010 Week 1 Week 2 14.0 Students organize and describe distributions of data by using a number of different

More information

Probability and Statistics Prof. Dr. Somesh Kumar Department of Mathematics Indian Institute of Technology, Kharagpur

Probability and Statistics Prof. Dr. Somesh Kumar Department of Mathematics Indian Institute of Technology, Kharagpur Probability and Statistics Prof. Dr. Somesh Kumar Department of Mathematics Indian Institute of Technology, Kharagpur Module No. #01 Lecture No. #15 Special Distributions-VI Today, I am going to introduce

More information

Applied Statistics. J. Blanchet and J. Wadsworth. Institute of Mathematics, Analysis, and Applications EPF Lausanne

Applied Statistics. J. Blanchet and J. Wadsworth. Institute of Mathematics, Analysis, and Applications EPF Lausanne Applied Statistics J. Blanchet and J. Wadsworth Institute of Mathematics, Analysis, and Applications EPF Lausanne An MSc Course for Applied Mathematicians, Fall 2012 Outline 1 Model Comparison 2 Model

More information

LAB 4 INSTRUCTIONS CONFIDENCE INTERVALS AND HYPOTHESIS TESTING

LAB 4 INSTRUCTIONS CONFIDENCE INTERVALS AND HYPOTHESIS TESTING LAB 4 INSTRUCTIONS CONFIDENCE INTERVALS AND HYPOTHESIS TESTING In this lab you will explore the concept of a confidence interval and hypothesis testing through a simulation problem in engineering setting.

More information

Properties of Future Lifetime Distributions and Estimation

Properties of Future Lifetime Distributions and Estimation Properties of Future Lifetime Distributions and Estimation Harmanpreet Singh Kapoor and Kanchan Jain Abstract Distributional properties of continuous future lifetime of an individual aged x have been studied.

More information

How To Check For Differences In The One Way Anova

How To Check For Differences In The One Way Anova MINITAB ASSISTANT WHITE PAPER This paper explains the research conducted by Minitab statisticians to develop the methods and data checks used in the Assistant in Minitab 17 Statistical Software. One-Way

More information

E3: PROBABILITY AND STATISTICS lecture notes

E3: PROBABILITY AND STATISTICS lecture notes E3: PROBABILITY AND STATISTICS lecture notes 2 Contents 1 PROBABILITY THEORY 7 1.1 Experiments and random events............................ 7 1.2 Certain event. Impossible event............................

More information

Statistics I for QBIC. Contents and Objectives. Chapters 1 7. Revised: August 2013

Statistics I for QBIC. Contents and Objectives. Chapters 1 7. Revised: August 2013 Statistics I for QBIC Text Book: Biostatistics, 10 th edition, by Daniel & Cross Contents and Objectives Chapters 1 7 Revised: August 2013 Chapter 1: Nature of Statistics (sections 1.1-1.6) Objectives

More information

SUMAN DUVVURU STAT 567 PROJECT REPORT

SUMAN DUVVURU STAT 567 PROJECT REPORT SUMAN DUVVURU STAT 567 PROJECT REPORT SURVIVAL ANALYSIS OF HEROIN ADDICTS Background and introduction: Current illicit drug use among teens is continuing to increase in many countries around the world.

More information

FITTING INSURANCE CLAIMS TO SKEWED DISTRIBUTIONS: ARE

FITTING INSURANCE CLAIMS TO SKEWED DISTRIBUTIONS: ARE FITTING INSURANCE CLAIMS TO SKEWED DISTRIBUTIONS: ARE THE SKEW-NORMAL AND SKEW-STUDENT GOOD MODELS? MARTIN ELING WORKING PAPERS ON RISK MANAGEMENT AND INSURANCE NO. 98 EDITED BY HATO SCHMEISER CHAIR FOR

More information

Fairfield Public Schools

Fairfield Public Schools Mathematics Fairfield Public Schools AP Statistics AP Statistics BOE Approved 04/08/2014 1 AP STATISTICS Critical Areas of Focus AP Statistics is a rigorous course that offers advanced students an opportunity

More information

Dongfeng Li. Autumn 2010

Dongfeng Li. Autumn 2010 Autumn 2010 Chapter Contents Some statistics background; ; Comparing means and proportions; variance. Students should master the basic concepts, descriptive statistics measures and graphs, basic hypothesis

More information

Multivariate Logistic Regression

Multivariate Logistic Regression 1 Multivariate Logistic Regression As in univariate logistic regression, let π(x) represent the probability of an event that depends on p covariates or independent variables. Then, using an inv.logit formulation

More information

PROPERTIES OF THE SAMPLE CORRELATION OF THE BIVARIATE LOGNORMAL DISTRIBUTION

PROPERTIES OF THE SAMPLE CORRELATION OF THE BIVARIATE LOGNORMAL DISTRIBUTION PROPERTIES OF THE SAMPLE CORRELATION OF THE BIVARIATE LOGNORMAL DISTRIBUTION Chin-Diew Lai, Department of Statistics, Massey University, New Zealand John C W Rayner, School of Mathematics and Applied Statistics,

More information

Statistics in Retail Finance. Chapter 6: Behavioural models

Statistics in Retail Finance. Chapter 6: Behavioural models Statistics in Retail Finance 1 Overview > So far we have focussed mainly on application scorecards. In this chapter we shall look at behavioural models. We shall cover the following topics:- Behavioural

More information

Algebra 2 Chapter 1 Vocabulary. identity - A statement that equates two equivalent expressions.

Algebra 2 Chapter 1 Vocabulary. identity - A statement that equates two equivalent expressions. Chapter 1 Vocabulary identity - A statement that equates two equivalent expressions. verbal model- A word equation that represents a real-life problem. algebraic expression - An expression with variables.

More information

Some Essential Statistics The Lure of Statistics

Some Essential Statistics The Lure of Statistics Some Essential Statistics The Lure of Statistics Data Mining Techniques, by M.J.A. Berry and G.S Linoff, 2004 Statistics vs. Data Mining..lie, damn lie, and statistics mining data to support preconceived

More information

Interpreting Kullback-Leibler Divergence with the Neyman-Pearson Lemma

Interpreting Kullback-Leibler Divergence with the Neyman-Pearson Lemma Interpreting Kullback-Leibler Divergence with the Neyman-Pearson Lemma Shinto Eguchi a, and John Copas b a Institute of Statistical Mathematics and Graduate University of Advanced Studies, Minami-azabu

More information

Poisson Models for Count Data

Poisson Models for Count Data Chapter 4 Poisson Models for Count Data In this chapter we study log-linear models for count data under the assumption of a Poisson error structure. These models have many applications, not only to the

More information

Distribution (Weibull) Fitting

Distribution (Weibull) Fitting Chapter 550 Distribution (Weibull) Fitting Introduction This procedure estimates the parameters of the exponential, extreme value, logistic, log-logistic, lognormal, normal, and Weibull probability distributions

More information

Descriptive Statistics

Descriptive Statistics Y520 Robert S Michael Goal: Learn to calculate indicators and construct graphs that summarize and describe a large quantity of values. Using the textbook readings and other resources listed on the web

More information

Summary of Formulas and Concepts. Descriptive Statistics (Ch. 1-4)

Summary of Formulas and Concepts. Descriptive Statistics (Ch. 1-4) Summary of Formulas and Concepts Descriptive Statistics (Ch. 1-4) Definitions Population: The complete set of numerical information on a particular quantity in which an investigator is interested. We assume

More information

Time Series Analysis

Time Series Analysis Time Series Analysis hm@imm.dtu.dk Informatics and Mathematical Modelling Technical University of Denmark DK-2800 Kgs. Lyngby 1 Outline of the lecture Identification of univariate time series models, cont.:

More information

CA200 Quantitative Analysis for Business Decisions. File name: CA200_Section_04A_StatisticsIntroduction

CA200 Quantitative Analysis for Business Decisions. File name: CA200_Section_04A_StatisticsIntroduction CA200 Quantitative Analysis for Business Decisions File name: CA200_Section_04A_StatisticsIntroduction Table of Contents 4. Introduction to Statistics... 1 4.1 Overview... 3 4.2 Discrete or continuous

More information

Penalized regression: Introduction

Penalized regression: Introduction Penalized regression: Introduction Patrick Breheny August 30 Patrick Breheny BST 764: Applied Statistical Modeling 1/19 Maximum likelihood Much of 20th-century statistics dealt with maximum likelihood

More information

Modeling Individual Claims for Motor Third Party Liability of Insurance Companies in Albania

Modeling Individual Claims for Motor Third Party Liability of Insurance Companies in Albania Modeling Individual Claims for Motor Third Party Liability of Insurance Companies in Albania Oriana Zacaj Department of Mathematics, Polytechnic University, Faculty of Mathematics and Physics Engineering

More information

Simple Linear Regression Inference

Simple Linear Regression Inference Simple Linear Regression Inference 1 Inference requirements The Normality assumption of the stochastic term e is needed for inference even if it is not a OLS requirement. Therefore we have: Interpretation

More information

Module 3: Correlation and Covariance

Module 3: Correlation and Covariance Using Statistical Data to Make Decisions Module 3: Correlation and Covariance Tom Ilvento Dr. Mugdim Pašiƒ University of Delaware Sarajevo Graduate School of Business O ften our interest in data analysis

More information

Use of deviance statistics for comparing models

Use of deviance statistics for comparing models A likelihood-ratio test can be used under full ML. The use of such a test is a quite general principle for statistical testing. In hierarchical linear models, the deviance test is mostly used for multiparameter

More information

Least Squares Estimation

Least Squares Estimation Least Squares Estimation SARA A VAN DE GEER Volume 2, pp 1041 1045 in Encyclopedia of Statistics in Behavioral Science ISBN-13: 978-0-470-86080-9 ISBN-10: 0-470-86080-4 Editors Brian S Everitt & David

More information

Microeconomic Theory: Basic Math Concepts

Microeconomic Theory: Basic Math Concepts Microeconomic Theory: Basic Math Concepts Matt Van Essen University of Alabama Van Essen (U of A) Basic Math Concepts 1 / 66 Basic Math Concepts In this lecture we will review some basic mathematical concepts

More information

171:290 Model Selection Lecture II: The Akaike Information Criterion

171:290 Model Selection Lecture II: The Akaike Information Criterion 171:290 Model Selection Lecture II: The Akaike Information Criterion Department of Biostatistics Department of Statistics and Actuarial Science August 28, 2012 Introduction AIC, the Akaike Information

More information

Bias in the Estimation of Mean Reversion in Continuous-Time Lévy Processes

Bias in the Estimation of Mean Reversion in Continuous-Time Lévy Processes Bias in the Estimation of Mean Reversion in Continuous-Time Lévy Processes Yong Bao a, Aman Ullah b, Yun Wang c, and Jun Yu d a Purdue University, IN, USA b University of California, Riverside, CA, USA

More information

Modeling the Claim Duration of Income Protection Insurance Policyholders Using Parametric Mixture Models

Modeling the Claim Duration of Income Protection Insurance Policyholders Using Parametric Mixture Models Modeling the Claim Duration of Income Protection Insurance Policyholders Using Parametric Mixture Models Abstract This paper considers the modeling of claim durations for existing claimants under income

More information

Confidence Intervals for Exponential Reliability

Confidence Intervals for Exponential Reliability Chapter 408 Confidence Intervals for Exponential Reliability Introduction This routine calculates the number of events needed to obtain a specified width of a confidence interval for the reliability (proportion

More information

Sections 2.11 and 5.8

Sections 2.11 and 5.8 Sections 211 and 58 Timothy Hanson Department of Statistics, University of South Carolina Stat 704: Data Analysis I 1/25 Gesell data Let X be the age in in months a child speaks his/her first word and

More information

Lecture 2: ARMA(p,q) models (part 3)

Lecture 2: ARMA(p,q) models (part 3) Lecture 2: ARMA(p,q) models (part 3) Florian Pelgrin University of Lausanne, École des HEC Department of mathematics (IMEA-Nice) Sept. 2011 - Jan. 2012 Florian Pelgrin (HEC) Univariate time series Sept.

More information

MBA 611 STATISTICS AND QUANTITATIVE METHODS

MBA 611 STATISTICS AND QUANTITATIVE METHODS MBA 611 STATISTICS AND QUANTITATIVE METHODS Part I. Review of Basic Statistics (Chapters 1-11) A. Introduction (Chapter 1) Uncertainty: Decisions are often based on incomplete information from uncertain

More information

Nonparametric adaptive age replacement with a one-cycle criterion

Nonparametric adaptive age replacement with a one-cycle criterion Nonparametric adaptive age replacement with a one-cycle criterion P. Coolen-Schrijner, F.P.A. Coolen Department of Mathematical Sciences University of Durham, Durham, DH1 3LE, UK e-mail: Pauline.Schrijner@durham.ac.uk

More information

Introduction to Quantitative Methods

Introduction to Quantitative Methods Introduction to Quantitative Methods October 15, 2009 Contents 1 Definition of Key Terms 2 2 Descriptive Statistics 3 2.1 Frequency Tables......................... 4 2.2 Measures of Central Tendencies.................

More information

Generalized Linear Models

Generalized Linear Models Generalized Linear Models We have previously worked with regression models where the response variable is quantitative and normally distributed. Now we turn our attention to two types of models where the

More information

5/31/2013. 6.1 Normal Distributions. Normal Distributions. Chapter 6. Distribution. The Normal Distribution. Outline. Objectives.

5/31/2013. 6.1 Normal Distributions. Normal Distributions. Chapter 6. Distribution. The Normal Distribution. Outline. Objectives. The Normal Distribution C H 6A P T E R The Normal Distribution Outline 6 1 6 2 Applications of the Normal Distribution 6 3 The Central Limit Theorem 6 4 The Normal Approximation to the Binomial Distribution

More information

Probability Calculator

Probability Calculator Chapter 95 Introduction Most statisticians have a set of probability tables that they refer to in doing their statistical wor. This procedure provides you with a set of electronic statistical tables that

More information

Regression III: Advanced Methods

Regression III: Advanced Methods Lecture 16: Generalized Additive Models Regression III: Advanced Methods Bill Jacoby Michigan State University http://polisci.msu.edu/jacoby/icpsr/regress3 Goals of the Lecture Introduce Additive Models

More information

Tests for Two Proportions

Tests for Two Proportions Chapter 200 Tests for Two Proportions Introduction This module computes power and sample size for hypothesis tests of the difference, ratio, or odds ratio of two independent proportions. The test statistics

More information

Math 120 Final Exam Practice Problems, Form: A

Math 120 Final Exam Practice Problems, Form: A Math 120 Final Exam Practice Problems, Form: A Name: While every attempt was made to be complete in the types of problems given below, we make no guarantees about the completeness of the problems. Specifically,

More information

Quantitative Methods for Finance

Quantitative Methods for Finance Quantitative Methods for Finance Module 1: The Time Value of Money 1 Learning how to interpret interest rates as required rates of return, discount rates, or opportunity costs. 2 Learning how to explain

More information

**BEGINNING OF EXAMINATION** The annual number of claims for an insured has probability function: , 0 < q < 1.

**BEGINNING OF EXAMINATION** The annual number of claims for an insured has probability function: , 0 < q < 1. **BEGINNING OF EXAMINATION** 1. You are given: (i) The annual number of claims for an insured has probability function: 3 p x q q x x ( ) = ( 1 ) 3 x, x = 0,1,, 3 (ii) The prior density is π ( q) = q,

More information

GENERALIZED LINEAR MODELS IN VEHICLE INSURANCE

GENERALIZED LINEAR MODELS IN VEHICLE INSURANCE ACTA UNIVERSITATIS AGRICULTURAE ET SILVICULTURAE MENDELIANAE BRUNENSIS Volume 62 41 Number 2, 2014 http://dx.doi.org/10.11118/actaun201462020383 GENERALIZED LINEAR MODELS IN VEHICLE INSURANCE Silvie Kafková

More information

Objective. Materials. TI-73 Calculator

Objective. Materials. TI-73 Calculator 0. Objective To explore subtraction of integers using a number line. Activity 2 To develop strategies for subtracting integers. Materials TI-73 Calculator Integer Subtraction What s the Difference? Teacher

More information

Multivariate Analysis of Ecological Data

Multivariate Analysis of Ecological Data Multivariate Analysis of Ecological Data MICHAEL GREENACRE Professor of Statistics at the Pompeu Fabra University in Barcelona, Spain RAUL PRIMICERIO Associate Professor of Ecology, Evolutionary Biology

More information

Descriptive statistics Statistical inference statistical inference, statistical induction and inferential statistics

Descriptive statistics Statistical inference statistical inference, statistical induction and inferential statistics Descriptive statistics is the discipline of quantitatively describing the main features of a collection of data. Descriptive statistics are distinguished from inferential statistics (or inductive statistics),

More information

11. Analysis of Case-control Studies Logistic Regression

11. Analysis of Case-control Studies Logistic Regression Research methods II 113 11. Analysis of Case-control Studies Logistic Regression This chapter builds upon and further develops the concepts and strategies described in Ch.6 of Mother and Child Health:

More information

CURVE FITTING LEAST SQUARES APPROXIMATION

CURVE FITTING LEAST SQUARES APPROXIMATION CURVE FITTING LEAST SQUARES APPROXIMATION Data analysis and curve fitting: Imagine that we are studying a physical system involving two quantities: x and y Also suppose that we expect a linear relationship

More information

Convex Hull Probability Depth: first results

Convex Hull Probability Depth: first results Conve Hull Probability Depth: first results Giovanni C. Porzio and Giancarlo Ragozini Abstract In this work, we present a new depth function, the conve hull probability depth, that is based on the conve

More information

START Selected Topics in Assurance

START Selected Topics in Assurance START Selected Topics in Assurance Related Technologies Table of Contents Introduction Some Statistical Background Fitting a Normal Using the Anderson Darling GoF Test Fitting a Weibull Using the Anderson

More information

Review of Random Variables

Review of Random Variables Chapter 1 Review of Random Variables Updated: January 16, 2015 This chapter reviews basic probability concepts that are necessary for the modeling and statistical analysis of financial data. 1.1 Random

More information

A Coefficient of Variation for Skewed and Heavy-Tailed Insurance Losses. Michael R. Powers[ 1 ] Temple University and Tsinghua University

A Coefficient of Variation for Skewed and Heavy-Tailed Insurance Losses. Michael R. Powers[ 1 ] Temple University and Tsinghua University A Coefficient of Variation for Skewed and Heavy-Tailed Insurance Losses Michael R. Powers[ ] Temple University and Tsinghua University Thomas Y. Powers Yale University [June 2009] Abstract We propose a

More information

Introduction to Statistics for Psychology. Quantitative Methods for Human Sciences

Introduction to Statistics for Psychology. Quantitative Methods for Human Sciences Introduction to Statistics for Psychology and Quantitative Methods for Human Sciences Jonathan Marchini Course Information There is website devoted to the course at http://www.stats.ox.ac.uk/ marchini/phs.html

More information

Descriptive Statistics and Measurement Scales

Descriptive Statistics and Measurement Scales Descriptive Statistics 1 Descriptive Statistics and Measurement Scales Descriptive statistics are used to describe the basic features of the data in a study. They provide simple summaries about the sample

More information

STATS8: Introduction to Biostatistics. Data Exploration. Babak Shahbaba Department of Statistics, UCI

STATS8: Introduction to Biostatistics. Data Exploration. Babak Shahbaba Department of Statistics, UCI STATS8: Introduction to Biostatistics Data Exploration Babak Shahbaba Department of Statistics, UCI Introduction After clearly defining the scientific problem, selecting a set of representative members

More information

Chapter 6: Multivariate Cointegration Analysis

Chapter 6: Multivariate Cointegration Analysis Chapter 6: Multivariate Cointegration Analysis 1 Contents: Lehrstuhl für Department Empirische of Wirtschaftsforschung Empirical Research and und Econometrics Ökonometrie VI. Multivariate Cointegration

More information

Lecture 3: Linear methods for classification

Lecture 3: Linear methods for classification Lecture 3: Linear methods for classification Rafael A. Irizarry and Hector Corrada Bravo February, 2010 Today we describe four specific algorithms useful for classification problems: linear regression,

More information

Descriptive Statistics. Purpose of descriptive statistics Frequency distributions Measures of central tendency Measures of dispersion

Descriptive Statistics. Purpose of descriptive statistics Frequency distributions Measures of central tendency Measures of dispersion Descriptive Statistics Purpose of descriptive statistics Frequency distributions Measures of central tendency Measures of dispersion Statistics as a Tool for LIS Research Importance of statistics in research

More information

Maximum Likelihood Estimation

Maximum Likelihood Estimation Math 541: Statistical Theory II Lecturer: Songfeng Zheng Maximum Likelihood Estimation 1 Maximum Likelihood Estimation Maximum likelihood is a relatively simple method of constructing an estimator for

More information

GLM, insurance pricing & big data: paying attention to convergence issues.

GLM, insurance pricing & big data: paying attention to convergence issues. GLM, insurance pricing & big data: paying attention to convergence issues. Michaël NOACK - michael.noack@addactis.com Senior consultant & Manager of ADDACTIS Pricing Copyright 2014 ADDACTIS Worldwide.

More information

Principle of Data Reduction

Principle of Data Reduction Chapter 6 Principle of Data Reduction 6.1 Introduction An experimenter uses the information in a sample X 1,..., X n to make inferences about an unknown parameter θ. If the sample size n is large, then

More information

Stat 5102 Notes: Nonparametric Tests and. confidence interval

Stat 5102 Notes: Nonparametric Tests and. confidence interval Stat 510 Notes: Nonparametric Tests and Confidence Intervals Charles J. Geyer April 13, 003 This handout gives a brief introduction to nonparametrics, which is what you do when you don t believe the assumptions

More information

Maximum likelihood estimation of mean reverting processes

Maximum likelihood estimation of mean reverting processes Maximum likelihood estimation of mean reverting processes José Carlos García Franco Onward, Inc. jcpollo@onwardinc.com Abstract Mean reverting processes are frequently used models in real options. For

More information

TEACHER NOTES MATH NSPIRED

TEACHER NOTES MATH NSPIRED Math Objectives Students will understand that normal distributions can be used to approximate binomial distributions whenever both np and n(1 p) are sufficiently large. Students will understand that when

More information

Hypothesis testing. c 2014, Jeffrey S. Simonoff 1

Hypothesis testing. c 2014, Jeffrey S. Simonoff 1 Hypothesis testing So far, we ve talked about inference from the point of estimation. We ve tried to answer questions like What is a good estimate for a typical value? or How much variability is there

More information

NCSS Statistical Software Principal Components Regression. In ordinary least squares, the regression coefficients are estimated using the formula ( )

NCSS Statistical Software Principal Components Regression. In ordinary least squares, the regression coefficients are estimated using the formula ( ) Chapter 340 Principal Components Regression Introduction is a technique for analyzing multiple regression data that suffer from multicollinearity. When multicollinearity occurs, least squares estimates

More information

Why Taking This Course? Course Introduction, Descriptive Statistics and Data Visualization. Learning Goals. GENOME 560, Spring 2012

Why Taking This Course? Course Introduction, Descriptive Statistics and Data Visualization. Learning Goals. GENOME 560, Spring 2012 Why Taking This Course? Course Introduction, Descriptive Statistics and Data Visualization GENOME 560, Spring 2012 Data are interesting because they help us understand the world Genomics: Massive Amounts

More information

Parametric Survival Models

Parametric Survival Models Parametric Survival Models Germán Rodríguez grodri@princeton.edu Spring, 2001; revised Spring 2005, Summer 2010 We consider briefly the analysis of survival data when one is willing to assume a parametric

More information

1) Write the following as an algebraic expression using x as the variable: Triple a number subtracted from the number

1) Write the following as an algebraic expression using x as the variable: Triple a number subtracted from the number 1) Write the following as an algebraic expression using x as the variable: Triple a number subtracted from the number A. 3(x - x) B. x 3 x C. 3x - x D. x - 3x 2) Write the following as an algebraic expression

More information

Lecture 6: Poisson regression

Lecture 6: Poisson regression Lecture 6: Poisson regression Claudia Czado TU München c (Claudia Czado, TU Munich) ZFS/IMS Göttingen 2004 0 Overview Introduction EDA for Poisson regression Estimation and testing in Poisson regression

More information

A Primer on Mathematical Statistics and Univariate Distributions; The Normal Distribution; The GLM with the Normal Distribution

A Primer on Mathematical Statistics and Univariate Distributions; The Normal Distribution; The GLM with the Normal Distribution A Primer on Mathematical Statistics and Univariate Distributions; The Normal Distribution; The GLM with the Normal Distribution PSYC 943 (930): Fundamentals of Multivariate Modeling Lecture 4: September

More information

1 Prior Probability and Posterior Probability

1 Prior Probability and Posterior Probability Math 541: Statistical Theory II Bayesian Approach to Parameter Estimation Lecturer: Songfeng Zheng 1 Prior Probability and Posterior Probability Consider now a problem of statistical inference in which

More information

Software for Distributions in R

Software for Distributions in R David Scott 1 Diethelm Würtz 2 Christine Dong 1 1 Department of Statistics The University of Auckland 2 Institut für Theoretische Physik ETH Zürich July 10, 2009 Outline 1 Introduction 2 Distributions

More information

Chapter 13 Introduction to Nonlinear Regression( 非 線 性 迴 歸 )

Chapter 13 Introduction to Nonlinear Regression( 非 線 性 迴 歸 ) Chapter 13 Introduction to Nonlinear Regression( 非 線 性 迴 歸 ) and Neural Networks( 類 神 經 網 路 ) 許 湘 伶 Applied Linear Regression Models (Kutner, Nachtsheim, Neter, Li) hsuhl (NUK) LR Chap 10 1 / 35 13 Examples

More information

Probability and Statistics Vocabulary List (Definitions for Middle School Teachers)

Probability and Statistics Vocabulary List (Definitions for Middle School Teachers) Probability and Statistics Vocabulary List (Definitions for Middle School Teachers) B Bar graph a diagram representing the frequency distribution for nominal or discrete data. It consists of a sequence

More information

. (3.3) n Note that supremum (3.2) must occur at one of the observed values x i or to the left of x i.

. (3.3) n Note that supremum (3.2) must occur at one of the observed values x i or to the left of x i. Chapter 3 Kolmogorov-Smirnov Tests There are many situations where experimenters need to know what is the distribution of the population of their interest. For example, if they want to use a parametric

More information

2DI36 Statistics. 2DI36 Part II (Chapter 7 of MR)

2DI36 Statistics. 2DI36 Part II (Chapter 7 of MR) 2DI36 Statistics 2DI36 Part II (Chapter 7 of MR) What Have we Done so Far? Last time we introduced the concept of a dataset and seen how we can represent it in various ways But, how did this dataset came

More information

Statistics. Measurement. Scales of Measurement 7/18/2012

Statistics. Measurement. Scales of Measurement 7/18/2012 Statistics Measurement Measurement is defined as a set of rules for assigning numbers to represent objects, traits, attributes, or behaviors A variableis something that varies (eye color), a constant does

More information